ggKbase home page

SCNpilot_expt_300_bf_scaffold_383_curated_28

Organism: scnpilot_dereplicated_Nitrosospira_1

near complete RP 52 / 55 BSCG 51 / 51 ASCG 12 / 38
Location: comp(24762..25781)

Top 3 Functional Annotations

Value Algorithm Source
hypothetical protein n=1 Tax=Thiothrix disciformis RepID=UPI000362A18C similarity UNIREF
DB: UNIREF100
  • Identity: 31.1
  • Coverage: 312.0
  • Bit_score: 145
  • Evalue 8.30e-32
Chondroitin 4-O-sulfotransferase {ECO:0000313|EMBL:AJY51404.1}; TaxID=1504981 species="Bacteria; Proteobacteria; Gammaproteobacteria; Oceanospirillales; Halomonadaceae; Halomonas.;" source="Halomonas sp. KO116.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 30.3
  • Coverage: 294.0
  • Bit_score: 140
  • Evalue 4.90e-30
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 35.6
  • Coverage: 216.0
  • Bit_score: 124
  • Evalue 8.10e-26

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Halomonas sp. KO116 → Halomonas → Oceanospirillales → Gammaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 1020
ATGAAAAAAAATCATATTATTTTTGCTCATATACCCAAAACTGCTGGTACAAGCTTTAGGTTATCGCTAGAGCACATTGTTCCTAAAGATAAAATAGCACTGGACTATGGGCGCACCTCGAAAGAAACATCGAATTTAATTCGCAATACTGTTTATCAGAAAGCTGAGCAAAAGTTTCTACAACATATGGATTCAGTGAGGGTTTTATTTGGACACTTTCAAGATAGCTGCATCAGGCTTCGCGGCTATCGCAAGTTGTTGCCGAAAGCTATGCTATGTACGGTCCTGCGTGATCCAATCACGCGGATTATTTCAGAGTATTATCATTTTCGGCATTATTTCGGTTATCAAGAGTCGTTCGATACATTCTTCCATCAGGTACCATTTATCAACAGGCAGTCAAAATGCATTGATTCCATTCCGTTATGTGCTTTCGATTTTGTAGGGATTACCGAACATTATTCCGAATCCCTAAAATTGTTTACAAAAATTTCCAGCTTAGAGCTTATTGAAAACTTTGTGAACCTTCGTGCTAACACCAATTTGGATGCATTGATATCTCAAGCGGATCTCGATAATTTTGCCCGATTGAATATTGAGGATATCGCTATATACAATGAAGCTCTCTACAGATTTCATTACCAGTCAGGCCTGAGGTTCCCATCCTCCGCGCACAGGCGCTTTGAGGGAAGTATCGGGCCAGGCAACAACGGTATGGTTGCAGGCTGGGCGGTTGACCATTCTTCATATCGCCCTGTCGAGATAAAAGTGTTTCGTGGCAGGCGTATGATCTTTAGTGAGGTTGCATGTCTGTACAGACCAGACGTGAGAAAGGCTGGTTTGCACATATCAGGTTATTGCGGGTTCAGCATTCCCTTCGAAAAATTGCAGGAAAATAATAAAAATAGCTTGCTAGTTGTTCACGTAGAGGGATGCGGAGTCCTTGGAAAAATCCCGGCAGTCGTCGAGCCCGGCTTGTTACACAAGCAAGCTCGACGATCAGCAACTCAAACACAATAA
PROTEIN sequence
Length: 340
MKKNHIIFAHIPKTAGTSFRLSLEHIVPKDKIALDYGRTSKETSNLIRNTVYQKAEQKFLQHMDSVRVLFGHFQDSCIRLRGYRKLLPKAMLCTVLRDPITRIISEYYHFRHYFGYQESFDTFFHQVPFINRQSKCIDSIPLCAFDFVGITEHYSESLKLFTKISSLELIENFVNLRANTNLDALISQADLDNFARLNIEDIAIYNEALYRFHYQSGLRFPSSAHRRFEGSIGPGNNGMVAGWAVDHSSYRPVEIKVFRGRRMIFSEVACLYRPDVRKAGLHISGYCGFSIPFEKLQENNKNSLLVVHVEGCGVLGKIPAVVEPGLLHKQARRSATQTQ*