ggKbase home page

ar4r2_scaffold_3580_10

Organism: ALUMROCK_MS4_Beggiotoa_37_524_curated

near complete RP 52 / 55 MC: 1 BSCG 51 / 51 MC: 2 ASCG 14 / 38 MC: 1
Location: 8738..12064

Top 3 Functional Annotations

Value Algorithm Source
putative glycosaminoglycan synthase (EC:2.4.1.-) similarity KEGG
DB: KEGG
  • Identity: 44.8
  • Coverage: 451.0
  • Bit_score: 380
  • Evalue 8.30e-103
  • rbh
Putative glycosaminoglycan synthase n=1 Tax=Ilumatobacter coccineus YM16-304 RepID=M5AQ48_9ACTN similarity UNIREF
DB: UNIREF100
  • Identity: 44.8
  • Coverage: 451.0
  • Bit_score: 380
  • Evalue 2.90e-102
  • rbh
Uncharacterized protein {ECO:0000313|EMBL:KHD11410.1}; Flags: Fragment;; TaxID=1003181 species="Bacteria; Proteobacteria; Gammaproteobacteria; Thiotrichales; Thiotrichaceae; Thiomargarita.;" source="Candidatus Thiomargarita nelsonii.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 60.1
  • Coverage: 414.0
  • Bit_score: 522
  • Evalue 1.10e-144

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Candidatus Thiomargarita nelsonii → Thiomargarita → Thiotrichales → Gammaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 3327
ATGACTACTCACATGTATAGAATACATGAGATAAAACTCCATACTCTACAGAATGGCTATTCTCATGTTTTAAATGCACCGATGCTGGGGCAAGTTGTAGAAGAAAGCTTAGTTTTACGTGGAAAAATAAAAAATGTTCCATTAAAACAAATAAAAGTATTGCATCTCAAGAAAGGTAGCAATCCTAAAGGTCATTTTAGTAATAATAAAAGTGTACTAGAATACCAAATATCAGTGGCGACTAACGAACTATTAGATTTTAATAATCCATTGCAATTAGCAATTGTTTTGCAAGATAATCAATTATGTATTTTGGCAGAAATTGAAATTGCAGTTTTACGACGAGAACAAGAATTAGTCGAATTAGCTACTAAGATATTACAAGCTAATGATCTTGCAAGCTGCAAAGCAGGATGTGAAAGAATCTTAGCTGCCTACCCTAATACTCCGAGCGCATCTGTTTTATTAAATAATTTATCTTTTTCTTCGACCCCAATAACACTAGAGAATCAAAATCTCTTAATTACTGTTCTTAGCCATGAAGGATTTGGTTGTGAATATATCGCAGAATTAGCGCAGAAATATCAATTGTGGGAACTTGCTTTAACCTATTGGAATGAAATTATAAGCCGAACAAAGCTTGTTGATATTCAATGGTTGGTCGCTAAAGGAGATGTGTTAATTAGAGCTAAGAGGTATACAGAAGCAGAGCAATTTTTCAAGCAAATGAGAGAGCAAGTACCCCAACTCTCTTTAGGATATGTAGGTCTGGCTTACTTGGCACAACAACAAGAAAATTGGGAATTAGCGTTGATTCATTGGGACGAGTGTATTCAACGTTTTTCTCACGAAAAAGAGTTAATTCCCAATTGGTTAACTAGAAAGATAGAGATTTATACAGCATCTAAAGAATTTGAACAAGCTACAACACTTTTACAGTCACTCATTCAACAATATCCAGAATTAGCTTTAATTCCTTATCATCGATTAATTGAATTAGCACTTAAACAACGAAATTGGGATAAAGCAATAGAATACTGTAACTTGGGGATAGAACAATTTCCCACTACTGAGTCTTTGTATCAGCAGCTAATAAATGTATTGCATGAACAAAATAAATTAGATGAGGCAATCCAATATTATCAAAATCAAATAGTTGTTAAACCATTTCAGGTTAGTTTACTAATGAGATTGGCCTATGTGTTACAGGAAAAAAGAGAATTAGAAGCTGCTTTAAGTACATATCAACAAATTAAACAAATGCATGCTTTACCTCATTGGTTGCACAGTAAATTTATTCAGGTATTAGTGGAACTCGGTAAACTACAGGAGGCAGAAACTGAATTACAGCAGCAATTTGCAACTCCTGAGCAATATATTCAGCTATTAGAAGGGTTGGCTGAGTTGGCAATGTATGCTGCGCAGTTTGATTTAGCTTTAGAACGTTGGCAGGAATTAATTATTTTACAACCAAACTCTATGGCTGCCTATTTAGGTAAAGCTAATGCCTTGCTTTCTCTCAAACAGTATAAGGCTGCAAGAACTGTTTTTGAACAACTTTTAGTTGATTATCCAAATAGTAACATTGGTCCAGACAATCTGGCAAAGATAGCAGTACAACAGGGAAATTTTGAATCAGCACTATATTACTATAATATTGCTATACTCAGAAGTCCAGATAATACTTGGTTACAAATTAACAAAGCTGATATTTTAGTGAAGTTATCTCGTTTAGAAGAGGCAAAACCTATTTATGAGCGATTTGTTGATAAATCTCAAGGGTTAGCAGGTTTAGCTAAAATAGCCCGATTATCTGGAAAATTTGAGGAATGTTTAGAGTTTTGTCAGCAATTGATGGAACATTTTCCAAACTTGGCAATAGGGTATCAAGAGGCGGAACAAGCTTATATTGAATTGGGGGAATTTGCAAAAGCCAAACAAGCTTTTCTTGCTTACAGCAGTCATCAAGCTAATCAAGCAGCAATTCCTAAAAGAACTTCTTCTATCATGTTGCCAGATGGTTTAATCTTGCCAGAAATAAAAGGTAAGAATAATGATTACACTTTTGTAGAAGAAAAGCTGGAAGCGTTTATTAAAAGTGGACGAGCTTATAGCTTGCCTGTTTCTATCATCATTCCTGTTTATAACAGAAAAGTCTTATTAGCAAAAACGTTAGCCGCACTTACTCATCAGACTTATCCAAAAGAACTGATTGAAGTCGTTGTTGCAGATGATGGTAGTTCTGATGGTGTGGAAGAAGTTATTGAAAAATATAGAAGGTTTTTGAACTTACAATATGTTTATCAACCTGATGAAGGTTTTCGTCTTTCAGCAGTCAGAAATTTGGGAATGAAAGTAGCAAAACATGATTATTTTATTTTCTTAGATTGCGATGTTTTACCTGTTCCTCAATTAGTAGAAGCGTATATGAAGTATTTTCATGTTTCTGATCGTATTGCTATGATGGGTACTTTACGTTTTGTATGTTCAGATACGATTTCTGATGATGATATTCTCAAAGATGCTACCGTATTTTTAGAACTTCCTGATATAAAAAGCAGTAATGATGTTTCTACTAGAACTTTAGTGTTGGGTACAACTATTGATTGGCGTTTACCACTGTTTGTGCGAACTAATAATGGAAAAGAGGAGAGATGGCCTTTTAGAGGTTTTGTTGGGGCCAATATGGTACACACACGTAAAGCAATAGATGAAATAGGTGGATATGATGAAGAATTTCAAGCATGGGGACATGAAGACGTTGAAATGGGCTATCGTTTATATAATGCTGGTTACTATTTTATTCCTGTCATGGAGGCAATAGTCCTACATCAAGAACCTGAAAACAATAAAAATGATTCAGATAGAATGGGTGGAAAAACTCAAACTGATTTATTATTTGAGGAAAAGTGTCCCGTTCTCCTATATCGTAAATATCAAAAAGGTAGAATCTATAAAATACCTAAAGTATCTATTTACATTCCCGCGTACAATGTAGAAAAGTATATCAAGGCAGCAATAGACAGTGCACTCAATCAAACTTATACAGATTTGGAAGTATGTATTTGTAATGACGGTTCCACTGATAATACCTTAAAGATATTAGAAGAAAATTACACTAATAACCCAAGAGTACGTTGGTTATCACAACCCAATGGTGGAACAGCAAAAGCCTCTAACACGGCTGTTAGAATGTGCCGTGGTATGTATATTGGACAACTTGATGCTGATGATATACTAAAACCTATGGCTGTAGAATTAGCAGTCAATTATTTAGACAATCATGATGTAGGTTGTGTTTATAGTGATCTTGAAATGGTAGATGCT
PROTEIN sequence
Length: 1109
MTTHMYRIHEIKLHTLQNGYSHVLNAPMLGQVVEESLVLRGKIKNVPLKQIKVLHLKKGSNPKGHFSNNKSVLEYQISVATNELLDFNNPLQLAIVLQDNQLCILAEIEIAVLRREQELVELATKILQANDLASCKAGCERILAAYPNTPSASVLLNNLSFSSTPITLENQNLLITVLSHEGFGCEYIAELAQKYQLWELALTYWNEIISRTKLVDIQWLVAKGDVLIRAKRYTEAEQFFKQMREQVPQLSLGYVGLAYLAQQQENWELALIHWDECIQRFSHEKELIPNWLTRKIEIYTASKEFEQATTLLQSLIQQYPELALIPYHRLIELALKQRNWDKAIEYCNLGIEQFPTTESLYQQLINVLHEQNKLDEAIQYYQNQIVVKPFQVSLLMRLAYVLQEKRELEAALSTYQQIKQMHALPHWLHSKFIQVLVELGKLQEAETELQQQFATPEQYIQLLEGLAELAMYAAQFDLALERWQELIILQPNSMAAYLGKANALLSLKQYKAARTVFEQLLVDYPNSNIGPDNLAKIAVQQGNFESALYYYNIAILRSPDNTWLQINKADILVKLSRLEEAKPIYERFVDKSQGLAGLAKIARLSGKFEECLEFCQQLMEHFPNLAIGYQEAEQAYIELGEFAKAKQAFLAYSSHQANQAAIPKRTSSIMLPDGLILPEIKGKNNDYTFVEEKLEAFIKSGRAYSLPVSIIIPVYNRKVLLAKTLAALTHQTYPKELIEVVVADDGSSDGVEEVIEKYRRFLNLQYVYQPDEGFRLSAVRNLGMKVAKHDYFIFLDCDVLPVPQLVEAYMKYFHVSDRIAMMGTLRFVCSDTISDDDILKDATVFLELPDIKSSNDVSTRTLVLGTTIDWRLPLFVRTNNGKEERWPFRGFVGANMVHTRKAIDEIGGYDEEFQAWGHEDVEMGYRLYNAGYYFIPVMEAIVLHQEPENNKNDSDRMGGKTQTDLLFEEKCPVLLYRKYQKGRIYKIPKVSIYIPAYNVEKYIKAAIDSALNQTYTDLEVCICNDGSTDNTLKILEENYTNNPRVRWLSQPNGGTAKASNTAVRMCRGMYIGQLDADDILKPMAVELAVNYLDNHDVGCVYSDLEMVDA