ggKbase home page

ar4r2_scaffold_1963_10

Organism: ALUMROCK_MS4_Thiomonas_64_15

near complete RP 41 / 55 MC: 1 BSCG 43 / 51 MC: 2 ASCG 10 / 38 MC: 2
Location: 8763..11813

Top 3 Functional Annotations

Value Algorithm Source
LmbE family protein n=1 Tax=Nitrosomonas sp. (strain Is79A3) RepID=F8GHU7_NITSI similarity UNIREF
DB: UNIREF100
  • Identity: 38.1
  • Coverage: 423.0
  • Bit_score: 259
  • Evalue 1.50e-65
Putative glycsyltransferase {ECO:0000313|EMBL:CDW92743.1}; TaxID=554131 species="Bacteria; Proteobacteria; Betaproteobacteria; Burkholderiales; Thiomonas.;" source="Thiomonas sp. CB2.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 67.9
  • Coverage: 999.99
  • Bit_score: 1410
  • Evalue 0.0
LmbE family protein similarity KEGG
DB: KEGG
  • Identity: 38.1
  • Coverage: 423.0
  • Bit_score: 259
  • Evalue 4.40e-66

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Thiomonas sp. CB2 → Thiomonas → Burkholderiales → Betaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 3051
ATGCCTCCAGAAGTCAATTTACACCCACATCTTCACGACTCTCAGACTGCTAGCGGCTCTGATACAAATGGCGCTAGCAACAAGATTCCACTTGTCAGCGTCATCGTCCGCAGCATTGGGCGCGCTTCGTTGCAAGAGGCGTTGGACTCGGTCGCACGCCAGAGTCACCCGGCGATTGAAGTCATCGTCGTCAATGCCAAAGGCCAAGGGCACCCGAAAACCCCGGAAAGCGTTGGAAGCTTGCCTCTGCGCTTTATCGACCGAGGACATCCGCTGCCACGTAGTGCCGCGGCGAACGCGGGGATCGACGCGGCGCAGGGCGCGTACCTCATCTTCCTAGACGACGACGATCTATTCGAAGCCACGCATATCGCAGCGCTAATCGCGACGCTTGCAGAACACCCCGAGGCCAAGCTTGCCTACGCTGATGTTCGCGTCGTGTCTGCAGACAACGCGACCATCGGCTACTTTAATCAGGACTTTGTCTCTGCACGCCTTTGGAGCGGCAATTTTCTACCGATTCATGCTGTTCTTTTCCATCGCAGCATAGTCCAGCGCGGCTGCCGTTTCGATGAAACACTAGATAGTTATGAAGACTGGGATTTCTGGCTCCAAGCAAGCCAAATCTCAACTTTCGTCCATTGCCCCAGTGTGGGCGCGATCTACCGCATCGCTTTGAGCCAGTCTGGAATGGCGACCAACGCACCTGATCTCGTGCCTCGGCAGCGGGAGGCTAGAAAGCTGGTGTGGCGAAAATGGTGGCCACGTTTCAACCTCGAAACCTTCGGCGAAGCCATTGACGACCTGAAAAATCAGCTCGTGACCCTCAATCAATCCGCAGCGGCTGCAACAAAGGAGCATGATCAGCTGTGCAAAGCGCATGATGCACTGCACGAGAAATTTCTACAGCTTGAGCAAATGGCCGCCCAGCAAGCACATCAGATTACGCACCTTGAAGCAGCGCTGGATGACAGCCGTCATCAGAAGACCCAACATGAGCAAACGATCAATGCTCTGCTGAGTTCCACCAGTTGGCGCGTCACAGCACCATTGCGCGCTGCGGGGAGGCCCATCGCACGGTACCGACACTTGCTTCAAACCTGGCGTGCGCGTCATGCTGCGACAGGTGGCGACGCACCCTCGTTCCCGACCCTGCTGTTCAAAACTGCGCGCACCTTGCTGCACGAAGGGCCTGCTGGCATTCGGCTGCGCACCACGGCCTTGCAGTCCATTGCCCAGCCTCCTGGGCCCTTGGCTGCACGCGTCAACGCACAGCCAACCAAAACGCCCCCAGACCAGCTCTCGTTGATCAATATTGATCAGTACCAAACATTTTTTGTCGATGTATTCGACACCGCCGTCATTCGCACCTGGCGAGCACCCACCGATGTTTTCGGCCTTCTCTCCACGACGAAAAAAGAGGCCGGTTTCGCCAAGCGCCGCATTGAACGCGAGACGAAGACACGGGCCGAGTTCAGTGCACAACGCGAGATCACACTCACGCAAATCTATGCTGGACTGCCGGACGACGACATCAACAGCGAAATCGAAGCGGAACTTCGCTATTGCGTCGCTAATCCAGCATTTCGCCACTTTTATGAATGCCTCATTAATAAGGGTAAGACAGTTTATTTTGTCTCTGACATGTACCTTGACAAAGCCACCGTTGGTCGCATTTTGAACAACTGCGGTTACAGCCAATATACTGATTTGTTTATTTCCTCTGAAGATCAACTGCTCAAAGGGGATGGAAGCCGATTCATATGGCTAAAAAACAAATTTCCGAACTGCGAAGAACAGGCCATCCACATTGGCGATCATCCCATAGCCGACCATGCACAACCAAGCGCGCATGGGTTTGCGACACACCGCCTGCCAATCGCCTCGGAATGGTTCGCATACGATGACTTCATCTCTTCCAAATGGCCTGCGCTACAAGAAAACAGCTCCCTCGGTCAATCCGCCATTCTTGGCCTCTTCCGCCTTTGGAAATCAGGTTTCAACAGCGCAAACAAACCATCCTATTGGCACCAGTTTGGCTTTTTTTACGGCGGCGCCCTTGTCAGCGCATTTTGCGGCCACATTCATGCAGAACTGAACCGGCGCGGCCTCAACGTCTCCAGGCTATTTTTTCTAGCACGCGACGGCGACATCCTGTCACGCGTCTACCGTGAGCTGTATGAGCGCCCAGAACCTGTTTACACACTTGCATCACGACGCTGCATGAGTTTTGCTGCGCTCTACAGCCTGAACGAAGCAGACGACAAGGAGCAAATGCGTTTGTTTACAACCTCGCTCGGCGTCTCTGAGGCAAAAGATATTTTTGAGCGATTCGGCTATCCAGACCTCACCAACCTCGAAAATGATCTGCTGGCGCAGCAATCGCTTGGCGAACCTTGGACGGACGACAACATCCTCGCGGTCATGCAGCGACATCGCGACGCGTTACTAGACAAAGCATCCGCAGAACGGGCCACGCTACTCGATTATTTGCGACACATTGGTTTTTTCGACGAAACCGATGCGGTCATCGTTGATGTTGGCTGGTCAGGATCAATACAAAACGCCTTGCACAAGATTCTCGACCGTGAAGCCGCTGGCGTTCCACGGTTACATGGCATGTATTTAGGCGTTTATCGTGAAGCACTGCACAAACAAAACAAATCGGGCTTTCTGTTCGACGGCGCTCCCGCCGCATTTGCCCCTTACCTCAACCTCATCGAATTGCTGACTGCATCGCCGCAAGATGGTGTGATACGCATCCAGCGCAATGGCGACACTTATGAATCCGTTCCGGCGCGCCGCACCGAGCATGAAGCCAAACGTCAGGAAATTTCAGCGCAAATTCAGCGCGGCATTCTCGATTTTGCGATGTTGGCACGTCAATACTTTGACGCCGATCTTGGCTTTTTCACGCCGAAAGATTTCGAACACCTGTTCTCCGTCCTCCGCACGCACACCAGCGAGACAGATGCAGCAGAACTTGGAAGTTTGCGCCATGCGATGACTCTGGGCAATCAATTTGATCAGTACGTCCTCACGGGAACTTGA
PROTEIN sequence
Length: 1017
MPPEVNLHPHLHDSQTASGSDTNGASNKIPLVSVIVRSIGRASLQEALDSVARQSHPAIEVIVVNAKGQGHPKTPESVGSLPLRFIDRGHPLPRSAAANAGIDAAQGAYLIFLDDDDLFEATHIAALIATLAEHPEAKLAYADVRVVSADNATIGYFNQDFVSARLWSGNFLPIHAVLFHRSIVQRGCRFDETLDSYEDWDFWLQASQISTFVHCPSVGAIYRIALSQSGMATNAPDLVPRQREARKLVWRKWWPRFNLETFGEAIDDLKNQLVTLNQSAAAATKEHDQLCKAHDALHEKFLQLEQMAAQQAHQITHLEAALDDSRHQKTQHEQTINALLSSTSWRVTAPLRAAGRPIARYRHLLQTWRARHAATGGDAPSFPTLLFKTARTLLHEGPAGIRLRTTALQSIAQPPGPLAARVNAQPTKTPPDQLSLINIDQYQTFFVDVFDTAVIRTWRAPTDVFGLLSTTKKEAGFAKRRIERETKTRAEFSAQREITLTQIYAGLPDDDINSEIEAELRYCVANPAFRHFYECLINKGKTVYFVSDMYLDKATVGRILNNCGYSQYTDLFISSEDQLLKGDGSRFIWLKNKFPNCEEQAIHIGDHPIADHAQPSAHGFATHRLPIASEWFAYDDFISSKWPALQENSSLGQSAILGLFRLWKSGFNSANKPSYWHQFGFFYGGALVSAFCGHIHAELNRRGLNVSRLFFLARDGDILSRVYRELYERPEPVYTLASRRCMSFAALYSLNEADDKEQMRLFTTSLGVSEAKDIFERFGYPDLTNLENDLLAQQSLGEPWTDDNILAVMQRHRDALLDKASAERATLLDYLRHIGFFDETDAVIVDVGWSGSIQNALHKILDREAAGVPRLHGMYLGVYREALHKQNKSGFLFDGAPAAFAPYLNLIELLTASPQDGVIRIQRNGDTYESVPARRTEHEAKRQEISAQIQRGILDFAMLARQYFDADLGFFTPKDFEHLFSVLRTHTSETDAAELGSLRHAMTLGNQFDQYVLTGT*