ggKbase home page

ar4r2_scaffold_1121_9

Organism: ALUMROCK_MS4_Beggiotoa_37_524_curated

near complete RP 52 / 55 MC: 1 BSCG 51 / 51 MC: 2 ASCG 14 / 38 MC: 1
Location: comp(3926..7228)

Top 3 Functional Annotations

Value Algorithm Source
Glycosyl transferase, family 2 n=1 Tax=Beggiatoa sp. PS RepID=A7BXP2_9GAMM similarity UNIREF
DB: UNIREF100
  • Identity: 58.4
  • Coverage: 503.0
  • Bit_score: 641
  • Evalue 9.10e-181
  • rbh
glycosyl transferase, family 2 similarity KEGG
DB: KEGG
  • Identity: 52.9
  • Coverage: 1095.0
  • Bit_score: 1198
  • Evalue 0.0
Glycosyl transferase, family 2 {ECO:0000313|EMBL:BAP54482.1}; TaxID=40754 species="Bacteria; Proteobacteria; Gammaproteobacteria; Thiotrichales; Thiotrichaceae; Thioploca.;" source="Thioploca ingrica.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 52.9
  • Coverage: 1095.0
  • Bit_score: 1198
  • Evalue 0.0

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Thioploca ingrica → Thioploca → Thiotrichales → Gammaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 3303
ATGCAATTAATTAAACAATTAAAACGCCTTTTTACTAAAAAATCTAATTTAACACATATGACGGAAACTGATTGCATTTATCATAAAATTCCTGCATTAATATTACTCAATGGACAACAACTTGTTAGTAAATTAAATTCTGAAGGTAATATACCTATTCGCCTGACTAAAGGCACTATTGCGCATTTTTCCATTATTCCACAATCAAACAAAATCACAGGTATCCGTTTAAGATTAGGTACTTATTGTAGAACAAATCATTCACATCTAACTATTAAAATAGATAACTTTATTCAACATTTTTCAGTCAATCAAATTGTTGATAATCAATATACAGATTTTACTTTTTCTACTCCTTATTCATGTATCTCAGGAAAACCTATCAATATTTCAATTTATTCTGAAGATGCTGATGAAAACAATGCTATTGCCTTGTGGTGTAGCGACAAATACCTACCTTTTCATGATCAATTAGATTTCAAATTACTCCATTTACCTACCGTTTCAAATCCGCGAGTGAGCATTATTATTCCTATATTTAACAATGTGTTATACACATACAATTGCTTACTGAGTTTGCAACAATGCGATCAACATATTGCTCAAGAAATTATTCTAGTTGATAACGCCTCAACAGATGAGACTGAACAACTACTTTCTCATCTTACCTCTCAATGTAAAATTGTACAAAATCAAGAAAATACAGGATTTGTTCAAGCCTGTCGTCAAGGCGCAGAACTGGCAACTGGAGAATTTATCGTTTTACTCAACAACGATACTCAAGTGACCCCAAACTGGTTAGAAAACCTGTTAAATATCATGCAACAACACCCTAAAGTTGGCGTTGTGGGTTCAAAACTGATTTACCCCGATGGTCGTTTACAAGAAGCAGGCGGTATTATTTTTAATGATGCCAGCGGTTGGAATTATGGGCGTTTTCAATCACCTACCCATAATTTATACAACCAAGATCGACCTGTAGATTATTGCTCAGGTGCCAGCTTAATGATTCGGCGTGATTTATGGCAACACATCGGTGGTTTTGACTTACGTTATGCACCCGCTTATTACGAAGACACAGACCTATGTTTTGCAGTGCGTCAAGCGGGTTACCAAGTGATTTATTGTCACAACTCCATTGTCGTACATCATGAAGGCATTACTGCGGGCACTGATGTATCGAGTGGATACAAACGTTATCAATTGATTAATCAACAAAAATTTCAACAAAAATGGCAAACAGTACTCAATACGCACTATCCACCACCACCGCATTGCTCACCGGATGCTGCAGCTTTTCGACTTGCTACCGATAAATTGACCTTTCGTTTACCACAACAAAAAATCATTGCAACCCACTTTTTAGCTGAAGGATGGGCACCTAATTTCTGGAGCTACCTAAATATTCATCAAATCGACGAAGAATTACATTTTATTCAATCACTTGACTTTAACACAATTATACTTTTAGTTACTTGGGTTGGGTTTCAAACTAGTATTGAGCCTATTTGTTATTATGAACCTTATTTTGAATTATTTGATCAATTATTGGCAAAAATACAAAAAACAGGATTACAAGTCATATTACGCATTGGTTATACCCACGATAACGGGCCTGTCAGTACACCAGAAGGTTATTTACGTCAAGTAGTGATTGCAGAAGATAAACCTACCTTAGTTGCTTGGTGTGATTACTTAGATAGATTATGGCAAATCTGTAAAAAATACAATAATTTATTGGGAGGATTTATCGCTTGGGAGGATTTCTTTTTCATGGATTTAACACATATTCCATTTGCAGCGCGATTAGAATATGCACAAAAAACGGGTTATCAACGTTATTTAGAGAAAAAGTACAGTTTAACTGATATTTCACAAAAATATGGTCAACAATTCCTGTCTTATCAAGATGTTCCTATTCCCGCTTTTAAAACTCAAGGAATACATTTATTTTGTGAATTCTGGGATCATGTACTGATTCATACTATTTTTAGCACAACCAAATTATATTTTCCCTTATTAAGCATGGAAGTACGAATGGATTGTGATCCTGAAGGAAATCAGGGTATTCATATTTGTCATGAAAAAACCTTTGATTTAACAGAAGATAGTCATATTTCTATGATTTACTATTCACCTGCTTGGGGAAGTCCTAATCAAGGGCAACCAGAAGCTGCTCAAACTATTTTAACGCGTATGCAGTTTATGTTTGATCATATTCGAGCGTATACGCAAAATAGTATTTTTATCGATCAATTTAACTTTATTGATAATACGCCCGGATTTGAACATAATACTGGAATTTTACCTTCAGAACTACCTGATTTTTTAGACAATGTAGCTACGGTGTTACAAAATAATACGATTGGTTATAGTTTATGGACATTACGTGACGTGCCAGCCAATTTATTAAGAAATGGTTTATTTATACATCACTATCCTAGTTGGGAAATGGAACAAGGACATGTAACATTTGATGCCAATAGTAAAAAACAGATGGTTTGTTTACACTCTAATGGTCAATTAAGTCAATTAGTTACTCAAAGTTTTGGTGTACCATTAGTTAAAAATAAACCATTTCAACTAAATTTTCAAGTGAAAGCTGAAAAATTAACAGAAATAGAAGTGAGTGTTATTCGTGATGATGTTGCAGTATTTCAGAAGAAAATCAGTGTAAAAACAAATGAATGGCACACTCAATCTTTAGAACAAGTTCCTTTTAATATAGGTTGTCGTTTACAAGTAACTAATCTTGGAAAATCAGAGATTTATCTAAGTCATTTTTATTTGTATCAAATTTGTCAAGAAAATGGAATCATTGATGTAAAAGGACAAGTTAAATCTTTTTATCCCAATTTAGTTAATTTAAACAAGAAATTGAAAGCCACTTATCAAGTTGAATTAAAAAGTTTCTTAGATAAGAGTGAATTAGCGACACAACGGTTACATGGTCTTTACAGTGATTTATGGATGGGCGCGCAATTATTAGGAAAATTGGCTATTTTACAAACAGAACAGACTGAAATTTACTTTTTAGTAAAAGTGTATATTCCTGATCATTGGGAAAATTATACCAATAAACTGACTTTAATGGTTAATGAACAAAAAATTGGTAATCAATACCACATAAATTCTGGTTATCAAGAGATACATTGGACAATGCCGAACCAATTTGTGGCATCATGGGTCGCTTTTCAACTGCAAGCGGAGAAAATTTATTCAATTAAACAATATGATAGCCAATCGCAAGACAATCGAGCGGTGAGTATGCAAATCATAGGATTAGGATTTTCTTTATCTTAA
PROTEIN sequence
Length: 1101
MQLIKQLKRLFTKKSNLTHMTETDCIYHKIPALILLNGQQLVSKLNSEGNIPIRLTKGTIAHFSIIPQSNKITGIRLRLGTYCRTNHSHLTIKIDNFIQHFSVNQIVDNQYTDFTFSTPYSCISGKPINISIYSEDADENNAIALWCSDKYLPFHDQLDFKLLHLPTVSNPRVSIIIPIFNNVLYTYNCLLSLQQCDQHIAQEIILVDNASTDETEQLLSHLTSQCKIVQNQENTGFVQACRQGAELATGEFIVLLNNDTQVTPNWLENLLNIMQQHPKVGVVGSKLIYPDGRLQEAGGIIFNDASGWNYGRFQSPTHNLYNQDRPVDYCSGASLMIRRDLWQHIGGFDLRYAPAYYEDTDLCFAVRQAGYQVIYCHNSIVVHHEGITAGTDVSSGYKRYQLINQQKFQQKWQTVLNTHYPPPPHCSPDAAAFRLATDKLTFRLPQQKIIATHFLAEGWAPNFWSYLNIHQIDEELHFIQSLDFNTIILLVTWVGFQTSIEPICYYEPYFELFDQLLAKIQKTGLQVILRIGYTHDNGPVSTPEGYLRQVVIAEDKPTLVAWCDYLDRLWQICKKYNNLLGGFIAWEDFFFMDLTHIPFAARLEYAQKTGYQRYLEKKYSLTDISQKYGQQFLSYQDVPIPAFKTQGIHLFCEFWDHVLIHTIFSTTKLYFPLLSMEVRMDCDPEGNQGIHICHEKTFDLTEDSHISMIYYSPAWGSPNQGQPEAAQTILTRMQFMFDHIRAYTQNSIFIDQFNFIDNTPGFEHNTGILPSELPDFLDNVATVLQNNTIGYSLWTLRDVPANLLRNGLFIHHYPSWEMEQGHVTFDANSKKQMVCLHSNGQLSQLVTQSFGVPLVKNKPFQLNFQVKAEKLTEIEVSVIRDDVAVFQKKISVKTNEWHTQSLEQVPFNIGCRLQVTNLGKSEIYLSHFYLYQICQENGIIDVKGQVKSFYPNLVNLNKKLKATYQVELKSFLDKSELATQRLHGLYSDLWMGAQLLGKLAILQTEQTEIYFLVKVYIPDHWENYTNKLTLMVNEQKIGNQYHINSGYQEIHWTMPNQFVASWVAFQLQAEKIYSIKQYDSQSQDNRAVSMQIIGLGFSLS*