ggKbase home page

L3_106_000M1_scaffold_2083_10

Organism: dasL3_106_000M1_concoct_82_fa

near complete RP 48 / 55 MC: 1 BSCG 51 / 51 ASCG 12 / 38 MC: 1
Location: comp(12684..13817)

Top 3 Functional Annotations

Value Algorithm Source
Thiazole biosynthesis protein ThiH n=116 Tax=Escherichia coli RepID=E1PAV4_ECOAB similarity UNIREF
DB: UNIREF100
  • Identity: 100.0
  • Coverage: 377.0
  • Bit_score: 771
  • Evalue 3.40e-220
thiH; thiamine biosynthesis protein ThiH similarity KEGG
DB: KEGG
  • Identity: 100.0
  • Coverage: 377.0
  • Bit_score: 771
  • Evalue 9.70e-221
Thiamine biosynthesis protein {ECO:0000313|EMBL:CBG37181.1}; TaxID=216592 species="Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales; Enterobacteriaceae; Escherichia.;" source="Escherichia coli O44:H18 (strain 042 / EAEC).;" similarity UNIPROT
DB: UniProtKB
  • Identity: 98.4
  • Coverage: 377.0
  • Bit_score: 758
  • Evalue 4.20e-216

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Escherichia coli → Escherichia → Enterobacteriales → Gammaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 1134
ATGAAAACCTTCAGCGATCGCTGGCGACAACTGGACTGGGACGACATTCGCCTGCGTATCAACAGCAAAACGGCTGCTGACGTAGAGCGGGCGCTAAATGCCTCGCAACTCACCCGCGACGATATGATGGCATTGCTATCACCTGCCGCCAGTGAATACCTGGAACCACTGGCTCAACGGGCGCAGCGTCTGACCCGTCAGCGATTTGGCAACGTCGTCAGCTTCTACGTCCCGCTTTATCTTTCCAATCTTTGCGCTAACGACTGCACGTACTGCGGATTTTCCATGAGCAATCGCATCAAGCGCAAAACGCTGGATGAAGCGGATATTGCCAGGGAAAGCGCCGCTATACGGGAGATGGGCTTTGAACATCTGCTATTAGTCACTGGTGAACATCAGGCGAAAGTGGGGATGGATTACTTTCGTCGTCATCTCCCTGCCCTGCGTGAACAGTTCTCTTCACTACAGATGGAAGTGCAACCGCTGGCGGAGACGGAATACGCCGAGTTAAAGCAACTAGGTCTGGATGGCGTGATGGTTTATCAGGAGACATATCACGAGGCGACTTATGCCCGCCATCATCTGAAAGGTAAAAAACAGGACTTCTTCTGGCGGCTGGAAACGCCGGATCGGCTAGGGCGTGCGGGGATTGATAAGATAGGCCTCGGCGCGCTAATCGGCCTTTCCGACAACTGGCGAGTTGACTGCTATACGGTTGCCGAACATTTGCTATGGCTGCAACAGCATTACTGGCAAAGCCGCTACTCTGTCTCCTTCCCACGCCTGCGTCCATGTACTGGCAGCATTGAGCCTGCGTCGATTATGGATGAACGCCAGTTAGTGCAAGCCATCTGCGCTTTCCGGCTGCTTGCACCGGAGATTGAACTGTCACTCTCCACGCGGGAATCACCGTGGTTTCGCGATCGCGTAATTCCGCTGGCAATTAATAACGTCAGCGCTTTTTCGAAAACGCAGCCAGGTGGCTATGCCGACAACCACCCCGAGCTGGAACAGTTCTCACCGCACGACGATCGCAGACCGGAAGCGGTTGCTGCCGCGTTAACCGCTCAGGGTTTGCAGCCGGTATGGAAAGACTGGGACAGCTATCTGGGACGCCCATCGCAAAGGCCATGA
PROTEIN sequence
Length: 378
MKTFSDRWRQLDWDDIRLRINSKTAADVERALNASQLTRDDMMALLSPAASEYLEPLAQRAQRLTRQRFGNVVSFYVPLYLSNLCANDCTYCGFSMSNRIKRKTLDEADIARESAAIREMGFEHLLLVTGEHQAKVGMDYFRRHLPALREQFSSLQMEVQPLAETEYAELKQLGLDGVMVYQETYHEATYARHHLKGKKQDFFWRLETPDRLGRAGIDKIGLGALIGLSDNWRVDCYTVAEHLLWLQQHYWQSRYSVSFPRLRPCTGSIEPASIMDERQLVQAICAFRLLAPEIELSLSTRESPWFRDRVIPLAINNVSAFSKTQPGGYADNHPELEQFSPHDDRRPEAVAAALTAQGLQPVWKDWDSYLGRPSQRP*