ggKbase home page

NECEvent2014_8_5_scaffold_743_3

Organism: NECEvent2014_8_5_Escherichia_coli_50_14_partial

partial RP 22 / 55 MC: 3 BSCG 17 / 51 MC: 2 ASCG 9 / 38 MC: 2
Location: comp(3887..5020)

Top 3 Functional Annotations

Value Algorithm Source
2-iminoacetate synthase n=141 Tax=Enterobacteriaceae RepID=THIH_ECOLI similarity UNIREF
DB: UNIREF100
  • Identity: 100.0
  • Coverage: 377.0
  • Bit_score: 769
  • Evalue 1.70e-219
  • rbh
thiH; thiamine biosynthesis protein ThiH similarity KEGG
DB: KEGG
  • Identity: 100.0
  • Coverage: 377.0
  • Bit_score: 769
  • Evalue 4.80e-220
Thiazole biosynthesis protein ThiH {ECO:0000313|EMBL:AEE59313.1}; TaxID=696406 species="Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales; Enterobacteriaceae; Escherichia.;" source="Esc similarity UNIPROT
DB: UniProtKB
  • Identity: 100.0
  • Coverage: 377.0
  • Bit_score: 769
  • Evalue 2.40e-219

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Escherichia coli → Escherichia → Enterobacteriales → Gammaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 1134
ATGAAAACCTTCAGCGATCGCTGGCGACAACTGGACTGGGACGACATCCGCCTGCGTATCAACGGCAAAACGGCTGCTGACGTAGAGCGGGCGCTAAATGCCTCGCAACTCACCCGCGACGACATGATGGCGCTGTTATCGCCTGCCGCCAGTGGCTATCTGGAACAACTGGCCCAACGGGCGCAGCGTCTGACCCGTCAGCGATTTGGCAACACAGTTAGTTTCTACGTCCCGCTTTATCTTTCCAATCTTTGCGCTAACGACTGCACGTACTGTGGATTTTCCATGAGTAATCGCATCAAGCGCAAAACGCTGGATGAAGCGGATATTGCCAGGGAAAGTGCCGCTATACGGGAGATGGGCTTTGAACATCTGCTGTTAGTCACTGGTGAACATCAGGCGAAAGTGGGGATGGATTACTTTCGTCGTCATCTCCCTGCCCTTCGTGAACAGTTCTCTTCACTACAGATGGAAGTGCAACCGCTGGCGGAGACGGAATACGCCGAGTTAAAGCAACTTGGTCTGGATGGCGTGATGGTTTATCAGGAGACATATCACGAGGCGACTTATGCCCGCCATCATCTGAAAGGCAAAAAACAGGACTTCTTCTGGCGGCTGGAAACGCCGGATCGGCTGGGGCGTGCGGGGATTGATAAGATAGGCCTCGGCGCGCTAATTGGCCTTTCCGACAACTGGCGCGTTGACAGCTATATGGTTGCCGAACATTTGCTATGGCTGCAACAGCATTACTGGCAAAGCCGTTACTCTGTCTCCTTTCCGCGCCTGCGCCCGTGTACTGGCGGCATTGAGCCTGCGTCGATTATGGATGAACGCCAGTTAGTGCAAACCATCTGCGCCTTCCGACTGCTTGCACCGGAGATTGAACTGTCACTCTCCACGCGGGAATCACCGTGGTTTCGCGATCGCGTTATTCCGCTGGCGATCAATAACGTCAGCGCCTTCTCGAAAACGCAGCCAGGTGGCTATGCCGATAATCACCCCGAGTTGGAACAGTTCTCACCGCACGACGATCGCAGACCGGAAGCGGTTGCTGCCGCGTTAACCGCTCAGGGTTTGCAGCCGGTATGGAAAGACTGGGACAGCTATCTGGGACGCGCCTCGCAAAGACTATGA
PROTEIN sequence
Length: 378
MKTFSDRWRQLDWDDIRLRINGKTAADVERALNASQLTRDDMMALLSPAASGYLEQLAQRAQRLTRQRFGNTVSFYVPLYLSNLCANDCTYCGFSMSNRIKRKTLDEADIARESAAIREMGFEHLLLVTGEHQAKVGMDYFRRHLPALREQFSSLQMEVQPLAETEYAELKQLGLDGVMVYQETYHEATYARHHLKGKKQDFFWRLETPDRLGRAGIDKIGLGALIGLSDNWRVDSYMVAEHLLWLQQHYWQSRYSVSFPRLRPCTGGIEPASIMDERQLVQTICAFRLLAPEIELSLSTRESPWFRDRVIPLAINNVSAFSKTQPGGYADNHPELEQFSPHDDRRPEAVAAALTAQGLQPVWKDWDSYLGRASQRL*