ggKbase home page

NECEvent2014_7_3_scaffold_366_4

Organism: NECEvent2014_7_3_Escherichia_coli_51_37_partial

partial RP 23 / 55 MC: 4 BSCG 24 / 51 MC: 1 ASCG 10 / 38 MC: 3
Location: comp(2789..5881)

Top 3 Functional Annotations

Value Algorithm Source
Beta-galactosidase {ECO:0000256|RuleBase:RU361154, ECO:0000256|SAAS:SAAS00046613}; EC=3.2.1.23 {ECO:0000256|RuleBase:RU361154, ECO:0000256|SAAS:SAAS00046613};; Lactase {ECO:0000256|RuleBase:RU361154}; similarity UNIPROT
DB: UniProtKB
  • Identity: 99.9
  • Coverage: 999.99
  • Bit_score: 2156
  • Evalue 0.0
Glycoside hydrolase family 2 TIM barrel n=6 Tax=Escherichia coli RepID=B1IRP1_ECOLC similarity UNIREF
DB: UNIREF100
  • Identity: 99.9
  • Coverage: 999.99
  • Bit_score: 2156
  • Evalue 0.0
  • rbh
beta-galactosidase subunit alpha similarity KEGG
DB: KEGG
  • Identity: 99.9
  • Coverage: 999.99
  • Bit_score: 2156
  • Evalue 0.0

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Escherichia coli → Escherichia → Enterobacteriales → Gammaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 3093
ATGAATCGCTGGGAAAACATTCAGCTCACCCACGAAAACCGACTTGCGCCGCGTGCGTACTTTTTTTCATATGATTCTGTTGCGCAAGCGCGTACCTTTGCCCGCGAAACCAGCAGCCTGTTTCTGCCCTTAAGCGGTCAGTGGAATTTCCACTTTTTTGACCATCCGCTGCAAGTACCAGAAGCCTTCACCTCTGAGTTAATGGCTGACTGGGGGCATATTGCCGTCCCCGCCATGTGGCAAATGGAAGGTCACGGCAAACTGCAATATACCGACGAAGGTTTTCCGTTCCCCATCGATGTGCCGTTTGTCCCCAGCGATAACCCAACCGGTGCCTATCAACGTATTTTCACCCTCAGCGACGGCTGGCAGGGTAAACAGACGCTGATTAAATTTGACGGCGTCGAAACCTATTTTGAAGTCTACGTTAACGGTCAGTATGTGGGTTTCAGCAAGGGCAGTCGCCTGACCGCAGAGTTTGACATCAGCGCGATGGTTAAAACCGGCGACAACCTGTTGTGTGTGCGCGTGATGCAGTGGGCGGACTCTACCTACGTGGAAGACCAGGATATGTGGTGGTCAGCGGGGATCTTCCGCGATGTTTATCTGGTCGGAAAACAACTAACGCATATTAACGATTTCTCCGTGCGTACCGACTTTGACGAAGCCTATTGCGATGCCACGCTTTCCTGCGAAGTGGTGCTGGAAAATCTCGCCGCCTCCCCTGTCGTAACGACGCTGGAATATACCCTGTTTGATGGCGAACGCGTGGTGCACAGCAGCGCCATTGATCATTTGGCAATTGAAAAACTGACCAGCGCCAGCTTTGCTTTTACTGTCGAACAGCCGCAGCAATGGTCAGCAGAATCCCCTTATCTTTACCATCTGGTCATGACGCTGAAAGACGCCGACGGCAACGTTCTGGAAGTGGTACCACAACGCGTTGGCTTCCGTGATATCAAAGTGCGCGACGGTCTGTTCTGGATCAATAACCGTTATGTGATGCTGCACGGCGTCAACCGTCACGACAACGATCATCGCAAAGGCCGCGCCGTTGGAATGGATCGCGTCGAGAAAGATCTCCAGTTGATGAAGCAGCACAACATCAACTCCGTGCGTACCGCTCACTACCCGAACGATCCGCGTTTTTACGAACTGTGTGATATCTACGGCCTGTTTGTGATGGCGGAAACCGACGTCGAATCGCACGGCTTTGCTAATGTCGGCGATATCAGCCGTATTACCGACGATCCGCAGTGGGAAAAAGTCTACGTCGAGCGCATTGTTCGCCATATCCACGCGCAGAAAAACCATCCGTCGATCATCATCTGGTCGCTGGGCAATGAATCCGGCTATGGCTGTAACATCCGCGCGATGTACCACGCAGCGAAGGCGCTGGATGACACGCGACTGGTGCATTACGAAGAAGATCGCGATGCTGAAGTGGTCGATATTATTTCCACCATGTACACCCGCGTGCCGCTGATGAATGAGTTTGGTGAATACCCGCATCCGAAGCCGCGCATCATCTGTGAATATGCTCATGCGATGGGGAACGGACCGGGCGGGCTGACGGAGTACCAGAACGTCTTCTATAAGCACGATTGTATTCAGGGACATTATGTTTGGGAGTGGTGCGACCACGGAATCCAGGCGCAGGATGACAACGGCAATGTCTGGTATAAATTCGGCGGCGACTACGGCGACTATCCCAACAACTATAACTTCTGTCTTGATGGTTTGATCTATTCCGATCAGACGCCGGGACCAGGCCTGAAAGAGTACAAACAGGTTATCGCGCCGGTAAAAATCCACGCGCTGGATCTGACTCGCGGCGAGCTGAAAGTCGAAAATAAACTGTGGTTTACCACGCTTGATGACTACACCCTGCACGCAGAGGTGCGCGCCGAAGGTGAAACGCTCGCAACGCAGCAGATTAAACTGCGCGACGTTGCGCCGAACAGCGAAGCCCCCTTGCAGATCACGCTGCCGCAGCTGGACGCCCGCGAAGCGTTCCTCAACATTACGGTGACCAAAGATTCCCGCACCCGCTACAGCGAAGCCGGGCATTCTATCGCCACTTATCAGTTCCCGCTGAAGGAAAACACCGCGCAGCCAGTGCCTTTCGCACCAAATAATGCGCGTCCGCTGACGCTGGAAGACGATCGTTTGAGCTGCACCGTTCGCGGCTACAACTTCGCGATCACCTTCTCAAAAATGAGTGGCAAACCGACATCCTGGCAGGTAAATGGCGAGTCGCTGCTGACCCGCGAGCCAAAGATCAACTTCTTCAAGCCGATGATCGACAACCACAAGCAGGAGTACGAAGGGCTGTGGCAACCGAATCATTTGCAGATCATGCAGGAACATCTGCGCGACTTTGCCGTAGAACAGAGCGATGGTGAAGTGTTGATCATCAGCCGCACGGTTATAGCACCGCCGGTGTTTGACTTCGGGATGCGCTGCACCTACATCTGGCGCATCGCTGCAGATGGCCAGGTTAACGTGGCGCTTTCCGGCGAGCGTTACGGCGACTATCCGCACATCATTCCGTGCATCGGTTTCACCATGGGGATTAACGGCGAATACGATCAGGTGGCGTATTACGGTCGTGGACCGGGCGAAAACTACGCCGACAGCCAGCAGGCTAACATCATCGATATCTGGCGCAGCACCGTCGATGCCATGTTCGAGAACTATCCCTTCCCGCAGAACAACGGCAACCGTCAGCATGTCCGCTGGACGGCACTGACTAACCGCCACGGCAACGGTCTGCTGGTGGTTCCGCAGCGCCCAATTAACTTCAGCGCCTGGCACTATACCCAGGAAAACATCCACGCTTCCCAGCACTGTAACGAGCTGCAGCGCAGTGATGACATCACCCTGAATCTCGACCACCAGCTGCTTGGCCTCGGCTCCAACTCCTGGGGCAGCGAGGTGCTGGACTCCTGGCGCGTCTGGTTCCGTGACTTCAGCTACGGCTTTACGTTGCTGCCGGTTTCTGGCGGAGAAGCTACCGCGCAAAGCCTGGCGTCGTATGAGTTCGGCGCAGGGTTCTTTTCCACGAATTTGCACAGCGAGAATAAGCAATGA
PROTEIN sequence
Length: 1031
MNRWENIQLTHENRLAPRAYFFSYDSVAQARTFARETSSLFLPLSGQWNFHFFDHPLQVPEAFTSELMADWGHIAVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGWQGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYLVGKQLTHINDFSVRTDFDEAYCDATLSCEVVLENLAASPVVTTLEYTLFDGERVVHSSAIDHLAIEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWEKVYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYEEDRDAEVVDIISTMYTRVPLMNEFGEYPHPKPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALDLTRGELKVENKLWFTTLDDYTLHAEVRAEGETLATQQIKLRDVAPNSEAPLQITLPQLDAREAFLNITVTKDSRTRYSEAGHSIATYQFPLKENTAQPVPFAPNNARPLTLEDDRLSCTVRGYNFAITFSKMSGKPTSWQVNGESLLTREPKINFFKPMIDNHKQEYEGLWQPNHLQIMQEHLRDFAVEQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAADGQVNVALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWHYTQENIHASQHCNELQRSDDITLNLDHQLLGLGSNSWGSEVLDSWRVWFRDFSYGFTLLPVSGGEATAQSLASYEFGAGFFSTNLHSENKQ*