ggKbase home page

SCNpilot_expt_1000_bf_scaffold_5142_curated_13

Organism: scnpilot_dereplicated_Rhizobiales_12

near complete RP 44 / 55 MC: 2 BSCG 46 / 51 MC: 5 ASCG 10 / 38 MC: 2
Location: 12211..13686

Top 3 Functional Annotations

Value Algorithm Source
Endoglucanase {ECO:0000256|RuleBase:RU361166}; EC=3.2.1.4 {ECO:0000256|RuleBase:RU361166};; TaxID=106592 species="Bacteria; Proteobacteria; Alphaproteobacteria; Rhizobiales; Rhizobiaceae; Sinorhizobium/Ensifer group; Ensifer.;" source="Ensifer adhaerens (Sinorhizobium morelense).;" similarity UNIPROT
DB: UniProtKB
  • Identity: 47.9
  • Coverage: 466.0
  • Bit_score: 413
  • Evalue 4.30e-112
cellulose 1,4-beta-cellobiosidase (EC:3.2.1.4 3.2.1.91); K01179 endoglucanase [EC:3.2.1.4]; K01225 cellulose 1,4-beta-cellobiosidase [EC:3.2.1.91] similarity KEGG
DB: KEGG
  • Identity: 45.3
  • Coverage: 492.0
  • Bit_score: 402
  • Evalue 2.20e-109
hypothetical protein n=1 Tax=Rhizobium giardinii RepID=UPI0003674BBB similarity UNIREF
DB: UNIREF100
  • Identity: 49.7
  • Coverage: 487.0
  • Bit_score: 449
  • Evalue 3.90e-123

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Ensifer adhaerens → Ensifer → Rhizobiales → Alphaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 1476
ATGACTTCCCCAATCCTCGCGGCGCTCGGTCTGGTGCTGGCGTTCGGCACGTCCGCCTTTGCGGCCCCTTCCTCCAGCCTCGTGCGTGGCGGCGATTTTTCCAATGGAAAGGCCGGTTTCTGGTCGACGGACAATGTGCGTCTCGCCGTGCGCTCCGGCCAGCTCTGCGGCGATGTGTCCGCCGGCGTCGAAAAGCCGTGGGATGCGCTGATCGGCGTGGATACGGGCGAGCTGAAGGCCGGCCAGTCCTATGTGCTCGGTTTCGAGGCGCGCGCCACCGGTAATGACGCGCCCGACAGGATCAAGGTGATGCTGCAGGATCCGAAGGCGCCATGGGCGGAGCGGTTTGCCGCGCCGGTGGAGGTAACAGAGGCGCTGGAGCCGGCCAGCCTGCCGTTTTCCAATGCGCGCGCCGGCCGCGCCCAGCTCGTCTTCCAGCTCGGCGGCCAGCAGGGCGCCTGGCGTTTCTGCATCGACAATGTGAAGCTTTCGCCCGCGGGCGAGGACCAGGCGGAGACTCGCATGAACCAGAACCGCGCCGCACCCGTGCTGGAGCCGGTCGCCGATCCGGTGCGGCTCAACCAGGCCGGCTTCCTGCCCGGTGGGCCGAAGCGGGCGACGATCATATCCTTGTCGAAGACGCCGTTGCGCTTCCAGATCGTCGATGGCGCCGGCAAGCTGTTCGGCGAGGGTGAGACGGAAGTGCGCGGCGTCGATCCCGCCTCCGGTTACAGCCTGCACGTCGCCGATTTCACGCCGCTGACGCGGCCGGGCGGCGATTACCGGCTGGTCGCCGGCGAGGCGGCAGGCCCGGCCTTTGCTATCGACAGCGACCTTTACCGGCGGCTCTCCGTCGACGCGCTCTCCTGGTTCTATCCGCAGCGCAGCGGCATTGTCATCGACGGCGCGATCGCCGGCGCGGCCTATGCCAGCCCGGCCGGCCATGTCGGTGTCGATCCCAATCGCGGCGATGGCGCGGTTGCCTGCCTGACGGGACCGGTGGCGGCCGAGCTTTACGGCAAGGACTGGCAATGCAAAGGCACGCGCGACGTCAGCGGCGGCTGGTACGATGCCGGCGACCACGGCAAATATGTCGTCAATGGCGGCATCTCGGTGGCGCAGATGATGGCCGCTTTCGAGCGCGCCAAGCGTTTTGCACCGAAATCGCCGCTGCTCTCGGATGGTTTCGCACGTCTGCCGGAGCGGGGCAACGGCGTGCCGGACGGTCTCGACGAGGCGCGCTGGGAGCTGGAATTCCTGCTGAAGATGATCGTGCCGGACGGCGAACCGCTTTCCGGCATGGCCTATCACANNNNNNNNNNNNNNNNNNNNNNNNNNNTGCCGATGCTGCCGCATCTCGACCCGAAGGAGAGGGCGCTGCACCGGCCCTCGACGGCGGCGACGCTGAACGTCGCGGCGGTGGCCGCGCAAGGGGCGCGGCTGTTCCGTCCCTATGACGCCGCCTCGCCGCCTTCG
PROTEIN sequence
Length: 492
MTSPILAALGLVLAFGTSAFAAPSSSLVRGGDFSNGKAGFWSTDNVRLAVRSGQLCGDVSAGVEKPWDALIGVDTGELKAGQSYVLGFEARATGNDAPDRIKVMLQDPKAPWAERFAAPVEVTEALEPASLPFSNARAGRAQLVFQLGGQQGAWRFCIDNVKLSPAGEDQAETRMNQNRAAPVLEPVADPVRLNQAGFLPGGPKRATIISLSKTPLRFQIVDGAGKLFGEGETEVRGVDPASGYSLHVADFTPLTRPGGDYRLVAGEAAGPAFAIDSDLYRRLSVDALSWFYPQRSGIVIDGAIAGAAYASPAGHVGVDPNRGDGAVACLTGPVAAELYGKDWQCKGTRDVSGGWYDAGDHGKYVVNGGISVAQMMAAFERAKRFAPKSPLLSDGFARLPERGNGVPDGLDEARWELEFLLKMIVPDGEPLSGMAYHXXXXXXXXXXPMLPHLDPKERALHRPSTAATLNVAAVAAQGARLFRPYDAASPPS