ggKbase home page

L2_026_000M1_scaffold_343_20

Organism: L2_026_000M1_public_UNK

megabin RP 52 / 55 MC: 52 BSCG 51 / 51 MC: 51 ASCG 17 / 38 MC: 16
Location: comp(15737..19027)

Top 3 Functional Annotations

Value Algorithm Source
Uncharacterized protein (Fragment) n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=E2N889_9BACE similarity UNIREF
DB: UNIREF100
  • Identity: 42.6
  • Coverage: 462.0
  • Bit_score: 355
  • Evalue 1.00e-94
Uncharacterized protein {ECO:0000313|EMBL:AEV68858.1}; TaxID=720554 species="Bacteria; Firmicutes; Clostridia; Clostridiales; Ruminococcaceae; Ruminiclostridium.;" source="Clostridium clariflavum (strain DSM 19732 / NBRC 101661 / EBR45).;" similarity UNIPROT
DB: UniProtKB
  • Identity: 41.8
  • Coverage: 469.0
  • Bit_score: 351
  • Evalue 3.40e-93
dockerin type I cellulosome protein similarity KEGG
DB: KEGG
  • Identity: 37.7
  • Coverage: 546.0
  • Bit_score: 341
  • Evalue 5.50e-91

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

[Clostridium] clariflavum → Ruminiclostridium → Clostridiales → Clostridia → Firmicutes → Bacteria

Sequences

DNA sequence
Length: 3291
ATGAAAACAAGAAGCATGTATCGGGGCATCGTTCTGATAGCCGCGGCACTGGCAACGGTAGCCTGCCAGAATGAACTGAAGGAAGAGTACAACGAACCCAAACCGGGTGAGAAGATAACCATGACCATCCGGGCAACACAAGGCGCGGCCTCACAGACGCGCACCGACTATGAAGACAATCTGGGAATAACCGGTATAGACAACATAGCAGTGAAGTGGGAAGGCGGAAGTACAGACGGTGCACCGGTGGAGAAGATTAAAGTATTCGGAGTCGATGCAAATGAGCTGGATTACTCGGTAGATTTCAATAGCCTGCCAAGCAGCCTCAGCCAGGACGGGACGAGTATCAGCTTCGAAGGAACCATTAATGCAAAGAGCCTCTACTTTGCCATGTATCCTGCCGATAACTGCAACTACAACACTACGGTTCAAGCCATCTACACCTCCTTCTCAGACCAGACACAAGACTGCGCCAAGCCCATGGCGCACCTCAAAGGCTTCGACCTTATGGTGGGACGAGCAGCGACAACGGGCTCGTATGATAAACTCACATTCAGCCACGAGGCCGCGATGATACGCTTCACCCTCAATGGCGTTCCTTCTTCAGAGAAAATCACCCGTGTAAGCCTCGCTGCCGCCAATAACAAACTGAGCTCCCGGATGTGCGCATCACTCGCCGGCTCGGAAATAGGAGGGTTAACCGTCGAAGCGGATACGGAATACGCCCCGGTGTCAAGTCTCGGCCTCGACATCACTAACCATACCCCCTCCACAGAGCCGCTGAAAGCCTACATGATGATACCTCCATGCGATCTGAGCAACGACCAGCTCACCGTCACCGTAAACACGGAAAGCGGCAATACCTACACCGGCGATCTGACAACCGGAGCCAGCACCTTGCTGAAGGCCGGTCTGTGCTATACCTTAGAACCCACACTAGCGCTCGGCAAAACCATCAGCCTGCCACCCGCTACGGCGGGGAGTTTGGGAAACGCCTTGAACAATATAACCCCGGCCCAAGGACAAACCGAACTGGCCGTGACGGGCGCGGTAAACACCGACGACATCACCGCCCTGGCCGCTTTCTTGAAAGAAACCAAGGCTGAGAATATCACCGCCATCGACCTGTCCGGCATCAGCGGCATAACCGACGTGACAGGCTTTGCAGGCTGCGCAAAGATAGAGAAAGTCATACTGCCCGACGCCGCGGAAGCCATTGGCGACAATGCCTTTGAAGGCTGCACGGCACTGACCACAGTCATCCAGAACGACCCGATACCCGCCGATGTAGCACCCGCCACCCGCAGTATCTCCAAAAGAATAAAAAGAATAGGCCACAGCGCCTTTAAAAATTGCACCTCGATGACCGAAATGTTCCTGCACGCCGATATACAAAGCGTAGGAAACAGCGCCTTTGAAGGGTGCACGGCAATGACAGCCCTCATATTCGAAGGCACAAAAGCGGTCAACGAAACCGACGGTATAAGCTTAGGAACCGGCATCATAACCGGAACGCACGCGGACATCAAAATATTCCTGCCCGCCATCACCGATCTCGCAGTGGGCACCGCATATAAGACAATCCTGGAAGAAAAGCCCACCTACTACAACTTCGCGGGCTACGGCAGCGCCACTACCACTGAAGAGAAAACGAATCCCGCATCGTACACACTCATCCCCACGGTTCCAGTTGATACAATGCGGTTCACCGTGAAGGTGGAAAGTGGCAATCTGGGATTCTGCATTCCCTTCCCCGACTCCGGCAATACTCCCGCGACTATCACGGTAAGTTGGGGTGACGGTACACCCGCTGTCGTAGTGCCCAAAGGCACGACGCTTGCAACGGGTGACAAATTCGAGTACACGTATGCCGAAGCGGGCACATACACCATCACCATCGGCTCGGGTGCGACGGCGGATAAACAGCAAATACCGGTACTGAATTTTAACCAAAGAGGCAGCTCTTACAACCCGAATAAACTGGTGAGCCTTGAAACGCCATTGCTCAATATGAATTGCTCATCTTTGAGCAAAGCGTTTAGAATTTGCGAAAACTTAACCACAATCCCGGGAAATCTTTTCGAAAAGAACACAGCGGTTACAAACTTCAGCAATTGCTTCGATTATTGTAAAGCATTAACCGCAATCCCGGGAAATCTTTTCGAAAAGAACACAGCGGCTACAAACTTCAGCTTTTGCTTCTTTAATTGCGAACTATTAAAAGAGATTCCTAACGAGCTTTTCGCAAGCAACACAGCGGCTACAAACTTCAGCGGTTGCTTCGCTAATTGTAAAGGATTAACCACAATCCCGGGAAATCTTTTCGAAAAGAACACAGCGGCTACAGACTTCAGCAATTGCTTCTATTATTGTAAAGAATTACAGTCAATTCCAGGAGGGCTTTTCGCAAGCAACACAGCGGCCATAAACTTTAGCACTTGCTTCAATCACTGTGACGCATTAACCACAATACCGGAATCACTTTTCGCGAACAACACAGAGGCTACAAAATTTAGTCAATGCTTCGCCGATTGTACCGCTTTAACCACAATCGAGGCAAGACTTTTCGCGAACAACGCAAATATAAACATTAGTTATTGCTTCTCTGGCTGTACCGCATTAACCACAATTTCGGCAGATCTTTTTGCGAATAACACAGCTATTAAAAGCTTCAACTATTGCTTCTATGAGTGTACCGCATTAAAGGCAATCCCGGAAGGGCTTTTCGCGAAGAACGCAGAGGCTACAAGCTTCAGCTATTGCTTCGCTAATTGTAAAGGATTAACCGCAATCCCGGAAAATCTTTTCGAAAAGAACACAGCGGCTACAGACTTCAAAAATTGCTTCCAATCGTGTAGCGCATTAAAGGCAATCCCGGGAAATCTTTTCGAAAAGAACACAGCGGCTACAGACTTCAGCTATTGCTTCTATGACTGTAGTAGTACCCAATTAACCACAATCCCGGAAGGGCTTTTCGCGAAGAACGCAGAAGCTACAAACTTCAACAGTTGCTTCTATGGGTGTACCTATATGATGTTCAATCCAAATATATTCGTCGATCCCACCGCGGCCGAACAGGATAAATTAAACCGCTTCATAGATAAAGACATGGACTTTAGGAATTGCTTCTACCAAGTCAATCTGCATAACAATTCAGGTACCGCCCCCGCGCTGTGGAAGTATGAGAAAGGTTCGGGCCAGTGGAAAACGACAAATTGCTTCAAAGGCTGCATAATGTCAAATTCCGAAGATATCACGGATTATTCAGCTTGGGGCACTCCTAAATTCTAA
PROTEIN sequence
Length: 1097
MKTRSMYRGIVLIAAALATVACQNELKEEYNEPKPGEKITMTIRATQGAASQTRTDYEDNLGITGIDNIAVKWEGGSTDGAPVEKIKVFGVDANELDYSVDFNSLPSSLSQDGTSISFEGTINAKSLYFAMYPADNCNYNTTVQAIYTSFSDQTQDCAKPMAHLKGFDLMVGRAATTGSYDKLTFSHEAAMIRFTLNGVPSSEKITRVSLAAANNKLSSRMCASLAGSEIGGLTVEADTEYAPVSSLGLDITNHTPSTEPLKAYMMIPPCDLSNDQLTVTVNTESGNTYTGDLTTGASTLLKAGLCYTLEPTLALGKTISLPPATAGSLGNALNNITPAQGQTELAVTGAVNTDDITALAAFLKETKAENITAIDLSGISGITDVTGFAGCAKIEKVILPDAAEAIGDNAFEGCTALTTVIQNDPIPADVAPATRSISKRIKRIGHSAFKNCTSMTEMFLHADIQSVGNSAFEGCTAMTALIFEGTKAVNETDGISLGTGIITGTHADIKIFLPAITDLAVGTAYKTILEEKPTYYNFAGYGSATTTEEKTNPASYTLIPTVPVDTMRFTVKVESGNLGFCIPFPDSGNTPATITVSWGDGTPAVVVPKGTTLATGDKFEYTYAEAGTYTITIGSGATADKQQIPVLNFNQRGSSYNPNKLVSLETPLLNMNCSSLSKAFRICENLTTIPGNLFEKNTAVTNFSNCFDYCKALTAIPGNLFEKNTAATNFSFCFFNCELLKEIPNELFASNTAATNFSGCFANCKGLTTIPGNLFEKNTAATDFSNCFYYCKELQSIPGGLFASNTAAINFSTCFNHCDALTTIPESLFANNTEATKFSQCFADCTALTTIEARLFANNANINISYCFSGCTALTTISADLFANNTAIKSFNYCFYECTALKAIPEGLFAKNAEATSFSYCFANCKGLTAIPENLFEKNTAATDFKNCFQSCSALKAIPGNLFEKNTAATDFSYCFYDCSSTQLTTIPEGLFAKNAEATNFNSCFYGCTYMMFNPNIFVDPTAAEQDKLNRFIDKDMDFRNCFYQVNLHNNSGTAPALWKYEKGSGQWKTTNCFKGCIMSNSEDITDYSAWGTPKF*