ggKbase home page

L3_072_000M1_scaffold_2235_10

Organism: dasL3_072_000M1_concoct_101_fa

near complete RP 47 / 55 MC: 1 BSCG 50 / 51 MC: 4 ASCG 15 / 38 MC: 1
Location: 10843..14166

Top 3 Functional Annotations

Value Algorithm Source
Ig domain protein group 2 domain protein n=1 Tax=Clostridium thermocellum (strain ATCC 27405 / DSM 1237) RepID=A3DGE8_CLOTH similarity UNIREF
DB: UNIREF100
  • Identity: 31.8
  • Coverage: 556.0
  • Bit_score: 210
  • Evalue 5.20e-51
dockerin type I cellulosome protein similarity KEGG
DB: KEGG
  • Identity: 31.8
  • Coverage: 556.0
  • Bit_score: 210
  • Evalue 1.50e-51
Ig domain protein group 2 domain protein {ECO:0000313|EMBL:ABN53027.1}; TaxID=203119 species="Bacteria; Firmicutes; Clostridia; Clostridiales; Ruminococcaceae; Ruminiclostridium.;" source="Clostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 /; NCIMB 10682 / NRRL B-4536 / VPI 7372) (Ruminiclostridium; thermocellum).;" similarity UNIPROT
DB: UniProtKB
  • Identity: 31.8
  • Coverage: 556.0
  • Bit_score: 210
  • Evalue 7.30e-51

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Ruminiclostridium thermocellum → Ruminiclostridium → Clostridiales → Clostridia → Firmicutes → Bacteria

Sequences

DNA sequence
Length: 3324
ATGAAAAAACTATTAATGATTATTGCAATTGCGACGCTCGCGTTTACGTGCGTCTTTTCGACGTGTAACAATGCCGGCGAGACAAAAAAACACGAGCATGTTTTCGGCGATTATTTCGTCACGAAAGAAGCGACATGCACGGAGAACGGCGAAAAGGTCCGCTATTGCACCGAATGCAAACAGCCCGACGATATTGTCGTAATACCTGCACTCGGTCACGATATAGTAAAGGACGAAGCAGTGTCGGCAACCTGTCTTAAAACAGGCTTGACAGCGGGAGAACATTGCGCCCGTTGCGATTACAAAGTAGAACAAACCGAAGTACCCGCGCTCGGTCACGATATAGTAAAGGACGAAGCAGTGTCGGCAACCTGTCTTAAAACAGGCTTGACAGCGGGAGAACATTGCACCCGTTGCGACTACAAAGTAGAACAAACAGAAATACCCGCGCTCGGACACGACATAGTAAAGGACGAAGCAGTGTCGGCAACATGTCTTAAAACAGGCTTGACAGCGGGAGAACATTGTTCTCGTTGTGATTATAAGATGGAACAAACCGAAATACCTGCGCTGGGACACGATATAGTAAAGGACGAAGCGGTTTCCGCAACCTGCCTTAAAACAGGCTTGACAGCGGGAGAACATTGTTCTCGTTGCGATTATAAGGTGGAACAAACAGAAGTACCTGCACTCGGTCACGATATAGTAAAGGACGAAGCAGTTTCCGCAACCTGTCTTACGACAGGACTTGCAGCGGGCGAGCATTGTTCTCGTTGCGACTACAAAGTAGAACAAACAAAAGTACCTGCACTCGGTCACGATATAATCAAAGACGAAGCAGTATCGGCAACCTGTCTTAAAACAGGCTTGACAGCGGGCGAACATTGTTCTCGTTGCGACTACAAAGTAGAACAAACAGAAATACCTGCACTCGGTCACGATATAATCAAAGACGAAGCAGTATCGGCAACCTGCCTTACGACAGGACTTACAGCGGGAGAACATTGCACCCGTTGCGATTATAGGGTAGAACAAACAGAAGTGCCTGCACTCGGACACGATATAGTAAAAGACGAAGCAGTATCGGCAACCTGCCTTAAAACGGGACTTACAGCGGGAGAACATTGCACCCGTTGCGACTACAAAGTGGAACAAACAGAAGTACCTGCGCTCGGTCACGATATAGTAAAGGACGAGGCGGTTTCCGCAACATGTCTTACGACAGGACTTACAGCAGGCGAACATTGCACCCGTTGCGACTACAAAGTAGAACAAACAGAAGTACCCGCACTCGGTCACGATATAATAAAGGACGAGGCAGTATCGGCAACATGTCTTAAAACAGGACTTACAGCGGGCGAACATTGTTCTCGTTGCGATTATAAAATAGCGCAACAGATCGTTCCTAAAACCGATCACGTCTACGGCACCGACGGAACGTGCAAATTCTGCGGCGAAAGTAAGTACGTAGTGACGTTTGAACTCAATTCGGACGGCAAGGGTTACACGGTTGCCGGGCTTAAAAAGAAAGCGACGATTACGAACGGGATACTGATTATTCCCGGCATGTATAACAACCTGCCGGTCACCGAAATCAAAGCGACGGCGTTCCGCCCCGAGTACGACTACGGAACCGAACATAAAATCAAAAAGGTAATAATAGAAGAGGGCGTGTTGCAGATAAACGGCGGCGCGTTTGAAAACTGCGACAATATAGAAGAGGTTTCGCTGCCGAAAAGTCTTACGCGCATTTGTTATTATGCGTTTGCTAACTGCGAAAAGCTTAAAGAAATAGTCATTCCGGCCAAAGTCGTGTACCTCGAATCGCATACTTTCAACTACTGCACGTCGCTTAAAAAAGTCACTATTCTGGGCGATATAAACACAAGCGAGGCTGCGGGAACGGCGTTTGAAGGATGTCTTAACATCGAAGAATGGTACGGCACGCCCTGCACGTTCAGATTTTTCGACAAAACAAATCTTAAAAAAGCGACCATGAAAACGCTTGGCTCCAATTCGTATTTCGTTGACAGAGGCGTTTTGTACGGAGCAAACAATCTCGTTGAACTTTCGATTCCGCAAATAGGAGATTCCGCATCCGACGGCTACGACGGTTCGACGGAGTATTTCCTCGGTAAAATATTCTGGTCTCAGGCGAAAAGCAACTCGATAGTACCGGCTTCGCTTAAAAAAGTAACGGTGGAAACGGGATCGCTGCCGGCAACCGCTTTTTCCGATTGTTCTCACATCGAAGAAATAATTCTGCCCGACGATATTTACATTATAGAAGCCAAGGCCTTTTACAACTGTGCGTCGCTTGTAAAACTTAACGTCCCGTCCAAGGTCGCTTTGGTTGCCGCCGACGCGTTTACGGGCGCGGATAAACTGCCGTACACGGTAAGCGACAACTGCCGATATCTTAAAAAAGGCGACAATAAGTACGGTTTGCTGGTCGGCGTTGTAAATAAATCGGCGAAAACCGTGATAAACGACGTCGCCGAAATAATTGCCTGCGGGGCTTTCGACGGAGTGCAGACATCCGTGATGAATAAATACGACAACGCTTACTATCTGCCTTCATCCGGTAACGCATATTTCGCTCTTGTAAAATCGGCTTTTCAGCCGATTGCATCGTGCAAATTGAACGCCGCTACAAAAATAATATGCGCACGCGCGTTCCGCAATTGCTATATGCTTAAAGCGTTTGAAGTTCCCGCGTCGGTCATAAGCATAGGCGAAGGCGCGATAGAGGGCTGCGCCGCGCTCGAAACTCTGTCCGTGCCGTTTGCGGGAGTAGTAAGACAGCAAAAAACCGACTATGTACAACTTACCGAAGGCTTTTTGTTTGGCAGAACGTCCGACAGGGCTCAGACGCGTCAGTTTGTATGCGTCGGCAAATATTCTCAGGATGTGTTTTTCAATATTCCGTCAACCATAACGCTTATAAAAACGGGCGGAAAATATATTCAGAAAGACGCGTTCATGAATATGAAAGGGAATAACCCTACTACTCGCCTTATGACCGTGATAATAGGCGACGACGTGGAGGAAATAGACGAAACGTCATTTATAGATAACTACTTAAAAGCGGTGGTAATCGGCAAAAACGTCAAAAAAATCGGATACAACGTGTTTAATTCGTATATGGCAAGCACTGCGCCGTCGTACGTCTACTATAACGGCAGCGAAAGCGAATTCGCGCAGATAAAAATAAACGCGGACGACAAGACGCTTACCGCTCCTCGCTATTATTACAGCGAGCAAAAGCCCGCGGCAGAAGGACAGTATTGGCATTACGTGGACGGTATGCCCGTAGCGTGGTAG
PROTEIN sequence
Length: 1108
MKKLLMIIAIATLAFTCVFSTCNNAGETKKHEHVFGDYFVTKEATCTENGEKVRYCTECKQPDDIVVIPALGHDIVKDEAVSATCLKTGLTAGEHCARCDYKVEQTEVPALGHDIVKDEAVSATCLKTGLTAGEHCTRCDYKVEQTEIPALGHDIVKDEAVSATCLKTGLTAGEHCSRCDYKMEQTEIPALGHDIVKDEAVSATCLKTGLTAGEHCSRCDYKVEQTEVPALGHDIVKDEAVSATCLTTGLAAGEHCSRCDYKVEQTKVPALGHDIIKDEAVSATCLKTGLTAGEHCSRCDYKVEQTEIPALGHDIIKDEAVSATCLTTGLTAGEHCTRCDYRVEQTEVPALGHDIVKDEAVSATCLKTGLTAGEHCTRCDYKVEQTEVPALGHDIVKDEAVSATCLTTGLTAGEHCTRCDYKVEQTEVPALGHDIIKDEAVSATCLKTGLTAGEHCSRCDYKIAQQIVPKTDHVYGTDGTCKFCGESKYVVTFELNSDGKGYTVAGLKKKATITNGILIIPGMYNNLPVTEIKATAFRPEYDYGTEHKIKKVIIEEGVLQINGGAFENCDNIEEVSLPKSLTRICYYAFANCEKLKEIVIPAKVVYLESHTFNYCTSLKKVTILGDINTSEAAGTAFEGCLNIEEWYGTPCTFRFFDKTNLKKATMKTLGSNSYFVDRGVLYGANNLVELSIPQIGDSASDGYDGSTEYFLGKIFWSQAKSNSIVPASLKKVTVETGSLPATAFSDCSHIEEIILPDDIYIIEAKAFYNCASLVKLNVPSKVALVAADAFTGADKLPYTVSDNCRYLKKGDNKYGLLVGVVNKSAKTVINDVAEIIACGAFDGVQTSVMNKYDNAYYLPSSGNAYFALVKSAFQPIASCKLNAATKIICARAFRNCYMLKAFEVPASVISIGEGAIEGCAALETLSVPFAGVVRQQKTDYVQLTEGFLFGRTSDRAQTRQFVCVGKYSQDVFFNIPSTITLIKTGGKYIQKDAFMNMKGNNPTTRLMTVIIGDDVEEIDETSFIDNYLKAVVIGKNVKKIGYNVFNSYMASTAPSYVYYNGSESEFAQIKINADDKTLTAPRYYYSEQKPAAEGQYWHYVDGMPVAW*