ggKbase home page

SCNpilot_cont_500_bf_scaffold_693_2

Organism: SCNpilot_BF_INOC_TM7_49_20

near complete RP 48 / 55 MC: 1 BSCG 48 / 51 MC: 1 ASCG 11 / 38 MC: 2
Location: comp(1213..4239)

Top 3 Functional Annotations

Value Algorithm Source
Glycosyltransferase sugar-binding region containing DXD motif family protein n=1 Tax=Firmicutes bacterium CAG:24 RepID=R5H3J4_9FIRM similarity UNIREF
DB: UNIREF100
  • Identity: 49.1
  • Coverage: 220.0
  • Bit_score: 230
  • Evalue 9.90e-57
Uncharacterized protein {ECO:0000313|EMBL:EMZ21795.1}; TaxID=97139 species="Bacteria; Firmicutes; Clostridia; Clostridiales; Clostridiaceae; Clostridium.;" source="Clostridium sp. ASF502.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 46.1
  • Coverage: 230.0
  • Bit_score: 230
  • Evalue 1.40e-56
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 49.1
  • Coverage: 222.0
  • Bit_score: 221
  • Evalue 1.10e-54

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Clostridium sp. ASF502 → Clostridium → Clostridiales → Clostridia → Firmicutes → Bacteria

Sequences

DNA sequence
Length: 3027
ATGAAAAAAATTCCTAAAAAAATTCACTATTGTTGGTTTGGGGGAAATCCACTCACACCTTTAGCTCAAGAGTGTATTGAAAGTTGGAAGAAATATTGCCCTGACTTTGAAATTATTGAGTGGAATGAGACAAACGTTGATATAGAAGGCTCGCCATTTATGAAGGCGGCGTATGAAAACAAAAAATGGGCGTTTGTGTCTGACTATGCACGCTACAAGGTTATACATGACAACGGTGGAGTATATCTAGACGTCGATGTCGAGGTTATCAAGAGTATTGATGATCTTCTTGAGCTTGGTGCATATATGGGCTTTGAAGGCAATGAGTATGTCAATTCAGGTTTGGGATTTGGAGCTACTGCTGGTGATGAAATGCTCGGAGAAATCTTGAGTGTATATGACAGCATAAACTACGAGGATCATAAAGACAATCCAGCGGCAATATCAACGCCTATAATTGTCACAGATATATTATCAAAGCACGGATTTCAGAAGAACGGTGAACTACAAAAGGTTGGTAGTATGACTATTCTTCCTGAGGACTACTTGTGTCCAAAAAATCCACTTACACGACTCACCAACATTACAGAGAACTCACGATCAATTCACCACTACGATGCAAGTTGGGTTAATTCAAAAGAACGAGAGCATATTGATGCCCTTGAGCTTAAAAGTAAAGATGTTGCTGCACAATTTCCTGATTTAATATCAATCATCATCCCTGTCTATAACGGTGAGGATTATCTGGCGCACGCGATTGAGAGCGCGCTCAACCAATCATATAAGAATCTTGAGGTCATTGTTGTTGATGATGGGAGTAGCGATGGTACCGAGAGTATTGCACGTTCATACGGTGCAAGACTAAGGTATATTCGTAAGGCGAATGGTGGTGTTTCGTCAGCTCTCAACACCGGTATTAAGAGCATGCGTGGTAAGTATTTTGCTTGGCTGTCTCATGATGATATGTACCACGAGAAAAAGCTAGAGCTGTTACACAAGGAGATTAAAAATTCCGACAAAACTATCGCTATATCCGACTGGGATATTGTAGATGCGAGTGGAACATTCCTTAGAAAAGCCACGCTGGATAGTCGACTTGAAACTTCACCTAAATCTTTTTTGGCGTTCGATAGAAACACTTGGCTTAACGCATGTGCCATGCTTATACCTAAGAGTCTTTTTGATGAAGTGGGTGTCTTTGATGAAAGCCTTAGAACAACTCAAGATTACGACATGCATTTGCGCATGATTGAAGCGGGCGCTCGCTATAAAATCGTTCACAAAAGCCTGTTTTACTCTAGGGCTCATGCTAACCAGGGGTCGCTCACAATTGGTGACGATACATTCAAGAACTCAGATAAAATGCACGAATTAATCATTGCTTCACTGGAAAAAAATGATTATGAAACGTACTTTAATGGGAGTGTGAGTGAATTTTATAAAGCGTATCACTCATTTGTAAAAAATGGGTATTCACTTACCCCGTCAGCCATGGTGTCAAGGATGCTTAATTTGTATCCTGCGGATAGCGTATTCCTTAGAAAAGTTATTGATAAGACGCTCCTCTCGCTACATGGTGATGCTCCAGATAGTGTCATCGATAGCTTAATCAATGAAGTGTCGACAAAAAAAGGCAAAAAACCGCGACTAATGTTCACTTCTGGTACATGGATGACCGGCGGTATGGAGCGGGTGATGTCGAATCTATTTATTCATATTGCGAAGAAGTACGACATCTATCTCGTTACGCCAAGTGGGTTTACGCAGAACGACGCTACCATCCCTCTTCCTGATGCGGTAACGCACATTAGCATAGACGAGTCGCTGTATTTTGATAGGTTTGATATTGTTGGATTCACTCTTGCAAAGCTATTGGAAGTCGACATAGTTATAGGTTTCCTTAATCTTAATGAAAAACAATTAAAGTTATACGAACATTGTGCGGATAATGGTATTAAGACGATCGCTTCTAATCATGAGTATTACTTCTATCCGTATAGAAGCTTATACGCGCCAATGAGGCGCCTCGCGTTACGTCGCAAGGAAGTCTATAAAAGACTCGATGCCGTACTTTGGTTAACTAACTTCAACACAGCTATCGGCAGGCTTGATGCGAGTAATGCTCATCTAATGCCAAATCCAAATACATATGACGTTGCTTCTGGAATAAAAAATAAAAAGAATACCAAGAATATTCTTTGCGTCGGAAGATTTAATGATCATGTTAAGCGAGTTGACCGGATGCTCGCTGCCTATAGCCTAGTAAGCAAGCAGGCTCCAGATGCAACCCTTACCTTAGTTGGGTCTATAGATCTAGATAAGCCTACGGCGGATGCGTCAGACAGAACAATTAACCAGCTTATAGATTTCCATGGCCTTGATCGCGCGAAGATTACTATTGTAGGTGAGACGAAAGACACACTTCAGTACTATCGAAATGCCGATGTGTTACTCATGACCTCAGAGTCGGAAGGCTTTCCTATGGTTGTAACAGAAGCGATGAGCCAAGGTGTTCCAGTTGTATGTGGGGACATACCGGGTGTGGAGGATGTGGTTGTTGATGGTGTTACTGGATATATTGTGGCGCAGGATGACTACGATAGCTATGCTGCAAGAATATTGAGTATCATTCTTAGCAATAAAGAGCGAGTAACTCTTGGTGAGAATGCTATCTCACATGTAAAGCAATATGACGGAAAAAATATTGCAACTGGATGGATGAGTATCATCGATTCTATTCTTGGTGGGCGAGAGGTGGCGTTTACTGCTTTCGACGCTGATGTGAGTAAAGCTAACGTAAAGTTCGAGAAACGACTACTTGACGAGCTCGATAAGTCGCTCCATCAATCTGTAAGAATGTGGGAAGGGCATGACCTTACATTATTCATCGAGAGTAAGAAGGAAAAGATTAAGCGTCTAATGAGAGGTGTTCAAAGGGATTACAAAGCCCTGGGGCCTTCTAAGACTGGAAAAAAGGCAATTAAAAAAATTGCCAAAAAAATTAAGAGCAAATTACGGATGGATTAA
PROTEIN sequence
Length: 1009
MKKIPKKIHYCWFGGNPLTPLAQECIESWKKYCPDFEIIEWNETNVDIEGSPFMKAAYENKKWAFVSDYARYKVIHDNGGVYLDVDVEVIKSIDDLLELGAYMGFEGNEYVNSGLGFGATAGDEMLGEILSVYDSINYEDHKDNPAAISTPIIVTDILSKHGFQKNGELQKVGSMTILPEDYLCPKNPLTRLTNITENSRSIHHYDASWVNSKEREHIDALELKSKDVAAQFPDLISIIIPVYNGEDYLAHAIESALNQSYKNLEVIVVDDGSSDGTESIARSYGARLRYIRKANGGVSSALNTGIKSMRGKYFAWLSHDDMYHEKKLELLHKEIKNSDKTIAISDWDIVDASGTFLRKATLDSRLETSPKSFLAFDRNTWLNACAMLIPKSLFDEVGVFDESLRTTQDYDMHLRMIEAGARYKIVHKSLFYSRAHANQGSLTIGDDTFKNSDKMHELIIASLEKNDYETYFNGSVSEFYKAYHSFVKNGYSLTPSAMVSRMLNLYPADSVFLRKVIDKTLLSLHGDAPDSVIDSLINEVSTKKGKKPRLMFTSGTWMTGGMERVMSNLFIHIAKKYDIYLVTPSGFTQNDATIPLPDAVTHISIDESLYFDRFDIVGFTLAKLLEVDIVIGFLNLNEKQLKLYEHCADNGIKTIASNHEYYFYPYRSLYAPMRRLALRRKEVYKRLDAVLWLTNFNTAIGRLDASNAHLMPNPNTYDVASGIKNKKNTKNILCVGRFNDHVKRVDRMLAAYSLVSKQAPDATLTLVGSIDLDKPTADASDRTINQLIDFHGLDRAKITIVGETKDTLQYYRNADVLLMTSESEGFPMVVTEAMSQGVPVVCGDIPGVEDVVVDGVTGYIVAQDDYDSYAARILSIILSNKERVTLGENAISHVKQYDGKNIATGWMSIIDSILGGREVAFTAFDADVSKANVKFEKRLLDELDKSLHQSVRMWEGHDLTLFIESKKEKIKRLMRGVQRDYKALGPSKTGKKAIKKIAKKIKSKLRMD*