ggKbase home page

GWF2_Bacteroidetes_38_335_gwf2_scaffold_3005_4

Organism: Bacteroidetes bacterium GWF2_38_335

near complete RP 53 / 55 MC: 2 BSCG 51 / 51 ASCG 12 / 38
Location: comp(10667..13960)

Top 3 Functional Annotations

Value Algorithm Source
kgp; lysine-specific cysteine proteinase Kgp Tax=GWF2_Bacteroidetes_38_335_curated similarity UNIPROT
DB: UniProtKB
  • Identity: 100.0
  • Coverage: 999.99
  • Bit_score: 2222
  • Evalue 0.0
Putative Gingipain R id=3981480 bin=GWF2_Bacteroidetes_38_335 species=unknown genus=unknown taxon_order=unknown taxon_class=unknown phylum=unknown tax=GWF2_Bacteroidetes_38_335 organism_group=Bacteroidetes organism_desc=a11 similarity UNIREF
DB: UNIREF100
  • Identity: 100.0
  • Coverage: 999.99
  • Bit_score: 2222
  • Evalue 0.0
kgp; lysine-specific cysteine proteinase Kgp similarity KEGG
DB: KEGG
  • Identity: 29.5
  • Coverage: 999.99
  • Bit_score: 443
  • Evalue 2.30e-121

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

GWF2_Bacteroidetes_38_335_curated → Bacteroidia → Bacteroidetes → Bacteria

Sequences

DNA sequence
Length: 3294
ATGAAAGCAATCTTTACTTATCTCATTTTCATTTTGGTTTCAGCATCCCTGATGGCCAATAAAAATGAAATTAAATTGAAAAATGCACAGGAAAGCGATCTCAAAATGGTCAGCAAATCAACAACCGGTTTTCAGGTTGTTAGTACTCTTTCCAAGCTGGATTACTTTGTTGTTAAAACTGAAAAAGGTTCATTTGTTCAGTTAAAAATTAACGGCTATACCCGTAACAACGATTTTGGTAGTCCACTTCTTCCCATTCAGAAAAGAATGATTGAAATTCCTCAGAATGCCAATGTTAAAGTTAATATTCTGAGTTATGATGAAGAAATCATCAATTTGAACGATTTGGGTTTGTCAATGAAACTGATGCCATCACAACCTTCGGTTTCTAAAAGCGCCGATCCTTCAGAAATTGAATTTGTTTATAACGAAAGTGTATATAAGCTTAATGATTTTAATAAACGCCCCATGGCCGAGGTAATTGAATTGGGAACCATGAGAGGTGTTAAGCTTGGTCGTCTGGAAATTGCTCCTTTCCGTTATAACCCTGTAACCAATGAATTGAAGGTGTACAACAATATTCAGATTGAAGTTGTTTTTGAAAATGCAAATCTGTCAAAAACCAATGAAATCAAGAACAATGCTTATTCACCGGCTTTTGAAAATAATTTCAGCAAACTTCTGAATTATTCACCTGTCAGATCTAAAGACGCTATCACTCAATATCCTATCAAAATGGTTATTGTGTCAGATCCTATGTTCGAAACCATTCTTCAGCCCTTTATTGAATGGAAAACAAGAAAAGGTTTTACTGTTGTAGAAGCATACACAAATGACGCTTCTGTTGGAACCACCACAACCTCAATAAAAAATTATATTCAAGGTTTGTACGACGCTGGCACTCCGACAGACCCGGCTCCGACTTATGTTCTTTTTGTAGGTGATTTGGCTCAGATTCCTGCTTTTAACGCAGGTGCCCATTATACCGATTTATATTATTTTACCTATGATGGTACCAGTGATATTTATCCTGAAATTTACTATGGAAGATTTTCTGCATCAAACACCAATCAACTTCAGCCTCAGCTTGATAAAACACTTGAATACGAACAGTATTTATTTCCGGATGATGCATTTCTGAATGATGTGATTTTAGTTTCAGGTGTTGATGCTTCATGGGCACCTGTAAATGGCAACGGACAAATTAATTACGGAACAGATAATTACTTCAATGCTGCTCACGGAGTAACTGATTATACCTGGCTTTACCCTGCTTCAGATGCAGCCGGTGTTGATGTAACCATCAGAAATCAGTTTAGTCAGGGTGCTTCATTGGTTAACTATACCGCACATTGCAGCGAAGATGGCTGGGCTGATCCCTCATTTACAGTAGCAAATGTTGCCTCGCTTACCAATGCACATGAATATCCATTATCAATTGGAAACTGCTGCCTTTCAAATAAATTTGATGTGGCAGTTTGTTTTGGAGAAGCACTCCTGAGAGCTGAAAACAAAGGCGCTATCGGCCATATCGGAGGATCAAACAGCACATACTGGGACGAAGATTACTGGTGGGGTGTTGGTTCAGAAGCTGTTTCTGCAACACCTGCCTATGTTGCTGGAAAACTTGGTGCATACGACTGCCTTTTCCATGATCATGGTGAAGCTGAAATTGACTGGTTTGTGACAAACGGACAAATAATGAACGCCGGAAATCTCGCTGTAACTGAGGCCGGTGGTGATGAACAATATTACTGGGAAATATACCATTTGATGGGCGATCCTTCAGTAATGACTTACATCTCTGTTCCTCCCGCACTCACTGTTTCTTACGCTGAACCTCTTCCGGTTGGAATTGGTACCCTTGATGTTACCACAGAACCTTATTCTTATGTGGCAATTTCTCAGAATAATGTGTTGCTTGATGCAAAATATACCGGATCAGGCACCACTGTAACACTTGAATTTACTCCGTTTACTGTTCCTGGTACAGCTGATATCGTAATCACCAAGCAGTTCAGACAACCTTATATTGCTACAATTGATATTTTTGCTGCCAATACCGATAATGATGCCATGATGGCTTCAATCAGTGTTCCTGCAACTTATGAAAGTGTGACTGCTCCTGATGTTACTCCCACCGTTACAATTAAAAATCTTGGAAACTTGAATCTGACTTCAGCTCAGGTTGGTTATGAAATTGACGGCGGATCGGTTGTTTCACAGGCATGGTCAGGTAACCTGGCTCAATACGAAGATGATGTGGTTACTTTCCCGCTTATTACCTTAGCCAGTGGTACTCATACTATTACTGCTTTTGTATCATGGCCAAACGGAGTGGAAGATGAGTTTCATCCCGGTGATACCCTCGAAAAAGTTTTCCATGTTACTGCCGGTGATGCTGCAACTGTTGGAGTGTATGAATTTCTTGATGTTTATTGTGCCGCAGAAACAATTACTCCTACCGTTACCATCAGTAATAAAGGTGATGTTGACCTTCTTTCATGTGACGTAAATTATCAGATTGATGCCGGTGCTGTTCAAACAATTAACTGGACAGGCACTCTTGCTCCCAATGCTGAAACAACTGTTTCCTTCCCCGCAATTACATTATCTCCAGGTGACCATACATTTAATGCTTTCACTTCTAATCCCAATGGAGGTACAGATGAAAATGCATCCAATGACAATATGGCTGTTAACTTCTCAGTATTTGCTGCTGCTCAAACCGTTTCGGTTGATATTTTAACCGATGACTATGGAAGTGAAACTTCCTGGGAAATTACTGATGATGATTCAGGTGATGTTTTATATTCGGGCGGACCCTATGCGGATTGGGATTCACAGCATTTTATTACAGAGTATTGCTTTGGTGAAGGATGTTATACATTTACCATTATGGACTCCTATGGTGATGGTCAGGACGGCTGGTCAAATGATGGCTCTTATGCTGTTGTTAACGTGACTACTTCTACAGCTTTAGGCTCAGGTAGCGGTAACTGGGGTTCGGATGATGTGGTTAATTTCTGCATTACCGGTGTGGGAATTAATGAGGTTGCGCAATCAAAAGTTGCAGTTTACCCCAATCCTACAAACGGAGAGCTTTTTGTGAAAAAAGGGGCAGGCAATGCCCATGTCGAAATTACCAATGTTATTGGTGAATTAATTTATTCTCAGGAAGTTTCAGATGAACTGATCAGAATAGATTTGAAAGGAAATTCAAAAGGTGTGTATTTTGTTTCTGTCGATCTGAATGGACAAATAATTACTGAAAAAATTGTACTTGTCAGATAA
PROTEIN sequence
Length: 1098
MKAIFTYLIFILVSASLMANKNEIKLKNAQESDLKMVSKSTTGFQVVSTLSKLDYFVVKTEKGSFVQLKINGYTRNNDFGSPLLPIQKRMIEIPQNANVKVNILSYDEEIINLNDLGLSMKLMPSQPSVSKSADPSEIEFVYNESVYKLNDFNKRPMAEVIELGTMRGVKLGRLEIAPFRYNPVTNELKVYNNIQIEVVFENANLSKTNEIKNNAYSPAFENNFSKLLNYSPVRSKDAITQYPIKMVIVSDPMFETILQPFIEWKTRKGFTVVEAYTNDASVGTTTTSIKNYIQGLYDAGTPTDPAPTYVLFVGDLAQIPAFNAGAHYTDLYYFTYDGTSDIYPEIYYGRFSASNTNQLQPQLDKTLEYEQYLFPDDAFLNDVILVSGVDASWAPVNGNGQINYGTDNYFNAAHGVTDYTWLYPASDAAGVDVTIRNQFSQGASLVNYTAHCSEDGWADPSFTVANVASLTNAHEYPLSIGNCCLSNKFDVAVCFGEALLRAENKGAIGHIGGSNSTYWDEDYWWGVGSEAVSATPAYVAGKLGAYDCLFHDHGEAEIDWFVTNGQIMNAGNLAVTEAGGDEQYYWEIYHLMGDPSVMTYISVPPALTVSYAEPLPVGIGTLDVTTEPYSYVAISQNNVLLDAKYTGSGTTVTLEFTPFTVPGTADIVITKQFRQPYIATIDIFAANTDNDAMMASISVPATYESVTAPDVTPTVTIKNLGNLNLTSAQVGYEIDGGSVVSQAWSGNLAQYEDDVVTFPLITLASGTHTITAFVSWPNGVEDEFHPGDTLEKVFHVTAGDAATVGVYEFLDVYCAAETITPTVTISNKGDVDLLSCDVNYQIDAGAVQTINWTGTLAPNAETTVSFPAITLSPGDHTFNAFTSNPNGGTDENASNDNMAVNFSVFAAAQTVSVDILTDDYGSETSWEITDDDSGDVLYSGGPYADWDSQHFITEYCFGEGCYTFTIMDSYGDGQDGWSNDGSYAVVNVTTSTALGSGSGNWGSDDVVNFCITGVGINEVAQSKVAVYPNPTNGELFVKKGAGNAHVEITNVIGELIYSQEVSDELIRIDLKGNSKGVYFVSVDLNGQIITEKIVLVR*