ggKbase home page

GWF1_CP_41_5_gwf1_scaffold_506_13

Organism: Spirochaetes bacterium GWF1_41_5

partial RP 37 / 55 BSCG 38 / 51 ASCG 9 / 38 MC: 1
Location: comp(20091..23462)

Top 3 Functional Annotations

Value Algorithm Source
Glycosyl hydrolase family 20, catalytic domain protein n=1 Tax=Bacteroides coprophilus DSM 18228 = JCM 13818 RepID=S0F6P1_9BACE similarity UNIREF
DB: UNIREF100
  • Identity: 21.4
  • Coverage: 477.0
  • Bit_score: 97
  • Evalue 6.60e-17
N-acetyl-beta-hexosaminidase Tax=GWF1_Spirochaetes_41_5_curated similarity UNIPROT
DB: UniProtKB
  • Identity: 100.0
  • Coverage: 999.99
  • Bit_score: 2275
  • Evalue 0.0
N-acetyl-beta-hexosaminidase similarity KEGG
DB: KEGG
  • Identity: 24.4
  • Coverage: 495.0
  • Bit_score: 114
  • Evalue 1.90e-22

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

GWF1_Spirochaetes_41_5_curated → Spirochaetes → Bacteria

Sequences

DNA sequence
Length: 3372
ATGAAAAAAAATAAAACATCAATACGCCTGATAACCCTGCTTCTTCTGGCAACAGATATGTATTTAATAAATCCTGCATTTTCTGCTGATGCAACCCAATATAATGCAGTTATTGAAAATCTACAGCCTAAAGAATGGAAAATTGAAGATTTAACGCCGGGATTTTTAATTTATAACAAGTATCCTGTTGTTTCCTTTTCGGAAATACTTAAAAACAGTATTGTTGTTTGCAATCCTGCTTTGAAAAAAACTGATGCCTTTAAATTTAGCTGCGTAGCAAAAGAAGATATTACTATTTATATGTTTTTATACTCCAATCCGGAAGCGGAATGGAATTCCTGGGATAAAAGCAAGGCGGAAATTAAATGGGAAATAAGCAAGACTGACGGACAAAAATCAATATCACAATATAAAAATATTCCATACAGGCATTTCCCCAAAGGAAATATTGTTCTGGATTATACTCCACCAGCCAAGCCTGCGACTATCCCCTTTTTTATTATTACAGCGAAGTCGCTTGTTATGGGTAAAAATGATACGCAGCCGAGCGCGCAAGTTTCTAATTTTTTAAAATTTATTTTACTGCCGGAAAGTACTGGGAAGGTTACCGTCGACGGTATAATTAATTCTGAAGAATGGGCAGCTGTTAAAAAAGAAAGTATTATGCTCTATCAGACAGATGCGAGTAAAATCAGAACAGGTTTTATTTTACCCGCAGAAAAAACCGAAGTATTCCTGGCACGAAACAGCAATACATTTTTTCTTGCTGTTAAAGCGTATAAAAATGATATGAACACCATACGCTCCGAAATAAAAACGAACAGCTACAGCACGCTTTGGGACGATGAAAGCATTGAGATTTATTTTGATTCTCCGCGCCTGGCAAATAACATAATGCATCTGGTAGTGAATTCGCCCGGATACTGCGGCATTAGCAAAGCAAACAGGGAAACCAATGTACAGCTTATGGTTAAAACATCTGTACAGAAAGATCACTGGGAACTCGAAATAGCAATCCCGATGCATCAGATAATGAATGGAAACGTTAAAAATGATATTATCGGATTCAATATGGTCCGCAATACATATCTTGATAATAAAATCAGCGAACGTACCGGATTCGCCACAGTAAAATCGGAATCATATAAAAACTTCGCGCCATTGTTTTTAGAGACCAGAGACAAAATATATAATAAAGAAGTGCAAAATAAGCTAAAAGATGAATTAACCGGAGCAAATGCTCCCAAATCGGAAATGCCGTATAATTCATTCGGGATTTATCCGCTTCCCAAGATTAAAAAAGATTTAAACGGTTACATTATACTGAAGGATTTTTCTATAACAGATCTGGCCAGGGCGAATAATACCAGTGCACTTCTAAAAAGCGAATTATTGAGGAAGTATGCTTTTTCGTTTCCAACCAAAGCAGTCAAAGAGATTATGATAGGATTAATTGAAAATGAGGAAGTAAAAAAAAAATTAACAGAGAGAAATCTTTCCGGTGCGCTGAAAAACGATAACCCGGATGGATTTATTTTAAGCATTCGCGATAGCGGCATTCTCATTGCCGCACCCAATGATCGGGGGATATATTATGCTGTACGCGCGTTTTTGAAACTCGCCGATGTTGACACTCCTGCCGGAGAAATGCCGCGAATAAAATGCCGCGATATTATTGACTGGCCGGATCAGAAATTTCGGGCTTTTTTTACTCATGTCATTGGGCCTGGATGGATGATGAGAATGGCAAAACCGGTTGCCGTTAACGATATTGTTTTTTTCAAAAAATTTATTTTTGACACTATAGCGGGAGCGCGCTATAATGCCATGGTTTTTGAATTTAATAATACCTATCGTTTCAAGTCGTACCCGGAAATTGCTATACCCAATGCGCTTAATGAATCCGAGATGAAAGACATTCTGCAGTTCTGCCGCGATCATTATATTATACCGATACCGGGAATTAACACTCCCGGTCATGCTGGCTGGCTTGTGGACCGGCACCCGGAATGGGCGGAATTAAACnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGCGCTACTGGCCGTCAGTAATATATTTACGGAAGTAATTGAATTATTCGGCGGGAAAGAAAAATGTCCGTATTTTCATATCGGCGGCGATGAAGTACGCTGGACATTATTCGATAACAGCCATAAAAAATTACGGGATGAATGTCCGTACTGTAAGGGAATTCCCTATAACAAGTTGCTGCTTGATTATATCAATTTACGGCACGAATTTTTTAAAGCCAGAGGTATACGCATGATGATGTGGGCTGATATGTTCAGCGATTTGCACAACGGTTCGAAATTTCGGACAACAGAACTTGTCAGAACCATGCCTAAAGACATCATTCTCGTTCCCTGGAGCGGGGAACATGATTACCCGGCTATTCCCGGCTGGCTCCAGGAAGGTTTTAGCGTATTAAAAAGCTCCACGGGATATCAGCATAATGGAATTTGCGATCAGCAGATGTTTGGTTATATGCTGAATGATTTTACAACTTCCGTATGGCTTAGCTTTACCTATGGCCGAGCATCATCGCATAATTATTACTTCAACACCTCTATTCTCCGTTACGGGGACCTGGCATGGAATAATGAATCGGCGGTAAGTAACAGGGAAGAAGGAGAACTCGGACGGACTGATTATTTATTCCGCTACGGTAATGCCCTTTGTTATTATTATAATCAGGAACGATTTCCCAAACAATCATCAAAAACCAAAATATTAGATATAACCAAAGCAGCGAACTCACTCCGTAAAGACTGTTTTAATGCCGGATGGGAGTATGATTTATCTGCTTTTCATCCGGGTATAACTGATATTGCCGGTATAGAAATGCAAGTTACAGATCAGTGTATTGTGCTTGACGAAAAACGGTTAAGAGTTTCTGATATAAATTTATCCTGCCGCGCTTCATCTGTAATCTTTTTGCATACTGCATATCTGCCGGAAAAAAAAGAAGAAGCATTCCGCAACCGTATCCGTACCAACGGCAATTTTTCGGAAATGCCTCATTTTAATCCTGTTGCCTGTTATTTTGTAAAATACAGCGATGGCTCGCAGGAAAATATAATTATGCGTTATGGACTTAATGTCGGAGCCATACGCCCGCCGCTGCATCTGCGTTTCCCGTATCATATCCGGCATGTTTTACGTGCCCAGACCGGAAACTGGCCAGAAGCTCAGGATGGACGCGATGTTACTCCCGGCGCTCCTGCTTTATATCAATACGAATGGGTAAATCCTCATCCTGAAAAACTAATCGCTTCTATTGATTTTGTATCGCTGGGGACAGAAGTAATTCCCGCTCTGGCTGCAATTACCATACGCGATGTGCAGTAA
PROTEIN sequence
Length: 1124
MKKNKTSIRLITLLLLATDMYLINPAFSADATQYNAVIENLQPKEWKIEDLTPGFLIYNKYPVVSFSEILKNSIVVCNPALKKTDAFKFSCVAKEDITIYMFLYSNPEAEWNSWDKSKAEIKWEISKTDGQKSISQYKNIPYRHFPKGNIVLDYTPPAKPATIPFFIITAKSLVMGKNDTQPSAQVSNFLKFILLPESTGKVTVDGIINSEEWAAVKKESIMLYQTDASKIRTGFILPAEKTEVFLARNSNTFFLAVKAYKNDMNTIRSEIKTNSYSTLWDDESIEIYFDSPRLANNIMHLVVNSPGYCGISKANRETNVQLMVKTSVQKDHWELEIAIPMHQIMNGNVKNDIIGFNMVRNTYLDNKISERTGFATVKSESYKNFAPLFLETRDKIYNKEVQNKLKDELTGANAPKSEMPYNSFGIYPLPKIKKDLNGYIILKDFSITDLARANNTSALLKSELLRKYAFSFPTKAVKEIMIGLIENEEVKKKLTERNLSGALKNDNPDGFILSIRDSGILIAAPNDRGIYYAVRAFLKLADVDTPAGEMPRIKCRDIIDWPDQKFRAFFTHVIGPGWMMRMAKPVAVNDIVFFKKFIFDTIAGARYNAMVFEFNNTYRFKSYPEIAIPNALNESEMKDILQFCRDHYIIPIPGINTPGHAGWLVDRHPEWAELNXXXXXXXXXXALLAVSNIFTEVIELFGGKEKCPYFHIGGDEVRWTLFDNSHKKLRDECPYCKGIPYNKLLLDYINLRHEFFKARGIRMMMWADMFSDLHNGSKFRTTELVRTMPKDIILVPWSGEHDYPAIPGWLQEGFSVLKSSTGYQHNGICDQQMFGYMLNDFTTSVWLSFTYGRASSHNYYFNTSILRYGDLAWNNESAVSNREEGELGRTDYLFRYGNALCYYYNQERFPKQSSKTKILDITKAANSLRKDCFNAGWEYDLSAFHPGITDIAGIEMQVTDQCIVLDEKRLRVSDINLSCRASSVIFLHTAYLPEKKEEAFRNRIRTNGNFSEMPHFNPVACYFVKYSDGSQENIIMRYGLNVGAIRPPLHLRFPYHIRHVLRAQTGNWPEAQDGRDVTPGAPALYQYEWVNPHPEKLIASIDFVSLGTEVIPALAAITIRDVQ*