ggKbase home page

gwc1_scaffold_238_56

Organism: GWC1_OP11_37_12

near complete RP 42 / 55 BSCG 45 / 51 ASCG 8 / 38 MC: 1
Location: 55179..58457

Top 3 Functional Annotations

Value Algorithm Source
Collagen triple helix repeat-containing protein {ECO:0000313|EMBL:KKQ24141.1}; Flags: Fragment;; TaxID=1618483 species="Bacteria; Microgenomates.;" source="Microgenomates (Roizmanbacteria) bacterium G UNIPROT
DB: UniProtKB
  • Identity: 100.0
  • Coverage: 999.99
  • Bit_score: 2166
  • Evalue 0.0
RBAM_007760; GXT repeat-containing collagen-like protein KEGG
DB: KEGG
  • Identity: 31.5
  • Coverage: 764.0
  • Bit_score: 245
  • Evalue 7.00e-62
Collagen triple helix repeat-containing protein similarity UNIREF
DB: UNIREF90
  • Identity: 0.0
  • Coverage: 0.0
  • Bit_score: 233
  • Evalue 3.00e+00

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

GWC1_OP11_37_12 → Roizmanbacteria → Microgenomates → Bacteria

Sequences

DNA sequence
Length: 3279
ATGGAAAAAGGGTATAATCTTTATAATCTTGAGCCCCAATTTAAAAAATTTTTAACTGCGGGAAACGTTTCTCCGATCACCCTGAAAAATTACCTCTCTGATTTTAGGCATTTTGCCGGTTGGCTGGATTTTTATATCAAGGCGAATCCAAATGTAGGGTCGATTCATGAATCGACCCTACAAGATTTTATTAATGAAAAAATGATTTCCGAATACCGATCCTATCTTGTCGAAAACAACCTTCCCCACAAAAGCATTAATCGCCGCCTGTCAACCTTGCGAAAGTTTTGTTCGTTTTGTATTTCCCAAGGTTGGATAAAAGAAAACCCCGCTAAGAAAATATCAAATATATCTCGTGTAGGGAACAACCCCCGTGTCGTTCCTCTTAAGGCCTGGCAAAGCCCTATTAGAGTCATTTCGAGAGTCAATAAAATAATCTCTTTTTTCTCGTCATTCTGGGGAGGAGTGAAACGACGACTCCAGAATCCTAACAACGATTCTGGACCCGCTTCGCCAAAGCTTCAGCGAGGCGAGCAAGCCAGAATGACAGAGTCACCCCTTTCTAATAATTTCGGTATTCAATACTACATCGCCTTCCTCATTATTCTCGTCTTCGTCGCCACCCTAGGAGCTGGCATTTATAATAAATTTTTCCTAAAATCGGAAAAAACTTTTGCTTACCCGACTGCGCCTACCCGGGCCGGAAGATTACTTTCTTTTCAAGGCCGATTGACTGATTCGCTGGCTAACCCAATAACTACTGCTACCAATGTCACTTTTAAGCTCTATAATGTTTCTTCAGGCGGCAGTGCTTTATACACAGCCGGGGCTTGTTCACTCACTCCTGACCAAGATGGTATTTTTAATGTTCTGATCGGCGGCTCCGGATATTCGCCGACTCCGCCTCAATCTGTCTGCGGCAGCGAAGTTGATTCGTCAATCTTTAGCGAAAATGCCAATGTCTATATGGGAATCACGGTCGCTTCTGATTCCGAGATGACTCCCCGCCAACAAATTGCAAACGTAGGATATGCAATTAATTCCGAGACACTTCAAGGCCTGCCGCCCGGATCAACAGTCTCAACTATTCCCTATATCGATGTCAACGGCAATGTGCTAATTGCCGCTGCTTCACCTGGAATAAGATCGACCTATACCTCAAATACTTTTACCGTTTCCTCTGCCCATGCGACGACCATCCAGTCTGCAAGTACAGGTGACATAGTCTTGCAGGCGACCGAGTCGGGAACTTTGAAATTCCGGACCGGAGGCGCGACTAACACCTATACAAGAATAATCGTTGATAACGCCGGTTTGGTGGGAATCGGTACGACGAGCCCGGGACAGGAGTTGGATATCGTCGGCGATCTGCAGTTTTCCGGAGCGCTGATGCCCAATTCAACGGCCGGAACCAGTGGGCAGTTTCTAATTTCATCCGGGTCTGGAGTGGCCCCGACCTGGACCTCAACTGTTGGAGCGACCAGCATATCTTTTTCCGGGATCACGTCCGGGACCAATACACAGGCGGCCATGGTGGTCGGGACCGGAGCGTCTTTAAATTATGAAAATAGCGGAACTATTAATGCCTCAAGTTTGATCGGCGGGACATGGGCATCTCCAGGGACAATTGGTTCTACCACTCCAAACTCCGGTGTTTTTACTAATTTAACGTCCAACGGAAATACAACTATTGGAGATAATATTGCTGATACGACTACCGTTAATTCCGGAGCTTGGACATTTGCCAATGATACAAATTTTACTTTAACCGGAGGAGCTAATGGGCTTTCTTTTGATACAAATACCCTATCGATCGATGCAACCTCTGATAGAGTCGGCATCGGGACGACGGCACCGAGTCAAACATTAGAAATAAATGGAACAACTCAATCATTGCAATATTATGCAATTGGAAGTTCTTCTGAAGATTATGGAAGAATTGTACTTGGAGAAACACTAAGTGCTGGACAATGGGGGGCTATTGATTGGGATACTACATCAGATCAAATTAGAATAAAGATGAGTGATGCATCCACTGTGGCTGTATTTACAGAAGCCGGCAACGTCGGCATCGGGACGACGGCACCAGCGGGAAAATTAGATGTGACCGATACTTCAAATACTGCGGCTTCTTTTAATTTGACAAATAACACTGCCACCACTATTGGGGTTGGAGTTAATACTTTAGGAGTTATGGACCTGCAATCTACTTCACTTACTACCGGTAATTTTTTAAATATTGAAACTAATGCTTTAACCAGCGGAAAGAGTTTGAACCTTTCCTCTACTTCAACCGGGTTGACCACCGGAAATCTGGCTTCTTTGGACTGGAGTCCGGGCTCGGCCACAACCGCAACCGGAGATTTGCTTTCTCTAAATATTGGAACTAATGGTAATATTGGGAATCTTTTCAACGTTAAAGACACCGGCTCTTCTTTGTTTTCTGTTTCGGAAACTGCCGTTACTTCTAATCTTCCAACTTCATTTACTTCAAGTGGAGATGTGTCAATCTCCTATGATCTAAATTTTACAAACCCAACTGCCTCATATATTAAATCTGTCGCTCCTCTTTATATCCAGGCTGGTGAAACTTTTGGAAGTTCCGATTTGACTTTGCAGACTTATAATTCGGGAGATGTTGTGATCGACTCTCCTGGTGGAGTCACTTTAGCTCAAGCTCAAGCTTGGGATTTATCTGATTCTTCAACAACTTCTCTAAATATTGAATCTGGCTTGATGAACTTTGATACGACAAATTCAAGAGTTGGAATTGGAACTACTGCGCCTACTCAAAAACTTGACATTGTTGGAAATGCTACCGCATCAGGAAATATCACGATGGGTGGACAAGCACAGTTGGGGAATTTTTCCGTTGCCCCAACCGCTGTTGGTGAAGGGGCTTTTTATTATGATTCTACCACTAAAAAAGTTTACTACTGGAATGGGACGGCTTGGAGCGAATCTCTAGGATCAACGGGGCCAACTGGAGCAACAGGTGTTACTGGTCCTACTGGTAGTACAGGCGTAACCGGACCAACTGGGGCAACAGGTGTCACCGGGCCAACAGGAGCAACAGGAGTTACAGGTCCCACGGGAGCTACAGGAGTAACCGGGCCTACTGGTAGTACGGGAGTTACTGGTCCTACAGGAGCAACCGGAGTGACAGGTCCTACGGGTGCTACAGGAGTTACTGGTCCTACGGGTGCGACGGGAGTTACAGGACCAACAGGTGCTACCGGAGTAACCGGTCCTACTGGTAGTACAGGAGTTACTGGTCCT
PROTEIN sequence
Length: 1093
MEKGYNLYNLEPQFKKFLTAGNVSPITLKNYLSDFRHFAGWLDFYIKANPNVGSIHESTLQDFINEKMISEYRSYLVENNLPHKSINRRLSTLRKFCSFCISQGWIKENPAKKISNISRVGNNPRVVPLKAWQSPIRVISRVNKIISFFSSFWGGVKRRLQNPNNDSGPASPKLQRGEQARMTESPLSNNFGIQYYIAFLIILVFVATLGAGIYNKFFLKSEKTFAYPTAPTRAGRLLSFQGRLTDSLANPITTATNVTFKLYNVSSGGSALYTAGACSLTPDQDGIFNVLIGGSGYSPTPPQSVCGSEVDSSIFSENANVYMGITVASDSEMTPRQQIANVGYAINSETLQGLPPGSTVSTIPYIDVNGNVLIAAASPGIRSTYTSNTFTVSSAHATTIQSASTGDIVLQATESGTLKFRTGGATNTYTRIIVDNAGLVGIGTTSPGQELDIVGDLQFSGALMPNSTAGTSGQFLISSGSGVAPTWTSTVGATSISFSGITSGTNTQAAMVVGTGASLNYENSGTINASSLIGGTWASPGTIGSTTPNSGVFTNLTSNGNTTIGDNIADTTTVNSGAWTFANDTNFTLTGGANGLSFDTNTLSIDATSDRVGIGTTAPSQTLEINGTTQSLQYYAIGSSSEDYGRIVLGETLSAGQWGAIDWDTTSDQIRIKMSDASTVAVFTEAGNVGIGTTAPAGKLDVTDTSNTAASFNLTNNTATTIGVGVNTLGVMDLQSTSLTTGNFLNIETNALTSGKSLNLSSTSTGLTTGNLASLDWSPGSATTATGDLLSLNIGTNGNIGNLFNVKDTGSSLFSVSETAVTSNLPTSFTSSGDVSISYDLNFTNPTASYIKSVAPLYIQAGETFGSSDLTLQTYNSGDVVIDSPGGVTLAQAQAWDLSDSSTTSLNIESGLMNFDTTNSRVGIGTTAPTQKLDIVGNATASGNITMGGQAQLGNFSVAPTAVGEGAFYYDSTTKKVYYWNGTAWSESLGSTGPTGATGVTGPTGSTGVTGPTGATGVTGPTGATGVTGPTGATGVTGPTGSTGVTGPTGATGVTGPTGATGVTGPTGATGVTGPTGATGVTGPTGSTGVTGP