ggKbase home page

GWB1_scaffold_182_4

Organism: GWB1_OD1_54_7

near complete RP 41 / 55 MC: 2 BSCG 46 / 51 MC: 1 ASCG 9 / 38 MC: 1
Location: 6225..7412

Top 3 Functional Annotations

Value Algorithm Source
Collagen triple helix repeat-containing protein {ECO:0000313|EMBL:KKW38031.1}; TaxID=1618607 species="Bacteria; Parcubacteria.;" source="Parcubacteria (Adlerbacteria) bacterium GW2011_GWB1_54_7.;" UNIPROT
DB: UniProtKB
  • Identity: 100.0
  • Coverage: 395.0
  • Bit_score: 803
  • Evalue 1.20e-229
hypothetical protein KEGG
DB: KEGG
  • Identity: 32.9
  • Coverage: 365.0
  • Bit_score: 120
  • Evalue 1.20e-24
Collagen triple helix repeat-containing protein similarity UNIREF
DB: UNIREF90
  • Identity: 0.0
  • Coverage: 0.0
  • Bit_score: 119
  • Evalue 1.00e+00

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

GWB1_OD1_54_7 → Adlerbacteria → Parcubacteria → Bacteria

Sequences

DNA sequence
Length: 1188
ATGATTACTTTCATGAAGTCCCGCTCGTTTTTTGCTCGATCACTCGCACTCTCTTTGGCGCTTCTGCTCGCGGGAGCAGTGGCGGCGTGGACTGGACCCTCCTCTGCACCGCCAAATGGCAACGTTGCCGCGCCGGTTAACGTCTCGGGCACCGCGCAGGTAAAGTCGGGCGGTTTTTGGGCCTCAAGCGTCGGCTCCGATGCGGGTTTCTGCATCGGCTCCTCCTGCATCACAAGCTGGCCGGCGGACGGTTCCGGATTTACCGGCTCGGGCTCTACCAACTATCTGACCAAATTTATCGGCGCGACCGCGCTCGGCAATTCGCTTATTTATGACAACGGCACGAACGTCGGCATCGGGACGGCGAGCCCGGGATATAAACTCGACGTGAACGGAGAGGTGCGTCTGGGGTCGTCCGCCAACGGCGGATTACGAGTCATCTCCATGAGTAGCAACAGCGTCAACCTGCGGCCGTCCATTAGTAATGGCTCCATCACGCTTACGGACGACAGCGGCGATGCCGCTAGGGGCATGACCATTGCCAACGGCGGCAACGTCGGCATCGGGGCGACATCGCCAAATGGTAAATTAACAATCGTAAATTCTGCGGGGGGCGTAGGTGTGTCGAATTATTTGTCTCTTAGATATGACGAAAGTGCAACTGCGGACTATACAGTTGGTCGAAATGGCAATACGGGATTTCTTGAATTTACCGGAAACCAGACTGGATATATAGGATATACATTCAACGGCAACGTCGGCATCGGGACGGCGAGTCCGGGGTATAAGCTTGATGTCATAGGAGACATAAACGTAACCGGATGTTTCAGAATAAACGGCACCTGCAGTACTGGACTCCAGGGGCCGCCCGGTCCCCAAGGTCAGACAGGAGCGACTGGCGCAACGGGAGCGACTGGCGCGACCGGCCCCCAAGGTCAGACAGGAGCGACAGGCGCAACGGGAGCGACTGGCGCGACCGGCCCCCAAGGTCCTGCAGGTTCATCAGGTGCCGTGACTTGGGGAGGAACCTACACCAGCGGTTGGTCACAAGGTAGTCACAATGCGGACGAGGTCGAATGGAATACATGTCTAAGTGGGTACGTTATGATCGGTATAAAGACAACCAATAGTCTCGCGTCCAATACCGAGTGGTCTCAAAACAAAATAACTTGTCAGCAAATCTATTAA
PROTEIN sequence
Length: 396
MITFMKSRSFFARSLALSLALLLAGAVAAWTGPSSAPPNGNVAAPVNVSGTAQVKSGGFWASSVGSDAGFCIGSSCITSWPADGSGFTGSGSTNYLTKFIGATALGNSLIYDNGTNVGIGTASPGYKLDVNGEVRLGSSANGGLRVISMSSNSVNLRPSISNGSITLTDDSGDAARGMTIANGGNVGIGATSPNGKLTIVNSAGGVGVSNYLSLRYDESATADYTVGRNGNTGFLEFTGNQTGYIGYTFNGNVGIGTASPGYKLDVIGDINVTGCFRINGTCSTGLQGPPGPQGQTGATGATGATGATGPQGQTGATGATGATGATGPQGPAGSSGAVTWGGTYTSGWSQGSHNADEVEWNTCLSGYVMIGIKTTNSLASNTEWSQNKITCQQIY*