ggKbase home page

SCN18_25_1_16_R2_B_scaffold_151_52

Organism: SCN18_25_1_16_R2_B_SCNPILOT_CONT_300_BF_Rhizobiales_62_47-related_62_30

near complete RP 51 / 55 MC: 2 BSCG 51 / 51 ASCG 11 / 38
Location: comp(51276..52175)

Top 3 Functional Annotations

Value Algorithm Source
Collagen triple helix repeat-containing protein n=1 Tax=Thiothrix nivea DSM 5205 RepID=I3BSJ5_9GAMM similarity UNIREF
DB: UNIREF100
  • Identity: 47.2
  • Coverage: 214.0
  • Bit_score: 176
  • Evalue 3.90e-41
Collagen triple helix repeat-containing protein {ECO:0000313|EMBL:EIJ34338.1}; TaxID=870187 species="Bacteria; Proteobacteria; Gammaproteobacteria; Thiotrichales; Thiotrichaceae; Thiothrix.;" source="Thiothrix nivea DSM 5205.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 47.2
  • Coverage: 214.0
  • Bit_score: 176
  • Evalue 5.40e-41
collagen-like surface protein similarity KEGG
DB: KEGG
  • Identity: 45.6
  • Coverage: 226.0
  • Bit_score: 168
  • Evalue 2.30e-39

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

SCNpilot_P_inoc_Xanthomonadales_63_13 → SCNpilot_P_inoc_Xanthomonadales_63_13 → Xanthomonadales → Gammaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 900
ATGTCTAAAGCTCCCGTCATGCTCGCGCGTGGCGAGCCGCAGCCGCCAAAAAAGCCGGTGTCGTTGCTGGAGCGCACGATCGGTCTGGTCGAGGCGCTGGCCAACCGGATTCGTGCGCTCGAGCAAAAGCCGCTTGTGCGTGACGGTCGCGATGGTGCATCCGGCCGGGACGGCAAGGATGGCGCGCCGGGTCGCGATGGCGTCGACGGCAAAGATGGTGCCGACGGCAAGAACGGCCCGCCCGGCGTCAATGGCAAAGACGGTCAGCCTGGCCGAGACGGAGTCGACGGCAAGGATGGAGCGGACGGCAAGGACGGCCCGCCCGGTGCCGATGGCAAGGATGGTCAGCCTGGCCGCGACGGCGTCGACGGCAAGGATGGAGCCGACGGCGTCGGGGTTGCCGATCTGGCAGTGAGCGACGCTGGCGATCTTCAGGTCTCGCTAACGAATGGTCGGACATTCGAGCTCGGCCGGGTCCGGGGTCTCGATGGTCGCAACGGCTCCGATGGAAAGGACGGTCGCGACGGCCGCGATGGCATCGCCGGGCGCTCGATCGTCTCAGGAAAGATCGTCGATGGCGTCCTGACGCTCACCATGACCGACGGTTCAACCGAGCAGGTTGGCAATGTCGTCGGACCACCCGGCGCCGACGGCAAGGATGGATCGCCGGGCGCCGACGGCAAGGATGGCCAGGACGGCAGGACCGGCCCGCGTGGTGAAGCCGGTCCGCAGGGCAGGGCCGGTCGCGACGGCAAGGACGGTCAGCCGGCGCCGGTGACCGCCGTGGTCGAGTTTGGCGATGCGTTCCCGGCGCGGATCAGCGCAAAGGATCTCGACCGCCTGATGGTCCGCGAGATCACCGTCAATGGCGAAACCTTCCAGGTCCTCGCTCCGAATTGA
PROTEIN sequence
Length: 300
MSKAPVMLARGEPQPPKKPVSLLERTIGLVEALANRIRALEQKPLVRDGRDGASGRDGKDGAPGRDGVDGKDGADGKNGPPGVNGKDGQPGRDGVDGKDGADGKDGPPGADGKDGQPGRDGVDGKDGADGVGVADLAVSDAGDLQVSLTNGRTFELGRVRGLDGRNGSDGKDGRDGRDGIAGRSIVSGKIVDGVLTLTMTDGSTEQVGNVVGPPGADGKDGSPGADGKDGQDGRTGPRGEAGPQGRAGRDGKDGQPAPVTAVVEFGDAFPARISAKDLDRLMVREITVNGETFQVLAPN*