ggKbase home page

NECEvent2014_5_7_scaffold_203_39

Organism: NECEvent2014_5_7_Haemophilus_parainfluenzae-rel_39_109

near complete RP 52 / 55 MC: 1 BSCG 51 / 51 ASCG 13 / 38 MC: 1
Location: 37011..40169

Top 3 Functional Annotations

Value Algorithm Source
Cell wall surface anchor family protein n=1 Tax=Streptococcus mitis (strain B6) RepID=D3HA73_STRM6 similarity UNIREF
DB: UNIREF100
  • Identity: 56.2
  • Coverage: 762.0
  • Bit_score: 749
  • Evalue 5.10e-213
  • rbh
Collagen triple helix repeat (20 copies) {ECO:0000313|EMBL:KJQ68876.1}; TaxID=28037 species="Bacteria; Firmicutes; Bacilli; Lactobacillales; Streptococcaceae; Streptococcus.;" source="Streptococcus mi similarity UNIPROT
DB: UniProtKB
  • Identity: 58.0
  • Coverage: 796.0
  • Bit_score: 800
  • Evalue 2.70e-228
cell wall surface anchor family protein similarity KEGG
DB: KEGG
  • Identity: 56.2
  • Coverage: 762.0
  • Bit_score: 749
  • Evalue 1.40e-213

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Streptococcus mitis → Streptococcus → Lactobacillales → Bacilli → Firmicutes → Bacteria

Sequences

DNA sequence
Length: 3159
ATGAACAAAATCTATAAGGTTCTTTGGAATCATGCTGCCCAAAATTGGGTGGTCACCTCTGAATTGGCTCGTGGAAACGTTAAATCTTCCACCAGCCAACTCACAAATCAACTTACCGATCAACCAACCAGCCAACTGAAAAAAAGCTTTACAATAGCCACACTAAGTGCGGTCATTATTGCGAGTTTATTCTCTACATCGGCTATGGCTTATACAATGAACCAAAATGCGTGGGATAGCGCTGACGCCAGTACCGTTGTAATTGGTGAACAAAACGATGCCTCAGGTCCTAATGGTGGAGCATCAGCTAGCACTGGTGGTTCAACTACCGATGGGAGAATAACTGGTGTTCATCACTCTATCGCTATTGGTGGTAAAGCAACAGTAACTAATACTCAAAATGCTCTAGCCATTGGTTATAATGCTAAAATTAATTCAAAAACAAAAGGAAGTGTCGCTCTAGGAGCACATTCACAAGTTAGTGGTGTACATACACAAACAGATTCTCAATCAATCACCATTGGTGGTAAAGATTACAACTTTGCAGGGAAAGCAGATACAAATACATCTGTTCTTTCTATTGGTTCAGGTACAGCTGTCAACGATTGGAACGGACGCAGAAACGATGGTCGAGGCCATATGATAGATACTTATGACCGAGGTACTAATCGCTACAACGTTCGCCAAATTCAAAACGTAGCGGCAGGTCGTGTGGCAGAAAGCTCAACCGATGGAATTAACGGCTCTCAACTTTATGCTGTTGTAGCAGCTGTTAATAAGTTGTCAGAAAATACTGCCTATGGTTGGAAAGTAAAATCTGAAGGTGGAACAGGTTCAAGTGAACAACCCGTTAACAACGGCGAGACTGTTACCTTTGCTGCTGGCGAAAACATGACAATTAAGCAAGATGGAAAAAAATTCACCTATGGTTTAAAAACCAAAAAAGTAACAGCGAAGGATGAAACTGGCGAGTATGATGATATAAAAGAAAACGAAGAAGATGGCGTTGTTACTGCGAAGGATTTATTAAAGGCACTAAAAAAAGCTGGCTGGAAGTTACAAGCTAACGGTTCTAATACGATAATTAAAGCTGGTGATGTGGTTAATTTTGTTAATGGTACGGGTACTACCGCATCAGTTAAAGGGAATTCCATTACATTTAATATTAACAAATCAGATTTAACAGCATCTGGTACTGGTACTATTTCTGCCAGTAAAGCTGGTGATCATTTTGCTACCGCAACAAGTGTTGCAAATGCAATTAATGGCGCGTTCTGGAGAGCTACAGCAGCTGGTGCTGGTGGCGGAACTCGCGTAGAACAACCAATCAAAGCTGGAGATTTAGTTACATTTAAAGGTGGCGATGGTATTAAAGTTGATCAAAATGAGAGAACTTTTACCTTTAGCCTTGATAAAGACTACATTAATAAACACCCTGAATTTAAAGGCCCTAAAGGGGATAAAGGTGCAACGGGTGTTCGTGGTGCAGCGGGTCAAAATGGTAGAGACGGCCTAACACCTACTGTAAGTACTAAAGACAATAACGATGGTACGCATACCGTCACGATTACTACTGGTAAAAATATATCCGAATTTACCGTAAAAGATGGTGCTCAAGGCGAACGCGGACTTCAAGGTGAACGCGGACTTCAAGGTGCTAAAGGCGAAAAAGGTGACCAAGGCGAACGCGGACTTCAAGGCGAACGCGGGCTTCAAGGTGCTAAAGGCGAAAAAGGCGACCAAGGCGAACGCGGGCTTCAAGGTGCTAAAGGCGAAAAAGGTGACCAAGGCGAACGCGGACTTCAAGGCGAACGCGGACTTCAAGGTGCTAAAGGCGAAAAAGGCGACCAAGGCGAACGCGGACTTCAAGGTGCTAAAGGCGAAAAAGGCGACCAAGGCGAACGCGGGCTTCAAGGTGCTAAAGGCGAAAAAGGTGACCAAGGTGAACGCGGACTTCAAGGTGAACGCGGACTTCAAGGTGAACGCGGACTTCAAGGTGAACGCGGACTTCAAGGTGAAAAAGGTGAGAAAGGAGATACTCCTAAAATCACAACTGCTCGTGGAGCAGATGGCCACAGTACTGACATTACATTTACACTTCCTGGAGAAGAGCCTGTTGTAGCTAACATTAAAGATGGAAAAGATGGACGTACACCAAACCTCGACTTAAATGCCTTAGCTGAAGCAGCAGTAAGACTAAACAATCAAAGAAGTGGAAGAGTAAGAAGAGCTTTAGCAGACGCCCCATCTACAGCACCAGCAGAAAAACCAAGAGAAGGTACTCTTATTACAGCGTACTTTGATAACAATGGTAATGGCAGATACGATGAAGGTATAGATGAGTTAATTGCTAAACAACCAATCTATAATGGTACAGATGGTGCCAATGGAGCAGCCGGAGCAGCCGGACGTAATGGTGCTGAACTATTAAGCGGATCAAAAGCTCCAGTAGACAAAGATGGAAAAGATGGCGACACTTACATTGATGCTACTACAGGAGATGTTTACAAAAAAGAAGGTGAAAACTGGAACCAAATTGGGAACATCAGAGGTCCTCAAGGTCTTAAAGGTGAAAAAGGACAAGATGGTGCTCAAGGTCGTGACGGACGTGATGGCCGCGACGGTAAAGATGTCTTAAACGGCAAAGTCAACCCAACAACAGAAGGTAAAGACGGCGATAAATACGTCAATACTGAAACAGGCGACGTCTTCGTTAAGAATAACGGCAACTGGGATAAAGAAGGCAACATCAAAGGACCTAAAGGGGACAAAGGTGAAGAAGGTCTTCAAGGTCGTGATGGTCAAGACGGAGCTCAAGGTTTACCAGGCCGCGACGGACGTGATGGAGCGCAAGGCCGTGACGGACGTGATGGACGTGACGGTAAAGATGTCTTAAACGGCAAAGTCAATCCAACAACAGAAGGTAAAGACGGCGATAAATACGTCAATACTGAAACAGGCGACGTCTTCGTTAAGAATAACGGCAACTGGGAAAAAGAAGGCAACATCAAAGGACCTAAAGGTGACAAAGGTGAACAAGGTCTTCAAGGTCGTGATGGTCAAGACGGAGCTCAAGGTTTACCAGGTCGTGATGGACGTGACGGCGCAGCAGGTCGTGACGGACGTGATGGCCGCGACGGTAAAGATGTCTTAAACGGC
PROTEIN sequence
Length: 1053
MNKIYKVLWNHAAQNWVVTSELARGNVKSSTSQLTNQLTDQPTSQLKKSFTIATLSAVIIASLFSTSAMAYTMNQNAWDSADASTVVIGEQNDASGPNGGASASTGGSTTDGRITGVHHSIAIGGKATVTNTQNALAIGYNAKINSKTKGSVALGAHSQVSGVHTQTDSQSITIGGKDYNFAGKADTNTSVLSIGSGTAVNDWNGRRNDGRGHMIDTYDRGTNRYNVRQIQNVAAGRVAESSTDGINGSQLYAVVAAVNKLSENTAYGWKVKSEGGTGSSEQPVNNGETVTFAAGENMTIKQDGKKFTYGLKTKKVTAKDETGEYDDIKENEEDGVVTAKDLLKALKKAGWKLQANGSNTIIKAGDVVNFVNGTGTTASVKGNSITFNINKSDLTASGTGTISASKAGDHFATATSVANAINGAFWRATAAGAGGGTRVEQPIKAGDLVTFKGGDGIKVDQNERTFTFSLDKDYINKHPEFKGPKGDKGATGVRGAAGQNGRDGLTPTVSTKDNNDGTHTVTITTGKNISEFTVKDGAQGERGLQGERGLQGAKGEKGDQGERGLQGERGLQGAKGEKGDQGERGLQGAKGEKGDQGERGLQGERGLQGAKGEKGDQGERGLQGAKGEKGDQGERGLQGAKGEKGDQGERGLQGERGLQGERGLQGERGLQGEKGEKGDTPKITTARGADGHSTDITFTLPGEEPVVANIKDGKDGRTPNLDLNALAEAAVRLNNQRSGRVRRALADAPSTAPAEKPREGTLITAYFDNNGNGRYDEGIDELIAKQPIYNGTDGANGAAGAAGRNGAELLSGSKAPVDKDGKDGDTYIDATTGDVYKKEGENWNQIGNIRGPQGLKGEKGQDGAQGRDGRDGRDGKDVLNGKVNPTTEGKDGDKYVNTETGDVFVKNNGNWDKEGNIKGPKGDKGEEGLQGRDGQDGAQGLPGRDGRDGAQGRDGRDGRDGKDVLNGKVNPTTEGKDGDKYVNTETGDVFVKNNGNWEKEGNIKGPKGDKGEQGLQGRDGQDGAQGLPGRDGRDGAAGRDGRDGRDGKDVLNG