ggKbase home page

scnpilot_p_inoc_scaffold_6333_1

Organism: scnpilot_dereplicated_Eukaryote_unknown_16

partial RP 39 / 55 MC: 26 BSCG 17 / 51 MC: 6 ASCG 21 / 38 MC: 8
Location: comp(2..3004)

Top 3 Functional Annotations

Value Algorithm Source
Putative trimeric autotransporter adhesin n=5 Tax=Haemophilus RepID=E7AGE2_HAEIF similarity UNIREF
DB: UNIREF100
  • Identity: 22.7
  • Coverage: 924.0
  • Bit_score: 153
  • Evalue 1.20e-33
General secretion pathway protein GspB {ECO:0000313|EMBL:KKB95025.1}; TaxID=1302 species="Bacteria; Firmicutes; Bacilli; Lactobacillales; Streptococcaceae; Streptococcus.;" source="Streptococcus gordo similarity UNIPROT
DB: UniProtKB
  • Identity: 23.2
  • Coverage: 926.0
  • Bit_score: 160
  • Evalue 1.00e-35
caaA2; trimeric autotransporter adhesin similarity KEGG
DB: KEGG
  • Identity: 22.7
  • Coverage: 924.0
  • Bit_score: 153
  • Evalue 3.30e-34

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Streptococcus gordonii → Streptococcus → Lactobacillales → Bacilli → Firmicutes → Bacteria

Sequences

DNA sequence
Length: 3003
ATGCAACAGCAACACCAACACCAACAGCAGCTGCGGCGACAGAGGCCGGAGTCGCTTTTGCAGCACTCACCGTCACGGTACGGTCTGCAGCCGCAACCTCAACACGCTTATGTTCGCGGCAACGCTGTTAGGGTCGGTGGCAGTAAACACGAGTTTGGCGTGGCCGTATGCAACGCGGCGAGTGCATGTGCGCAGCGACCGCCGCCTGTAAACAAGCTCGCTGCTGCAGCCGCTGCCGCGGCGGCCGCAGCCGCTGCTGCCGCTGCTGCTGCTACCGCTTCCGGCTCTGCCCCGAAGGCTGCGGCGACAAAGCCACATCTGTCGACATCAACTCAAATTCCAATCACGCAGTCCAATACCTTGCGCGGTGACAGCAGTGACGACGATAGTGACGCCGACTTCGAACTTGTTTTCGTTAATCCACACACAAATCTGCACAGTGCGCCAGCTAGAGTTGAGACCACAGTTAGTGCGAACGATCGGGGGCTGAAGCGCGCGGCGGCTGACACACTGAATAGCAAGAATAGCAGCAGCACAAAGAATATAGCTGTTTCTGCATCTTCTGCTCAAGCACCGCCTAATGCCGACGCTACCGCTAAACGGCCCCGCGTGGCTGCGACAAACAACGCTTCCGAGACCACATCAGAAACATCGCATAGTAAACAACCTGCAGCTACAGTTATCCCCGCTGCTGCAGCTTACAACATCATTCAAAGACCTGCTCCTCACAGTGTCATGATTATTAGACAGAGTTATCCTGGTAGAGGCGACCCATCATTTGCAGTCACCTCGACGCCCGCGCCGTCTTACACTGGGCGCACTCCTTCGAGCGCGACCAAGCTCAACTTCAACGCTCTTCCGACAGCACGGTCGGCGCAGGATGTGACGCCTAGCCGCGCTCAGAACAGCGCAGAAGTCGGCGGCGTTGCGACTGCGCGCGACTTGGTGGCCGTGAGCAGTCGCATTTCCGCAGCGCACGGGAGCAATTGCCTCGACGCGAGCGCAACGCCGACTGCGTTTACCGCGAACAACAGCAACAACAAGAATACGAACAACATCAGCAACCACAATAATAACAATGTATCTGCAGCGGCCGCTGCTGCCGCTGCCGCTGCTGCCGCTACCAAAGCCACATTAGCTGCTGCCGATAATAGCGCAACATCAAGTGTTGGTCAAATCACGTCAATGCCCGCGAACACTCGCGAGAGTAAGATTCCTGTGGCGATCAATCAGCGTAAACTCAGTCTGCTTACCAGCACTGTGAACCCAACTACTGGTGACGGAGTTAATCCGAGTAGCGTCGCTGCGAGCACAGCCATTTCGCCACCGAATCTGCTGCCCTTGCCTGTTCCCCCAGGCATCGCTGCGGCGACCGCTGATGCCGACGCGGCAAATGCGTTTGCGCCAATCACAAGCGCCAGGGTTGGAGCGGGCACCGGCTTTCTCGCCACCTTGGCGTTGCGGACCGGGCGTGCACAGATGAGGCCGCTTGCGCCGGGGTGCTTCCGTGCTCCTGCAGCCAGTAACACCAGCAACAACAACTTAACCAATGGATTCACGGCTTCTTCTTCAGATCCAACTGCAGGTGATGACGCCACGAGCGCCAGCACTCACACCACGCCGGTTTCCGCTGCTGCGCACACACAGCTGTCTCCGATGCGCGTCGCGTACGCGCACGGTTTCCACGACAACGTCAGCGGCGCCATGGGCGGCCCCGTCAGCGTGACGACGGGTGCGGCCGCGGGGTCCATCGTGGTGTCGCGCGGGGGCGGCGCGCTCGACCCGAGCTATGACAGCGCAGACTGCTTGCTGCCGGCGCCGAGTGCGCTCCTCAGCAACAGCAACATCCACATCAGCAGCAGCGGGAGCGCCAAGAACACTGCGAGTGAGCAGAGCTCTGGGGCGTCGACGGCTCCCGGCGACAGTTCTTCTGTGAGTAATAGCTCTGCGACGAATGGGAGCAGTAGCAACGATTATTCACATTTGTTGCCGCCCCCGGAGTCAGGGCGGTGGATTCAAGAAGCTTTGCGTAAGAAGCGTTCCGACGCCAGTGGAGCACTCAGCGGCGACATTTTGAGCTTTGTGTCCGCGATCGACGACCCTGCGGTCGCGGAGGCAGCTCGTGACGCATCCGCTCGTGCGCGTGCTATTGCCACTGCGGTCTCAAGCATGAGCGGTGACAACTACCTGCAGGAACCGATTCTTTCAACTGCCACAACTCCAGCATTAGCCGCAGCGGCAGCGCCGATTGTTTCTCCAAACTCGGGCAGATACATTACAGAGAGTATCACCAACGTGATCAATCTAGCCTCAAGCGACTCCGACGACGACACATGCAGCGTTAGCACCACACCAGCGGTACTGGCGTCTAGCAACAACACCAACGTCTTTGCCAGCAAGCAATCTACGCAACTACCACTGTCTGCATTCGCAAATGCGCCTGCGTCTGAGAGTGGCAACGCGACGTCTAAGAGCCCGCCACGCCGCCGTAGCAGCGGCAACAGCCACAAAGCGCCGCGCACGCCGACCAGCAGCACGTCTCTGCCAGTGCCGACGCCGCTGAGCCCCGCAGTGCGCGCGGTATTCCGCGACGACGACGACACCGCGCCTGCATTCGTCTCTGCGACGACAAAGGCTGCAGCGAGTGCCAAGGCCGCCTCTGCCACAGCGTCAAGGAGTGAGATCTCAAGCGGAGGGAGTGAAAGCGATGATGTTTTGACTAAAGCTGTGCCTGTATCAGCGTTACGCGGCGATGAATGTGATCGCGAGTCAGCGCTCGTGCTCCGAGCCGCCTTCACCAGGGCCGCCACCGTTGCCGCGACTGCTGCGGCCGCATTAGATGAAGACACTGCATCAACAAACCACGGTACTGGCAACGCACTTTTCCGGTTAACCCACTCGTTGCTGGGCAAACATTGCACTACTGGTGACGCACAGTATAACGCCAAGATGAGTTCGTTGCTTGATCGTTTCTGTGCGTCGCTGGCGCCGTCAGTCTTAAGC
PROTEIN sequence
Length: 1001
MQQQHQHQQQLRRQRPESLLQHSPSRYGLQPQPQHAYVRGNAVRVGGSKHEFGVAVCNAASACAQRPPPVNKLAAAAAAAAAAAAAAAAAATASGSAPKAAATKPHLSTSTQIPITQSNTLRGDSSDDDSDADFELVFVNPHTNLHSAPARVETTVSANDRGLKRAAADTLNSKNSSSTKNIAVSASSAQAPPNADATAKRPRVAATNNASETTSETSHSKQPAATVIPAAAAYNIIQRPAPHSVMIIRQSYPGRGDPSFAVTSTPAPSYTGRTPSSATKLNFNALPTARSAQDVTPSRAQNSAEVGGVATARDLVAVSSRISAAHGSNCLDASATPTAFTANNSNNKNTNNISNHNNNNVSAAAAAAAAAAAATKATLAAADNSATSSVGQITSMPANTRESKIPVAINQRKLSLLTSTVNPTTGDGVNPSSVAASTAISPPNLLPLPVPPGIAAATADADAANAFAPITSARVGAGTGFLATLALRTGRAQMRPLAPGCFRAPAASNTSNNNLTNGFTASSSDPTAGDDATSASTHTTPVSAAAHTQLSPMRVAYAHGFHDNVSGAMGGPVSVTTGAAAGSIVVSRGGGALDPSYDSADCLLPAPSALLSNSNIHISSSGSAKNTASEQSSGASTAPGDSSSVSNSSATNGSSSNDYSHLLPPPESGRWIQEALRKKRSDASGALSGDILSFVSAIDDPAVAEAARDASARARAIATAVSSMSGDNYLQEPILSTATTPALAAAAAPIVSPNSGRYITESITNVINLASSDSDDDTCSVSTTPAVLASSNNTNVFASKQSTQLPLSAFANAPASESGNATSKSPPRRRSSGNSHKAPRTPTSSTSLPVPTPLSPAVRAVFRDDDDTAPAFVSATTKAAASAKAASATASRSEISSGGSESDDVLTKAVPVSALRGDECDRESALVLRAAFTRAATVAATAAAALDEDTASTNHGTGNALFRLTHSLLGKHCTTGDAQYNAKMSSLLDRFCASLAPSVLS