ggKbase home page

scnpilot_p_inoc_scaffold_1550_5

Organism: scnpilot_dereplicated_Eukaryote_unknown_16

partial RP 39 / 55 MC: 26 BSCG 17 / 51 MC: 6 ASCG 21 / 38 MC: 8
Location: comp(14714..18070)

Top 3 Functional Annotations

Value Algorithm Source
Predicted protein n=1 Tax=Micromonas pusilla (strain CCMP1545) RepID=C1MLF6_MICPC similarity UNIREF
DB: UNIREF100
  • Identity: 48.6
  • Coverage: 146.0
  • Bit_score: 149
  • Evalue 2.50e-32
SprT-like family protein {ECO:0000313|EMBL:KDD74827.1}; TaxID=1291522 species="Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Chlorellales; Helicosporidium.;" source="Helicosporidium sp. ATC similarity UNIPROT
DB: UniProtKB
  • Identity: 30.3
  • Coverage: 393.0
  • Bit_score: 162
  • Evalue 3.00e-36
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 20.7
  • Coverage: 632.0
  • Bit_score: 91
  • Evalue 1.70e-15

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Helicosporidium sp. ATCC 50920 → Helicosporidium → Chlorellales → Trebouxiophyceae → Chlorophyta → Viridiplantae

Sequences

DNA sequence
Length: 3357
ATGTTAGATTGCGACATTTTCCGCCTATATTTCTTTCTTTTTTTTCTAAAAAATATTTCAACCGCTCGCTTTGTTTGCTTCTTAGAAGTCGCGATGGACCTGCAGCTGATGAACGAGCTCCGAGGTCTCCGCGACACAGAGGGTCTCGCCTCAGCGCTGTCTCTTGCTGCGGCGAGAGTACGGTCACTGCTTTCTGACCCGTCTACCACCCGGACTCTGCCTCCGTACCGGGGTCGTGGCTTTGGCGCCAAATCAAGCGGCCGCAGCAACACCAACACAGGCTCCGGATATGGCTCTGGCGCTGGCCCCTTTGGCACGAGCTCCAACGGCAACGGCGACGGTTCAAGCGGTGTTAGCACAACCACTGCCGTCTTTACACGTCCTCGCACCCCCGTCGGCGGCCGCAATGACTGGGACGACAGTACCAGCGGCGGCGGCGGTGCGTTCACGTTCGGCATTACGACTCTCAACAACAACACAAACACCGGCAAACTTAGTAAGAACAACGTCAAGAGCGCGGCGGTTAACATGAGCGACTACGACGATTACGCTGAAGAAGAGAACTACCACAACGGTCTCCAGCTCCCCACGCGGCCGAGCAACACTGGTTCGATCTCGAAGTTTAGCCGCCTCGACCAGTCTTCGTCTTCGTCGTCGTCATCATCCTCGTCAAGTTCTAGCTCGTCTGGCTCTGACACAAGCCCTGACTCAGAATCTGACACGAGCGCAAAGATTCAGCGAAAAAGCGATGCTGTCACAGTCACTGGTGGCCATACTGTTCAGAAGAAGAAGAGAGTGCTGCTTGACGAGTCATCCTCCTCAGACTCTGACTACAACTACGCTCACAACAACGCGAATAAGAACACTACCGCAAACAAGCAGGACAGCATCAGTGGCAGTGAAAGTGATAGTGATGTGTTCCACAGCCCCGTACAGAGCGAAGTTAAGCCGCGCCGGCCTGCGCAAGAGAAGCCGCCTTGCGCACCCGAAGCTAAGCCACCTAAGCCGCATAAGTGTCGGTCCCTTGACAGCAACAGTGACCGTGAGAGCGACGCGCATAACGCAACAAGCACGAGTGTCATCACTATTGGCGACAGCAGCAGCGGCAGCGGCAGCAGCTCTAGTTCAAGCTCAAGCTCAAGCTCTAGCAGTAGCAGCAGCAGCAGTGAGGGTGGCGACGAGGGCGACAGCAGCGACAACGAAGACACCAAGACTCAACTTAAGACGAAGTCAAAGTCGAAGTCTACTGATCGCACTGAGGGCAAGAGTACTTCCCTTGATCTAAGTACCATCCGCCGCGCGACACCCGCTCGCAAGTCTCGAACTAAAGCCGCTTTAGACTTAGTGACTGCAACTCAGCAACTGCTTCTCGCACGCAAAAACAACAACTCCAACAACAACAACAGCAACAACAGGAACAACAACACAAGCATGACTGACTCTGTTCACGGTCAGTTAGTGCCGCCGAGTGCGCGCAAGCCACGCTTTGTCCTCGACGACGATGATGACGATAAGTCAAGCCGCAATGAAAAGTTCGGTCGCACACCTCTCCGCCGGCGCATTCTCAACGACGATGACGACGACGATGACAACAGCACTCTGAACACGACAACGAACAAGGCAAAGTACCTTGACGTCACTACGAGTGAGAGTGACAACAGCAGCGACCGCTCCTACAGCGATAACAACGGCAACAGTGATCATAGCTTTGTTGTCACTGATGGTGATGAGAGTGGTATTTCTGGTGTTTACAGTGAGTTTGACAACAGCGACTTTAACAACACTGACAACGAGAACAGTTTCTTCAACTATGACGATGATGATGACTCTGACCATCACCACAACCATGACCACGACCACTCCCACAACGACGACGATTATGACGGCGAAAATGTCAAGCCGTCTATCTCCGAAATTAAGCCGCCTCGCAACGTCGACAAGAACAAAACCGGCACTGAAGTGAAGCCGCCTGCGCAGCCGCGTGTGAAGCCGCCAACTCAAAAGGATCAGCCGAAATCTAAGCCGCACAACACCACTGACCGCGTCTTTAAGTCTGATCCGAACCCTAACACGAACACCATCACGACCAAGACGGGCACCAATGCCGGAAATGGTCAGCCTGCTCTGAGTAAGATTTATGAGGAAGTGACTGGACTGACGTTCTCTCGTCGTCGCGAGGCTTTGACAGCGACGATGTTCGCTGAACTCAACGCTGGTGTGTTCCAAAACATGCTTCCTCATGACTTGCCGATCAAATGGTCGAAAACTCTTCTCAGGACTGCCGGCCGCGCCCGTTTTCCTGCCTACAAGCCTCCGACAAACAACACTACGGGTAACAACAGCGCGGTGACTTCCTCGTGCTCTGGCTCAAACTCAAGCTCGGCGACTGCTGCAACTGGCTCTGGCACTGGAAATAATGCTAGCGTCGGCGCTAATACAAATCGTGTCAGTATGGAAAGCATTGGGCAGCGCATTCGTCGCACGGCAAACATTGAGCTTTCTCTTAAAGTTCTCGACAGCGCCAGTCGCCTGCGCTCGGTACTCTGCCACGAGCTGTGCCACGTCGCGACATGGCTCATCGACAGGGTTCGTCCCTTTCACGACAGCTATTTCTACGCCTGGGGCGCGCGCGCGACTCGTTACGACCGGAAACTCGTCGTCACGCGCTGCCATAGCTATCAGGTCCACAAGCCGTTTCTGTTCCGTTGCGTGGCGTGCGCTCTTGAATACAGGCGCCACAGCAAATCCATCGATCTCGACAAAAAGCGCGGTGGCAAATGCTATGCGAAGCTGATGCTTGTGGGCCGCTTCAAGAATGACGGCACGCCTGTAAAGGAACGCGAGCTCACGGGTTACGCGGCGTACGTCAAGCAGCACGCGGGCCAAGTGCGCAAAGAGCTCGCGGCTGCTGCTGCGACTAGGGCGGCAATCCCAGCGACTACTGGCGTCAACGCAGATGTGAGTGCTAGCGTAGGTGTAAGTGGGAAGCCTGTGAAGGTGGCTCAGGGCGCTGTTATGAAGGCACTTGCGGAGCGGTACAAGAGGGACAAGGACGCGGGGCTGATTCCCGCCAGTACGACCAAGGCCCAGCAGCCGCCGCACAAGTGCACGAGTAAGGGCGGTGTGTACAACGATAGCGGTGTTGAAGATGATAACAGCAACGCTGAAGACTGGGTTAAGCTTGATGACGAAGGCAATGGTGGCGATGGTGAGGGGATCGAGCAGGAGAACAGGCGGCTTACGTTTGATGACGCCAGTGACAACGACGATAACAACAACAACACTGATGCTGTGGCGCAAGAGAACGAGGACGATGGAAGCGAATTGGCCAAGTTCTTGAAGTCGCTTAGCATCTAA
PROTEIN sequence
Length: 1119
MLDCDIFRLYFFLFFLKNISTARFVCFLEVAMDLQLMNELRGLRDTEGLASALSLAAARVRSLLSDPSTTRTLPPYRGRGFGAKSSGRSNTNTGSGYGSGAGPFGTSSNGNGDGSSGVSTTTAVFTRPRTPVGGRNDWDDSTSGGGGAFTFGITTLNNNTNTGKLSKNNVKSAAVNMSDYDDYAEEENYHNGLQLPTRPSNTGSISKFSRLDQSSSSSSSSSSSSSSSSGSDTSPDSESDTSAKIQRKSDAVTVTGGHTVQKKKRVLLDESSSSDSDYNYAHNNANKNTTANKQDSISGSESDSDVFHSPVQSEVKPRRPAQEKPPCAPEAKPPKPHKCRSLDSNSDRESDAHNATSTSVITIGDSSSGSGSSSSSSSSSSSSSSSSSSEGGDEGDSSDNEDTKTQLKTKSKSKSTDRTEGKSTSLDLSTIRRATPARKSRTKAALDLVTATQQLLLARKNNNSNNNNSNNRNNNTSMTDSVHGQLVPPSARKPRFVLDDDDDDKSSRNEKFGRTPLRRRILNDDDDDDDNSTLNTTTNKAKYLDVTTSESDNSSDRSYSDNNGNSDHSFVVTDGDESGISGVYSEFDNSDFNNTDNENSFFNYDDDDDSDHHHNHDHDHSHNDDDYDGENVKPSISEIKPPRNVDKNKTGTEVKPPAQPRVKPPTQKDQPKSKPHNTTDRVFKSDPNPNTNTITTKTGTNAGNGQPALSKIYEEVTGLTFSRRREALTATMFAELNAGVFQNMLPHDLPIKWSKTLLRTAGRARFPAYKPPTNNTTGNNSAVTSSCSGSNSSSATAATGSGTGNNASVGANTNRVSMESIGQRIRRTANIELSLKVLDSASRLRSVLCHELCHVATWLIDRVRPFHDSYFYAWGARATRYDRKLVVTRCHSYQVHKPFLFRCVACALEYRRHSKSIDLDKKRGGKCYAKLMLVGRFKNDGTPVKERELTGYAAYVKQHAGQVRKELAAAAATRAAIPATTGVNADVSASVGVSGKPVKVAQGAVMKALAERYKRDKDAGLIPASTTKAQQPPHKCTSKGGVYNDSGVEDDNSNAEDWVKLDDEGNGGDGEGIEQENRRLTFDDASDNDDNNNNTDAVAQENEDDGSELAKFLKSLSI*