ggKbase home page

SCNpilot_expt_1000_bf_scaffold_5249_6

Organism: SCNPILOT_CONT_300_BF_Sphingomonadales_66_39

near complete RP 46 / 55 MC: 2 BSCG 47 / 51 MC: 4 ASCG 9 / 38 MC: 1
Location: comp(4317..5798)

Top 3 Functional Annotations

Value Algorithm Source
Large low complexity protein with proline/alanine-rich repeat n=1 Tax=Cryptosporidium parvum (strain Iowa II) RepID=Q5CRN0_CRYPI similarity UNIREF
DB: UNIREF100
  • Identity: 31.4
  • Coverage: 509.0
  • Bit_score: 166
  • Evalue 8.60e-38
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 31.4
  • Coverage: 509.0
  • Bit_score: 166
  • Evalue 2.70e-38
Large low complexity protein with proline/alanine-rich repeat {ECO:0000313|EMBL:EAK88053.1}; TaxID=353152 species="Eukaryota; Alveolata; Apicomplexa; Conoidasida; Coccidia; Eucoccidiorida; Eimeriorina; Cryptosporidiidae; Cryptosporidium.;" source="Cryptosporidium parvum (strain Iowa II).;" similarity UNIPROT
DB: UniProtKB
  • Identity: 31.4
  • Coverage: 509.0
  • Bit_score: 166
  • Evalue 1.20e-37

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Cryptosporidium parvum → Cryptosporidium → Eucoccidiorida → Coccidia → Apicomplexa → Eukaryota

Sequences

DNA sequence
Length: 1482
ATGCCCCTGGCTGGGCCGTCGGCCGGTTCTCCCCCGGCCGATGCCGTCGCGCATGCCGCACGGGCGCTTTCCGCACCATCGGCCGGCTCCCGGGACAATGCCTCCGATGCGCCGGCACCCGCCATGGCTCCGGCAGCCGGGGCTCGCTCCGGCCCGGTGCCCGCGCCCGTCGTGCAGCGCAAGATCGCCACCGCCATCAGCCCCCCCGCGGCGGGCGCCGCACCGATCGTAAATGCGGAGACCCGGGTCCGCGCGCAGGGGGGTGAAGCGGGTCCCCCGGCTTCCGCGACCCCGCACGCGTCCCGCAATGTCTCCGAAGCGGGCGGCGCGGCCGAAGCATCGGTTTCGTCGGCAGGGGGGCCCGGGGTCCCTTCGCCATCGGGCGAGGCCCTTGTCTCGCCGCCCACCGATGCGGCGCGTCGGGCGACCACGGGCTCGCGGCGAGGCATCGCGAGTCCGGCAGCCAGGCCGTCCGTCGCGGCACGGCAGGCGGAGGGCATGGAACCCACCGCTCCGGCCGACCGGGCATCGTTGTCCGACCCCGCTCGGCTCGATGCCGTGGCGCAGGCGCTTTCCGAGCCCGTGGCGTCGGCCGTCGCGCGCGCTCCGATGCCCGCGGCACGCGGCGCGCCTGCCGCGATGGCTGCGTACCCCGCCCCCCCCGCGCAAACGGCATCGCAGGCTGGGAACGCACCACCGGCCGCGCCCGCACCCGCCCCGGACTTGCCCGCGGCTCCGCCCGCCGGGCAGCGCGCCGCCCTATCGCAGCGCGCGGCGCAGGAAACCGCCCCGGTGCCGCCGGCACATGCCGGTGCAACCGAAGCGCCAGGGACGGGCAGGCCACGCCGTCCCGCTCGGCCCGCCAATGGCGTGTCGGCAGGCGCCCCACCGGTGGCGCCCGCTCCAACAGCGCAGCGATTTGCGAAGACGGACAATGGCATTGCGGCTGGCTCCCGGGCGCCTGCGTCCCCCGCTGCCGCGCCGGTCGTCCTCGCCTCGCCCGCTGAGCCCCAGGCGCTTCCCGCCACCGCCCCGAAGCCGGCGATGCGCGCGGCGCCGGAGTCCGCGCCGCCACCGCTTGCGGCCGCCGCGAGCCCGAGCCGGGACCGGCAGGCGCGGGATCGCGGGGCCGCCGGACCGCGCGATGCGCAGTCCGCGCCCGGGGGACGGCCCTCCGTGCTCGAACCGGACGAGGCCGGGGTTCGGCGCCCTTCCCCCGAGACACGCCGCATGCCGGATCGCACACCGGCCAGCGATTTGCCGGCGATGCCGGCGCGCGCACCTGCGACGCGATCGGTCGCGGCGGAAGTCATGCCCCCGCCCCAGCCCGCGCGCCGGCCTCCGCCTCCCGCACCGCCTTCGGGCGACATCCGCATCGACATCGGCCGTATCGCGATCGACCTGCCGCGTCCGCGCAGCGCGCCGGCGCGCCCGCAACCGCCGCCGCTAAAGGCCAAGCCGCGCGGGGGGCCAGACGCATGA
PROTEIN sequence
Length: 494
MPLAGPSAGSPPADAVAHAARALSAPSAGSRDNASDAPAPAMAPAAGARSGPVPAPVVQRKIATAISPPAAGAAPIVNAETRVRAQGGEAGPPASATPHASRNVSEAGGAAEASVSSAGGPGVPSPSGEALVSPPTDAARRATTGSRRGIASPAARPSVAARQAEGMEPTAPADRASLSDPARLDAVAQALSEPVASAVARAPMPAARGAPAAMAAYPAPPAQTASQAGNAPPAAPAPAPDLPAAPPAGQRAALSQRAAQETAPVPPAHAGATEAPGTGRPRRPARPANGVSAGAPPVAPAPTAQRFAKTDNGIAAGSRAPASPAAAPVVLASPAEPQALPATAPKPAMRAAPESAPPPLAAAASPSRDRQARDRGAAGPRDAQSAPGGRPSVLEPDEAGVRRPSPETRRMPDRTPASDLPAMPARAPATRSVAAEVMPPPQPARRPPPPAPPSGDIRIDIGRIAIDLPRPRSAPARPQPPPLKAKPRGGPDA*