ggKbase home page

M01_scaffold_2_1202_curated_prodigal-single_126

Organism: M01_PHAGE_CU_48_59

RP 0 / 55 BSCG 2 / 51 ASCG 0 / 38
Location: 156429..159746

Top 3 Functional Annotations

Value Algorithm Source
Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C7X4_THAPS similarity UNIREF
DB: UNIREF100
  • Identity: 18.6
  • Coverage: 830.0
  • Bit_score: 120
  • Evalue 5.40e-24
Uncharacterized protein {ECO:0000313|EMBL:EED90187.1}; TaxID=35128 species="Eukaryota; Stramenopiles; Bacillariophyta; Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales; Thalassiosiraceae; Thalassiosira.;" source="Thalassiosira pseudonana (Marine diatom) (Cyclotella nana).;" similarity UNIPROT
DB: UniProtKB
  • Identity: 18.6
  • Coverage: 830.0
  • Bit_score: 120
  • Evalue 7.60e-24
cell surface protein similarity KEGG
DB: KEGG
  • Identity: 20.9
  • Coverage: 503.0
  • Bit_score: 76
  • Evalue 3.30e-11

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Thalassiosira pseudonana → Thalassiosira → Thalassiosirales → Coscinodiscophyceae → Bacillariophyta → Eukaryota

Sequences

DNA sequence
Length: 3318
ATGCAGGTATTCATTCACAGGTCAGGGAAGCGAGTCGGCTTTGCCTCCGGTACCGATCACGTCGTGGAGGAAATCCAGAAGTGCATCGACGACGGGGACAACCTGCGAACCGCAGATTTGGCTTACGACACTTCCACGGACACACTAAACGTCAGCGTCCTCGGATCCGATGACCAGACTACGACAGGTCTTGTCAAGATTCCGTCGAACGCGATAATCGAATCGAATTCGTACCTTGACAGCGTGTATGCCATCATCGAGCTCGTATGCAAGTCATCAATCGACTACAGCGGAATCCTTGGGCAACCGACGGGCGGTGAGATATCGCTCCTGTGCATCGCCGACGTCGAATCGTTCGTCGACTCCGACGGTGAGCCTGACGTGAACCAGTGCATCGACTTCTTCAATATCGCAACGAATCTCGCCGATGCGGCGGGAATCAAGACCGTCGAGATTGTCAATGGGACGAACCCGTTCGGTATCGTCCCCGATGACGTCGCGAACGGCGAGTCCCAGGAGTTCCAGCGCATCCAAGCAGAGATCGAACAGGAGTACACCGATGATGACGTTCCCGCACAAAACTCGAATCCACCCCTCGATCTCGATGAGCCGACCCAGCCTGTGCCGGAAATCCGACAGGAAATCCCGCATGTTCCGGAGGTAAACCGAGACTTCGGTCAGCTTGATGACGGTATCGACGTTGATAACGATGACTTCGACGACATGTTCGATGATATCGATGACGAATTCGCCGTCGATGATACCATCAACGGGAATGGTAAAACCGTCGCCACCGTACCGATTAACGGCAATGCCGCCGTGAATGCCCAGACACCGCAACCTGACGTCCCGCACGTAATCTGGTCACCTGGCGACAAGCCTAGCGACACGTCCGCCCGCGAGACGGCACAGACGGTCAGCAGCCTCGGTAACGCCATCGACATCGACGATTCCCTTGACGATCTCGGCATTCTCAAGGAGAAAATCGATAAGTACCAGACGCAGTTCGCCCAGATGAAGCAGCAGCTCGATGACGTAAAAATCAAGAATCCCGGTGACTTGAGCGTCCCGCGCGACGAGACGGGTAACGTCGACGAGTTCGGTGGGAAGCTCGCGCAACTTGACAACCTGCTGCAAAGTATGAAGACCGAGATGAACGCAATCGAGGCGAACCGTCCGACATACACAAAGCGTAAGCTTCCGACAGATAGGAAACCGAGTCGCGAGTCCCTTGAAGCCGAGGCTACCCGCATGCTCGATGAGCTTACCGATCTGAGCAAGTCGTCCGACAACCGCACGCAGGTTCAGAACAATCTCGTGCGTAAGTTTCTCGTATCAGACCAGCCCGAGCATCAGAAGGTCGTGGCACTCAGCGGCATGATCGCCACCCTTAACGAACGCAATATCAACTACAAGCGTAGCCTATTCATCATGAAGCAGCGCTATGAGGAGATGAAGGCTACCGTCGCAAACAACCAGAAGACCATCAAGGTCATGAAGGCGAACTACGACGACTTGGCTCAGGAATCTCGATTCAGCAAGGAAGCGCAACGTAAGGCTGAGAAGTTCGTCGCCGATGCCCGCGCCGACGTCGCGAAGGTCCTGAAGGAATCGCAGGAGCAAATCGACAAGATGCAGGAGTCCGTCGATGCGGCGGCTCGTATCGTGTCCGACGAGCGCGAGCGCCGAATCGCTGCCGAGCGTGCACGTCAGGAGACCATCGACAACTACCGTTCACGCGTCGCCGAGCTCGATTCGAAGCACGCGCAGCTCGACGATATCATCGCCGACGGGACGAAGCGCCGCGACGAGGCGCAGAGAGTCATCGACGATGCAAAGGCGCAGCTAAAGCAGATTCAGGACAACGCATACGCCGAGATTTCCAGAGAGAAGCAGAAAGCAGCCGATATCCGCAAGGCACTCGAGGACACGAGCAGCCAGCTTGACGCACAGAACGCCCGAGTCCGCCATGCTGAGGAGCAGGTCAACATCCTAAAGCGGCAGTCGGAGCAGCGTCGACAGGAGTACGAAGCTCGCGTCGACGCCGTGAAAACGGAACGTGATCAGACTATCGCTTCCATCCGCGAATCCTGTGCGACCGAGATTGGAGCAGTCAAGGATTCATGCGAATCGCAAATCAAGCGAGCGAAGGAAGCCGCTAAGTCCCAGATGGACGAGATGCAGCGAAACCACGACGATGCGGTGACACAGCTGAAGGCGAACTTCGATGCCCAGACCGATACGGTCATGAGGGCATTCGCCGCGAAGTCCAGCGAGATGAGCGTGAAGTCTCAGACCCTGTCATCGCAGGTTGATTCGCTTACCGAGGAGCGCGACGAGCTGGTCAATCGAGTCAACGCTCTCCGCAAGACCGTGCAGGAGAACGGGATGAAGGCCGACCGAGAGAAGCGCTCACTCAATGAACTCATCGCCGACCGCGATTCCCGTATCGCCGAGCTTACCGCAAGCGTCAATGCGCTGACTAGGGACAAGAATCGAGCCCTGCAGCAGCTCGCAGTCGAGCATGACGGATGCAAGGATTTGCTCGACCGCATCGACAAGGCGACAAGCGGCAGCGGAATGTTCATGTCGAAGAAGGACATTCGCAGCATTACCTCGCGTCTGCACGATGTGCTCGACTATTCGCATATGGTCGGACGAACCGTCAAAGACGTCGACTTGGACACAGCAAAGGCGTTGCACCTCAGCGCACCGACGACTGATGATGCAGCAAGCGATTTGAGGGCTAGAATCGCGAATATTCCCGGATTCGACGAGCCGGCGGTAGCTGCACCATCCCATCTTGCCGAGGACGATTCCGATATCGATGAGCTCGATACGTTCTTTGATGATGAGGAGGAAGACGACGATGGGGATATCTTCAATGTCCCCGATGAGGCTCCGGCAAGCGATACGCTCATTGCCGACGTCAGCGCTTCGCTCGACGATACCGGCGCTGGGAACGATGCGATTGACGATATCGATATCGCAACCATTCCCGAGCCTGACGAGTCGAAGCAGGAGTCGTTTACCGGAACGTTCATGGCAAGCGATACCATCGCAAAGCTGAAACAGGCCGGTGTCAATCTCGACGCCATCTTTGAGGCGAGCAATGCCGATAACAATGAGAATAAGGAACGCGACGATAAGTTTTCCGATGGCGATTCCGATTTCATCTCCGGTAATGTCGATTCGCCTAGCAACGAGGAAGACGGGGAGACGGACGGGAAGAGGAAAGATCCTGCTACCAGTAAGAATGCCGATATCTCAAATGTTTCCGACGAGGACGTTAATAGGATTATGAACGATATCAAGTAG
PROTEIN sequence
Length: 1106
MQVFIHRSGKRVGFASGTDHVVEEIQKCIDDGDNLRTADLAYDTSTDTLNVSVLGSDDQTTTGLVKIPSNAIIESNSYLDSVYAIIELVCKSSIDYSGILGQPTGGEISLLCIADVESFVDSDGEPDVNQCIDFFNIATNLADAAGIKTVEIVNGTNPFGIVPDDVANGESQEFQRIQAEIEQEYTDDDVPAQNSNPPLDLDEPTQPVPEIRQEIPHVPEVNRDFGQLDDGIDVDNDDFDDMFDDIDDEFAVDDTINGNGKTVATVPINGNAAVNAQTPQPDVPHVIWSPGDKPSDTSARETAQTVSSLGNAIDIDDSLDDLGILKEKIDKYQTQFAQMKQQLDDVKIKNPGDLSVPRDETGNVDEFGGKLAQLDNLLQSMKTEMNAIEANRPTYTKRKLPTDRKPSRESLEAEATRMLDELTDLSKSSDNRTQVQNNLVRKFLVSDQPEHQKVVALSGMIATLNERNINYKRSLFIMKQRYEEMKATVANNQKTIKVMKANYDDLAQESRFSKEAQRKAEKFVADARADVAKVLKESQEQIDKMQESVDAAARIVSDERERRIAAERARQETIDNYRSRVAELDSKHAQLDDIIADGTKRRDEAQRVIDDAKAQLKQIQDNAYAEISREKQKAADIRKALEDTSSQLDAQNARVRHAEEQVNILKRQSEQRRQEYEARVDAVKTERDQTIASIRESCATEIGAVKDSCESQIKRAKEAAKSQMDEMQRNHDDAVTQLKANFDAQTDTVMRAFAAKSSEMSVKSQTLSSQVDSLTEERDELVNRVNALRKTVQENGMKADREKRSLNELIADRDSRIAELTASVNALTRDKNRALQQLAVEHDGCKDLLDRIDKATSGSGMFMSKKDIRSITSRLHDVLDYSHMVGRTVKDVDLDTAKALHLSAPTTDDAASDLRARIANIPGFDEPAVAAPSHLAEDDSDIDELDTFFDDEEEDDDGDIFNVPDEAPASDTLIADVSASLDDTGAGNDAIDDIDIATIPEPDESKQESFTGTFMASDTIAKLKQAGVNLDAIFEASNADNNENKERDDKFSDGDSDFISGNVDSPSNEEDGETDGKRKDPATSKNADISNVSDEDVNRIMNDIK*