ggKbase home page

NECEvent2014_9_1_scaffold_22_52

Organism: NECEvent2014_9_1_Candida_parapsilosis_39_137

megabin RP 26 / 55 MC: 5 BSCG 29 / 51 MC: 8 ASCG 31 / 38 MC: 10
Location: comp(99944..103072)

Top 3 Functional Annotations

Value Algorithm Source
Histone-lysine N-methyltransferase, H3 lysine-4 specific {ECO:0000256|PIRNR:PIRNR037104}; EC=2.1.1.43 {ECO:0000256|PIRNR:PIRNR037104};; TaxID=578454 species="Eukaryota; Fungi; Dikarya; Ascomycota; Sac similarity UNIPROT
DB: UniProtKB
  • Identity: 99.3
  • Coverage: 999.99
  • Bit_score: 2060
  • Evalue 0.0
Histone-lysine N-methyltransferase, H3 lysine-4 specific n=1 Tax=Candida parapsilosis (strain CDC 317 / ATCC MYA-4646) RepID=G8B631_CANPC similarity UNIREF
DB: UNIREF100
  • Identity: 99.3
  • Coverage: 999.99
  • Bit_score: 2060
  • Evalue 0.0
  • rbh
nuclear protein SET similarity KEGG
DB: KEGG
  • Identity: 39.3
  • Coverage: 150.0
  • Bit_score: 103
  • Evalue 4.10e-19

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Candida parapsilosis → Candida → Saccharomycetales → Saccharomycetes → Ascomycota → Fungi

Sequences

DNA sequence
Length: 3129
ATGTCTTACGGTGGTGGCTATTACAGCCATAGGTACGGGCAGCGAAGGGAACGGTATGAATCATATAGGCCGAATCGCTCGGTAGCCCCTTCTCAAGGTGGGCCTCGAGACTCCAACCCGCTGCCTGTACATTCACTACATATAGATGAGGGAAAGCCCGATCACCATCGCTATCATCATCATCGCGATAGGACAGAGTACACAAAAGGTTATCAGGGGCGACTGTCGCGATGGGATTCTGCCGAAAATTCAGGGCGAAATACTCCTTTATCTGCCAGTACCAATGGTTCACAATCTGAGGCTTATTCTAATACCAACACGAGAACCGGAGTTAATCAAAGGGACTTGGAATACAATAAACTAGCTCACCATCGTGATTGCACAAGGAATTTAGAGTCTTCAACAAAACTAGAGGGTGGCAAAAACTATAAGGTGATGTACGATCCCGAACTAGATCCTAGTCTATCAAAGAGTGAATCAAAATTGAAGTCGAAAAAAGTACGATTCAATGGAGAAGGGGTTGACAAGCCTCAAGATCCCCGACTACCGAGCTTGGCCAACTACTTACAAAAGCCAAACAAAAAGTCGTCCAAGTTCCCTTTCAAACAACTACCCCAAGCGAAATTCATATATGATCTGGATTCCATAGGTGAAGCACCGCTCACTACATTGGTAATCTGGGATCTTCCAATAAGTATCAATGAAACCTTTTTGAAAAACTTTCTCGCAAGCTTTGAATCCGGTATAGAAAACCTCAAGACATTTGATGATCCGAAATACGGAGTTCCTTTAGGCGTTGCCACTTTCAAATACCAAGGAAGCACTGAAAAGTCAAACTCATTGGCAGAGAAGTTCATTCAAACCGTTAAACGAGATTCCATAAAAATTGATGGGAATCTACTAAAAATTGCTTTGAATGATAATCAAGATGATTTTTTGAATCTGAAAATAAATTTAGCAAGAGGCAAAATCATTCAACAACAAAAAAAGAGAGAAGAGGAGGAAGAAAAGCGGCGATTGGAGAAACTTGAGGAGCAGAAAAAACTCGAACAACAGAGAATCGCAGAGGAGCAGAAAAAACTTGAAGAGGAGAGGAAGAAGAAGGAGGAGACTGAAAAATTGCGGGAAAAGGAGCTCGAAATAGCTGTGATTTCTGATGGTGGGACCACTCATTATCAGCGTGATACCACCGTTTTGTCCAAGAGGCACGGCAACAAAGTTGTTCAAGGTAATTTTTTGCCCGACGATTTGAAAAAGTATGTCAAGAATCGGCCCTACCTACTTATTCATGACAAGTACGTCTCAACAAAGAAGGTATCATCGCAAGAAATAAAACGAGCTTTTAAAAAATATGACTGGACTAGAGTTTTGTCTGACAAAACAGGCTTCTATATTGTGTTCAACTCCTTAAAAGAATGTGAGAGGTGTTTCCAAAACGAGGATTGTCGCCATTTTTATGAGTTCAGATTGGTAATGGAAATGGCTATCCCTGAGCACTATCAGGAGGTTGAGAAGGAGAATGACTTTAGCAATAGTGTCATCAATGAAGCCACAAACATTCTAATCAAGGAATTTGAAACTTTCTTGGTAAAGGACATTCGAGAGCGAATAATAGCGCCGCAAATTTTGAGTCTTTTGGATCACGATCGTTACCCAGCATTGGTCGAAGAACTCAAGGCAAAAGAGAAAGAACAAGCATTGAAGTCCAAGCTGGTAATGTCAAATATGGATTTGAAACAAAATGCCATGTCCATTCTAGAGAAGCAGAAGAGGGAGTTGCAGGAAAAGATTCCGTATTTTAAAAGAGCTGAGGAAAGACTTAAAGAAAGTAGAAGGGGCAATAAACGCCCGATTATTCCTATGCTGCATGCCTTGAACCTTGACGAGGATGAGGAGGAGATTGATAATGATGACCTTTATGAATCGGTGAGTGGGTCCATGACGCCCTTGGCTGAGCCACTAAAAAGAGTTCGTAGCTCAACTGCAACTAGCGTTGCTGAGGACTCTGAAATTGATGAGCCTGCTGCTAAACGACAAAAATCGAAGCTACAGGTATCATATGATATTATTTCTTCTGGGGATGAACAAATGGAAGATTTGGAAGAGCCAGAAGAAATTGCAGATGAAGAGAATGAACCTGAGTCACAGACTGATATTGTGGATTCAAAGTATGGCCCTACTGAGGGTAAGCCATCAACTGTGTATCCAGTCTCTCTGACATCCATAATCACTGACTTGAAGGGTCTTCAAGAAAACATAGTCGATGGTGAGGACTTGGAGTTAGCAAATACGGTTTTAGCTGATGTTGAGGGGGTTAATTTGTCACATATAGACTATTGGGCGTGGAAACAGAATGCCAACACTACAGGTTCACTTGAGGTTGATGAACATGAAGATATATTGGAAGAATTGCCTTCTAGACTTGACTCAATCACCGGCTCATTCAAGAGTGATGGTTTCAAAAAGATTTCTGAAGTGGACAAAGTTGAGTATTTGCCACATCGTAGGAAGGCAAATAAGCCTATCAAGACTGTTCAGTATGAAGAAGATGATGACGAAAAACCCGCTGACAGCAATAATGTGTTGCAAAGCTCCAGGGTCAATAGGGCAAACAACAGAAGGTTTGCGGCAGATATTACCGCACAAATCGGTTCTGAATCGGAAGTGATGTCCTTGAATGCGCTCACCAAGCGTAAAAAGCCTGTCACTTTTGCTAGGTCAACAATCCATAACTGGGGTCTTTATGCTATGGAGCCTATTGCGGCTAAGGAGATGATTATTGAGTATGTTGGAGAAAGAATTCGACAACAAGTGGCTGAGCATCGAGAGAAAAGCTATTTGCGAACAGGTATCGGTTCCTCTTATCTATTTCGAATTGATGAAAATACAGTTATTGATGCAACAAAGAAGGGAGGCATTGCTCGATTTATTAATCATTGTTGTAACCCGAGTTGTACAGCTAAAATCATTAAAGTTGAAGGTAAGAAGCGAATTGTCATTTATGCTCTTAGAGACATTGAAGCGAATGAAGAGTTGACCTATGATTATAAATTTGAAAGGGAAACCAATGATGAAGAACGTATTCGATGTTTGTGTGGTGCTCCTGGCTGCAAAGGCTATCTCAATTGA
PROTEIN sequence
Length: 1043
MSYGGGYYSHRYGQRRERYESYRPNRSVAPSQGGPRDSNPLPVHSLHIDEGKPDHHRYHHHRDRTEYTKGYQGRLSRWDSAENSGRNTPLSASTNGSQSEAYSNTNTRTGVNQRDLEYNKLAHHRDCTRNLESSTKLEGGKNYKVMYDPELDPSLSKSESKLKSKKVRFNGEGVDKPQDPRLPSLANYLQKPNKKSSKFPFKQLPQAKFIYDLDSIGEAPLTTLVIWDLPISINETFLKNFLASFESGIENLKTFDDPKYGVPLGVATFKYQGSTEKSNSLAEKFIQTVKRDSIKIDGNLLKIALNDNQDDFLNLKINLARGKIIQQQKKREEEEEKRRLEKLEEQKKLEQQRIAEEQKKLEEERKKKEETEKLREKELEIAVISDGGTTHYQRDTTVLSKRHGNKVVQGNFLPDDLKKYVKNRPYLLIHDKYVSTKKVSSQEIKRAFKKYDWTRVLSDKTGFYIVFNSLKECERCFQNEDCRHFYEFRLVMEMAIPEHYQEVEKENDFSNSVINEATNILIKEFETFLVKDIRERIIAPQILSLLDHDRYPALVEELKAKEKEQALKSKLVMSNMDLKQNAMSILEKQKRELQEKIPYFKRAEERLKESRRGNKRPIIPMLHALNLDEDEEEIDNDDLYESVSGSMTPLAEPLKRVRSSTATSVAEDSEIDEPAAKRQKSKLQVSYDIISSGDEQMEDLEEPEEIADEENEPESQTDIVDSKYGPTEGKPSTVYPVSLTSIITDLKGLQENIVDGEDLELANTVLADVEGVNLSHIDYWAWKQNANTTGSLEVDEHEDILEELPSRLDSITGSFKSDGFKKISEVDKVEYLPHRRKANKPIKTVQYEEDDDEKPADSNNVLQSSRVNRANNRRFAADITAQIGSESEVMSLNALTKRKKPVTFARSTIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLRTGIGSSYLFRIDENTVIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN*