ggKbase home page

13_1_20cm_2_scaffold_11711_4

Organism: 13_1_20CM_2_Thaumarchaeota_39_11

partial RP 2 / 55 BSCG 1 / 51 ASCG 15 / 38 MC: 3
Location: 2082..5519

Top 3 Functional Annotations

Value Algorithm Source
Very large, secreted (Periplasmic) protein n=1 Tax=Candidatus Nitrosoarchaeum koreensis MY1 RepID=F9CVS0_9ARCH similarity UNIREF
DB: UNIREF100
  • Identity: 34.8
  • Coverage: 999.99
  • Bit_score: 668
  • Evalue 9.40e-189
  • rbh
hypothetical protein Tax=CSP1_1_Thaumarchaeota UNIPROT
DB: UniProtKB
  • Identity: 34.2
  • Coverage: 999.99
  • Bit_score: 672
  • Evalue 9.30e-190
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 33.3
  • Coverage: 999.99
  • Bit_score: 633
  • Evalue 9.60e-179

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

CSP1_1_Thaumarchaeota → Nitrosopumilales → Thaumarchaeota → Archaea

Sequences

DNA sequence
Length: 3438
ATGAGATACTGGAAATCATACTTTATTGCAAGCTTTTTGGTGCTAATTCTAATTGGGCAAAGCCATACGGTTATGGGACAGGCAACCCCTATAGCAAATCGCATAGTCATAAACGAGGTCGATACAAACCCGGCAGGTGATGATTCAAAGCAGATAATTCAATGGGTCGAACTGTATAACCCTACAAATAGTCCAGTAAACATTGGTGGTTGGTCAATTGGCGCAACTACCGGACTTAGAGACCATTACACAATTCCAAATGGTATAATGATTCCAAGTCAACAGTTTCTTGTTTATACATATGGGCCGCTCTGGTTCCCCCACGTTGGTGCAGTAGTACAACTAAAGGATTCAAACGGAACAGTGGTTGATCAAACTCCTCCTTTAAGCGATCAAACAGATGATACTAATTCTTGGCAGAGAGCATATGATGGATACGATACAGGTTCACAAAGTGATTGGGTTTTCAGGCCTGGAAACCCTGGTTCGTCAAATGGAAAACTAGTAAGCACAACTGCTTCAAGTCAGATAACAGTCTCTCTAGCTACAGATAAAACAAGTTACAACTTTGGTGATACAGTTAGTATAAGTGGGCAGGTTTCAAAACTTGCATATTCAATTGATGGCTATCCTCAACAAGTAAAGCTATTGGTATCAGGATCAAAAGGATTTCAAAAGACATTCACTTTGTACCCTGAAAATAATCTAGAGTTTGCAACCAGCATGAAGCTTGATGAGATCTTGGGATTTCAGGACGGTAACTATACAATATCAGCATCATATGGAGATGCGCAAACATCAACTGTCTTCTCGCTTGGTGCTGCAACCTTTGTTCCTCCACCGCAGGCTGCTCCGGCTGCATTATCTATTTTCACGGACAAGCCAACTTATACAATTTCGCAGCCAATTACGCTGTCAGGAAATGTCTCAAAGGTGATTCCACTTACTGCTGTTACTTACAAGGTATATGATCCACACAATAGCTTGGTCTCTCAAGGTACTATCTTTCCTGATTCTCAGGGAAAATTTTCTTCTTTCAATCCTTACCAACAACACTTGACCAATTCAGGAATAACAATTAACAGCGTTAATCCTATTTACGGCACATATGACATAATCGCAGCCTACGGTGGCGCAACCGCCACTACTTCATTCACACTTGTTCCAGAATCAGTTCAAAATAGTGCAATATCGCTGTCTACTGATAAGCAAGTATATGGACTTGGAGATACTGTTACAATTACCGGCCGAAGTAACAAGGCCTGGGTTCCATCAATCAATTTAGAGTTGGTTCAATCATATTCTCTTGGGGTAGTGCCGCAAACACTTGATATTAAAACACAAGTGAGTGTGGCAGGCGACAGTACCTTTAGCTACCAGTATACCATTCCAGGAAAGCCTGATAGACTAGGAACATATCGTGTTATAGCTTCAGCACCGTTTGCAACAACAGAATCAGATTTTGTAGTTGTAGAAAATCCTGGTACTTACCAAGCTCCACCTTCTTCTCCACTTAGTATAACAACTGACCAGTCCTCATATGGCATAGGAGATGCAATTTTAATCTCGGGCAAAGTTACAAAATCAATCACAACTCCAACCATCACTGGCGTAAGCGTGAAGATTCTGGTGATAAATTCTAATGGTTCAGCCATAATATCTTCTCCAAATGTTGGGCCAAACTTATCAGGAGCTGGCCAGGGAGTTCAAGCTACACCACTAACATTTTATGCATATCCTGATAGTACTGGCAATTTCCAAATAAAAGAAACTTTAGCGAGGAGCGTTTTTGAAAAGGGTAATTACACTCTCAAAGCAATTTATGGCACATTATCTGCAACAACATCGTTCAATGTTTATGATCCACTTGACACTGGCTCACAGGGATCCATAATTGCTAGTACAAATAAGAAAGTTTACGGTATCGGAGAGACAGTCTTACTGGACGGCAAGATCTCAAGTTTGACCGGCACATCCACATACACGCTGACTCTATTGAAGCCAGATGGTACCATAATTAGTACGCCTCTACAGATAAACAATGGGTTGTTTTCTTGGAGTTGGACTATACCAAGCACTGCTACAGCTGGTAGCTCTCAAATAATTGGCACTGATAGAAAATCTGTTGTTTCAGTAAACCCGTCAGAAAATCTCTACGGTATCTATAGAATAACAATAAGTTCAGAGTATGCTAGAGGTGAATTCTTCTTCCAGGTTTCCAAGAGTCCACAAAACGCAACTGAGATATCGCCAATAGCTATCGAGACCGATAATACAGAATATGTGACCACAGACGTAATCAAAGTCTCAGGTCAAGTTGTACCTGAAATAAATGCCGCAGCAAAGGAGGCAAATGCTATGGTACGAATCATAGTCTTCTCTGACAAAGGCCAGCAAGCATATAGAGCAGATGCAAACGTTAACGCCGGCGGGCAATTCCATATCTCTATTCCGTTGCAGCCAGGCGTCTGGAGAAGTGGAACCTACAAGCTATATGCTCAATATCTTACTGCAAACACTAGAACTGATTTCAAAGTAGCGGATCCGTTTACAACTAGCTCGGGAAAACTTCAGGTTTTCATGACAACAGACCATGACAAGTATCTTCCAGGCCAGACTGTTCTGATAACAGGAAGGACAAGCTATATCATATCAATAAATACTGTGGATATTGCCATAGGAAAATCAGATGATGTTATCATTTCTGAAGGACAGATAATGTCAAAGAAAGGCAACGTGCTACCGCATGCGACCGCGTCTTTTGATCAGACAGGATCATTTAGCTACGATTATACAATTCCAACGACTGCCTCTATTGGCAACTATACAGTAGTGGCTCAGGTGCCTTTTGGAGCTTACGAGGCTCACTTTGAGATAGTTAGCCAACTGCCAGCAGAGAATGTTTTGCCGCAAGAAAATGCCACCCAGAGCACTAACGAAACGCAGAATGCTCAACCATTGACAACACTTCCAGATAGTATAGGACCTGTCCAAAAGCCAATGAGCCCGAATATGATAACAGAAAAGACTGGTAGAATATCTACACCTCTTATTCCTATTACGCTTGCCGCAAAATCAATTGGAAACAAGACATACTTTCCAATAGAACTTGATGGACTTCTCAGAGCAAATCCTGGAGACGAGAACAATGTGAATCTCAAGGTTACTCTAGAAAATGGTGCTTGTATAGTAGGGCAAGATTCTAACTGCATGGTGAGTGCATCAACAATCAAAGGCAATGTATTGTACGAAGTTGTAAAGATAGGAAATCAAAACTTTTTCATAGGATATTCAGGTGCTGGAGCAAGGCTAGAGCAGTTTAGTATTATCCCTACAAACGCAAATGACGTGATTCCCGATGGACAGTGGAATGTTGAGATAATCAAAAAAGACCAAACTTCAAGATTCTATTATCAAGTTGCCTACGCAGCAAAGTAA
PROTEIN sequence
Length: 1146
MRYWKSYFIASFLVLILIGQSHTVMGQATPIANRIVINEVDTNPAGDDSKQIIQWVELYNPTNSPVNIGGWSIGATTGLRDHYTIPNGIMIPSQQFLVYTYGPLWFPHVGAVVQLKDSNGTVVDQTPPLSDQTDDTNSWQRAYDGYDTGSQSDWVFRPGNPGSSNGKLVSTTASSQITVSLATDKTSYNFGDTVSISGQVSKLAYSIDGYPQQVKLLVSGSKGFQKTFTLYPENNLEFATSMKLDEILGFQDGNYTISASYGDAQTSTVFSLGAATFVPPPQAAPAALSIFTDKPTYTISQPITLSGNVSKVIPLTAVTYKVYDPHNSLVSQGTIFPDSQGKFSSFNPYQQHLTNSGITINSVNPIYGTYDIIAAYGGATATTSFTLVPESVQNSAISLSTDKQVYGLGDTVTITGRSNKAWVPSINLELVQSYSLGVVPQTLDIKTQVSVAGDSTFSYQYTIPGKPDRLGTYRVIASAPFATTESDFVVVENPGTYQAPPSSPLSITTDQSSYGIGDAILISGKVTKSITTPTITGVSVKILVINSNGSAIISSPNVGPNLSGAGQGVQATPLTFYAYPDSTGNFQIKETLARSVFEKGNYTLKAIYGTLSATTSFNVYDPLDTGSQGSIIASTNKKVYGIGETVLLDGKISSLTGTSTYTLTLLKPDGTIISTPLQINNGLFSWSWTIPSTATAGSSQIIGTDRKSVVSVNPSENLYGIYRITISSEYARGEFFFQVSKSPQNATEISPIAIETDNTEYVTTDVIKVSGQVVPEINAAAKEANAMVRIIVFSDKGQQAYRADANVNAGGQFHISIPLQPGVWRSGTYKLYAQYLTANTRTDFKVADPFTTSSGKLQVFMTTDHDKYLPGQTVLITGRTSYIISINTVDIAIGKSDDVIISEGQIMSKKGNVLPHATASFDQTGSFSYDYTIPTTASIGNYTVVAQVPFGAYEAHFEIVSQLPAENVLPQENATQSTNETQNAQPLTTLPDSIGPVQKPMSPNMITEKTGRISTPLIPITLAAKSIGNKTYFPIELDGLLRANPGDENNVNLKVTLENGACIVGQDSNCMVSASTIKGNVLYEVVKIGNQNFFIGYSGAGARLEQFSIIPTNANDVIPDGQWNVEIIKKDQTSRFYYQVAYAAK*