ggKbase home page

13_1_40cm_2_scaffold_10551_8

Organism: 13_1_40CM_2_Thaumarchaeota_39_13_2

partial RP 7 / 55 BSCG 4 / 51 ASCG 18 / 38 MC: 1
Location: 5000..8347

Top 3 Functional Annotations

Value Algorithm Source
DNA-directed RNA polymerase {ECO:0000256|RuleBase:RU000431}; EC=2.7.7.6 {ECO:0000256|RuleBase:RU000431};; TaxID=1001994 species="Archaea; Thaumarchaeota; Nitrosopumilales; Nitrosopumilaceae; Candidatu similarity UNIPROT
DB: UniProtKB
  • Identity: 87.8
  • Coverage: 999.99
  • Bit_score: 2000
  • Evalue 0.0
rpoB; DNA-directed RNA polymerase subunit B (EC:2.7.7.6) similarity KEGG
DB: KEGG
  • Identity: 86.7
  • Coverage: 999.99
  • Bit_score: 1989
  • Evalue 0.0
DNA-directed RNA polymerase n=1 Tax=Candidatus Nitrosoarchaeum koreensis MY1 RepID=F9CUN4_9ARCH similarity UNIREF
DB: UNIREF100
  • Identity: 87.8
  • Coverage: 999.99
  • Bit_score: 2000
  • Evalue 0.0

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Candidatus Nitrosoarchaeum koreensis → Candidatus Nitrosoarchaeum → Nitrosopumilales → Thaumarchaeota → Archaea

Sequences

DNA sequence
Length: 3348
ATGGCAAACATTTCAAATAAACGCTGGCCTATCATTCAAGATATTTTGAAGAGGGAGGGTATAGCTAGGCAACATCTTAACTCATACGATGAATTTCTTGAAAGAGGATTACAAAGCATCATCAACGAAGTCGGACAGATTGAAATAGAAAGTGCAGAATATCCGTACAAGATTCAGCTGGGCAAAGTAAAACTCCAACAACCACGTATGATGGAATTGGATGGTTCAATAACTCACATAGCTCCAATGGAAGCACGATTAAGAAACGTAACTTACGCTTCTCCAATAATGCTAGAGGCGAGCGTAGTAGAGGACGGTAAAATTCTAGAATCCAGATACATTCACATTGGAGACATGCCAGTAATGGTAAGATCGAATGCATGTATATTGCATAACCTTTCTGATCAAAAACTCGTAGAATACGGCGAGGATCCAAACGATCCCGGTGGATACTTTATCATAAATGGTTCAGAACGTGTGATAGTTGGATTAGAGGATCTTTCTTACAACAAAATCATTGTAGATAAAGAAATGGTCGGTGGAAATATTGTGTTTAAAGCAAAGGTGTATTCTTCGATAGTTGGATATCGTGCAAAGTTAGAACTTGTAATGAAAAATGATGGTTTGATTGTAGCCAAGATTCCTGGTTCTCCAGTTGATATTCCTGTAGTTACCTTAATGCGCGCGCTTGGACTTGAGTCTGATAGAGAAATTGCTGCAGCTGTTTCGTTAGTAGACGACATTCAGGATGAACTAGAAGCCTCATTTGAAAAATCAGGAGACGTCCCAACTGCAAAGGATGCTATTGTTTACATCAGTAAGAGAATTGCACCAGGAATGTTAGAAGAATTTCAAATTAAAAGAGCTGAAACTCTACTTGATTGGGGTCTCTTACCACATCTTGGAAAACATCCGGATAACAGAAAGGAAAAGTCGCTATTTCTTGGAGAGGCAACTTGTAAATTAATTGAGCTAAAATTGGGATGGATATCTCCTGACGACAAAGATCACTACGGAAATAAGGTCATAAAATTTGCAGGTCAAATGCTGGCAGACCTTTTCAGGACTGCTTTCAGAAACTTGGTTCGAGACATGAAGTATCAACTAGAAAGGTCAGGTCAGAAAAGAGGAATAAATGCGGTTGCGGCAGCTGTGAGACCAGGAATAGTGACAGATAAACTAAACAATGCTATTGCAACTGGAAATTGGGGCAGAGGAAGAGTAGGCGTTACCCAACTGTTAGACAGAACTAATTATCTTTCAACAATAAGTCATCTAAGAAGAATCCAATCCCCACTTAGTAGAAGTCAACCTAATTTTGAAGCAAGAGATCTTCATGCTACACATTTTGGAAGAATATGTCCAAGCGAAACTCCAGAAGGTTCTAACTGTGGATTGGTCAAGAACTTGGCCCTGTCAGCAATAATATCAGTGAATGTTCCCTCTGAAGATATCGTAGAAAAGCTCTACGATCTCGGTGTGACTTATGTCTCTGATGCGAAAGAAGAATTGAAGAAAGAAGGTGCTAGAGTTTTCGTAGATGGCAGGTTAATTGGATATTACAAAGATGGACAAAAGCTCGCAGACTCTTTGAAAGAACTTCGAAGAAACTTCAAGATTCATCCACACGTCGGTATCTTCTTGTACCAATCTGATTTTGAGGGATCAACTAAGAGACTCTATGTTAACTGTAATGCGGGTAGAGTATTGCGTCCGTTAATTGTAATTAAAGATAGCAAGCCTCTCCTAACACAAGAACTAATTGATAAGGTAAGCAAGAAATTTCTCTCATGGACTGATCTGTTGCATATGGGAGTCATTGAGTTAGTTGATGCTAACGAAGAAGAAAACTGCTATACTGCAATAGATGAGAATGATGTGAAAAAGCACACCCATATGGAAGTATTTCCATCAGCAATTCTTGGCGCAGGCGCATCGATAATTCCATATCCAGAACACAACCAGTCTCCAAGAAACACATACGAGTCAGCGATGGCAAAACAAAGCCTTGGTTTTTCAACTCCTCTGATGAATGCAAGCACCTACGTAAGACAACATCTCATGCTTTATCCTCAAACCCCTATTGTAACCACCAAAGCGATGGGCCTTCTGGGATTAGAAGAAAGACCAGCAGGCCAGAACTGTGTAGTCGCTGTACTTCCCTTTGATGGTTATAACATTGAAGATGCAATAGTACTGAGCAAGTCATCTGTTGAACGAGGACTGGGCAGAACATTTTTTTACAGAATTTACGAAGCAGAAGCCAAACAATATCCAGGAGGAATGCGCGACAATTTCGAAATCCCAACGGCAGAAGGAAATATTCGTGGATTCCGTGGAGATAAAGCATACAGATTGTTGGAAGAAGACGGAGTAATTGCAACCGAAGCCACAGTTCAGGGCGGTGATATACTAATCGGGAAAACTAGTCCTCCTAGATTCATGGAAGAGTACAGAGAATTCGAAGTAAAAGGACCTTATAGGAGAGATACCTCTGTTGGAGTAAGACCATCAGAGAGTGGGGTTGTTGATACTGTAGTTATGACTCAATCTCACGACGGAGGAAGAATGTACAAGATTAGAGTAAGAGATCTTAGAATTCCTGAAATTGGTGATAAATTTGCATCAAGGCATGGACAGAAAGGAGTAGTAGGATTACTAGTAAACCACGAAGATCTTCCTTATACCGAAGAAGGAATTGTACCGGATGTTCTAATTAATCCACACGCATTCCCATCAAGAATGACGGTAGGCATGTTTCTTGAATCAGTTACAGGAAAAGCTGCAGCATTACGCGGTAGTAAAATGGACGGTTCCGCATTTGTTGGCGAAAAGTTAGAAGATGTTAAAGGAGTTTTGGAAGCTGCAGGTTTCAAGTATTCAGGCAAAGAGACAATGTATGATGGGAGAACAGGTAAAGCATTTCCAGTTGATGTTTTCATTGGAGTAGTATATTACCAAAAACTACATCACATGGTAGCTGATAAGATTCATGCTAGAGCTAGGGGCCAAGTTCAGATGTTAACAAAACAGCCAACAGAAGGACGCGCCAGAGGTGGTGGTCTGAGATTTGGAGAAATGGAAAGAGACTGTCTTATAGCTTATGGAGCCTCTATGATGTTAAAAGACAGATTACTAGATGAATCTGACAAAGCTGACATATACGTTTGTGAGAGATGTGGCTTAGTTTCGTATTATGATATAAAACAGAGAAGATTTGTTTGCAGAGTTTGTGGTGACAAAGCGAAAGTCACTTCAGTCTCCGTTGCATATGCGTTTAAGCTACTACTTCAGGAAATGATGAGCCTTGATGTGGCACCACGACTTTTGATAAAGGAGAGAGTGTAA
PROTEIN sequence
Length: 1116
MANISNKRWPIIQDILKREGIARQHLNSYDEFLERGLQSIINEVGQIEIESAEYPYKIQLGKVKLQQPRMMELDGSITHIAPMEARLRNVTYASPIMLEASVVEDGKILESRYIHIGDMPVMVRSNACILHNLSDQKLVEYGEDPNDPGGYFIINGSERVIVGLEDLSYNKIIVDKEMVGGNIVFKAKVYSSIVGYRAKLELVMKNDGLIVAKIPGSPVDIPVVTLMRALGLESDREIAAAVSLVDDIQDELEASFEKSGDVPTAKDAIVYISKRIAPGMLEEFQIKRAETLLDWGLLPHLGKHPDNRKEKSLFLGEATCKLIELKLGWISPDDKDHYGNKVIKFAGQMLADLFRTAFRNLVRDMKYQLERSGQKRGINAVAAAVRPGIVTDKLNNAIATGNWGRGRVGVTQLLDRTNYLSTISHLRRIQSPLSRSQPNFEARDLHATHFGRICPSETPEGSNCGLVKNLALSAIISVNVPSEDIVEKLYDLGVTYVSDAKEELKKEGARVFVDGRLIGYYKDGQKLADSLKELRRNFKIHPHVGIFLYQSDFEGSTKRLYVNCNAGRVLRPLIVIKDSKPLLTQELIDKVSKKFLSWTDLLHMGVIELVDANEEENCYTAIDENDVKKHTHMEVFPSAILGAGASIIPYPEHNQSPRNTYESAMAKQSLGFSTPLMNASTYVRQHLMLYPQTPIVTTKAMGLLGLEERPAGQNCVVAVLPFDGYNIEDAIVLSKSSVERGLGRTFFYRIYEAEAKQYPGGMRDNFEIPTAEGNIRGFRGDKAYRLLEEDGVIATEATVQGGDILIGKTSPPRFMEEYREFEVKGPYRRDTSVGVRPSESGVVDTVVMTQSHDGGRMYKIRVRDLRIPEIGDKFASRHGQKGVVGLLVNHEDLPYTEEGIVPDVLINPHAFPSRMTVGMFLESVTGKAAALRGSKMDGSAFVGEKLEDVKGVLEAAGFKYSGKETMYDGRTGKAFPVDVFIGVVYYQKLHHMVADKIHARARGQVQMLTKQPTEGRARGGGLRFGEMERDCLIAYGASMMLKDRLLDESDKADIYVCERCGLVSYYDIKQRRFVCRVCGDKAKVTSVSVAYAFKLLLQEMMSLDVAPRLLIKERV*