ggKbase home page

13_1_40cm_4_scaffold_239_27

Organism: 13_1_40CM_4_Thaumarchaeota_48_7

partial RP 25 / 55 BSCG 16 / 51 MC: 1 ASCG 28 / 38
Location: 19370..22720

Top 3 Functional Annotations

Value Algorithm Source
DNA-directed RNA polymerase {ECO:0000256|RuleBase:RU000431}; EC=2.7.7.6 {ECO:0000256|RuleBase:RU000431};; TaxID=1237085 species="Archaea; Thaumarchaeota; Nitrososphaeria; Nitrososphaerales; Nitrososph similarity UNIPROT
DB: UniProtKB
  • Identity: 98.1
  • Coverage: 999.99
  • Bit_score: 2187
  • Evalue 0.0
rpoB; DNA-directed RNA polymerase subunit B (EC:2.7.7.6) similarity KEGG
DB: KEGG
  • Identity: 98.1
  • Coverage: 999.99
  • Bit_score: 2187
  • Evalue 0.0
DNA-directed RNA polymerase n=2 Tax=Candidatus Nitrososphaera gargensis RepID=K0ILI7_NITGG similarity UNIREF
DB: UNIREF100
  • Identity: 98.1
  • Coverage: 999.99
  • Bit_score: 2187
  • Evalue 0.0

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Candidatus Nitrososphaera gargensis → Nitrososphaera → Nitrososphaerales → Nitrososphaeria → Thaumarchaeota → Archaea

Sequences

DNA sequence
Length: 3351
ATGATAGAGAATTCTATTGAGATGTGGCCAATCATCAATGACATTTTGCGAAGAGAAGGTGTTGCAAGGCAGCACTTGAACTCTTATAACGAATTCATGGAACGCGGATTACAGAGCATAATTGACGAAGTAGGCGAGATCGAAATTGAAACTGCGGAATACCCGTACAAGATCAAACTGGGCAAGATAAAGCTCCAGCAGCCAAGGATTATGGAGCTTGACGGTTCTATTACGCATGTCGCGCCGATGGAAGCCCGTCTCCGCAACCTTACCTACGCTTCACCGGTCATGCTAGAGTGCAGCGTCGTAGAGGACGGTAAGATCCTTGAATCGCGGTTTATCCACATCGGCGACATGCCAGTTATGGTCAAGTCCAACACCTGCATATTGCACAATCTGCCGGAGAGCAAGCTTGTCGAGTTCGGAGAGGACCCGCGGGACCCCGGCGGATACTTTATAATCAACGGCTCCGAGCGCGTCATCGTCGGACTTGAAGATCTTTCATACAACAAGATAATCGTGGATGCTGAGGAAACCACCGGAACCTTGCTTTACAAGGCAAAAGTATATTCTTCAATAGTCGGCTACAGGGCGAAACTTGAACTGGTAATGCGCCCGGATGGCTCTATAGTGACTAAGATCCCCGGCTCGCCAGTTGATATTCCATTGATCACGCTGATAAGAGCGCTCGGCCTCGAGTCTGATAAGGACATTGCAGATTCTGTCTCACTCAATGAAACAATCCAGGACGAGCTCGAGCCTTCGTTTGAAAAGGCTGGCGATGTCAACACGAGCAGGGATGCTATCGTCTATATCAGCAAGAGAATTGCGCCCGGCATGCTGGAAGAATTCCAGATAAAGAGAGCTGAAACATTGCTTGACTGGGGTCTTTTACCGCACATTGGCAAGAACCCCGACAACAGACACGAAAAGGCATTGTTCCTGGGCGAAGCAGCGAGCAAACTCATCGAGCTCAAGATGGGCTGGATAGACTCCGATGATAAAGATCATTATGGCAACAAGGTGATAAAGTTCGCCGGCCAGATGCTTGCAGACCTATTCAGGACTGCTTTTCGCAACTTGATACGCGACATGAAATACCAGCTCGAGCGCTCGGGCCAAAAGCGCGGTATCAACGCAGTCGCTGCAGCAGTCCGACCAGGCATTGTTTCTGACAAGCTGAACAACGCCATTGCAACAGGCAACTGGGGCAGGGGTAGGGTTGGAGTCACGCAATTGCTTGACAGGACAAACTACATGTCCACCATTTCGCACTTGCGCAGGATCCAGTCGCCGCTCAGCAGGAGCCAGCCAAACTTTGAGGCAAGGGACCTGCACGCAACGCACTTTGGTAGGATCTGCCCGGCGGAGACTCCTGAGGGCTCTAACTGCGGACTAGTCAAGAACCTCGCGCTGTCGGCAATAATTTCGGTCAATGTTCAGAGCGCCGAGGTCACTGAAAAGCTGTACGAGCTTGGAGTGCAAAACGTGGAAGAGGCCGATGAGGACCTGAGGGAATCTGGCACGAGGGTGTTTGTCGATGGCAGGCTGATAGGTTACGTCGAAAAGGGGGAGCACTTGGCCGACACGCTTAGGTCGATGAGGAGGTCTGGCAAGATCCATCCACACGTCGGAATCTATCTTTACAGCTCGCAGAACGATAGTGCCACAAAGCGACTCTACATCAGCTGCAATGCAGGCAGGGTGCTGCGACCGCTCGTGGTAATAAGGGACAACAAGCCGCTCGTAACGTACGAGGTTATTGAAAAGGTATCGAAAAAGTTCCTTTCATGGAACGACCTCCTATACATGGGAGTGATTGAGCTTACAGACGCAAACGAGGAGGAGAACTGCTACGTCGCAATCGATCCTAAAAAGCTGGAGCCAAAACACACACATTTGGAGATATTCCCATCAGCAATTCTGGGCGTGGGCGCTTCCATAATCCCGTACCCAGAGCACAACCAGTCGCCTAGGAACACATACGAAAGTGCGATGGCAAAGCAGAGCCTTGGATTTTCAACGCCGCTGATGAACGCAAGCACGTACGTTAGACAGCATTTCATGCTGTACCCGCAGACGCCGGTAGTAAGCACAAAGGCCATAAACCTGCTTGGGCTCGAGGATCGTCCAACGGGCCAGAACGCTGTCGTTGCCGTGCTTCCGTTTGAAGGCTATAATATCGAAGATGCCGTAGTGTTCAACAAGTCGTCAGTCGACAGGGGACTTGGCAGGACGTTCTTCTACAGGATCTACGAAGCTGAAGCAAAACAGTACCCTGGCGGAATGAAGGACAACTTTGAGCTTCCACAGGCGGACGCCAACATCCGCGGCTACAGGGGTGAAAAGGCATACCGCCTGCTAGAGCAGGACGGCGCGATAATGCACGAAGCTGTTGTCAATGGAGGCGACATTTTGATAGGCAGGACGTCGCCGCCAAGATTCATGGAAGAGTACAAGGAATTTGAGGTTAAAGGCCCGTACAGGCGCGACACTTCAGTCGGTGTCAGGCCGTCAGAAAACGGAGTAGTCGATACGGTCATTGTCACGCAGTCAGTCGAAGGCGGCAAGATGTACAAGATTCGCGTCCGTGACATGCGCATTCCGGAGATCGGCGACAAGTTTGCCTCAAGGCACGGACAGAAAGGGGTCATCGGCATGCTGGTTAATCAGGAAGATGTGCCATACACTGAGGACGGTGTCGTGCCTGACATTATGATCAACCCACACGCGTTTCCATCAAGGATGACGGTGGGCCAGTTCATGGAGTCGCTTGGCGGTAAGGCTGCTTCGCTCCGGGGCAGGATTGTGGACGGCTCGGCATTTCTCGGCGAGAAAGGCGATGACATCAAGAGCGCGATGGAAGAGTATGGCTTCAAGTACACTGGCAAGGAAGTGATGTACGACGGCAGGACGGGCCGAAAGTTCCCGGCAGACGTCTACGTCGGCGTCGTATACTACCAGAAGCTGCACCACATGGTTGCCGACAAGATTCACAGCCGTGCAAGAGGCCAGGTCCAGATGCTCACGAAGCAGCCGACCGAGGGCCGTGCCCGCGGCGGTGGTTTACGGTTTGGAGAAATGGAGCGCGACTGCCTTATCGCATACGGCGCATCGATGATGTTGAAGGACAGGCTCTTGGAGGAATCAGACAAGGCAGAGGTCAATGTCTGCGAGCGCTGTGGCCTGCTTGCATACTATGACGTCAAACAGCGCCGTTACGTCTGCCGCGTCTGTGGTGAAAAGGCCAAGATATCGTCAGTTGTGATAGCCTATGCGTTTAAGCTGTTATTGCAGGAAATGATGAGCCTAAATGTTGCTCCTAGGATGCTTGTGAAGGAGAAGGTCTAA
PROTEIN sequence
Length: 1117
MIENSIEMWPIINDILRREGVARQHLNSYNEFMERGLQSIIDEVGEIEIETAEYPYKIKLGKIKLQQPRIMELDGSITHVAPMEARLRNLTYASPVMLECSVVEDGKILESRFIHIGDMPVMVKSNTCILHNLPESKLVEFGEDPRDPGGYFIINGSERVIVGLEDLSYNKIIVDAEETTGTLLYKAKVYSSIVGYRAKLELVMRPDGSIVTKIPGSPVDIPLITLIRALGLESDKDIADSVSLNETIQDELEPSFEKAGDVNTSRDAIVYISKRIAPGMLEEFQIKRAETLLDWGLLPHIGKNPDNRHEKALFLGEAASKLIELKMGWIDSDDKDHYGNKVIKFAGQMLADLFRTAFRNLIRDMKYQLERSGQKRGINAVAAAVRPGIVSDKLNNAIATGNWGRGRVGVTQLLDRTNYMSTISHLRRIQSPLSRSQPNFEARDLHATHFGRICPAETPEGSNCGLVKNLALSAIISVNVQSAEVTEKLYELGVQNVEEADEDLRESGTRVFVDGRLIGYVEKGEHLADTLRSMRRSGKIHPHVGIYLYSSQNDSATKRLYISCNAGRVLRPLVVIRDNKPLVTYEVIEKVSKKFLSWNDLLYMGVIELTDANEEENCYVAIDPKKLEPKHTHLEIFPSAILGVGASIIPYPEHNQSPRNTYESAMAKQSLGFSTPLMNASTYVRQHFMLYPQTPVVSTKAINLLGLEDRPTGQNAVVAVLPFEGYNIEDAVVFNKSSVDRGLGRTFFYRIYEAEAKQYPGGMKDNFELPQADANIRGYRGEKAYRLLEQDGAIMHEAVVNGGDILIGRTSPPRFMEEYKEFEVKGPYRRDTSVGVRPSENGVVDTVIVTQSVEGGKMYKIRVRDMRIPEIGDKFASRHGQKGVIGMLVNQEDVPYTEDGVVPDIMINPHAFPSRMTVGQFMESLGGKAASLRGRIVDGSAFLGEKGDDIKSAMEEYGFKYTGKEVMYDGRTGRKFPADVYVGVVYYQKLHHMVADKIHSRARGQVQMLTKQPTEGRARGGGLRFGEMERDCLIAYGASMMLKDRLLEESDKAEVNVCERCGLLAYYDVKQRRYVCRVCGEKAKISSVVIAYAFKLLLQEMMSLNVAPRMLVKEKV*