ggKbase home page

13_1_40cm_3_scaffold_811_1

Organism: 13_1_40CM_3_Thaumarchaeota_50_5

near complete RP 30 / 55 MC: 2 BSCG 17 / 51 MC: 1 ASCG 35 / 38 MC: 3
Location: comp(1..3045)

Top 3 Functional Annotations

Value Algorithm Source
DNA-directed RNA polymerase {ECO:0000256|RuleBase:RU004279}; EC=2.7.7.6 {ECO:0000256|RuleBase:RU004279};; Flags: Fragment;; TaxID=497727 species="Archaea; Thaumarchaeota; Nitrososphaeria; Nitrososphae similarity UNIPROT
DB: UniProtKB
  • Identity: 97.1
  • Coverage: 999.99
  • Bit_score: 1971
  • Evalue 0.0
rpoA; DNA-directed RNA polymerase subunit A (EC:2.7.7.6) similarity KEGG
DB: KEGG
  • Identity: 97.1
  • Coverage: 999.99
  • Bit_score: 1971
  • Evalue 0.0
DNA-directed RNA polymerase n=2 Tax=Candidatus Nitrososphaera gargensis RepID=K0IC33_NITGG similarity UNIREF
DB: UNIREF100
  • Identity: 97.1
  • Coverage: 999.99
  • Bit_score: 1971
  • Evalue 0.0
  • rbh

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Candidatus Nitrososphaera gargensis → Nitrososphaera → Nitrososphaerales → Nitrososphaeria → Thaumarchaeota → Archaea

Sequences

DNA sequence
Length: 3045
ATGATGGAAGAATCTGCAAAAATATTGGGCGGAATAAAGTTCAGCGTCTGGTCGCCAACCGAAGTGCGCAAGTTCTCTGTAGCCGAGATCACGGCGCCCGAGACGTACGACGAAGACGGTATGCCGGTGCAGGGCGGCTTGATGGATAACAGGCTCGGGACTCTCGAGCCGGGCCAAAAGTGTGCCACATGTGGAAATACTTCTGCAAAATGCCCCGGACACTTTGGTCACATCGAGCTTGCTGAACCGGTGCTTCACATCGCTTTCGTTGACGATATACACAAGCTTTTGTTGATATCATGCAGGTCGTGCAACAGGATAAAGCTCGAACCGGAAGAGCTCGCTCACTACAAGTCAATCCGCGACGCGAAGGCCGCTTATGCGGTCATTACCCTTGAAAACATCAAGGACGAAATAATCGAGCGGTCGAAGAAGGTAAAGCTCTGCCCGCACTGCGGCAAAGACCAGTATGACTTGATATTCACAAAGCCGACGATATTTGTAGAAAAGACGGACGCCGGAGAAAACCGGCTATTGCCAATTACGATACGCGAAAGGCTCAGCCACATCCCAGACGACGACCTAACTCTTCTGGGTTACGATTACAAGACGGCAAGGCCAGAATGGTTCGTGCTGCAGGTGCTGCCCGTGCCGCCGGTGACAGTCAGACCGTCAATCATACTTGAAACAGGAATTAGGTCGGAGGACGACCTGACACACAAGCTGGTGGACATCATCAGGGTCAACCAACGCCTCAAGGAGAGCAAGGAGGCAGGAACCCCGCCGCTCATCGTGCAGGACCTAGTCGACCTGCTACAGTACCATGTCACGACGTACTTTGATAACGAAGTGTCAGGCATTCCCCAGGCCCACCACAGATCGGGCAGGCCGCTCAAGACCCTGACGCAGCGGCTCAAGGGTAAGGAGGGAAGGTTCAGAGGCTCACTTTCGGGCAAGCGCGTCGACTTTTCCAGCAGGACAGTAATTTCGCCGGACCCCAACCTTACGATTGCAGATGTTGGCGTACCAACCGACGTTGCCAAGAAACTTACAATCCCAGAAACCGTGTCGCAGTGGAACCTAGAGCGGCTCAAGGAGCTGGTGATGAACGGTCCGAACATGTACCCTGGCGCAAACTACATCATCAGGCCAGACGGCGTCAAGATAAGGCTTGACTACGTGACAGACAGGAAGGCAATAGCCGATTCGCTTGCTTCAGCCTATATCGTCGAACGCCACTTGGCAGACGGCGACATCGTAATTTTCAACAGACAGCCGTCGCTCCATAGGATGTCAATCATGGCGCACAGCGTTAGGGTGCTTCCGTACAGGACGTTCCGCTTGCACCCGGCAGTATGCCCGCCCTACAACGCTGACTTTGACGGGGACGAGATGAATCTCCACGTGCCGCAGAGCGAAGAAGCAAGGGCAGAAGCTACCCTTCTTATGAGGGTGCAGGACCAGCTGATATCGCCAAGGTACGGGGGACCTATCATAGGCGGCATCAGGGACTTTATCACAGGTGCCTTCATGTTGACACGCGATGGAACCACGCTTACAAAGGATGAGTTTGCAAACCTTGCAATGACGGGCGGCTACGAAGGTCCGCTGCCAGAGCCGGCGGTAACGAAGGACGGCCAAAAGCTGTACGCAGGCAGGCAACTATTCTCGCTCTTCCTTCCAAAAGATTTTAACTTTATTATCACGTCGAAATGGAACAAAGCCGCAAAAGGTGAGGGTAAGGACGTCGTTATAAAGAACGGCGAGCTAATGAGTGGAGTAATAGATAAAGCGTCGATTGGCGCTGAAGAGCCGGACAGTGTGCTCCACAGGATAGCCAAGGACTACGGCACGGACGAAGCAAGAAAGTTCCTCGACTCGATCCTCACAACGCTTAAGACATATATCACTCACAGGGGATTCACCTACGGCTACTCTGACCTATGGCTCTCGCCGGAAACAAGACAGGAAATAAGCGACATCATCCAAAAGACCTACGAAAAAGTCTACGAGCTAATACAGCAGTACAATGACGGCACGCTTCCGCTGACAAGGGGTCTCGCTGCAGAAGAAGCACTTGAATTGTATGTAGTCAATGAATTGTCGCGTGCCCGTGACAGGGCCGGCAGGACAGCCGACAGGGCGTTCCCGGACGAGAACTCAGGGGTGATAATGGCATCGACGGGCGCAAGAGGTTCGACCCTCAACATCGGCCAAATGACTGCGGCTCTTGGTCAACAGTCGATAAGGGGAAAGAGGATCCAAAAGGGCTATCACAACAGGGCGCTATCGCACTTTAAGCCAAAAGACGCGAACCCGGACGCTAAAGGTTTTGTCAAGTCAAACTATCGAGACGGCCTCTCGCCGCTTGAATTCTTCTTCCACGCAATGGGCGGAAGGGAAGGCCTTGTTGACACTGCAGTCAGGACGCAGCAGTCTGGCTATATGCAGAGGAGGCTTATCAACGCGCTGGAGCACCTGAAGATCGAGTACGACCAGACGGTGCGCGACCCTCACGGAAACATCGTGCAGTATCTCTACGGCGAGGATGGCATTGATACTGCCAAGAGCGACCACGGAGAAGCAGTCAACATTTCGAGGCTCATTGAAGCTGAATCTGTAGTCGACGAGGGCAGGAAGGCAACAGAGGACGTTATCAAGGGAATCATTGGCAAGTACGCCGAGAACCTCAACCCAAGGATGAAGACCAACCTGGAAAAGACACTTCTTGAAAACAGGCTCAGCAAGGAAGGCGTTGAAAAAGTCATGAAGAAAGTGCTCGACCTGATTGATAGGGCACTGGCAGAGCCGGGAGAAGCAGTCGGCGTCGTCACCGCGCAATCGATTGGGGAGCCAGGAACTCAGATGACCCTCAGGACGTTCCACTTTGCAGGCGTAAAAGAGAGGAACGTGACGCTCGGCCTACCGAGGCTCATCGAGCTTGTGGACGCGCGCAAAAAGCCAGTGACGCCCACGATGGACATCTATCTCGACGAGGAACACAAGGTGTCGAGAGAAAAAGCGTTGGAAGTCGCAAGGGAGATA
PROTEIN sequence
Length: 1015
MMEESAKILGGIKFSVWSPTEVRKFSVAEITAPETYDEDGMPVQGGLMDNRLGTLEPGQKCATCGNTSAKCPGHFGHIELAEPVLHIAFVDDIHKLLLISCRSCNRIKLEPEELAHYKSIRDAKAAYAVITLENIKDEIIERSKKVKLCPHCGKDQYDLIFTKPTIFVEKTDAGENRLLPITIRERLSHIPDDDLTLLGYDYKTARPEWFVLQVLPVPPVTVRPSIILETGIRSEDDLTHKLVDIIRVNQRLKESKEAGTPPLIVQDLVDLLQYHVTTYFDNEVSGIPQAHHRSGRPLKTLTQRLKGKEGRFRGSLSGKRVDFSSRTVISPDPNLTIADVGVPTDVAKKLTIPETVSQWNLERLKELVMNGPNMYPGANYIIRPDGVKIRLDYVTDRKAIADSLASAYIVERHLADGDIVIFNRQPSLHRMSIMAHSVRVLPYRTFRLHPAVCPPYNADFDGDEMNLHVPQSEEARAEATLLMRVQDQLISPRYGGPIIGGIRDFITGAFMLTRDGTTLTKDEFANLAMTGGYEGPLPEPAVTKDGQKLYAGRQLFSLFLPKDFNFIITSKWNKAAKGEGKDVVIKNGELMSGVIDKASIGAEEPDSVLHRIAKDYGTDEARKFLDSILTTLKTYITHRGFTYGYSDLWLSPETRQEISDIIQKTYEKVYELIQQYNDGTLPLTRGLAAEEALELYVVNELSRARDRAGRTADRAFPDENSGVIMASTGARGSTLNIGQMTAALGQQSIRGKRIQKGYHNRALSHFKPKDANPDAKGFVKSNYRDGLSPLEFFFHAMGGREGLVDTAVRTQQSGYMQRRLINALEHLKIEYDQTVRDPHGNIVQYLYGEDGIDTAKSDHGEAVNISRLIEAESVVDEGRKATEDVIKGIIGKYAENLNPRMKTNLEKTLLENRLSKEGVEKVMKKVLDLIDRALAEPGEAVGVVTAQSIGEPGTQMTLRTFHFAGVKERNVTLGLPRLIELVDARKKPVTPTMDIYLDEEHKVSREKALEVAREI