ggKbase home page

qh_2_scaffold_869_3

Organism: QH_2_Halobacteriales_65_23

near complete RP 33 / 55 MC: 3 BSCG 29 / 51 MC: 1 ASCG 38 / 38 MC: 1
Location: 2243..5332

Top 3 Functional Annotations

Value Algorithm Source
DNA mismatch repair protein MutS n=1 Tax=Halosimplex carlsbadense 2-9-1 RepID=M0CKT0_9EURY similarity UNIREF
DB: UNIREF100
  • Identity: 53.1
  • Coverage: 964.0
  • Bit_score: 906
  • Evalue 1.90e-260
DNA mismatch repair protein MutS {ECO:0000256|HAMAP-Rule:MF_00096}; TaxID=797114 species="Archaea; Euryarchaeota; Halobacteria; Halobacteriales; Halobacteriaceae; Halosimplex.;" source="Halosimplex ca similarity UNIPROT
DB: UniProtKB
  • Identity: 53.1
  • Coverage: 964.0
  • Bit_score: 906
  • Evalue 2.60e-260
DNA mismatch repair protein MutS similarity KEGG
DB: KEGG
  • Identity: 52.1
  • Coverage: 967.0
  • Bit_score: 880
  • Evalue 5.30e-253

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Halosimplex carlsbadense → Halosimplex → Halobacteriales → Halobacteria → Euryarchaeota → Archaea

Sequences

DNA sequence
Length: 3090
ATGGAGGAGGGGGCGATCGGACTGCCGCCGGGGATAGCCGAGAAACGCGAGGCGTTGACGCCGATGCTCGCCCAGTACGCCGACCTCTGTGCGGCCTACGACGGGGTACTCGTGCTCTTTCAAGTAGGCGACTTCTACGAGGCCTTCTGTGAGGCCGCCGACGCCATCGCCCGGGAACTGGAGATCACGCTGACGAAACGCGAGGATTCCACAGGTGAGTATCCGATGGCTGGTATCCCGATCGACAGCGCCGCCTCCTACATCGAACGGTTGTTGGACGCCGGCTACCGGATCGCGATCGCCGATCAGGTCCAAGACCCCGCCGACTCGACCGGCCCGGTCGACCGGGCGGTCACACAGGTCGTGACTCCCGGAACGGTGATCGACGACGAACTCCTGGAACCCGGATCGAGCAACTACGTCGCCTGCCTCGCGAGCGACGGGGCCGATCCCGGCGTCGTGGCGATCGGCAATGGCAACGGCGACGGTGGCGGCGACGAACCCGCCACGAGCGAGACGTCGTCAGGGCCCGGATCCGGATCGGAGCCCGAAATCGGGACGTACGCGGTCGCGTTCGCGGACGTCTCGACCGGCGAGTTCCTCGTGACGAGCGGGCCCCGAGCCACCGTCGCCGACGAACTCGATCGCTTCGACCCCGCCGAGGTACTCCTCGCCCCCGAGATCAGTTCCGAAGCGTTCGAACTGCGAGGAACTGTGACGCCGCCGCGCGAGGGGACCTTCGAGCACGAGGCCGCGACCGAGCGGATCGAGCAGTCGGGACGGTCGGCCAACGACCTCGAACGCGTGTCCGAAATCCGTGCCTGTGGGGCCCTGCTGGCGTACGTCGAGTACACGGAGGGCGACGCCGACAGCGAGGACGCATACGACGCGGGCGGCGACGCCGACGACACGAACACAGGACGAAACGAGAACGACGGGGACAAAAGCAACGACGAGAGCGGACGTGGAAACGAAAGCAGGCAGGGTGATGACAGCGAGAGTGCCGACCGGAGTCGCTTCGACGCGATCTCACGGTACGAACCCACCGAACACCTGCGCCTCGACGCGACGGCGCTTCGAAGCCTCGAAGTGTTCGCCAGTCACGACGACGGCGACGACAGCGACGATCACACGCTCTTTTCGGTACTCGACGACGCCTCCAGCGCGCTCGGTCGTCGGCGACTGGCCGGGTGGCTGCGCCGCCCGCTGTACGATGCCGAAGCGATCGAGGCACGCCACGACGCCGTCTCGGAACTAGTGGACAGTTCGCTCGTCCGTGAGGAACTGCGAGAGCTCCTGCGAGACGTCTACGACCTCTCGCGGCTGTCGACGCGCGTTTCCCGAAAGCGGGCGAACGCACGGGACCTGCGCTCGCTCGCGGACTCCTTGGAGGTCGTTCCCCTCGTCCGCGAGGCGCTCTCGGGGGTCGAATCCGAACGGCTCGTTGCGCTACGAGAACGGCTCGACGATTGTGGCGACATCCGGGGGCTGATCGAGCGAGCGATCCGGCCGGACCCCCCGATCGAGCTCACCGAGGGCGGACTCATCGAACCCGACTTCGACGAGGGACTGTGCTCGCTCCGGGAGACCGAACAGGGCGGGAAGGAGTGGGTCGCGGAGCTCGAAGCCCGCGAGCGCCGAAAGACGGGCATCGACTCGCTGTCAGTAGGGTTCAATCAGGTTCACGGGTATTACATCGAGGTCACGAACCCGAACCTCGATAGCGTGCCCGACGAATACACCCGCCGACAGACGCTCAAGAACGCGGAACGGTTCTACACGCCCGACCTCAAGCGCCGCGAGGACGAGATCCTGAGCGCCGGCGAGCGTGCCGATGAGCGCGAGTACGAACTGTTCCGCGAGGTCCGGAGCCGAGTCGGCGGGGAAGCCGACCGTATCGATCGGGTCGGGGATGCCCTCGCCGATCTCGACGCCCTGTGTACCCTCGCGGCCGTCGCCGTAGAACGGGACTACGTCCGGCCCGCGGTCGGTACCGAGGACTTGGAGATCCGGGGCGGCCGACACCCGGTCGTCGAAGAAGCACAGGACTCGTTCGTCCCGAACGACCTTTCGTTACACGCCGGCGAGGTGGCGGTGATCACGGGACCGAACATGTCGGGCAAATCCACGTACATGCGTCAGGCCGCGCTGTGTGTTCTGCTCGCACAGATCGGGAGCTTCGTCCCCGCACGGGAGGCTCGATTGCCGGTCGTCGACCGCGTGTTCACCCGTGTCGGAGCATCCGACGACATCGCCGGCGGCCAATCGACGTTCATGCGCGAGATGAGCGAACTGGCGACGATCCTGCGCGATGCCAGCGACCGCTCGCTCGTGGTTCTCGACGAGGTGGGGCGGGGAACCAGTACGGCCGACGGGCGCGCAATTGCCCGTGCGGCGGTCGAATTCCTCCACGACGCGATCGGTGCGCGTACCCTCTTTGCCACTCACTACCACGAGCTGACCGACCTCGCCGCGGGGCACGAACGGGTGTCTAACTACCACTTCGCCGCCGATCGGGAGGGCAAGGACGTGACCTTCCTCCACAGCGTCGGCGAGGGAGCCGCGGCGGCGTCGTACGGCGTCGACGTCGCTCGGTTGGCCGGTGTGCCCGAGGACGTGGTCGCCCGCTCGCGCTCCCTCGTCGGTGGGAGCGACGGGGACGGCGGGAGCGACGGAACCGATGGATCGGCGAACGCCGCCGGGAGGAGAGCCGATCCGGGAGCGAACGGTCACCGACCGAATAGGAGGCGATCGCCGATCGAAGGGACGACAACTGCCGAACTGACTGACGGGAACCCGACCGAACGGAAAGAGCGGCTGGATCGGAACGAACCGTCAGCGGACGAAACCGCCATACCGGCGCGGAACGGAACCACGCACGACCCATCCGAGATACGGGGGGCACACAACGGCAGGAAGAAGCGAGGAGGAACATCGAACGGTGGGGACGGAAGAACGGAGCGGGGAATCGAAACCACCCTCCGAGACCTCGACATCGCGAACACGACGCCGGTCGAGGCGCTCTGTACGCTCAACGAACTCAAAGGCCGACTCGATGACGAACCCGACGAGCGACACCGAACCGACTGA
PROTEIN sequence
Length: 1030
MEEGAIGLPPGIAEKREALTPMLAQYADLCAAYDGVLVLFQVGDFYEAFCEAADAIARELEITLTKREDSTGEYPMAGIPIDSAASYIERLLDAGYRIAIADQVQDPADSTGPVDRAVTQVVTPGTVIDDELLEPGSSNYVACLASDGADPGVVAIGNGNGDGGGDEPATSETSSGPGSGSEPEIGTYAVAFADVSTGEFLVTSGPRATVADELDRFDPAEVLLAPEISSEAFELRGTVTPPREGTFEHEAATERIEQSGRSANDLERVSEIRACGALLAYVEYTEGDADSEDAYDAGGDADDTNTGRNENDGDKSNDESGRGNESRQGDDSESADRSRFDAISRYEPTEHLRLDATALRSLEVFASHDDGDDSDDHTLFSVLDDASSALGRRRLAGWLRRPLYDAEAIEARHDAVSELVDSSLVREELRELLRDVYDLSRLSTRVSRKRANARDLRSLADSLEVVPLVREALSGVESERLVALRERLDDCGDIRGLIERAIRPDPPIELTEGGLIEPDFDEGLCSLRETEQGGKEWVAELEARERRKTGIDSLSVGFNQVHGYYIEVTNPNLDSVPDEYTRRQTLKNAERFYTPDLKRREDEILSAGERADEREYELFREVRSRVGGEADRIDRVGDALADLDALCTLAAVAVERDYVRPAVGTEDLEIRGGRHPVVEEAQDSFVPNDLSLHAGEVAVITGPNMSGKSTYMRQAALCVLLAQIGSFVPAREARLPVVDRVFTRVGASDDIAGGQSTFMREMSELATILRDASDRSLVVLDEVGRGTSTADGRAIARAAVEFLHDAIGARTLFATHYHELTDLAAGHERVSNYHFAADREGKDVTFLHSVGEGAAAASYGVDVARLAGVPEDVVARSRSLVGGSDGDGGSDGTDGSANAAGRRADPGANGHRPNRRRSPIEGTTTAELTDGNPTERKERLDRNEPSADETAIPARNGTTHDPSEIRGAHNGRKKRGGTSNGGDGRTERGIETTLRDLDIANTTPVEALCTLNELKGRLDDEPDERHRTD*