ggKbase home page

qh_7_scaffold_565_8

Organism: QH_7_Halobacteriales_64_30

near complete RP 34 / 55 MC: 4 BSCG 19 / 51 ASCG 38 / 38 MC: 1
Location: comp(6659..9766)

Top 3 Functional Annotations

Value Algorithm Source
DNA mismatch repair protein MutS n=1 Tax=Halosimplex carlsbadense 2-9-1 RepID=M0CKT0_9EURY similarity UNIREF
DB: UNIREF100
  • Identity: 52.8
  • Coverage: 964.0
  • Bit_score: 908
  • Evalue 8.40e-261
DNA mismatch repair protein MutS {ECO:0000256|HAMAP-Rule:MF_00096}; TaxID=797114 species="Archaea; Euryarchaeota; Halobacteria; Halobacteriales; Halobacteriaceae; Halosimplex.;" source="Halosimplex ca similarity UNIPROT
DB: UniProtKB
  • Identity: 52.8
  • Coverage: 964.0
  • Bit_score: 908
  • Evalue 1.20e-260
DNA mismatch repair protein mutS similarity KEGG
DB: KEGG
  • Identity: 52.3
  • Coverage: 984.0
  • Bit_score: 894
  • Evalue 3.60e-257

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Halosimplex carlsbadense → Halosimplex → Halobacteriales → Halobacteria → Euryarchaeota → Archaea

Sequences

DNA sequence
Length: 3108
ATGGAGGAGGGGGCGATCGGACTGCCGCCGGGGATAGCCGAGAAACGCGAGGCATTGACGCCGATGCTCGCCCAGTACGCCGACCTCTGTGCGGCCTACGACGGGGTACTCGTGCTCTTTCAAGTAGGCGACTTCTACGAGGCCTTCTGTGAGGCCGCCGACGCCATCGCCCGGGAACTGGAGATCACGCTGACGAAACGCGAGGATTCCACAGGTGAGTATCCGATGGCTGGTATCCCGATCGACAGCGCCGCCTCCTACATCGAACGGTTGTTGGACGCCGGCTACCGGATCGCGATCGCCGATCAGGTCCAAGACCCCGCCGACTCGACCGGCCCGGTCGACCGGGCGGTCACACAGGTCGTGACTCCCGGAACGGTGATCGACGACGAACTCCTGGAACCCGGATCGAGCAACTACGTCGCCTGCCTCGCGAGCGACGGGGCCGATCCCGGCGTCGTGGCGATCGGCAATGGCAACGGCGACGGTGGCGGCGACGAACCCGCCACGAGCGAGACGTCGTCAGGGCCCGGATCCGGATCGGAGCCCGAAATCGGGACGTACGCGGTCGCGTTCGCGGACGTCTCGACCGGCGAGTTCCTCGTGACGAGCGGGCCCCGAGCCACCGTCGCCGACGAACTCGATCGCTTCGACCCCGCCGAGGTACTCCTCGCCCCCGAGATCAGTTCCGAAGCGTTCGAACTGCGAGGAACTGTGACGCCGCCGCGCGAGGGGACCTTCGAGCACGAGGCCGCGACCGAGCGGATCGGGCAGTCGGGACGGTCGGCCAACGACCTCGAACGCGTGTCCGAAATCCGTGCCTGTGGGGCCCTGCTGGCGTACGTCGAGTACACGGAGGGCGACGCCGACAGCGAGGACGCATACGACGCGGGCGGCGACGCCGACGACACGAACACAGGACGAAACGAGAACGACGGGGACAAAAGCAACGACGAGAGCGGACATGGAAACGAAAGCAGGCAGGGTGATGACAGCGAGAGTGCCGACCGGAGTCGCTTCGACGCGATCTCACGGTACGAACCCACCGAACACCTGCGCCTCGACGCGACGGCGCTTCGAAGCCTCGAAGTGTTCGCCAGTCACGACGACGGCGACGACAGCGACGATCACACGCTCTTTTCGGTACTCGACGACGCCTCCAGCGCGCTCGGTCGTCGGCGACTGGCCGGGTGGCTGCGCCGCCCGCTGTACGATACCGAAGCGATCGAGGCACGCCACGACGCCGTCTCGGAACTAGTGGACAGTTCGCTCGTCCGTGAGGAACTGCGAGAGCTCCTGCGAGACGTCTACGACCTCTCGCGGCTGTCGACGCGCGTTTCCCGAAAGCGGGCGAACGCACGGGACCTGCGCTCGCTCGCGGACTCCTTGGAGGTCGTTCCCCTCGTCCGCGAGGCGCTCTCGGGGGTCGAATCCGAACGGCTCGTTGCGCTACGAGAACGGCTCGACGATTGTGGCGACATCCGGGGGCTGATCGAGCGAGCGATCCGGCCGGACCCCCCGATCGAGCTCACCGAGGGCGGACTCATCGAACCCGACTTCGACGAGGGACTGTGCTCGCTCCGGGAGACCGAACAGGGCGGGAAGGAGTGGGTCGCGGAGCTCGAAGCCCGCGAGCGCCGAAAGACGGGCATCGACTCGCTGTCAGTAGGGTTCAACCAGGTTCACGGGTATTACATCGAGGTCACGAACCCGAACCTCGATAGCGTGCCCGACGAATACACCCGCCGACAGACGCTCAAGAACACGGAACGGTTCTACACGCCCGACCTCAAGCGCCGCGAGGACGAGATCCTGAGCGCCGGCGAGCGTGCCGACGAGCGCGAGTACGAACTGTTCCGCGAGGTCCGGAGTCAGGTCGGGGCCGAAGCCGACCGTATCGATCGGGTCGGGGATGCCCTCGCCGATCTCGACGCCCTGTGTACCCTCGCGGCCGTCGCCGTAGAACGGGACTACGTCCGGCCCGCGGTCGGTACCGAGGACTTGGAGATCCGGGGCGGCCGACACCCGGTCGTCGAAGAAGCACAGGACTCGTTCGTCCCGAACGACCTTTCGTTACACGCCGGCGAGGTGGCGGTGATCACGGGACCGAACATGTCGGGCAAATCTACATACATGCGTCAGGCCGCGCTGTGTGTTCTGCTCGCACAGATCGGGAGCTTCGTGCCCGCACGGGAGGCTCGATTGCCGGTCGTCGACCGCGTGTTCACCCGTGTCGGAGCATCCGACGACATCGCCGGCGGCCAATCGACGTTCATGCGCGAGATGAGCGAACTGGCGACGATCCTGCGCGATGCCAGCGACCGCTCGCTCGTGGTTCTCGACGAGGTGGGGCGGGGAACCAGTACGGCCGACGGGCGCGCAATTGCCCGTGCGGCGGTCGAATTCCTCCACGACGCGATCGGTGCGCGTACCCTCTTTGCCACTCACTACCACGAGCTGACCGACCTCGCCGCGGGGCACGAACGGGTGTCTAACTACCACTTCGCCGCCGATCGGGAGGGCAGGGACGTGACCTTCCTCCACAGCGTCGACGAGGGAGCCGCGGCGGCGTCGTACGGCGTCGACGTCGCTCGGTTGGCCGGTGTGCCCGAGGACGTGGTCGCCCGCTCGCGCGCCCTCGTCGGTGGGAACGGCGGGAACGACGGGAACGACGGGAACGGCGGGAGCGACGGAACCGATGGATCGGCGAACGCCGCCGGGAGGAGAGCCGATCCGGGAGCGAACGGTCACCGACCGAATAGGAGGCGATCGCCGATCGAAGGGACGACAACTGCCGAACTGACTGACGGGGACCCGACCGAACGGAAAGAGCGGCTGGATCGGAACGAACCGTCAGCGGACGAAACCGCCATACCGGCGCGGAACGGAACCACGTACGACGCATCCGAGATACAGGGGACACACAACGGCAGGAAGAAGCGAGGAGGAACATCGAACGGTGGGGACGGAAGAACGGAGCGGGGAATCGAAACCACCCTCCGAGACCTCGACATCGCGAACACGACGCCGGTCGAGGCGCTCTGTACGCTCAACGAACTCAAAGGCCGACTCGATGACGAACCCGACGAGCGACACCGAACCGACTGA
PROTEIN sequence
Length: 1036
MEEGAIGLPPGIAEKREALTPMLAQYADLCAAYDGVLVLFQVGDFYEAFCEAADAIARELEITLTKREDSTGEYPMAGIPIDSAASYIERLLDAGYRIAIADQVQDPADSTGPVDRAVTQVVTPGTVIDDELLEPGSSNYVACLASDGADPGVVAIGNGNGDGGGDEPATSETSSGPGSGSEPEIGTYAVAFADVSTGEFLVTSGPRATVADELDRFDPAEVLLAPEISSEAFELRGTVTPPREGTFEHEAATERIGQSGRSANDLERVSEIRACGALLAYVEYTEGDADSEDAYDAGGDADDTNTGRNENDGDKSNDESGHGNESRQGDDSESADRSRFDAISRYEPTEHLRLDATALRSLEVFASHDDGDDSDDHTLFSVLDDASSALGRRRLAGWLRRPLYDTEAIEARHDAVSELVDSSLVREELRELLRDVYDLSRLSTRVSRKRANARDLRSLADSLEVVPLVREALSGVESERLVALRERLDDCGDIRGLIERAIRPDPPIELTEGGLIEPDFDEGLCSLRETEQGGKEWVAELEARERRKTGIDSLSVGFNQVHGYYIEVTNPNLDSVPDEYTRRQTLKNTERFYTPDLKRREDEILSAGERADEREYELFREVRSQVGAEADRIDRVGDALADLDALCTLAAVAVERDYVRPAVGTEDLEIRGGRHPVVEEAQDSFVPNDLSLHAGEVAVITGPNMSGKSTYMRQAALCVLLAQIGSFVPAREARLPVVDRVFTRVGASDDIAGGQSTFMREMSELATILRDASDRSLVVLDEVGRGTSTADGRAIARAAVEFLHDAIGARTLFATHYHELTDLAAGHERVSNYHFAADREGRDVTFLHSVDEGAAAASYGVDVARLAGVPEDVVARSRALVGGNGGNDGNDGNGGSDGTDGSANAAGRRADPGANGHRPNRRRSPIEGTTTAELTDGDPTERKERLDRNEPSADETAIPARNGTTYDASEIQGTHNGRKKRGGTSNGGDGRTERGIETTLRDLDIANTTPVEALCTLNELKGRLDDEPDERHRTD*