ggKbase home page

sw_6_scaffold_1049_2

Organism: SW_6_Halobacteriales_65_23

near complete RP 33 / 55 MC: 3 BSCG 27 / 51 MC: 3 ASCG 38 / 38 MC: 2
Location: 649..3747

Top 3 Functional Annotations

Value Algorithm Source
DNA mismatch repair protein MutS n=1 Tax=Halosimplex carlsbadense 2-9-1 RepID=M0CKT0_9EURY similarity UNIREF
DB: UNIREF100
  • Identity: 52.8
  • Coverage: 974.0
  • Bit_score: 903
  • Evalue 1.60e-259
DNA mismatch repair protein MutS {ECO:0000256|HAMAP-Rule:MF_00096}; TaxID=797114 species="Archaea; Euryarchaeota; Halobacteria; Halobacteriales; Halobacteriaceae; Halosimplex.;" source="Halosimplex ca similarity UNIPROT
DB: UniProtKB
  • Identity: 52.8
  • Coverage: 974.0
  • Bit_score: 903
  • Evalue 2.20e-259
DNA mismatch repair protein MutS similarity KEGG
DB: KEGG
  • Identity: 51.6
  • Coverage: 967.0
  • Bit_score: 877
  • Evalue 3.40e-252

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Halosimplex carlsbadense → Halosimplex → Halobacteriales → Halobacteria → Euryarchaeota → Archaea

Sequences

DNA sequence
Length: 3099
ATGGAGGAGGGGGCGATCGGACTGCCGCCGGGGATAGCCGAGAAACGCGAGGCATTGACGCCGATGCTCGCCCAGTACGCCGACCTCTGTGCGGCCTACGACGGGGTACTCGTGCTCTTTCAAGTAGGCGACTTCTACGAGGCCTTCTGTGAGGCCGCCGACGCCATCGCCCGGGAACTGGAGATCACGCTGACGAAACGCGAGGATTCCACAGGTGAGTATCCGATGGCTGGTATCCCGATCGACAGCGCCGCCTCCTACATCGAACGGTTGTTGGACGCCGGCTACCGGATCGCGATCGCCGATCAGGTCCAAGACCCCGCCGACTCGACCGGCCCGGTCGACCGGGCGGTCACACAGGTCGTGACTCCCGGAACGGTGATCGACGACGAACTCCTGGAACCCGGATCGAGCAACTACGTCGCCTGCCTCGCGAGCGACGGGGCCGATCCCGGCGTCGTGGCGATCGGCAATGGCAACGGCGACGGTGGCGGCGACGAACCCGCCACGAGCGAGACGTCGTCAGGGCCCGGATCCGGATCGGAGCCCGAAATCGGGACGTACGCGGTCGCGTTCGCGGACGTCTCGACCGGCGAGTTCCTCGTGACGAGCGGCCCCCGAGCCACCGTCGCCGACGAACTCGATCGCTTCGACCCCGCCGAGGTACTCCTCGCCCCCGAGATCAGTTCCGAAGCGTTCGAACTGCGAGGAACTGTGACGCCGCCGCGCGAGGGGACCTTCGAGCACGAGGCCGCGACCGAGCGGATCGGGCAGTCGGGACGGTCGGCCAACGACCTCGAACGCGTGTCCGAAATCCGTGCCTGTGGGGCCCTGCTGGCGTACGTCGAGTACACGGAGGGCGACGCCGACAGCGAGGACGCATACGACGCGGGCGGCGACGCCGACGACACGAACACAGGACGAAACGAGAACGACGGGGACAAAAGCAACGACGAGAGCGGACGTGGAAACGAAAGCAGGCAGGGTGATGACAGCGAGAGTGCCGACCGGAGTCGCTTCGACGCGATCTCACGGTACGAACCCACCGAACACCTGCGCCTCGACGCGACGGCGCTTCGAAGCCTCGAAGTGTTCGCCAGTCACGACGACGGCGACGACAGCGACGATCACACGCTCTTTTCGGTACTCGACGACGCCTCCAGCGCGCTCGGTCGTCGGCGACTGGCCGGGTGGCTGCGCCGCCCGCTGTACGATACCGAAGCGATCGAGGCACGCCACGACGCCGTCTCGGAACTAGTGGACAGTTCGCTCGTCCGTGAGGAACTGCGAGAGCTCCTGCGAGACGTCTACGACCTCTCGCGGCTGTCGACGCGCGTTTCCCGAAAGCGGGCGAACGCACGGGACCTGCGCTCGCTCGCGGACTCCTTGGAGGTCGTTCCCCTCGTCCGCGAGGCGCTTTCGGGGGTCGAATCCGAACGGCTCGTTGCGCTACGAGAACGGCTCGACGATTGTGGCGACATCCGGGGGCTGATCGAGCGAGCGATCCGGCCGGACCCCCCGATCGAGCTCACCGAGGGCGGACTCATCGAACCCGACTTCGACGAGGGACTGTGCTCGCTCCGGGAGACCGAACAGGGCGGGAAGGAGTGGGTCGCGGAGCTCGAAGCCCGCGAGCGCCGAAAGACGGGCATCGACTCGCTGTCAGTAGGGTTCAACCAGGTTCACGGGTATTACATCGAGGTCACGAACCCGAACCTCGATAGCGTGCCCGACGAATACACCCGCCGACAGACGCTCAAGAACGCGGAACGGTTCTACACGCCCGACCTCAAGCGCCGCGAGGACGAGATCCTGAGCGCCGGCGAGCGCGCCGACGAGCGCGAGTACGAACTGTTCCGCGAGGTCCGGAGCCGAGTCGGTGGGGAAGCCGACCGTATCGATCGGGTCGGGGATGCCCTCGCCGATCTCGACGCCCTGTGTACCCTCGCGGCCGTCGCCGTAGAACGGGACTACGTCCGGCCCGCGGTCGGTACCGAGGACTTGGAGATCCGGGGCGGCCGACACCCGGTCGTCGAAGAAGCACAGGACTCGTTCGTCCCGAACGACCTTTCGTTACACGCCGGCGAGGTGGCGGTGATCACGGGACCGAACATGTCGGGCAAATCTACATACATGCGTCAGGCCGCGCTGTGTGTTCTGCTCGCACAGATCGGGAGCTTCGTCCCCGCACGGGAGGCTCAATTGCCGGTCGTCGACCGCGTGTTCACCCGTGTCGGAGCATCCGACGACATCGCCGGCGGCCAATCGACGTTCATGCGCGAGATGAGCGAACTGGCGACGATCCTGCGCGATGCCAGCGACCGCTCGCTCGTGGTTCTCGACGAGGTGGGGCGGGGAACCAGTACGGCCGACGGGCGCGCAATTGCCCGTGCGGCGGTCGAATTCCTCCACGACGCGATCGGTGCGCGTACCCTCTTTGCTACTCACTACCACGAGTTGACCGACCTCGCCGCGGGGCACGAACGGGTGTCTAACTACCACTTCGCCGCCGATCGGGAGGGCAAGGACGTGACCTTCCTCCACAGCGTCGGCGAGGGAGCCGCAGCGGCGTCGTATGGCGTCGACGTCGCTCGGTTGGCCGGTGTGCCCGAGGACGTGGTCGCCCGCTCGCGCGCCCTCGTCGGTGGGAACGGCGGGAACGACGGGAACGGCGGGAGCGACGGAACCGAGGGATCGGCGAACGCCGCCGGGAGGAGAGCCGATCCGGGCGCGAACGGTCACCGACCGAATAGGAGGCGATCGCCGATCGAAGGGACGACAACTGCCGAACTGACTGACGGGGACCCGACCGAACGGAAAGAGCGGCTGGATCGGAACGAACCGTCAGCGGACGAAACCGCCATACCGGCGCGGAACGGAACCACGCACGACCCATCCGAGATACGGGGGGCACACAACGGCAGGAAGAAGCGAGGAGGAACATCGAACGGTGGGGACGGAAGAACGAAGCGGGGAATCGAAACCACCCTACGAGACCTCGACATCGCGAACACGACGCCGGTCGAGGCGCTCTGTACGCTCAACGAACTCAAAGGCCGACTCGATGACGAACCCGACGAGCGACACCGAACCGACTGA
PROTEIN sequence
Length: 1033
MEEGAIGLPPGIAEKREALTPMLAQYADLCAAYDGVLVLFQVGDFYEAFCEAADAIARELEITLTKREDSTGEYPMAGIPIDSAASYIERLLDAGYRIAIADQVQDPADSTGPVDRAVTQVVTPGTVIDDELLEPGSSNYVACLASDGADPGVVAIGNGNGDGGGDEPATSETSSGPGSGSEPEIGTYAVAFADVSTGEFLVTSGPRATVADELDRFDPAEVLLAPEISSEAFELRGTVTPPREGTFEHEAATERIGQSGRSANDLERVSEIRACGALLAYVEYTEGDADSEDAYDAGGDADDTNTGRNENDGDKSNDESGRGNESRQGDDSESADRSRFDAISRYEPTEHLRLDATALRSLEVFASHDDGDDSDDHTLFSVLDDASSALGRRRLAGWLRRPLYDTEAIEARHDAVSELVDSSLVREELRELLRDVYDLSRLSTRVSRKRANARDLRSLADSLEVVPLVREALSGVESERLVALRERLDDCGDIRGLIERAIRPDPPIELTEGGLIEPDFDEGLCSLRETEQGGKEWVAELEARERRKTGIDSLSVGFNQVHGYYIEVTNPNLDSVPDEYTRRQTLKNAERFYTPDLKRREDEILSAGERADEREYELFREVRSRVGGEADRIDRVGDALADLDALCTLAAVAVERDYVRPAVGTEDLEIRGGRHPVVEEAQDSFVPNDLSLHAGEVAVITGPNMSGKSTYMRQAALCVLLAQIGSFVPAREAQLPVVDRVFTRVGASDDIAGGQSTFMREMSELATILRDASDRSLVVLDEVGRGTSTADGRAIARAAVEFLHDAIGARTLFATHYHELTDLAAGHERVSNYHFAADREGKDVTFLHSVGEGAAAASYGVDVARLAGVPEDVVARSRALVGGNGGNDGNGGSDGTEGSANAAGRRADPGANGHRPNRRRSPIEGTTTAELTDGDPTERKERLDRNEPSADETAIPARNGTTHDPSEIRGAHNGRKKRGGTSNGGDGRTKRGIETTLRDLDIANTTPVEALCTLNELKGRLDDEPDERHRTD*