ggKbase home page

qs_6_scaffold_1032_5

Organism: QS_6_UNK

megabin RP 52 / 55 MC: 48 BSCG 49 / 51 MC: 46 ASCG 38 / 38 MC: 38
Location: comp(2449..5481)

Top 3 Functional Annotations

Value Algorithm Source
Putative type IV restriction endonuclease n=1 Tax=Haloarcula amylolytica JCM 13557 RepID=M0K8L0_9EURY similarity UNIREF
DB: UNIREF100
  • Identity: 82.6
  • Coverage: 999.99
  • Bit_score: 1743
  • Evalue 0.0
Putative type IV restriction endonuclease {ECO:0000313|EMBL:EMA17158.1}; TaxID=1227452 species="Archaea; Euryarchaeota; Halobacteria; Halobacteriales; Halobacteriaceae; Haloarcula.;" source="Haloarcul similarity UNIPROT
DB: UniProtKB
  • Identity: 82.6
  • Coverage: 999.99
  • Bit_score: 1743
  • Evalue 0.0
type IV restriction endonuclease similarity KEGG
DB: KEGG
  • Identity: 31.8
  • Coverage: 787.0
  • Bit_score: 339
  • Evalue 4.30e-90

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Haloarcula amylolytica → Haloarcula → Halobacteriales → Halobacteria → Euryarchaeota → Archaea

Sequences

DNA sequence
Length: 3033
ATGAGCGTTCAAGAAACCGTAGATATGGATAAGGACGCCATTCAGGACTTGGTAGATTCCTACCACTCTCATTCGCCCCGCGAGCGGAAGCAGATGAAAGAAGCTGCCGTCCGCCAGCAATTCATCAATCCTCTTCTTCGAGCGCTTGGATGGGATACGACGACCGATCAAGTGAAGCCCGAACAGCGGACGCTTGTCGGGGACGCAGACTACGCATTGAGCCTGAATGGCCGCGAACAGTTTTTCATCGAAGCCAAGGCATTTTCTAAAGATCTTGGCGGAAGCCGTCGGGTCAGTAATGACGAAACGCAATCCTACATCGAACAAGCAATTGACTATGCGTGGCATCAAGGATGCGATTGGGCCGTCCTGACTAATTTCGAGGAACTTCGACTTTATTTCACGCACGTCAGCAGGGACAACCTCGAAAATGGACTGGTCTTCACGCTCACAGTAGACGAATATGCATCTGAAGATGGATTCGAACAGCTGGCGAATCTCTCGAAAGCCGCCGTTGCCGACGGGTCACTCGAACGGCTGGAACGAGCGCGTGAGCGCGATACCGTCACTGAAGAGATTCTGAATGTCCTATCTGAAGCGCGGCGTCGATTGACCCAGGACGTTCACGACTCTCACCCTGACCTGTCGATGGACGACCTTCGAGACGGTGTACAGCGTATTTTAGACCGGGTAGTGGTCATGCGGGTCGCGGAAGACCGTGGCGTCATTCCGGCAGATACGTTGCTGAACATGGCCGAGTCGTGGGAACAGACCACGATTAATCCGGACGTGCGGACACTGGTTCGTGATCTAAAAAACGCGTTCCGTGACTTTGATTCGGTCTATAATTCGGAGTTGTTCGCGGAACACCCGTGCGAGGACTACGAAATATCGAACGACGTACTGCTGGACATTATCGACTCGCTGTATGATTACAACTTCTCGTATATCGACGCCGACGTACTGGGGAATATCTACGAGGACTACCTCGGCCACGCTATTGAGGACAAATCCGAGGACTTGGAACTGGTCGAACACCCGGACGAGCGACGTGAAGAGGGAATCTACTATACGCCGGTCCCGGTGGTCGAATACATCGTTGAGAGCGTATTAGGCGACCGGATTGACGCTATCATGGCGAACGTGCGGAAAGAGTTAGAAGGCGATGAGCCGGATTTTGAAGCCGCTCGTGCTGAGTTCAACGCTATTGAGGACATAGCGGTCTTGGACGTGTCGTGTGGGAGCGGCAGTTTCCTGATTAAATCGTTCGACCTGCTGGTGGACGCCTGTGAGGAAGTTCGATCATTGGTTCGGTCAGGGAACGGCGATATTGGTATTAACGAGTATTCAACCGTCCAAATCGTCCCATCGGACTACAAGCGCCACATCCTTCGGAACAACATCTTCGGAGTGGACTTGGACTATCAGGCGACCGAGATTGCGACGGTTAATCTGCTTCTGAAGGCACTCAAGAAGAACGAGAAGCTGCCAGCGATACTCGAAGACAATATTCGTTCAGGAAACAGTCTACTGAATGGTTCGCCCGAAGAGGTGGCCGATGTCCTCGATATTTCGGTAGAAGAAGCCGAAGAAATGGGTACGTTCGAGTGGGAAGAAGAGTTTGACCACATCTTCGAAGAACGGGGTGGCTTCGACGTGATTGTTGGCAATCCACCGTGGGGCGCGGAAATAGATGAATATGATGCGTGGTTAGAAAACGAGAAAGGCTACGAACTGGCCGAGGGCCAATACGATTCGTATGAACTGTTCTTGGAATTGGCTGAAGACCTTCTGAGAGAAGGAGGGACGCTTGGTTTTATTATCCCGGACAGTATCTTCAATGAAGATTCAGTACCCCTACGCCGTTGGTTAGTTGACAGTCATCAATTAGACCGAGTTCATAAGTTAGATGAAGGAATTTTTGATAATGTCTTTGCGGCTACTGCTATTGTTCAGTATACGAACATCAAACCACGCAAAGAGCATCAAGTAGAGGTGAGTTTACTTCAAAAGGCAGACCGGAAAAAGATGCTTGGTGCGGGTGGTGAGGCATTAGCGAGTATTATTGAAGATAAGAAGCACATTACAGAACAACGCCGATTTGCTGAAGATGAAGACTATGTGTTTGATATTTGGGCCGGTGAGAAAGACCACGAAGTTCTTGACGCGATGGAAGCTGACACAGTAGATTGGTCACAGGTTATTGATAATGGGCGTGGGGATGAGACAGGCAGGGAAGGAGAGATTCTACAATGTCCCTATTGCACGGAATGGAGTTCGTTCCCGCGAAAACGGGCGGAATCTAAAGGCGGTGGATACTACTCAAAGACGTGCGACCGTTGCGGAGAGGAATTTGAGTTTGAGGATGCCATCTCAACCCGGCATATAGTGAAGCATGAGCCAACTGAAAAATGTGAGACGCCCATCTACTTCGGGGAACACGTCAATCGTTACCGTATCAGCGGAAACGCATACATTGACACGGATGTACCGGGGGTCGGACTGAAAGATACGTGGCGATTTGAACCGCCTAAGTTACTCATTCGGGAAGCGGGTGTTGGCTTCTATGCGACTATTGACTACACCGAAGCCCGATGTTTAAAATCAGTAATGTCGTTCCGACCGGCTGAGGAACGGGAAGAGCCATTTGACAAGTACGATCTTGAGTATTTCCTGGGCTTTCTTAATTCGCGGGCGATGCTCTACTATTACTCCAAGATCAAGGGAATTGTTGAGTGGCAGTCGTTCCCCCGTCACCCGCAAAGTTTCATTATGTCGCTCCCTGTCCCAGCAATTGACTTTGATGACCCTGATGAAAAGGACGCATATAATGAGTTTGTGGGGTTAGTGAAGCAGGCGACAGACGGCGATGAACAGATAGACGAGGACTTAGATTGGGAAATCGAGCGTGCGGCGTTAGACCTATACGGGATTCCCAACAAAAAGCGGCCCCGAATCTGGAACGAACTCAAGAAACTTCAGCGGCTTCGGATTGTTCGAGAACTGTTCCCCGACGCTGGCGAAGACGACTGA
PROTEIN sequence
Length: 1011
MSVQETVDMDKDAIQDLVDSYHSHSPRERKQMKEAAVRQQFINPLLRALGWDTTTDQVKPEQRTLVGDADYALSLNGREQFFIEAKAFSKDLGGSRRVSNDETQSYIEQAIDYAWHQGCDWAVLTNFEELRLYFTHVSRDNLENGLVFTLTVDEYASEDGFEQLANLSKAAVADGSLERLERARERDTVTEEILNVLSEARRRLTQDVHDSHPDLSMDDLRDGVQRILDRVVVMRVAEDRGVIPADTLLNMAESWEQTTINPDVRTLVRDLKNAFRDFDSVYNSELFAEHPCEDYEISNDVLLDIIDSLYDYNFSYIDADVLGNIYEDYLGHAIEDKSEDLELVEHPDERREEGIYYTPVPVVEYIVESVLGDRIDAIMANVRKELEGDEPDFEAARAEFNAIEDIAVLDVSCGSGSFLIKSFDLLVDACEEVRSLVRSGNGDIGINEYSTVQIVPSDYKRHILRNNIFGVDLDYQATEIATVNLLLKALKKNEKLPAILEDNIRSGNSLLNGSPEEVADVLDISVEEAEEMGTFEWEEEFDHIFEERGGFDVIVGNPPWGAEIDEYDAWLENEKGYELAEGQYDSYELFLELAEDLLREGGTLGFIIPDSIFNEDSVPLRRWLVDSHQLDRVHKLDEGIFDNVFAATAIVQYTNIKPRKEHQVEVSLLQKADRKKMLGAGGEALASIIEDKKHITEQRRFAEDEDYVFDIWAGEKDHEVLDAMEADTVDWSQVIDNGRGDETGREGEILQCPYCTEWSSFPRKRAESKGGGYYSKTCDRCGEEFEFEDAISTRHIVKHEPTEKCETPIYFGEHVNRYRISGNAYIDTDVPGVGLKDTWRFEPPKLLIREAGVGFYATIDYTEARCLKSVMSFRPAEEREEPFDKYDLEYFLGFLNSRAMLYYYSKIKGIVEWQSFPRHPQSFIMSLPVPAIDFDDPDEKDAYNEFVGLVKQATDGDEQIDEDLDWEIERAALDLYGIPNKKRPRIWNELKKLQRLRIVRELFPDAGEDD*