ggKbase home page

ar4r2_scaffold_5384_10

Organism: ALUMROCK_MS4_Thiotrichales-related_46_269_curated

near complete RP 47 / 55 MC: 3 BSCG 49 / 51 MC: 5 ASCG 11 / 38 MC: 2
Location: 7840..11073

Top 3 Functional Annotations

Value Algorithm Source
hypothetical protein n=1 Tax=Thiomicrospira halophila RepID=UPI0003810F54 similarity UNIREF
DB: UNIREF100
  • Identity: 56.8
  • Coverage: 1104.0
  • Bit_score: 1214
  • Evalue 0.0
  • rbh
Type I restriction endonuclease subunit M {ECO:0000313|EMBL:KEP68470.1}; TaxID=1185766 species="Bacteria; Proteobacteria; Alphaproteobacteria; Rhodobacterales; Rhodobacteraceae; Thioclava.;" source="Thioclava dalianensis.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 47.8
  • Coverage: 1093.0
  • Bit_score: 950
  • Evalue 1.70e-273
type I restriction-modification system methyltransferase subunit similarity KEGG
DB: KEGG
  • Identity: 56.0
  • Coverage: 861.0
  • Bit_score: 948
  • Evalue 1.30e-273

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Thioclava dalianensis → Thioclava → Rhodobacterales → Alphaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 3234
ATGTCGCTATTCTTGCCTAAAATCCTTGCCAAACACCTTGTTAACACGCCCATTTCCGAAGCTCACCTAACCGCCATCGCACACTGGCAAAGCTCGATTAACGATGGCAGTTTAAAAAAGCTAGGTGAAAAATCAGCGCATGGCGCATTTATCCAAACTTTTTTAGTGACCTTGCTCGACTACACCACGGTAGCAACCCATAGCCAATACACAGCCAGCTACGAAATGGGCATTAAAAAAGGTGGTATTGTCGATGTTGCTTTAGGGCGTTTTGGTAACGATATCGAAGCGCAAATTATCGCGCCCTTCGAGCTAAAAGGCTTAGACACACCCAACCTCGATGCCATTATGTCGGGAAGACACAAAACGCCTGTGCAGCAAGCATGGGAGTACGCCAATGCCATTAAAGGCGCGAAATGGGTATTGGTTAGCAACTATCGCGAAGTTCGCTTGTATGCCGTCGGACATGGCATGCAAAGTTATGAAAGCTGGGATGTGTTAACGCTAAACCAGCCAGCTGAATATGCTCGCTTTCGCTTGTTGTTATCACAAAACAGTCTGTTAGGCGACACGACCAGCGCATTGTTACAAGCCACCGAACAAGCCGACAAAGACATCACCGCCCAACTCTATGCCGACTACAAGTCGGTGCGTGAAAGCCTAATCGCCCACTTAATCAGTGATAACCCCACTAAAGTACCGACAGAACTCATTGCCCCCGCGCAAAAGCTGCTCGACCGCGTGCTGTTCGTTGCCTTTGCCGAAGACCGTGGGCTGATTCCCGACAACACCATCAAAAACGCTTATTTGCACGCTGACCCTTACAACCCACGCCCGATATACGACAACTTCAAAGGCTTGTTCACTGCCATTGATAAAGGCAATGCGCGCTTAAAAATACCCGCCTACAATGGCGGACTATTTGCCCCCGATGCTGAATTAGACAGTCTGACGATTAGCGATGCCCTCTGCGAAGCCTTTAAAAATATTGCCGAATACGATTTTGCTTCCGATGTGTCGGTTACTGTTTTGGGGCATATTTTTGAACAATCCATTGCCGACTTGGAAGAAATCAGCGAAAGCCTAGCGACAGGGCAAAGCACACTCAGCAAAACCGCCAAAGCAACCGCCGTATCGGGCAAACGCAAACTGCACGGCGTAGTTTACACGCCCGACCACATTACCGCCTTTATTGTCGAGCATACGCTTGGTGCCTATTTGCGCAGTCAGTTTTTAAGGTTGCTGACAGATTATGGCACAGCTAACGACGACGGTAGCATCAAATGGAAGCAAGGCAAACAAACCGATTTACGCTTTTGGTACGCTTGGCAAGAACGCCTGAAACAAATTAAAGTGGTTGACCCGGCCTGCGGTTCGGGCGCGTTTTTAGTTGCCGCTTTCGATTACTTGCATGGTGAATATCGCCGAGCCAACGAAGCCATTGCCAGCATTACAGGGCAAGCGGGCGTATTCGACCTGAACAAAGAAGTGCTCAACAACAACCTATTTGGCGTGGATATTAACCCCGAATCGATAGAAATCACCAAACTATCACTGTGGCTGAAAACTGCCGAATACGGCAAGCCACTCACCAGCTTAGACAGCAACCTAAAAGCAGGCAATAGCCTTGGACTGAGCGAAGCCGTGGCAGGCGACACCTTCTGTTGGCATAACGCATTTGCCGATATTTTCGCCACAGGCGGCTTCGATGTGGTACTGGGTAATCCGCCCTACGTGCGTCAAGAGCGATTTAGTCACCTCAAGCCGTGGTTAGAAGCCCAATATGCTGTGTATCACGGTGTTGCCGATTTGTACGCCTATTTCTTTGAATTGGGCGCACGCTTGCTCAAGCCTAATGGCATGATGGGCTATATTTCCTCGTCGACCTTTTTTAAGACAGGTAGTGGCGAACCCTTGCGCCAGTTTTTACGTGCGCAAACAGCGATTCAAACCATCGTTGATTTTGGTGATTTGCAGATTTTTGAAGGCGTTACCACCTATCCCGCCATTGTTGTTCTGCAACAGACACCGCCTAAGGACACGCATCAGCTTAGCATGTTGGTGCTCAAAGAGAGCTTGCCCGACAACCTCAATCAAGCCTTTAGCCAACAACAAGCCACCATGCCACAAGCACGCCTTACACAAGACTCTTGGCAACTAGAAAGCGACCAACTTGCTGCCTTGCGTGCCAAGCTAACGACTGGGCATAAAACACTCAAAGAAGTTTATGGTTCGCCCTTGTATGGCATTAAAACAGGATTCAATGAGGCCTTTGTCATTGATCGTGTCACACGTGATGCGCTCATTGCCAGCGACGCGAAAAGTAGCGAGCTAATTAAACCGTTTTTAGAAGGTAAAGACCTGAAAAAGTGGCACGCTCAGCCACGGGATTTGTACCTCATTGCCATTCCTAAGTTCTGGACACGCTCGCAAATGGGCAAAACGGATGCTCCCAACGAAGCAGAAGCAACAGAATGGTTTAGCCAACACTATCCCGCTTTATTCGCTTATTTACAACCCTTTGAAGCACCTGCCAAAAAACGCACCGACAAAGGTGAGTTTTGGTGGGAGTTACGGGCTTGTGCTTACTATGACAAGTTTGAAGAAGTTAAAATTATCTATCCTGAAATGTCTGATAAGTCAGCATTTTTTATAGATAAAATAGGTTTCTACACACAAAAAACATGTTTCATCTTGCCAAAACTAGATTGGTTCTTATTAGGCTTGCTAAACTCAAGTGCTGTATGGTTTTACTTATTAGGCGAGTGTAATTCTGTTAGGGGCGGTTGGTTGAATTTACAAGGTATTTTTATAAATACAATTCCCATCCCCACAGCTACAGAAACACAAAAAACCAGCATCGGACAACTGGCAGAAAGCTGTCAAAGCCTGACTGAACAGCGTTACGCCATCGAGCAAAAAGTGGCACACAGATTGGTCAGCGACCTTTGCCCAACGGACAACACCGCCAAACTTACGCAAAAAGCACAAGTTTGGTGGACACTCGATTTTAGCAGCCTACAAAACGAGCTAAAAAAGAGCTTTGGACTAAAAGCAGGCGACAAACTCATCCCCATCAGCGAACGCGACGATTGGGAGGAATACCTGAACAGCAACCGCCAAAAAATCGAGCAGCTTAACCAGCAAATTAAGGAAAAAGAACAGGCGTTGAATGTGGCGGTTTATGCGTTGTTTGGGTTGACAGAGGAAGAGCAGGGGTTGGTGGAGCGGTAA
PROTEIN sequence
Length: 1078
MSLFLPKILAKHLVNTPISEAHLTAIAHWQSSINDGSLKKLGEKSAHGAFIQTFLVTLLDYTTVATHSQYTASYEMGIKKGGIVDVALGRFGNDIEAQIIAPFELKGLDTPNLDAIMSGRHKTPVQQAWEYANAIKGAKWVLVSNYREVRLYAVGHGMQSYESWDVLTLNQPAEYARFRLLLSQNSLLGDTTSALLQATEQADKDITAQLYADYKSVRESLIAHLISDNPTKVPTELIAPAQKLLDRVLFVAFAEDRGLIPDNTIKNAYLHADPYNPRPIYDNFKGLFTAIDKGNARLKIPAYNGGLFAPDAELDSLTISDALCEAFKNIAEYDFASDVSVTVLGHIFEQSIADLEEISESLATGQSTLSKTAKATAVSGKRKLHGVVYTPDHITAFIVEHTLGAYLRSQFLRLLTDYGTANDDGSIKWKQGKQTDLRFWYAWQERLKQIKVVDPACGSGAFLVAAFDYLHGEYRRANEAIASITGQAGVFDLNKEVLNNNLFGVDINPESIEITKLSLWLKTAEYGKPLTSLDSNLKAGNSLGLSEAVAGDTFCWHNAFADIFATGGFDVVLGNPPYVRQERFSHLKPWLEAQYAVYHGVADLYAYFFELGARLLKPNGMMGYISSSTFFKTGSGEPLRQFLRAQTAIQTIVDFGDLQIFEGVTTYPAIVVLQQTPPKDTHQLSMLVLKESLPDNLNQAFSQQQATMPQARLTQDSWQLESDQLAALRAKLTTGHKTLKEVYGSPLYGIKTGFNEAFVIDRVTRDALIASDAKSSELIKPFLEGKDLKKWHAQPRDLYLIAIPKFWTRSQMGKTDAPNEAEATEWFSQHYPALFAYLQPFEAPAKKRTDKGEFWWELRACAYYDKFEEVKIIYPEMSDKSAFFIDKIGFYTQKTCFILPKLDWFLLGLLNSSAVWFYLLGECNSVRGGWLNLQGIFINTIPIPTATETQKTSIGQLAESCQSLTEQRYAIEQKVAHRLVSDLCPTDNTAKLTQKAQVWWTLDFSSLQNELKKSFGLKAGDKLIPISERDDWEEYLNSNRQKIEQLNQQIKEKEQALNVAVYALFGLTEEEQGLVER*