ggKbase home page

ar4r2_scaffold_28746_1

Organism: ALUMROCK_MS4_Gammaproteobacteria_45_49_curated

near complete RP 49 / 55 MC: 11 BSCG 49 / 51 MC: 10 ASCG 14 / 38 MC: 3
Location: comp(2..913)

Top 3 Functional Annotations

Value Algorithm Source
Type I restriction-modification system, specificity subunit S {ECO:0000313|EMBL:KFC91883.1}; EC=3.1.21.3 {ECO:0000313|EMBL:KFC91883.1};; TaxID=911008 species="Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales; Enterobacteriaceae; Leclercia.;" source="Leclercia adecarboxylata ATCC 23216 = NBRC 102595.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 51.6
  • Coverage: 252.0
  • Bit_score: 241
  • Evalue 1.40e-60
Uncharacterized protein n=1 Tax=Campylobacter coli 1909 RepID=H7TSU1_CAMCO similarity UNIREF
DB: UNIREF100
  • Identity: 45.3
  • Coverage: 285.0
  • Bit_score: 230
  • Evalue 1.80e-57
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 44.1
  • Coverage: 299.0
  • Bit_score: 230
  • Evalue 5.00e-58

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Leclercia adecarboxylata → Leclercia → Enterobacteriales → Gammaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 912
ATGAGTCAGGACTTATATGACCTACCAGCGGGGTGGGAGTGGATTGAATTTGACGCTTTAGTGTATGTCAAGGGTGGGAAAAGATTACCAAAAGATAGTCAATTACAAGATAGTAAAACACCATTTCCCTATATTAGAGTTTCTGATTTCAATGATTTTGGTTCAGTAGATACAGATAATATAAAATACATAACATTAGATATTCATAATAAAATAAAAAATTATGTTATTTCTTCGGAAGATCTGTATATTAGTATTGCTGGAACAATCGGCAAAACAGGAATTATTCCAGCATCTTTAAATGGTGCTAATTTGACTGAAAATGCAGCTAAATTAGTCATCAAAAACAAAGATCAATTAACACTAAGATATTTATATCTATTTACTTTATCTAATACTTTCTCAGAGCAAGCTGGATTGGCAACAAAGCAAGTAGCTCAGCCAAAGCTAGCCTTATCCCGTCTGTCGGCAATTAGCATCCCCCTCCCTCCCCTAGCCGAACAAACCCGCATTGTCGAAAAACTCGATGCCGTGCTTTCTCGCATCGACACAGCCATCAATGAATTACAGCAAAGCCTTGCACTGGTGGATGCGATGTTTAAGAGTGGGTTGGATCAGGTATTTAACCCGCTAGGCTCGCCTAGTAATGAGGGTGGTTTGTATGACTTGCCCGAAAGGTGGGGGTGGAAGCAACTGCATACTATTGCTGATGTTGTGTCTGGTTACGCTTTTAAAAGTGAAGATTTTTCGTCTGATATTGGTGTTCCTAGTGTGAAAATTACCAATGTTGGTTTAGGTGCTTTCATTGAGTCACAAGATGATTACTTGCCAAGTGATTTTTCAACCAAATATAATAAGTTTGCTGTTAAACAAGGCGATATTGTTATTGCCTTAACTCGTCCAGTAATAAAT
PROTEIN sequence
Length: 304
MSQDLYDLPAGWEWIEFDALVYVKGGKRLPKDSQLQDSKTPFPYIRVSDFNDFGSVDTDNIKYITLDIHNKIKNYVISSEDLYISIAGTIGKTGIIPASLNGANLTENAAKLVIKNKDQLTLRYLYLFTLSNTFSEQAGLATKQVAQPKLALSRLSAISIPLPPLAEQTRIVEKLDAVLSRIDTAINELQQSLALVDAMFKSGLDQVFNPLGSPSNEGGLYDLPERWGWKQLHTIADVVSGYAFKSEDFSSDIGVPSVKITNVGLGAFIESQDDYLPSDFSTKYNKFAVKQGDIVIALTRPVIN