ggKbase home page

scnpilot_p_inoc_scaffold_6587_4

Organism: SCNpilot_P_inoc_Rhizobiales_67_4_fragment_2

partial RP 38 / 55 MC: 1 BSCG 35 / 51 MC: 1 ASCG 4 / 38
Location: comp(1875..4853)

Top 3 Functional Annotations

Value Algorithm Source
Type I site-specific deoxyribonuclease, HsdR family {ECO:0000313|EMBL:EDQ31786.1}; EC=3.1.21.3 {ECO:0000313|EMBL:EDQ31786.1};; TaxID=411684 species="Bacteria; Proteobacteria; Alphaproteobacteria; Rhiz similarity UNIPROT
DB: UniProtKB
  • Identity: 81.5
  • Coverage: 926.0
  • Bit_score: 1515
  • Evalue 0.0
type I site-specific deoxyribonuclease HsdR-like protein (EC:3.1.21.3) similarity KEGG
DB: KEGG
  • Identity: 77.5
  • Coverage: 984.0
  • Bit_score: 1506
  • Evalue 0.0
Type I restriction-modification system, R subunit n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DFK8_9RHIZ similarity UNIREF
DB: UNIREF100
  • Identity: 81.5
  • Coverage: 926.0
  • Bit_score: 1515
  • Evalue 0.0
  • rbh

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Hoeflea phototrophica → Hoeflea → Rhizobiales → Alphaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 2979
ATGAGCCAAGGTCCCGAGTGGCATTTCGCCGAGCGCCCGGCCATCGAGCATCTCGAGGCCATGGGCTACAGCTTCGTGCCACCGGTGGAGCATGCCGCGCTGCGCGACGGCGACAACCAGGTGCTGTTCCGCCCGCATCTCGTCGAGGCGCTCATGCGCATCAACGGCATCGACACGGCCTCGGCCGAGGCCGCCTACGGCGAGCTCGCCGGGATCAGCGACAACGAGGCCTGGCTGAAGGTGCTGCGCGGCGATTATTCGCGCAAGGTTTCCGGTCATGATACGCGGCTGACGCTGCGCGTGATCGATTTCCTCGAGCCGGGCAACAACCGCTTTACCGTTACCAGCCAGCTGCGCGTCAAGGCGGAGGCGACGCGGCGGCCGGATCTCGTCATCCATGTCAACGGCATCCCGCTGGTGGTGATCGAGGCGAAGAGCCCACTCAACACCAAGGACAAGACCGGCGAGGCCTTCGAGCAGATCCGGCAATATGAGCGCGACATCCCGCGCCTCTTCGCCTCCAACGCCTTCAACATCGTCACCGACGGGGCGCTGATGCTCTATGGCGCGACTGGCTCGCCCTCGAAGCACTACGCCGAATGGCGCGATCCCTGGCCGAAGGCGGCGGCGGATTTTCCCGATCGCCTCGCGTTGGGGCTCTGGTCGCTGCTGGAGCCGGCGCGGCTGCTCGACCTCCTGGCGCATTTCATCGTCTTCGAGAAGACCGAGGACGGCACGATCAAGAAGATCGCCCGCTACCAGCAATTCCGCGCCGTCAACAAGATCGTCGGGCGCATCATCGAAGGCAAGCACCGGCAAGGGCTGATCTGGCACACGCAGGGCAGCGGCAAATCGCTGACCATGGTGTTCGCGGCGCTGAAGCTGAAGACGCACCGCACGATCGCCTCCGACGCGCTCACCAACCCCAACATCATGGTACTGACCGACCGCGTCGATCTCGACAGCCAGATCAGCGGCACCTTCGCCGCCTGCGGCCTTGCCAATCCGACCCCGATCGAGAGCATCAGGGACCTGCACGCGCTGATCGGCAGCGGCAAGGATGGCCACACCGCGCTGTCGACCATCTTCAAGTTCCAGGGCTCGACGGCGCCGATCGCCAATTCGTCCAACTGGATCGTCATGGTCGACGAAGCGCATCGGACGCAGGAGAAGGACCTCGGCGCCTCGCTCCGCGCCACCTTGCCCGACGCACGCTTCTTCGGCTTCACCGGCACGCCGGTGAAGAAGGACGACAAGGACACCTATGCCAATTTCGGCGTGGTGGGCGAAGGCTATCTCGACAAATACGGGATCGACGATGCCGTGGCCGACGGCGCCACCGTGCCGATCTATTATACCGGCCGCAAGACCGACTGGCACATCGACGAAGCCCGGATCGACATCCTGTTCGACACCTGGTTTGCCGAACTCGACGACGACAGGCTCAACGAGATCAAGAAGCGCGGCGTCGCGATCGAGGATCTCGTCAAGCATCCGCGCCGGATCGAGCTGATCGCCTATGACATCTGGACCCATTTCAAGGCCTATGCCCGGCCCGACGGCTTCAAGGCGCAGATCGTTACCATCGATCGCGAGGCGGTGATCCTCTACAAGCAGGCGCTGACGCGGGTGATCGCCGATGATCTCGCGGCGGAGGGGCTGGCGCCGGACGAGGCCTTGGCGCGCGCCGAGGCCATGTCGGCCTGCGTCTATTCGGTCAACCAGGAAGACGGCAAGCCGAGCGAGGATCCGCACAAGGAGGCCGTTCGCCTGGCGCTCAAGGCCAACTATCTCGATGCGGAGGCCGAGAAGCTGGCGAAGAAGGCCTTCGGCCGGCGCGGCGAGGATCCGCAATTCCTCATCGTCTGCGACAAGCTGTTGACCGGTTTCGATTGTCCGGTCGAGAGCGTGATGTATCTCGACAAGCCGCTCAAGGAGCACGGCCTGCTGCAGGCGATCGCCCGCACCAACCGTGTTTCGGACGCTCGCAAGCGCAACGGGCTGATCGTCGACTATATTGGCGTCTCCGCCAATCTCGAGGCGGCGCTGGCCAGCTACCGCGCCGACGACGTGAAGAACGCCATGCGCGATCTCGACGATCTGCGCAGCCAGCTCCGCGCGGCGCATGCCGCGGTCGCAGCCATGATGAAGGGGGTCAAGCGCGGCGCCACCGGCAAGGATGGGCTGAAGAAGGAGTTCGACGCCTTCATCGCCGTGCTCGGCACAGAAGACCAGTGGTTCGTCCTCAAGGGACACGCCCGCAGCTTCATCGCGCTCTACGAGACGCTCTCGCCCGATCCAAGCGTGCTCGAATTCACTGCCGACCTGAAATGGGTCGCGACCTTCCTGCTCTACGGCACGCAGCACTTCGAGAAGCGCCAGGCCTTCGACCAGCTGGCCTACAGCCAGAAGATCCGCCAGATGCTCGAGACCCATCTCGAGGCGACGGGTCTGAGCGTCACGGTGAAGCTGCGCCACATCACCGACCCCGATTTCTGGGAGGATTTCGACGCCGAGGGCAAGACCGACGAAGACCTGATGACTGCGGCGATCCGGAAGACCACCGAGTTGCGCCGGACCGTCAGCGAGCGGATCGACGACAGCCCGCATCAATATGGCAAGACGGTGAAACCCGGCTGCCTTCGTGAGATGCACTGGCATCCCAATGGCAGCGAGTGGCAATACTGGATCAAAGGGAAGGGTCGCATGACCGTATTCCCAGGTGAGGAAAAAGCGCGAACCATCGATTTCAATGCCAACGACGTTGGCTTCGTCTCCAACATGGCCGGTCATTACATCGAGAACACTGGCACCGAGGATCTGGTGTTCCTTGAGATGTTCGTGGCCCCGGAGTTCCAGGAAATATCGCTCAACGGGTGGCTGCGCGCTCTGCCGGAACAGGCTGTGACAGCGCATACCAACCTCACTGCCGAGGATATTCGCAAGATTCCCATCGGTCACAATCCCCTGCTGCGGTAA
PROTEIN sequence
Length: 993
MSQGPEWHFAERPAIEHLEAMGYSFVPPVEHAALRDGDNQVLFRPHLVEALMRINGIDTASAEAAYGELAGISDNEAWLKVLRGDYSRKVSGHDTRLTLRVIDFLEPGNNRFTVTSQLRVKAEATRRPDLVIHVNGIPLVVIEAKSPLNTKDKTGEAFEQIRQYERDIPRLFASNAFNIVTDGALMLYGATGSPSKHYAEWRDPWPKAAADFPDRLALGLWSLLEPARLLDLLAHFIVFEKTEDGTIKKIARYQQFRAVNKIVGRIIEGKHRQGLIWHTQGSGKSLTMVFAALKLKTHRTIASDALTNPNIMVLTDRVDLDSQISGTFAACGLANPTPIESIRDLHALIGSGKDGHTALSTIFKFQGSTAPIANSSNWIVMVDEAHRTQEKDLGASLRATLPDARFFGFTGTPVKKDDKDTYANFGVVGEGYLDKYGIDDAVADGATVPIYYTGRKTDWHIDEARIDILFDTWFAELDDDRLNEIKKRGVAIEDLVKHPRRIELIAYDIWTHFKAYARPDGFKAQIVTIDREAVILYKQALTRVIADDLAAEGLAPDEALARAEAMSACVYSVNQEDGKPSEDPHKEAVRLALKANYLDAEAEKLAKKAFGRRGEDPQFLIVCDKLLTGFDCPVESVMYLDKPLKEHGLLQAIARTNRVSDARKRNGLIVDYIGVSANLEAALASYRADDVKNAMRDLDDLRSQLRAAHAAVAAMMKGVKRGATGKDGLKKEFDAFIAVLGTEDQWFVLKGHARSFIALYETLSPDPSVLEFTADLKWVATFLLYGTQHFEKRQAFDQLAYSQKIRQMLETHLEATGLSVTVKLRHITDPDFWEDFDAEGKTDEDLMTAAIRKTTELRRTVSERIDDSPHQYGKTVKPGCLREMHWHPNGSEWQYWIKGKGRMTVFPGEEKARTIDFNANDVGFVSNMAGHYIENTGTEDLVFLEMFVAPEFQEISLNGWLRALPEQAVTAHTNLTAEDIRKIPIGHNPLLR*