ggKbase home page

SCNpilot_bf_inoc_scaffold_464_curated_35

Organism: scnpilot_dereplicated_Bacteroidales_1

near complete RP 52 / 55 MC: 4 BSCG 51 / 51 MC: 2 ASCG 13 / 38
Location: 33638..37105

Top 3 Functional Annotations

Value Algorithm Source
Eco57I restriction endonuclease n=1 Tax=Anaerophaga thermohalophila RepID=UPI000237B951 similarity UNIREF
DB: UNIREF100
  • Identity: 47.6
  • Coverage: 999.99
  • Bit_score: 927
  • Evalue 1.10e-266
Putative type IIS restriction /modification enzyme, N-terminal half {ECO:0000313|EMBL:GAO28333.1}; TaxID=1236989 species="Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Marinilabiliaceae; Geofilum.;" source="Geofilum rubicundum JCM 15548.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 65.8
  • Coverage: 695.0
  • Bit_score: 893
  • Evalue 3.40e-256
Eco57I restriction endonuclease similarity KEGG
DB: KEGG
  • Identity: 43.2
  • Coverage: 999.99
  • Bit_score: 879
  • Evalue 1.50e-252

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Geofilum rubicundum → Geofilum → Bacteroidales → Bacteroidia → Bacteroidetes → Bacteria

Sequences

DNA sequence
Length: 3468
ATGGACAAGAGAACAGCACAAGAACTCATAGAAAAAACTTTCAATAATGCTTTCAACGAAGAGCAGTTTACTGTTTTTGCCAAAAACCTGCTGAACGACTTTGAAGACAAAGACAACCGATACAGTGGAAACCTGATTTGGGACGACTACAAGGAGCACATCAATACCTACAAACGCATCGGCAAATACATTGACCCTGATGGCGAAGCGTTGGACGTGCTGATGGTAGAAGTAAAGAGCGTGAACAAATTGGAGCGGGCAAGAACGGCACTCCGAAATTTCGTCATCAAACACCTGAGCAGGTTTGAAAAAGATTATGCTTTGGTGGCTTTTTACAGCAAGGAAGATGAAGGTGCAGATTGGCGATTTTCATTTATTAAACTAGAATACCGTTCTGAACTGGACGAAGAAAAGGGCAAAGTAAAAACCAAAAAAGAATTTACCCCTGCCAAACGCTACTCGTTTTTGGTGGGCAAATACGAAAAAGCCCATACTGCCAAAAATCAGTTGTTGCCTTTGCTGCAAAACATTTCCAACAACCCCACCATTGAAGAATTAGAAGCTGCCTTTAGCATTGAAAAAGTTACAGATGAATTCTTCAACCAGTACAAAGACCTTTACATCAAACTTTACGAGCATTTTGAAAATGACCGCAAAGTAAAATCTGCCATTGACCAGGCAGGAATAGACAATGCCCGCTTTACCAAAAAGCTATTGGGGCAAATTGTGTTCTTGTATTTCTTACAGAAAAAAGGCTGGCTGGGTGTGGCTCAGAATGCCCGTTGGGGAACCGGCAAAAAACGATTTGTGCAGGAACTGTTTGACCAAGCCCAAAAAGAAAAGGTCAACTTCTTTAAAGACAAACTGCAATACTTGTTTTATGAAGCCCTTGCCAAAGAGCGGGACAATGTAAATTCATATTACAAGCGTTTTGATTGCCGCATTCCGTTTTTGAATGGTGGTTTGTTTGAAGCGGATTACAACTGGCAGGAAGCCAATATCACCACTCCTGAAAACCTATTCCGCAATGATGAAAAGAACAAAGCAGGTGATGTAGGCACGGGAATTCTGGATGTATTCGACCGCTACAACTTTACCATCAAAGAAGATGAACCTTTAGACAAGGAAGTAGCCGTGGACCCTGAAATGTTAGGGAAGGTTTTTGAAAATATGCTCGACATCACCGAGCGTAAAAGCAAAGGGGCTTTTTACACCCCAAGGGAAATTGTGCATTATATGTGTCAGGAAAGCCTGATTCATTATTTAGACAATGCCCTGAACAGCGGAACAAGCAGCTATCAGGAATTGGGTTCAGACCAAACCAAACTGTTTGGCGGCAGCACCGACAAAAAAGGAAATCTGAAAATAGAACTGGAGCATACCGACAATATCCGTGTTCCGAAAAAAGATATCGAAACCTTTATCCGTGAAGGGCATTTTGCTTTGGAAAATGACGAACGGGTAGCTACCAAAGGCGAAACCAAAACCTACCAATACCAACTGCCCGAAAGCATTCGCAAAAATGCTGACTTGATTGACCAAAAACTTAGCAATATTAAAATTTGTGACCCTGCCATTGGTTCAGGGGCATTTCCTGTGGGTTTGTTGCACGAGTTGGTCAATGCAATGTTGGTTTTGAAACCGCATTTGAGTTACGACTATTTGACCGAAAAACTGAAAGGCTTTGGTTTTGCCCAGCGTGAAAGCATCAGCGATAGCCGCTACATCTACCGACTGAAACGCCACATCATACAAGAAAGCATTTATGGGGTGGATATTGACAGCTCTGCCATTGACATTGCCCGTTTGCGTTTGTGGCTGAGCTTGGTGGTGGACGAAGACGACTTGGACCCGATAGAAACGCTTCCTAACTTGGATTATAAAATAGTAGCTGGTAATTCTTTGATTGGTCTGCCTGATGGTGCTATGCGAAACTTGGTAGTTGAAGCCGAACTGGAACAGCTTAAAGAAAAATTCTACGACATCACAGACGAGAAAGAAAAGAAAGCCCTACGCCAACAGATAAACACCAAAATACGCGAACTCTTAGATTCTGCTGAACAGTTTGCTGGTTACAAAATTGATTTTGATTTTAAACTCTTTTTCTCGGAAGTATGGCGGGAAAAAGGGGGCTTTGATGTGGTGATTGGGAATCCGCCTTATGATGTTTATGAAGGAAAGAAATCTGATGAAATACCAACAATTAAGAAAATCAATATTTATGACATTGCCAAGTCGGGTAAACTCAATGCATACAAACTCTTCCTGGCTAAATCAATAACGATTTTAAACGATGGCGGGATTTTTAATCAAATTTTTCAAAATTCATTTTTAGGTGACAATTCAGCAAAACTTCTTCGTAAACATTTTTTAACTGAACAAAAAATCATCAGAATTGACTCGTTTCCTGAACGAGACGATTTAAACAAAAGGGTCTTCGTTTCTGCAAAAATGAGTGTTTGCATTCTATTCTCACAAAACAAAAAAAGTACTAAGTATGACTTTCCATTATTTGTTTGGAGTGAAAGATGGATGGAGAATTCATATTCTTCAATATTTAGCAATAGAGAATTACTTGCTTTTGATAAAGAATCGTACGTTATTCCTTCAGTTTCACAAGCTGAAAAAGACATATTAAACAAAGTGTATGAAGTTAAAAGATTTGGCAGCACTGTAAACTGCTATCAAGGTGAAATAAATCTCTCAACAAACAAGTCAATAATTGTTCAAAAGGCTAATTCAAATACAATGCCTTTAATAAAAGGGGCAGGGGTTCAAAAATGGTATTTACCTGAAAAAATGAGCCAAGGTGTAGTTGAGTATTTAATTCATAGTGAATATTTAAACCAAAACAAAGGAGAGAAAAGCACACATTTTAATTTTTCAAGAATTGTTATGCAAGGAATCACGGGAGTTGATGAAAAGCATAGGATAAAATCAACAATTTTGGAAAAGGGATTCTTTTGTGGACACTCAATCAATTACATCTCACTAAAAAATGTTTCAGATTTATTAGCAAAATATTATTTGTCAATTCTAAATTCAGAATTTTCAAATTGGTTTTTCAAGAAATTCAGCACGAACAGCAATGTAAATAGCTATGAAATTCATAATCTTCCTTTGCCGAATTACTCTGATAGGTTCTTACCTCTCTCAATTGTCGCATCATATTTATTGAACAAAAGAAATCAGATTAAAGATGTGACATTTAATTTTTACGAGCACCTTATTAATTCTATTGTCTATGAATTGGTGTTTCCCGAAGAAATCAAATCGGCAGGGAAAGAAATCCTAAAGCATTTGGGAGATTTAAAGCCCATCACAGAAGATATGAGCGAAGAAAAAAAGCTTGCTATCATCCAAAGTGAGTTTGAGCGTTTGTATGACCCGAATCATCCTGTTCGCTTTGCCATAGAGACCTTGGATAGTGTGGAAGAAGTAAGGATTATTAAAGAAGCACTAAAATGA
PROTEIN sequence
Length: 1156
MDKRTAQELIEKTFNNAFNEEQFTVFAKNLLNDFEDKDNRYSGNLIWDDYKEHINTYKRIGKYIDPDGEALDVLMVEVKSVNKLERARTALRNFVIKHLSRFEKDYALVAFYSKEDEGADWRFSFIKLEYRSELDEEKGKVKTKKEFTPAKRYSFLVGKYEKAHTAKNQLLPLLQNISNNPTIEELEAAFSIEKVTDEFFNQYKDLYIKLYEHFENDRKVKSAIDQAGIDNARFTKKLLGQIVFLYFLQKKGWLGVAQNARWGTGKKRFVQELFDQAQKEKVNFFKDKLQYLFYEALAKERDNVNSYYKRFDCRIPFLNGGLFEADYNWQEANITTPENLFRNDEKNKAGDVGTGILDVFDRYNFTIKEDEPLDKEVAVDPEMLGKVFENMLDITERKSKGAFYTPREIVHYMCQESLIHYLDNALNSGTSSYQELGSDQTKLFGGSTDKKGNLKIELEHTDNIRVPKKDIETFIREGHFALENDERVATKGETKTYQYQLPESIRKNADLIDQKLSNIKICDPAIGSGAFPVGLLHELVNAMLVLKPHLSYDYLTEKLKGFGFAQRESISDSRYIYRLKRHIIQESIYGVDIDSSAIDIARLRLWLSLVVDEDDLDPIETLPNLDYKIVAGNSLIGLPDGAMRNLVVEAELEQLKEKFYDITDEKEKKALRQQINTKIRELLDSAEQFAGYKIDFDFKLFFSEVWREKGGFDVVIGNPPYDVYEGKKSDEIPTIKKINIYDIAKSGKLNAYKLFLAKSITILNDGGIFNQIFQNSFLGDNSAKLLRKHFLTEQKIIRIDSFPERDDLNKRVFVSAKMSVCILFSQNKKSTKYDFPLFVWSERWMENSYSSIFSNRELLAFDKESYVIPSVSQAEKDILNKVYEVKRFGSTVNCYQGEINLSTNKSIIVQKANSNTMPLIKGAGVQKWYLPEKMSQGVVEYLIHSEYLNQNKGEKSTHFNFSRIVMQGITGVDEKHRIKSTILEKGFFCGHSINYISLKNVSDLLAKYYLSILNSEFSNWFFKKFSTNSNVNSYEIHNLPLPNYSDRFLPLSIVASYLLNKRNQIKDVTFNFYEHLINSIVYELVFPEEIKSAGKEILKHLGDLKPITEDMSEEKKLAIIQSEFERLYDPNHPVRFAIETLDSVEEVRIIKEALK*