ggKbase home page

SCNpilot_expt_1000_bf_scaffold_478_curated_17

Organism: scnpilot_dereplicated_Thiobacillus_5

near complete RP 51 / 55 BSCG 51 / 51 ASCG 12 / 38 MC: 1
Location: comp(13936..15408)

Top 3 Functional Annotations

Value Algorithm Source
insulinase family protein; K01422 [EC:3.4.99.-] similarity KEGG
DB: KEGG
  • Identity: 83.3
  • Coverage: 444.0
  • Bit_score: 735
  • Evalue 1.10e-209
peptidase M16 n=1 Tax=Thiobacillus denitrificans RepID=UPI000379CF02 similarity UNIREF
DB: UNIREF100
  • Identity: 85.0
  • Coverage: 448.0
  • Bit_score: 762
  • Evalue 2.70e-217
Tax=GWE1_Thiobacillus_62_9_curated similarity UNIPROT
DB: UniProtKB
  • Identity: 82.4
  • Coverage: 448.0
  • Bit_score: 738
  • Evalue 7.70e-210

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

GWE1_Thiobacillus_62_9_curated → Hydrogenophilales → Betaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 1473
ATGTCGAGCCATCGCGTGAAACACCGCGCAGCGATTCGTTTTCAGTCCCCGCCACCCGCACGCGGCGCAAAGCCCCGGCGGGTATTGCCGTTGCGTGTCCTGGCGCGACGCCTCACCATGACCGCCGGCGTGCTCGCATCGGCACTCGCGCTCGGCAGCGCGCACGCGACCCTCACCGACGTCACGCTCGACAACGGCCTGCGCGTCATCGTCAAGGAGGATCACCGCGCGCCGGTGATGGTGTCGCAGGTGTGGTACCGCGCCGGTGCCATGGACGAATTCAACGGCACCACGGGTGTCGCGCACGTGCTCGAACACATGATGTTCAAGGGTACGCCCAAGGTCCCGGCGGGGGAGTTCTCGAAACGTATCGCCGCAGCGGGCGGGCGCGAGAACGCCTTCACCAGCCGCGACCACACCGCGTATTTCCAACAGATGCAGAAGGACCGCCTGCCGCTGGCGCTGGAACTGGAGGCCGACCGCATGGCCAATCTCGTCATCAGCGACGAGCTGTTCGCCAAGGAACTCCAGGTGGTGATGGAGGAGCGGCGGCTGCGCACCGACGACCAGGCGCAATCGGTGGTGTACGAGCGCCTCATGGCGGCCGCGTACCAGGCGCATCCCTACCGCCGGCCCATCATCGGCTGGATGGACGACCTCGTGAACATGAGCGCGCAGGATGCACGCGATTGGTACGCGCGCTGGTACGCACCGAACAACGCCACGCTGGTGGTGGCGGGCGACGTACAGGCGAGCGATGTCGTCGAACTGGCGAAGAAACACTTCGGCGCGCTGCCTGCACGCGCCCTGCCGGCGCGCAAGCCACAGGCCGAGCCGGAGCAGGTCGGCGAAAAGCGCATGGTGGTGAAGGCTCCCGCCAAGCTGCCGTATCTGCTCATGGCCTGGCATGCGCCCACGCTCAAGGACTGGCAGCAGGACACCGTGCCCTATGCGTTGCAGATTCTCGCCGGCGTGCTGTCCGGCAACGATTCGGCGCGCCTGCAGAAATCGCTGGTGAAGACGCGGCAGATCGCCGTCAACGCCAGCGCCGGTTACGACGCTGTGGCGCGCGGACCGGGGATGTTCATGATCGACGCCACGCCCGCAGAGGGGCAATCGGTGGCGGCACTGGAGAAGGCCATCCGCGAGGAGATCGCGCGCATCCAGCGGGACGGCATCGACGCGGACGAACTCGCGCGCGTGAAGGCGCAGGTCATCGCGGGCGAGGTCTACCAGCGCGACTCGTTGTTCTACCAGGCGATGCAACTGGGCGACTACGTCACCGCCGGCCAGCCGCCGGAGGCGCTCGCCGGGCGCGTTGACAAGCTGCGTGCCGTCACCGCGGAGCAGGTACGCGCGGCCGCGCGCGAATGGCTGCGCGACGACCGCCTGAGCCTGGCCGAGCTCGATCCCCAACCGCTGGACGCGCAACCGCGCCGTGCCGCTGTGCCGGGAGTGCGCCATGTCAATTAA
PROTEIN sequence
Length: 491
MSSHRVKHRAAIRFQSPPPARGAKPRRVLPLRVLARRLTMTAGVLASALALGSAHATLTDVTLDNGLRVIVKEDHRAPVMVSQVWYRAGAMDEFNGTTGVAHVLEHMMFKGTPKVPAGEFSKRIAAAGGRENAFTSRDHTAYFQQMQKDRLPLALELEADRMANLVISDELFAKELQVVMEERRLRTDDQAQSVVYERLMAAAYQAHPYRRPIIGWMDDLVNMSAQDARDWYARWYAPNNATLVVAGDVQASDVVELAKKHFGALPARALPARKPQAEPEQVGEKRMVVKAPAKLPYLLMAWHAPTLKDWQQDTVPYALQILAGVLSGNDSARLQKSLVKTRQIAVNASAGYDAVARGPGMFMIDATPAEGQSVAALEKAIREEIARIQRDGIDADELARVKAQVIAGEVYQRDSLFYQAMQLGDYVTAGQPPEALAGRVDKLRAVTAEQVRAAAREWLRDDRLSLAELDPQPLDAQPRRAAVPGVRHVN*