ggKbase home page

SCNpilot_expt_1000_bf_scaffold_478_curated_16

Organism: scnpilot_dereplicated_Thiobacillus_5

near complete RP 51 / 55 BSCG 51 / 51 ASCG 12 / 38 MC: 1
Location: comp(12639..13946)

Top 3 Functional Annotations

Value Algorithm Source
insulinase family protein; K01422 [EC:3.4.99.-] similarity KEGG
DB: KEGG
  • Identity: 70.3
  • Coverage: 435.0
  • Bit_score: 614
  • Evalue 2.50e-173
zinc protease n=1 Tax=Thiobacillus denitrificans RepID=UPI00036BB06E similarity UNIREF
DB: UNIREF100
  • Identity: 75.1
  • Coverage: 433.0
  • Bit_score: 654
  • Evalue 9.20e-185
Tax=RIFOXYA1_FULL_Hydrogenophilales_63_33_curated similarity UNIPROT
DB: UniProtKB
  • Identity: 72.7
  • Coverage: 436.0
  • Bit_score: 631
  • Evalue 6.90e-178

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

RIFOXYA1_FULL_Hydrogenophilales_63_33_curated → Hydrogenophilales → Betaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 1308
ATGTCAATTAAGCGATTTCTTGCCGTTCTGCTGTGCATGCTGCCGCTGACTGTTCAGGCCGCGGTCACCATCCAGCACTGGCGCACGCCCGAAGGCGCGCGCGTCTATTTCGTCGAAAGCCGCGAACTGCCGATGCTCGACGTCGCGGTGAGCTTTCCCGCCGGCAGCGCGCGCGATCCTGCCGCAAAATCCGGTCTCGCGCAGTTGACCCATACTGTGCTCGACCAGGGGGCGGGCGGCCTGTCGGAGACCGCGATCGCGCATGGGCTTGCCGACGTTGGTGCGGTGCTTTCCGGTAGTTTCGATCGCGATCGCGCGGCAGTGAGCCTGCGCACCCTGTCGTCGGCGCGCGAGAAAACCCAGGCGCTCGATCTGCTGATGCGCGTGCTGCAGCGCCCCGAGTTTCCCGCCGGCGTGGTGAAGCGTGAAAAACAGCGTCTCATCGCGGCGATCCGCGAGGCCGAAGCCGACCCCGGCACCGTGGTCGACAAGGCCTTCTATCGCGCCCTGTACGGCGCACATCCCTATGCGCGCGACGAGGCGGGCGAACCCGACGCCATCGCGCGGCTCACCCGCGCCGATTTGCAGGCGTTCCACCGCACCCATTACACCGCGGCCAACGCCGTGATCGCCTTGATGGGCGACGTCGATCGCGCGGAAGCCGAAGCCATCGCCGCGCGCCTGGCGGCCGGATTGCCGCGCGGCCCCGTGCTCGCGCCGCTGACCGCGCCGGTCGCGCCCGCCGCAGGCGAGACGCGCATCGCGCATCCTTCCGCGCAGAGTCACGTGCGCATCGGCGCGCTCGGCACGACGCGCGACGATCCCGACTTCTTCGCGCTGTTCGTCGGCAACTACGTGCTCGGCGGCGGCGGCTTCGATTCGCGGCTGCTGAAGGAAGTGCGCGACAAGCGTGGCTACGCCTACAGCGCGTACAGCTATTTCCTGCCGATGGCGGTGGCCGGCCCGTTCCAGATCGGGTTGCAGACCCAGGGCGCGCAGACCGCCGACGCGCTCGCTGTGGCGCGCGACACGCTGCGCCGCTTCGTCGCCGAGGGGCCGTCCGCGGACGAACTGGCGCAGGCCAGGGCCAATCTCACCGGCGGCTTCCCGCTGCGCATCGACAGCAACAAAAAGATTCTCGAATACCTGTCGATGATCGGCTTCTACGGGCTGCCGCTGGACTATCTCGACACCTGGGTGGACCGCGTCAACGCGGTGGATGTCGCCGCGGTGAAGGCCGCGTTCGCGCGCCGTATCGATCCTGCGCGCATGGTCACGGTCATCGTCGGGGGCGAGGGTGCGCGCTAA
PROTEIN sequence
Length: 436
MSIKRFLAVLLCMLPLTVQAAVTIQHWRTPEGARVYFVESRELPMLDVAVSFPAGSARDPAAKSGLAQLTHTVLDQGAGGLSETAIAHGLADVGAVLSGSFDRDRAAVSLRTLSSAREKTQALDLLMRVLQRPEFPAGVVKREKQRLIAAIREAEADPGTVVDKAFYRALYGAHPYARDEAGEPDAIARLTRADLQAFHRTHYTAANAVIALMGDVDRAEAEAIAARLAAGLPRGPVLAPLTAPVAPAAGETRIAHPSAQSHVRIGALGTTRDDPDFFALFVGNYVLGGGGFDSRLLKEVRDKRGYAYSAYSYFLPMAVAGPFQIGLQTQGAQTADALAVARDTLRRFVAEGPSADELAQARANLTGGFPLRIDSNKKILEYLSMIGFYGLPLDYLDTWVDRVNAVDVAAVKAAFARRIDPARMVTVIVGGEGAR*