ggKbase home page

SCNpilot_expt_1000_bf_scaffold_2577_3

Organism: SCNPILOT_EXPT_1000_BF_Rhizobiales_64_6

near complete RP 42 / 55 MC: 2 BSCG 41 / 51 MC: 4 ASCG 7 / 38
Location: 1181..2692

Top 3 Functional Annotations

Value Algorithm Source
sulfatase; K01130 arylsulfatase [EC:3.1.6.1] similarity KEGG
DB: KEGG
  • Identity: 79.2
  • Coverage: 505.0
  • Bit_score: 849
  • Evalue 4.20e-244
  • rbh
Arylsulfatase A family protein n=1 Tax=Novosphingobium sp. AP12 RepID=J3AJY8_9SPHN similarity UNIREF
DB: UNIREF100
  • Identity: 78.9
  • Coverage: 498.0
  • Bit_score: 853
  • Evalue 1.20e-244
  • rbh
Arylsulfatase A family protein {ECO:0000313|EMBL:EJL32971.1}; TaxID=1144305 species="Bacteria; Proteobacteria; Alphaproteobacteria; Sphingomonadales; Sphingomonadaceae; Novosphingobium.;" source="Novosphingobium sp. AP12.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 78.9
  • Coverage: 498.0
  • Bit_score: 853
  • Evalue 1.70e-244

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Novosphingobium sp. AP12 → Novosphingobium → Sphingomonadales → Alphaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 1512
ATGGCCAAGGCGAGCAAGCCGAACATCCTTTTCATCATGAGCGACGACATCGGTTGGTTCAACATTAGCTGCAACAATAACGGCGTTATGGGCTATCGCACGCCCAACATCGACCGCATCGCCAAAGAGGGCGCGAACTTTACGGACTTCTACGGGCAGCAGAGCTGCACGGCCGGCCGCGCAGCCTTCATCACCGGCCAATCGCCGATCCGCACGGGCCTGACGAAGGTCGGCATGCCGGGAGCGACGCTCGGACTGAAGGCAGAGGATCCGACCGTCGCGCAGTTCCTCAAGAACTTCGGCTATGCAACGGGCCAGTTCGGCAAGAACCATCTCGGCGACCGCAACGAGCATCTGCCGACCGTGCACGGTTTCGATGAGTTCTTCGGCAATCTCTATCATCTCAATGCGGAAGAGGAGCCGGAGTATCCCGACTATCCGAAGGATCCGAATTTCCGCAAGAAGTTCGGGCCGCGCGGCGTCCTCAAATGCAAGGCGACCGACAAGGACGACACGACGGTCGATCCGGTCTTCGGCAAGGTCGGCAAGCAGACGATCGAGAATACCGGTCCGCTGACGCGCAAGCGCATGGAGACGGTGGACGAGGAGTTCATCGCCGCGGCGCTCGACTTCATGGAGCGCAAGACCAAGGAGGGCGGACCCTGGTTCTGCTACGTCAACACGACGCGCATGCACGTCTTCACGCACCTGAAGCCGTCATCCGTCGGCAAGACGGGGCACGGTCTCTATCCCGATGGCATGGTCGAGCTGGATGGCTATGTCGGCCAGCTTCTCAAGAAGCTCGATGATCTCGGGGTCGCCGACGACACCGTCGTCGTGTTCACGACGGACAACGGCGCGGAGGTCATGTCGTGGCCGGACGGCGGTACGACCCCGTTCCGCGGCGAGAAGGACACCAACTGGGAGGGCGGTTGGCGGGTGCCTTGCGTGATGCGCTGGCCGGGCGTGATCGAGCCGGGCCGCGTGATCAACGACATCTGCTCATTGCAGGACTTCATCCCGACGTTCGCTGCGGCAGCCGGCGAGCCGAACCTCGTTGAAAAGGCGAAGAAGGGCTACAAGGCCGACGGCGCGACGTTCAAGGTTCACCTGGACGGATACAACCTCATGCCCTTCCTCTCCGGCAAGGAGAAGAAGTGTCCGCGCGAGGGCTTCCTCTACTGGAGCGATGACGGCGACCTGATGGCGCTGCGCGCGCATCAATACAAGATCGTGTTCGCCGAGCAGCGCGCAACGGGCATCGATGTCTGGCGCGAGGAGCTGTCGCGCTTGCGCATTCCGAAGATCTTCGATCTTCGCGCCGATCCGTTCGAGCGTGGCGAGGAGGGCTTCAAGTACAATGACTGGTTCGTCGAGCACATTCCATTTCAGTACGCCGCGCAGGCGATCGTCCACGAGTGGCTCGAGAGCTTCAAGGAGTTTCCCCCACGCCAGAAGGCCGCGAGCTTCACCATCGACCAGATCGTCGAGAAGCTCATGCCGAAGGATTAG
PROTEIN sequence
Length: 504
MAKASKPNILFIMSDDIGWFNISCNNNGVMGYRTPNIDRIAKEGANFTDFYGQQSCTAGRAAFITGQSPIRTGLTKVGMPGATLGLKAEDPTVAQFLKNFGYATGQFGKNHLGDRNEHLPTVHGFDEFFGNLYHLNAEEEPEYPDYPKDPNFRKKFGPRGVLKCKATDKDDTTVDPVFGKVGKQTIENTGPLTRKRMETVDEEFIAAALDFMERKTKEGGPWFCYVNTTRMHVFTHLKPSSVGKTGHGLYPDGMVELDGYVGQLLKKLDDLGVADDTVVVFTTDNGAEVMSWPDGGTTPFRGEKDTNWEGGWRVPCVMRWPGVIEPGRVINDICSLQDFIPTFAAAAGEPNLVEKAKKGYKADGATFKVHLDGYNLMPFLSGKEKKCPREGFLYWSDDGDLMALRAHQYKIVFAEQRATGIDVWREELSRLRIPKIFDLRADPFERGEEGFKYNDWFVEHIPFQYAAQAIVHEWLESFKEFPPRQKAASFTIDQIVEKLMPKD*