ggKbase home page

scnpilot_solids2_trim150_scaffold_112_curated_227

Organism: solids_Rhizobiales_1

near complete RP 51 / 55 MC: 2 BSCG 51 / 51 ASCG 11 / 38
Location: 248330..249703

Top 3 Functional Annotations

Value Algorithm Source
hypothetical protein n=1 Tax=Candidatus Poribacteria sp. WGA-4E RepID=UPI0003604C03 similarity UNIREF
DB: UNIREF100
  • Identity: 57.0
  • Coverage: 446.0
  • Bit_score: 544
  • Evalue 8.30e-152
Arylsulfatase {ECO:0000313|EMBL:KIL38786.1}; TaxID=1590651 species="Bacteria; Firmicutes; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.;" source="Paenibacillus sp. VKM B-2647.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 59.2
  • Coverage: 439.0
  • Bit_score: 559
  • Evalue 4.60e-156
Arylsulfatase A and related enzymes similarity KEGG
DB: KEGG
  • Identity: 58.7
  • Coverage: 436.0
  • Bit_score: 538
  • Evalue 2.40e-150

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Paenibacillus sp. VKM B-2647 → Paenibacillus → Bacillales → Bacilli → Firmicutes → Bacteria

Sequences

DNA sequence
Length: 1374
ATGAAAAAACCGAACGTCATTATTTTCTTCACTGATCAGCAGCGCCACGACACGACGGGCGTTCACGGCAATCCGATGGGGCTGACGCCTAATTTCGACAGGCTCGCAAGAGCCGGAACCCATCTCTACAATTCCTTCACCAGCCAGCCGCTGTGCGGCCCGGCGCGGGCGTCCATGCAGACAGGACGCTTCGCCACCCAGACAGGTGTCTTCCGCAACGAGATTCCGCTGCCGGACAATGCGGAGACGATGGCCAAGATCTTTGCCCACAACGGCTACGAGACCGCCTATATCGGAAAATGGCACCTGGCCGGAAAGGACCCCGTGCCAAAGCACCTGCGTGGCGGGTACCAGAACTGGCTGGCGGCCAACCATCTCGAATTCGTCTCCGACGCCTACGATGCCGTCGTCTTCGATGACGATAACAACAAGCGCAAGCTACCCGGCTACCGCGTCGATGCGCTGACGGATGCCGCCATCCGCTTCTGCGATACGCATCAGGACGACCCTTTCTTTCTTTTCGTTTCCTTTCTGGAGCCGCATCATCAGAATCATGTGGACGACTATCCGCCTCCCGATGGCTATCGCGAGGCTTTGACGGCGACCCAATGGACGCCGCCGGATCTCGCGACGCTCAAGGGTACCAGCACATGGCACCTCGGCGGCTATTACGGGATGGTCAAACGTCTCGACGAGGCATTCGGCCGGCTGATGGATGCCTTGAAGAGCCTGGACATGATCGACAACACGATCGTCATGTTCACCTCCGACCATGCGTGCCATTTCAAGACCCGCAACGACGAGTACAAGCGCTCTTGCCACGAAAGTGCGGTACGCGTGCCGAGCATGATCACGGGACCCGGCTTCAACGGCGGCGGGCAAGTACGGGCCATGTTCAGCACTATTGATGTCGCACCGACCCTGCTCGATGCATCGGGACTCGACGTGCCCGAAAGCATGGTCGGCAAGTCCATCATGCCCGTCATTCGAGATTCCCGCACGCCATGGCGGCAGGATCTGTTCTTCCAGATCAGCGAAACCGAGACCGGCCGGGCGTTGCGGACGCACCGCTGGAAATACGGCGTCACGTCGGAATATCACGAGGATGCGCCACGTTCCGACGTGTATCGGGAATGCTATCTCTACGATCTGGACAGCGACCCTTACGAGATGGTCAACCTGATAGGCATGGGCGTCTTCCGGAGCCTCTGCGACGATCTGAAGAAGAGGCTACTGGGATGGATTGCCGAGATCGAGGACGGGCACCGCCCGCAGGTTCTCGACGCCCCCGAGCGGGAGTCTCGGCAGTATCGACATCACCCGGCCGATCTCCTCGAACTGCAGAAGAGCAGGGAGAATGAAAAACAGGCCTGA
PROTEIN sequence
Length: 458
MKKPNVIIFFTDQQRHDTTGVHGNPMGLTPNFDRLARAGTHLYNSFTSQPLCGPARASMQTGRFATQTGVFRNEIPLPDNAETMAKIFAHNGYETAYIGKWHLAGKDPVPKHLRGGYQNWLAANHLEFVSDAYDAVVFDDDNNKRKLPGYRVDALTDAAIRFCDTHQDDPFFLFVSFLEPHHQNHVDDYPPPDGYREALTATQWTPPDLATLKGTSTWHLGGYYGMVKRLDEAFGRLMDALKSLDMIDNTIVMFTSDHACHFKTRNDEYKRSCHESAVRVPSMITGPGFNGGGQVRAMFSTIDVAPTLLDASGLDVPESMVGKSIMPVIRDSRTPWRQDLFFQISETETGRALRTHRWKYGVTSEYHEDAPRSDVYRECYLYDLDSDPYEMVNLIGMGVFRSLCDDLKKRLLGWIAEIEDGHRPQVLDAPERESRQYRHHPADLLELQKSRENEKQA*