ggKbase home page

L3_105_000G1_scaffold_1094_23

Organism: dasL3_105_000G1_metabat_metabat_7_fa_fa

near complete RP 49 / 55 BSCG 51 / 51 MC: 2 ASCG 13 / 38
Location: comp(25504..26973)

Top 3 Functional Annotations

Value Algorithm Source
Arylsulfatase {ECO:0000313|EMBL:EEP44204.1}; EC=3.1.6.- {ECO:0000313|EMBL:EEP44204.1};; TaxID=521003 species="Bacteria; Actinobacteria; Coriobacteriia; Coriobacteriales; Coriobacteriaceae; Collinsella.;" source="Collinsella intestinalis DSM 13280.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 95.9
  • Coverage: 489.0
  • Bit_score: 993
  • Evalue 5.90e-287
Arylsulfatase n=1 Tax=Collinsella intestinalis DSM 13280 RepID=C4FAC7_9ACTN similarity UNIREF
DB: UNIREF100
  • Identity: 95.9
  • Coverage: 489.0
  • Bit_score: 993
  • Evalue 4.20e-287
sulfatase similarity KEGG
DB: KEGG
  • Identity: 56.9
  • Coverage: 485.0
  • Bit_score: 563
  • Evalue 5.20e-158

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Collinsella intestinalis → Collinsella → Coriobacteriales → Coriobacteriia → Actinobacteria → Bacteria

Sequences

DNA sequence
Length: 1470
ATGGCCAATCGACCCAACGTGCTCCTCATCATGTGCGATCAGATGAGGGGCGATTGTCTGGGTATAGACGGGCACCCGGATGTGAAGACGCCCTACCTCGACACACTCGCCGCCGACGGCATGCTGTTCGAGAACGCCTACTCGGCGTGTCCGAGTTGCATTCCCGCCCGCGCCGCCCTCTTCACGGGTAGGACGCCTGCCGGCACGGGCCGCGTGGGATACAAGGACGGTGTCGCGTGGGAGTACGACCATATGCTCGCGCAGGAGATGCGCGACGGCGGCTACCAGACGGCCGTGGTCGGCAAGATGCACGTGCACCCGCCGCGCCTCGGCTGCGGCTTCGAGCACGTGCGCCTGCACGACGGCTACATCGGGCATTACCGTAAGGCGAACCTGCCCTACTGGATGCACCAGAACGTCTCCGACGACTATGTGCGCTTCCTCAAGGACGAGCTGGGTGAATTCGCCGACGTGAACGGAACCGGCGTGGAGAACAACTCGTGGATCACGCACCCGTGGGCATACGGGGAGCGCCTGCATCCCACCAACTGGGTGGTCGACGAGTCTATCCGCTTCCTCGAGACGCGCGATCGCACGCGCCCGTTCTTCCTCATGACGAGCTTCGTGCGGCCCCATCCGCCCTTCGACGCGCCGCAGACCTACTTCGACCTGTATCGCGATATGGAGTTGCGGGCCCCCGCCGTCGGCGACTGGGACGACGCGGGCGTCACCGAGCGCGACGGCATGATTCTGGACAGCGTGCATGGCTGTCGCGACGCCGAACTGCGTCGCGAGGCCATGGCGGGTTACTATGCCTGCATCACGCATATGGACCACCAGATCGGCCGCCTCATCACCGCGTTGGAGAACGATGAGACCTACCACGACACCGTGGTCGTGTTCTGCTCCGATCATGGCGAGATGCTGTTCGACCACAGCCTGTTCCGCAAGGTTTTACCCTATGAGGGCTCCACGCACATCCCCCTCATCGTCCATGTGGGCAAAAACGTCGAGCTGGCGCGCGGCGAGCGCGTCCGCGGCGTGAGTGAGAGCACTGTCGAGCTCATGGACCTCATGCCCACGATTCTCGAGGCGTGCGGGCTGCCCGTGCCCGAGGGCGTCGAGGGCTCATCGCTTTTGGGGGAGCTCACCGGGGCCGCACCGCTCAACCGTGCTTACCTGCACGGCGAGCACAGCGGCAGCCATGAGCAATCCAACCAGTGGATCGTGACCCCGCACGACAAGTACATCTGGTTCACCCAGACAGGCGTGGAGCAGTACTTCGACTTGGATGCCGATCCGCGCGAGTGCGTGAACCTCATCGACCGTCCGGAGTGCGCCGAGCGCATCGCCGAGCTCCGCGCGCTTTTGATCGAGGAACTCGCCGGCCGCGAAGAGGGGTATGTTAAGGACGGTGAGTTGAGGGTCGGCTGCAAGCCCGCCGTCAATCTCGAGCACCCTCGTCGCTAG
PROTEIN sequence
Length: 490
MANRPNVLLIMCDQMRGDCLGIDGHPDVKTPYLDTLAADGMLFENAYSACPSCIPARAALFTGRTPAGTGRVGYKDGVAWEYDHMLAQEMRDGGYQTAVVGKMHVHPPRLGCGFEHVRLHDGYIGHYRKANLPYWMHQNVSDDYVRFLKDELGEFADVNGTGVENNSWITHPWAYGERLHPTNWVVDESIRFLETRDRTRPFFLMTSFVRPHPPFDAPQTYFDLYRDMELRAPAVGDWDDAGVTERDGMILDSVHGCRDAELRREAMAGYYACITHMDHQIGRLITALENDETYHDTVVVFCSDHGEMLFDHSLFRKVLPYEGSTHIPLIVHVGKNVELARGERVRGVSESTVELMDLMPTILEACGLPVPEGVEGSSLLGELTGAAPLNRAYLHGEHSGSHEQSNQWIVTPHDKYIWFTQTGVEQYFDLDADPRECVNLIDRPECAERIAELRALLIEELAGREEGYVKDGELRVGCKPAVNLEHPRR*