ggKbase home page

SCNpilot_expt_1000_bf_scaffold_1068_curated_7

Organism: scnpilot_dereplicated_Rhizobiales_12

near complete RP 44 / 55 MC: 2 BSCG 46 / 51 MC: 5 ASCG 10 / 38 MC: 2
Location: comp(6733..8262)

Top 3 Functional Annotations

Value Algorithm Source
Choline sulfatase {ECO:0000313|EMBL:CCE95139.1}; EC=3.1.6.6 {ECO:0000313|EMBL:CCE95139.1};; TaxID=1117943 species="Bacteria; Proteobacteria; Alphaproteobacteria; Rhizobiales; Rhizobiaceae; Sinorhizobium/Ensifer group; Sinorhizobium.;" source="Rhizobium fredii (strain HH103) (Sinorhizobium fredii).;" similarity UNIPROT
DB: UniProtKB
  • Identity: 87.8
  • Coverage: 509.0
  • Bit_score: 950
  • Evalue 1.00e-273
Choline sulfatase n=1 Tax=Rhizobium fredii (strain HH103) RepID=G9A249_RHIFH similarity UNIREF
DB: UNIREF100
  • Identity: 87.8
  • Coverage: 509.0
  • Bit_score: 950
  • Evalue 7.30e-274
betC; choline sulfatase similarity KEGG
DB: KEGG
  • Identity: 87.8
  • Coverage: 509.0
  • Bit_score: 950
  • Evalue 2.30e-274

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Sinorhizobium fredii → Sinorhizobium → Rhizobiales → Alphaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 1530
TTGAGCCACGTAAAACCCAATATCCTGATCGTCATGGTCGACCAGTTCAACGGCACGTTCTTCCCGGACGGGCCGGCGGATTTTCTGCATGCCCCGCATCTGAAAGCGCTGGCGGCCCGCTCGGCACGGTTTGCCAACAATTATACCTCGTCGCCGCTCTGCGCGCCGGCGCGCGCCTCGTTCATGGCCGGCCAGTTGCCCAGCCGCACCCAGGTCTATGACAACGCCGCCGAATATGTCTCCTCGATCCCCACCTATGCGCATCACCTGCGCCGCGCCGGCTATTACACGGCGCTGTCGGGCAAGATGCATTTCGTCGGGCCGGACCAGTTGCACGGTTTCGAGGAACGGCTGACCACCGACATTTATCCGGCCGATTTCGGCTGGACGCCGGATTATCGCAAGCCCGGCGAACGCATCGACTGGTGGTATCACAATCTGGGTTCGGTCACCGGCGCCGGTGTCGCCGAAATCACCAACCAGATGGAATATGACGACGAGGTCGCCTTCCTCGCCAACCAGAAGCTCTACCATCTCTCGCGCGAGAACGACGATCAGGGCCGCCGCCCCTGGTGCCTGACCGTCTCCTTCACCCATCCGCACGACCCCTATGTCGCGCGGCGCAAATACTGGGATCTTTACGAAAACTGCGAGCATCTGCTGCCGGAAGTCGGCGCGATGCCGCTGGAACAGCAGGATCCGCATTCGCAGCGCATCATCTTCTCCTGCGACTACAAGAATTTCGACGTCACCGAAGAGGATATCCGCCGTTCGCGCCGGGCCTATTTCGCCAACATCTCCTATCTCGACGACAAGGTCGGCGAACTCGTCGACACGCTGACGCGCACGCGCATGCTCGACAACACCTATATCCTGTTCTGCTCCGACCATGGCGACATGCTCGGCGAGCGCGGATTGTGGTTCAAGATGAATTTTTTCGAGGGCTCGGCGCGCGTGCCGCTGATGGTCGCCGGCCCCGGAATACCCCCCGGCCTGCACACGACGCCGACGTCCAACCTCGACGTGACGCCGACGCTGGCCGATCTCGCCGGCATTTCCATGGACGAGGTCAAGCCGTGGACGGACGGCATCAGCCTCGTGCCGATGATCGACGGCGTCGAGCGCACCGAGCCGGTGCTCATGGAATATGCGGCGGAGGCCTCCTATGCGCCGCTGGTCGGCATTCGCGAGGGCAAGTGGAAATATATCCATTGCGAACTCGATCCCGAGCAACTTTACGACCTCGACGCCGATCCGAAGGAATTGACCAACCTCGCGACCGACCCGGCTCATGCCGAAACGCTCGGGCGTTTCCGCGCCAAGCGCGAAGCGCGCTGGGACATGAAAGCCTTCGATTCCGCCGTGCGCGAAAGCCAGGCGCGGCGCTGGGTGGTCTATGAGGCGCTGCGCAACGGCGCCTATTACCCCTGGGACCACCAGCCGCTGGCCCGCGCCTCCGAGCGCTACATGCGCAACCACATGAACCTCGACAATCTCGAAGAATCCAAACGCTATCCGCGAGGAGAATAA
PROTEIN sequence
Length: 510
LSHVKPNILIVMVDQFNGTFFPDGPADFLHAPHLKALAARSARFANNYTSSPLCAPARASFMAGQLPSRTQVYDNAAEYVSSIPTYAHHLRRAGYYTALSGKMHFVGPDQLHGFEERLTTDIYPADFGWTPDYRKPGERIDWWYHNLGSVTGAGVAEITNQMEYDDEVAFLANQKLYHLSRENDDQGRRPWCLTVSFTHPHDPYVARRKYWDLYENCEHLLPEVGAMPLEQQDPHSQRIIFSCDYKNFDVTEEDIRRSRRAYFANISYLDDKVGELVDTLTRTRMLDNTYILFCSDHGDMLGERGLWFKMNFFEGSARVPLMVAGPGIPPGLHTTPTSNLDVTPTLADLAGISMDEVKPWTDGISLVPMIDGVERTEPVLMEYAAEASYAPLVGIREGKWKYIHCELDPEQLYDLDADPKELTNLATDPAHAETLGRFRAKREARWDMKAFDSAVRESQARRWVVYEALRNGAYYPWDHQPLARASERYMRNHMNLDNLEESKRYPRGE*