ggKbase home page

SCNpilot_expt_1000_bf_scaffold_677_curated_19

Organism: scnpilot_dereplicated_Thiobacillus_5

near complete RP 51 / 55 BSCG 51 / 51 ASCG 12 / 38 MC: 1
Location: comp(20047..20982)

Top 3 Functional Annotations

Value Algorithm Source
Putative Sulfotransferase-like protein n=1 Tax=Candidatus Microthrix parvicella RN1 RepID=R4YYG2_9ACTN similarity UNIREF
DB: UNIREF100
  • Identity: 28.7
  • Coverage: 331.0
  • Bit_score: 88
  • Evalue 1.10e-14
Uncharacterized protein {ECO:0000313|EMBL:KHD06383.1}; TaxID=1003181 species="Bacteria; Proteobacteria; Gammaproteobacteria; Thiotrichales; Thiotrichaceae; Thiomargarita.;" source="Candidatus Thiomargarita nelsonii.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 37.6
  • Coverage: 311.0
  • Bit_score: 200
  • Evalue 4.80e-48
sulfotransferase similarity KEGG
DB: KEGG
  • Identity: 32.5
  • Coverage: 154.0
  • Bit_score: 69
  • Evalue 2.90e-09

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Candidatus Thiomargarita nelsonii → Thiomargarita → Thiotrichales → Gammaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 936
ATGACGACATGCCTCTACATCTGCGGCGCCGGACATTCCGGTTCGACCCTGCTCGACATGCTGCTCGGCAGCCATTCCCGCATCGCTTCGCTGGGCGAGATCATCAACCTGCCGATGGACTGGGCCACCAACAACCGTTGCACCTGCGGGGCGAATGTGCGCGACTGCAGCGTCTGGTCGAACGTGGTGCACGAGTTCGAGCGCCGGGACGGTATCGACGTGACGGCCAACCCGTATTCGCTCAACCTTGGGCCGATTTATGCGGTTGTGGGCGACAGCAAGGTGACAGGGCGGGCGTACCGCCTCCGCCGTCGGCTGGGAAGCGGTCTGCATTACCTCGAACTGAAGGCCGGTGCCTTTGGCGCCTTGCGGCCGCTGTTGCCCACCATCTACCGCGGCCTGGAAAACAATCTCTATCTGTACGACCTCGTCGGCAAGCAACTCCGCGTGGATTTCGTGGTGGACTCGTCGAAGGTGTACGCAAAGGCCGTCGGCCTTTACAAGATGGCGCCAGACCGGGTCAAGATCATCCTGCTGGTCCGTGATGGCCGAGCGGTGTATTACTCCATGCGCAAACGGAATTTCGACCGCACGTTGAGCCTCAACTCCTGGTACACACATTTCCGGCGCGCCTTGCCCTTGATCGAGAAGCACGTCGATGCCAAGGATGTTTTGACGGTCAAGTATGAAGACCTCGCCAGCGATCCCGCGAAGGAAATGCAGCGCATCTGTGCATTCGCGGGCATCGGCTACGAACCGGGAATGCTCGACTTCAAGTCCAAAGTACATCACAACGTCAACGGCAATGACATGCGATTCTCCACCGCTTCTGAAATCCGCTTGGACACGTCCTGGGTGACCAAGCTCGCCGATGAGGAAAAGAAATTCTTTCTGCAACGTGCAGGGTGGCTGAACGAGAAGCTGGGCTATCGCTGA
PROTEIN sequence
Length: 312
MTTCLYICGAGHSGSTLLDMLLGSHSRIASLGEIINLPMDWATNNRCTCGANVRDCSVWSNVVHEFERRDGIDVTANPYSLNLGPIYAVVGDSKVTGRAYRLRRRLGSGLHYLELKAGAFGALRPLLPTIYRGLENNLYLYDLVGKQLRVDFVVDSSKVYAKAVGLYKMAPDRVKIILLVRDGRAVYYSMRKRNFDRTLSLNSWYTHFRRALPLIEKHVDAKDVLTVKYEDLASDPAKEMQRICAFAGIGYEPGMLDFKSKVHHNVNGNDMRFSTASEIRLDTSWVTKLADEEKKFFLQRAGWLNEKLGYR*