ggKbase home page

SCNpilot_expt_1000_bf_scaffold_14544_2

Organism: SCNPILOT_CONT_300_BF_Rhizobiales_59_19

partial RP 30 / 55 MC: 3 BSCG 29 / 51 MC: 2 ASCG 5 / 38
Location: comp(752..2011)

Top 3 Functional Annotations

Value Algorithm Source
hypothetical protein n=1 Tax=Rhizobium leguminosarum RepID=UPI00037C1BBF similarity UNIREF
DB: UNIREF100
  • Identity: 64.3
  • Coverage: 415.0
  • Bit_score: 530
  • Evalue 1.90e-147
  • rbh
extracellular solute-binding protein; K02027 multiple sugar transport system substrate-binding protein similarity KEGG
DB: KEGG
  • Identity: 26.5
  • Coverage: 441.0
  • Bit_score: 138
  • Evalue 3.90e-30
Carbohydrate ABC transporter substrate-binding protein, CUT1 family {ECO:0000313|EMBL:ABC19794.1}; Flags: Precursor;; TaxID=264732 species="Bacteria; Firmicutes; Clostridia; Thermoanaerobacterales; Thermoanaerobacteraceae; Moorella group; Moorella.;" source="Moorella thermoacetica (strain ATCC 39073).;" similarity UNIPROT
DB: UniProtKB
  • Identity: 26.5
  • Coverage: 441.0
  • Bit_score: 138
  • Evalue 1.80e-29

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Moorella thermoacetica → Moorella → Thermoanaerobacterales → Clostridia → Firmicutes → Bacteria

Sequences

DNA sequence
Length: 1260
ATGACCCGTCTGATTTCGGCCACTGCGGTCGCCACCATTTCACTCGCTCTCCTTCCTGCATTTGCAGCGGCAGCCGGAGACCTCCGTTTCGTCTCCGGTCAGGAAAAGCAGAACGGCGCCGTGCTCAAGCAGATCTTCGATACCTATAACGGCACGAAGCCTGCATCTTCTGTGAAGCTTGAGCTCGACAACAAATCCGACCTCGATACGACGCAGAAGGTGCTCGCCGATATCGTGGCGGGCACGACGCCGGACGCCGTGCGCGTAACGGGTGCGGTTCTTCGCCCTTACGTCGATTCTGGCCGCGCACAGCCGCTTGACGATTGCCTGGCATCAGCGCCGGAGCTTGCGGCACAGTTGGACAAGGGTCTGCTCGACGACTTCCGCGTCAACGGCAAGCTCTATGCCATGCCCTGGTATGTCACTTTGCCGGCGCTATTCATCAATACCGACGCCTTCAAGGCAGCCGGCCTCGACCCGGCAAATCCTCCGAAGAATTGGACGGAACTGGAGGCCGCTGCCGCCAAACTCAGCGACAAGGCCAACAACAAGTTCGGCGTGCTGATGTATATGCCCAACACCTACATGTTTGAGGGGCAGCTCGCCTCTGCAGGTGGTGCCATGGTCGGTGCGGACGGCAAGTCCGGTGTCGGCAATGCGGCAGGCGTCGAGGTGATGAGCTATATGCGTGGCCTGGTCGAAAAGGGCTACATGCCAGCCGTTTCGCCGGGCACCTTCTGGGGCGAGGCCGGCCGGATGTTCCAGGCCGGCGAGGTGGCGATGCTCCTGAGTTCGTCCTCTGGCTATACCAGCCTTGTGCCGAAGGCTTCCTTCAAAGTGGCGCTGGCGCCGATGCCGGCCAAGGACGGCGCAACGCCGGTGACCATGGCATCGGCCAACGGCTTTGTCATGTTGGCGACGGACCCTGCCCGCAAGGAAGCAACCTGCAAGGCGCTGCTATCGCTCGTCACTCCCGAAAGCGTCGCCTTGACGGTGAAGGCGACGGCCTCTTCGCCGGTCAACGTGACGACAGTCGCAAAACCGGAGCTGCTTGGCGATTTCTACGCCCAGAACCCCGAGCTCAAGGTCCTCAACACGCAGAAAAGCCAGAATTGGTACACCCTGCCCGGCAAGGCCAACAACGAGTTCCAATCCAATTTCGGCGATACCCAGTACGAAATCCTGACGGGTGCGACCTCGGCTCAGGATGGCATGAAGCGCCTCGCCGGCATCATGGACGATCTGAACGGCGCAAAGTGA
PROTEIN sequence
Length: 420
MTRLISATAVATISLALLPAFAAAAGDLRFVSGQEKQNGAVLKQIFDTYNGTKPASSVKLELDNKSDLDTTQKVLADIVAGTTPDAVRVTGAVLRPYVDSGRAQPLDDCLASAPELAAQLDKGLLDDFRVNGKLYAMPWYVTLPALFINTDAFKAAGLDPANPPKNWTELEAAAAKLSDKANNKFGVLMYMPNTYMFEGQLASAGGAMVGADGKSGVGNAAGVEVMSYMRGLVEKGYMPAVSPGTFWGEAGRMFQAGEVAMLLSSSSGYTSLVPKASFKVALAPMPAKDGATPVTMASANGFVMLATDPARKEATCKALLSLVTPESVALTVKATASSPVNVTTVAKPELLGDFYAQNPELKVLNTQKSQNWYTLPGKANNEFQSNFGDTQYEILTGATSAQDGMKRLAGIMDDLNGAK*