ggKbase home page

GD18-4_B1_scaffold_162600_12

Organism: GD2018-4_B1_QB3_180703_Deltaproteobacteria_71_13

near complete RP 44 / 55 MC: 2 BSCG 49 / 51 MC: 3 ASCG 12 / 38
Location: comp(10811..13777)

Top 3 Functional Annotations

Value Algorithm Source
Uncharacterized protein n=1 Tax=Sorangium cellulosum So0157-2 RepID=S4XUK2_SORCE similarity UNIREF
DB: UNIREF100
  • Identity: 32.4
  • Coverage: 895.0
  • Bit_score: 357
  • Evalue 2.40e-95
General secretion pathway protein E {ECO:0000313|EMBL:AKF09042.1}; TaxID=927083 species="Bacteria; Proteobacteria; Deltaproteobacteria; Myxococcales; Sorangiineae; Sandaracinaceae; Sandaracinus.;" source="Sandaracinus amylolyticus.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 30.7
  • Coverage: 1078.0
  • Bit_score: 380
  • Evalue 4.80e-102
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 32.4
  • Coverage: 895.0
  • Bit_score: 357
  • Evalue 6.70e-96

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Sandaracinus amylolyticus → Sandaracinus → Myxococcales → Deltaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 2967
ATGGCCACGGTCAGCTCTCTCGTCGTTCGACGCCGCCTGGCCACGATGCGTCAGGTCGAGGAGGCGCTCGCCCGCCAGACGCTCTACGGCGGCGACGTCGTCACGAACCTGCTCGACGTCGCGCCCCCGCCCGAGACCGACGAGGCCGCGCTCACGGCGGCGCTCGCCGAGGCGATGGGCCTGCCGGCGGCGCCGCTCGGCGCGCTGCCCGAGCCCGACAGCGCCGTGGTCGCGCGCGTTGGCCGCGGCCTGGCCGAGCAGCACGGGTTCGTCCCCATCGCCATCCGCGGCGGCGCCATCGTCATCGCGCTCGCCGAGCCGCTCGCCAAGAGCGCGGTCGAGGAGATCGAGAGCGCCCTCGGCGCGTCGATCGTGCAGCAGGTGGCGCCGATGGTCCGGCTGCAGCAGGCGCTCGAGCACATCTACGGCGTGCCGATGGATCGACGCGTGCGCCGGCTGGCCGCGATCCTCGACGGCGAGAGCCCGGCCAACACGTCGGTGCCGCCGCCGCGCCGCTCTTTGATCCCCGACGTCTCCGACGCGCCTGCGCACGCGCCGCCGCGGGAGCCCGCGAAGGAGGCCGAGCGGCCGCCGGAGCCGCCGGCGCAGCAGCCGGCCGACGCGCCGGCCGTGCAGGAGGCGGCCGGAGCGCCGGCCGAACCGACGCCGACGCCCACTCCGAGCGTCAGCGCCTCGCGAGGTCCGCCGCCGACCACCATCCGCTCGGCGCGCGCGGGCACCGACGCGCTCCGCTCGTTCGTGAAGGACGCGCGCGGAGAGCGCTCGTCCGGCGCGAGCAAGCGCCGTCGCGGCCCGTTCACGCGCGCCGACGCCGAGAAGGTGTTCGCCGATCCGCAGTCGACCGACGCGGTCCTCGGCGCCGCGCTGGAGTTCGCGCAGCAGTGGTTCGCCTACGTGGCGATGTTCCTCGTCCACGACGATCTCGCCGAAGGATGGGACGCCGGCGGCGGCGGCGCGTCCGGCGATCGCCTGCGGAAGATGGGTGTTCCGCTCGATCTGCCGAGCCTCTTCGCCGAGGCGCGCGAGACGCGCGCGTCGATCGTCCGCGCGCGCCCGCACGACGGCCTCGACGCCGTGATCGCCGCCGACCTCGAGCGGCCGCTCGACGGCGACGTCGTCGTCGCGCCGGTGGTCGTCGGCAAGCGCGTGGTGGCGCTGCTCTACGCCGACGACGCCGGCGAGCCGATCTCTCCCGTCGACGTCGCCGAGGTGTTCGCGGTCGTCGCGCAGGCCGGTGCCGCGCTGGCGCGACGCATCCTCCGCAAGAAGGGCGTCCCGAACGTGCCGTTGCCCAAGCGCGAGTCGGCGCCGCCGCGCGAGATCGATCCCGACGTCGTCGCGCAGCGCGCGGGCGTGCTCGCGAAGGCGCTCGCCGGCGCGCGTGCGTCGAGCCGCCCGCCGCCCGAGCGAGCGCCCGCGCCCGAGCCGCGTGTGCGCGCCGCGTCGCTCGAGCCGATCGTGCGGCCGACGACGCCCACCGGCATCATGCGCGACGAGCCGACGCCGACCCCTCTCGGCGCGTTGCCCGCGGCCGCCGCGACGGGGGCGCCGGCCGCGCCCGCCAAGCGCTCCGAGCCCCCGCGGTCCGCGCCCGACGCGCCGACCGGGATCGCCCTCGAGCGCACGCCGTCGAGCGACGACATCCCCGGCCTCCGGCTCCTCGAGTCGTCCGAACCCCAGCCGCTCGCGCTGCCGACCGGCTCGGGCGAGCTCGCCGCGCCCGCGACCCCGCCGACCCCGCCGACGCCTGTCGCGCGGCCGATGCCCGCCGCGCCCGTCGAGCTCGTCACGCGTCAGGCGCCGCCGCCCAACCTCACCGGTCGAAAGCAGCTCGGGCCGCCGATCCCGCGCGAGGAGCCCGAAGAGCGTAGGAGGGCGGGGCAGACCAGCGAGCCGGAGCTGATCGAGAGCGCCGAGGTCAGCGACGACGAGGTCGAAGAGCTGCTCGAGCTCGATCGCGGACGAAAGCCCCTGCGCCCGAGCGAGCGCTTCGAGGTGTACACCGCGCGCGAGCCGCCGCGGCCCACGCGCCGCTCGGCCGAGCACGAGCTCCCGAAGGTGATCGTCGCGATCGAGTCCGAGCACGTCGCGCTGGTCGGCAAGGCGATCCGGGGCAGCGCCGCCGCCGAAGAAGCGGCCGAGAAGCTGCGAGCGCTCGGCGTCGCCGCGCTCCCGGCGATCATGGACCGGTTCCCCGGGCCCACGCGCGTCGACCGCACCACGCCGATCACGCAGGTCCCGCGGCCCTCCGACGCAGGACCGTTGCTGTCGCTGCTCTGCGCCCTCGGCCGCCTCGGGCTGCGCGACGTCCTCGCGCGCACCGGCGATCCGCTGCCCGAGACGCGCTTCTGGGCGACTTGGCTGCTCACGGAGGTGGTCGACGCCGAGTCGGCGCCGCTCCTGGTTCCGCGCGTCGTCGACGACGATCTCGCGGTGCGGCGCGCCGCGTGGGCGGCGTCGCGCGCGCTGCTCGAGGCGGAGCCGGCGACCGCCGACATCCTCATCGAGCCGCTGGTCGGCGTCATCCTCGATCCCGGCGGCGGCATGTCGCTGCGCATCCGCTCGGCGAACGCGCTCGGCGAGCTGCGCGATCGACGCGCCGTCGAGGGCCTGGTCTTCGGCGTCGACGTGCGCGAGGCCGAGCTGGCGAGCGCCTGCCACGAGGCGCTCGTCACGATCACCCGCGCCGATCCGCCGTCGCGCGGGCAGACCTGGGCGAGCTGGCTCGCGCAGCACGGCGCCGAGTCGCGCATCGAGTGGCTGATCGACGCGCTCCTGTCCGAGAGCGCCACCCTGCGCGAGGCCGCGGGCGCCGAGCTGAAAGCGACGACCAAGGTTTACGTCGGCTACTACGCGAACCTCCCGCGGGCCGAGCGCGAGGAGGCGTGGCGCCGCTACCGCGCGTGGTGGAAGGAAGAGGGCTCCCGGAAGTTCGCGGGTCGCTGA
PROTEIN sequence
Length: 989
MATVSSLVVRRRLATMRQVEEALARQTLYGGDVVTNLLDVAPPPETDEAALTAALAEAMGLPAAPLGALPEPDSAVVARVGRGLAEQHGFVPIAIRGGAIVIALAEPLAKSAVEEIESALGASIVQQVAPMVRLQQALEHIYGVPMDRRVRRLAAILDGESPANTSVPPPRRSLIPDVSDAPAHAPPREPAKEAERPPEPPAQQPADAPAVQEAAGAPAEPTPTPTPSVSASRGPPPTTIRSARAGTDALRSFVKDARGERSSGASKRRRGPFTRADAEKVFADPQSTDAVLGAALEFAQQWFAYVAMFLVHDDLAEGWDAGGGGASGDRLRKMGVPLDLPSLFAEARETRASIVRARPHDGLDAVIAADLERPLDGDVVVAPVVVGKRVVALLYADDAGEPISPVDVAEVFAVVAQAGAALARRILRKKGVPNVPLPKRESAPPREIDPDVVAQRAGVLAKALAGARASSRPPPERAPAPEPRVRAASLEPIVRPTTPTGIMRDEPTPTPLGALPAAAATGAPAAPAKRSEPPRSAPDAPTGIALERTPSSDDIPGLRLLESSEPQPLALPTGSGELAAPATPPTPPTPVARPMPAAPVELVTRQAPPPNLTGRKQLGPPIPREEPEERRRAGQTSEPELIESAEVSDDEVEELLELDRGRKPLRPSERFEVYTAREPPRPTRRSAEHELPKVIVAIESEHVALVGKAIRGSAAAEEAAEKLRALGVAALPAIMDRFPGPTRVDRTTPITQVPRPSDAGPLLSLLCALGRLGLRDVLARTGDPLPETRFWATWLLTEVVDAESAPLLVPRVVDDDLAVRRAAWAASRALLEAEPATADILIEPLVGVILDPGGGMSLRIRSANALGELRDRRAVEGLVFGVDVREAELASACHEALVTITRADPPSRGQTWASWLAQHGAESRIEWLIDALLSESATLREAAGAELKATTKVYVGYYANLPRAEREEAWRRYRAWWKEEGSRKFAGR*