ggKbase home page

M01_scaffold_2_1202_curated_prodigal-single_16

Organism: M01_PHAGE_CU_48_59

RP 0 / 55 BSCG 2 / 51 ASCG 0 / 38
Location: 18304..21510

Top 3 Functional Annotations

Value Algorithm Source
Papain family cysteine protease n=1 Tax=Firmicutes bacterium CAG:313 RepID=R6XR49_9FIRM similarity UNIREF
DB: UNIREF100
  • Identity: 27.0
  • Coverage: 610.0
  • Bit_score: 206
  • Evalue 7.30e-50
Papain family cysteine protease {ECO:0000313|EMBL:CDD21529.1}; TaxID=1263017 species="Bacteria; Firmicutes; environmental samples.;" source="Firmicutes bacterium CAG:313.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 27.0
  • Coverage: 610.0
  • Bit_score: 206
  • Evalue 1.00e-49
citrate transporter similarity KEGG
DB: KEGG
  • Identity: 23.9
  • Coverage: 775.0
  • Bit_score: 134
  • Evalue 1.30e-28

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Firmicutes bacterium CAG:313 → Tenericutes → Bacteria

Sequences

DNA sequence
Length: 3207
TTGAGAAAAGCCATTAGAAAGACCGCAGCAGGTAAGATTGCCATGGTAGCTATTGCCGCTGCAATATCATCATCGGCGTTCCTTATCGCTTCACCGTCCATGTCAATGGGTGATACGAGCAGCGATAAGGTCTATTGGTCGAATACGCAGGCTCCGTTCATGTACGGGACGGATGAAGCAACGATACACGCGGGAGAGAAATTCGATATCAAGGACTCAAGATACCGAGTGCAGGCAAGGGACTTCGAGGACGGGGACGTCACATGCGATATCAAGATGGTATCGAACAACGTTCAATCAGATGCACCTGGTGATTATAAGGTGTCGTATTCCGTCACAGATTCTGACGGCAACACTGCAACGATGGACACGGACGTACATGTTATAGCGACAGATTCATCCGATAATGATTGGTACGTCAAGCGCGTATATCAGAATCCAAACACATGGAACACGCGAGAACTGCTACGAATTAATCGCGGGGACTATATGGATCGGCAAATGCTCGGCGTCCATCTCCCAGCAAGCAAATCGGTCGACGTATCGTTCCTTGATTCATCGTCTAACTTGACTGTGACGATTACGGCACCTACAAACGATACGGCGACTGACGGGACGAGGACATCAATATCATCAGGAAACTCTGCGACGATCTCCGCGTCTGATTCTATCGACACGATACCGCTGATTAAGACGCCTATGCTTCCGCGCGGTGCTGGTACACAACAGCAGGATATTCGATTTAAGGTATCGTACTCGATGGACGATGGCGTAGGTCCGGCACATTTTTGGACAGATGGCGATGATGAGAATGTGTTCATCGACTCATGGTCGGCAGATGAGACGCACCCGTTCGCATACATCGAGACGAAAGCGTTCGAGGGGCTTCCAACGTGGAACGATCTCTCGACGATCAAAATTCTCAGCACGAAGCGCGGATGGGTTCCATCGTTAACGGCATGGTCTGAATATTGGGTCGGTGTTATGGATAAATACGATTCGATGCTCGGACTGTCGATCAGAACGAACAATCCGCTCGATCAGCGCGTCAGGACGAAATACTTCTGCCGAGCGAACGTTCACGGAGCAGGCGGTGCGTATTACAACGGCGGGGACCACGTCGGCATCCACAGCGCCAACATGGTATCGATATTCGAATATAATTGGGGCGGGCTGCATGAGGTCGGCCACGGATACCAAGGCTTCATGAACAGCGGCCCGATGTATCTCGCCGAAGTATCCAACAACATATACGGATACCAAGTACAGCATGACAAATCAATCTACAAAGGCAACGGCGAATGGATGAATTTCGACAGTCAGGAGACGAAGCAGAATGCGAAACGACTCGCAGGAGACTTCAAAAAATTCGATGACGTCGACTCTGCAGGTAAGCTTTACGTCATCATGAACGTCTTCAAATCCATCACACCGGATAACCTAGAAACGGCCCATGCAGAGTTTTTCACGTGGGCGCGCAAGAAATCACAGGTAAGCGGAACCACACAGAACGTCGATATGCTATGCCGTTGGCTCGCCGAGGAGCACGGGATTGACGCCTCCCCATTCTTCCAAGCATGGGGCGTCACACTTCCCGACACCACTATATCGTATCTCGATTCGAACGACGCTCTCGATAGGGGACTTATCGCAGGAGATGCTATAACATCAGACGCAGCACGCACCGCCTATCGCGCAGCAAACCATAACGCACCAATCTATTCACTCGCCTCGTCGTCGAACGTCGGGAGATACGCTAGGGGAAACGCGGAGATAACCGTCGAGGGAATCGATGAGGTGAAGAGCCGACTCGAGGGACGGCTCGCCGCACTCATCGACGGTAACGGCAAGACGGAATACGTTGTCGCAACGGCTCCGATACATGACGGAATCGCCGATTTCACCGATATTCCCGCAGGAAACTGGAAAGTCGTTATGCCGGATTTATCAGGTGACGGATTCGAGACAGCAGTATCGACTATCGACGGAAGAATCCATATCGCAGGAGAGAACTTAAAAGCATCATATATCGCCATCTACAACACGAACGGAATCGTCTCCGGATCATCATTCTCGATTGACGGTATCTTCGGGACGTCTGGATTCATGGCAACCGTCGACCCGACTGTATCCAAGCTAGTCGTATCGCTCGGCGGGGCGCGCCTACTCAACTGGGGCGAATCGGCTGACAAGGTCGTCGCTTCCGTCACGCTGAAAGGTAAAGATGGAAGCGTCAAGAAGGAATGGATCGTCAAGGGCGACGGCTATTTCAACGACGCTGGGAACGGAAACGAGAGCGGCGCATTCGACATCGAGGTCGGCGACGCGATAGACGTCTACTTCTCGCAGTACTCACGTGTCAAGTTCTACGGCGCTGACGGGAAGAAGCAGGATGACAACGTCATGACTACTGCGAACCAGACGTACAAGGTTACCTACGGCGGTCTCATAAGGGCTGACAGGGTCGATGAAAAGCCGTACAACGCATACGCCGCAGCGAGGATTGCCGAGATACGTAAGCAGCTTGGAGACGAGGTGCTTGGGAATAGGTTCCTCAAGAGCGTGCTGAAGAACGAGGTAAGCAGACTCTACTCGCTCATGAACTCAGGCGATACCGATATCGACGCTTGGATCAAGAAAGTTCGAAAGGGCGGCATCCCGACATGCGTGACGAGGAGCGTATCAATCGACATCACACCAGAGAACGGAATCGATACTGCAACCATTACCGGCAAAATCGGGCAGCTGAGCGATGCGGAGGACGGGAAGTTCACCGCGACGGCTGATAATGCGACCGTATATATCGACGGAAAGCTGATATCCGATATCGACTGGAAGGCATATGCAGGACGTAAAGTCGCGGGGGTAGCATACGTCACCGACACGGACGGAAACGCCGCCGAGATTCCCGTCAAGGCGAACGTCGTAAAAGCGAAGAACGGCGATTCGACGTCAACCGAATCCAACGGGAAGAGCGATAACGGCAATACGGATGGGAAGCGTAAGTCCGATACGCTTAGCGATTGGAGCAGGACTTCCGGCGACAACGACCTTAACCCGTCAAAATGGGCGATGCCAAGGAACAAGCGTAAGCGTAAGGCTCTTATTCAGACAGGCGTCGCATACGGTGCCGAAATACTCACTATGGTTGCCGTCGGCACGGCGGTTATCGCGGGAAGAGCGGTTGGAAGGAAGAATAATAGATAA
PROTEIN sequence
Length: 1069
LRKAIRKTAAGKIAMVAIAAAISSSAFLIASPSMSMGDTSSDKVYWSNTQAPFMYGTDEATIHAGEKFDIKDSRYRVQARDFEDGDVTCDIKMVSNNVQSDAPGDYKVSYSVTDSDGNTATMDTDVHVIATDSSDNDWYVKRVYQNPNTWNTRELLRINRGDYMDRQMLGVHLPASKSVDVSFLDSSSNLTVTITAPTNDTATDGTRTSISSGNSATISASDSIDTIPLIKTPMLPRGAGTQQQDIRFKVSYSMDDGVGPAHFWTDGDDENVFIDSWSADETHPFAYIETKAFEGLPTWNDLSTIKILSTKRGWVPSLTAWSEYWVGVMDKYDSMLGLSIRTNNPLDQRVRTKYFCRANVHGAGGAYYNGGDHVGIHSANMVSIFEYNWGGLHEVGHGYQGFMNSGPMYLAEVSNNIYGYQVQHDKSIYKGNGEWMNFDSQETKQNAKRLAGDFKKFDDVDSAGKLYVIMNVFKSITPDNLETAHAEFFTWARKKSQVSGTTQNVDMLCRWLAEEHGIDASPFFQAWGVTLPDTTISYLDSNDALDRGLIAGDAITSDAARTAYRAANHNAPIYSLASSSNVGRYARGNAEITVEGIDEVKSRLEGRLAALIDGNGKTEYVVATAPIHDGIADFTDIPAGNWKVVMPDLSGDGFETAVSTIDGRIHIAGENLKASYIAIYNTNGIVSGSSFSIDGIFGTSGFMATVDPTVSKLVVSLGGARLLNWGESADKVVASVTLKGKDGSVKKEWIVKGDGYFNDAGNGNESGAFDIEVGDAIDVYFSQYSRVKFYGADGKKQDDNVMTTANQTYKVTYGGLIRADRVDEKPYNAYAAARIAEIRKQLGDEVLGNRFLKSVLKNEVSRLYSLMNSGDTDIDAWIKKVRKGGIPTCVTRSVSIDITPENGIDTATITGKIGQLSDAEDGKFTATADNATVYIDGKLISDIDWKAYAGRKVAGVAYVTDTDGNAAEIPVKANVVKAKNGDSTSTESNGKSDNGNTDGKRKSDTLSDWSRTSGDNDLNPSKWAMPRNKRKRKALIQTGVAYGAEILTMVAVGTAVIAGRAVGRKNNR*