ggKbase home page

SCNpilot_BF_INOC_scaffold_4821_4

Organism: SCNpilot_BF_INOC_Clostridium_42_5_partial

partial RP 46 / 55 MC: 1 BSCG 40 / 51 MC: 2 ASCG 13 / 38 MC: 3
Location: 1294..2313

Top 3 Functional Annotations

Value Algorithm Source
CRISPR-associated endonuclease Cas1 {ECO:0000256|HAMAP-Rule:MF_01470}; EC=3.1.-.- {ECO:0000256|HAMAP-Rule:MF_01470};; TaxID=1235809 species="Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Bacter similarity UNIPROT
DB: UniProtKB
  • Identity: 82.9
  • Coverage: 339.0
  • Bit_score: 597
  • Evalue 1.50e-167
CRISPR-associated protein Cas1 n=1 Tax=Proteiniphilum acetatigenes RepID=UPI00037CBBB2 similarity UNIREF
DB: UNIREF100
  • Identity: 86.4
  • Coverage: 339.0
  • Bit_score: 620
  • Evalue 1.10e-174
cas1; CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype similarity KEGG
DB: KEGG
  • Identity: 75.5
  • Coverage: 339.0
  • Bit_score: 550
  • Evalue 2.40e-154

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Bacteroides pyogenes → Bacteroides → Bacteroidales → Bacteroidia → Bacteroidetes → Bacteria

Sequences

DNA sequence
Length: 1020
ATGAAAAAAACCTATTATTTATTCAATCCCGGCACATTAGAGCGAAAGGATAACACGCTTAAGTTTACTCCGTTTGAGGAAGATGGTAATGGAGAAATGCTGAAATCCGGCCAACCGCGTTATCTTCCGGTGGAAGATATCATGGAGTTTTATGTTTTCGGCTCGCTTAGAGCAAATAGTTCGTTATATAATTTTCTGGGGCAAAAAGGTATAGCCGTTCATTTTTTTGATTATTACGAAAACTATACCGGATCGTTTATGCCACGCGACAGTTTGCTCTCGGGGCGAATGATTTTGGCGCAAACTTCTGCCTATCAAAATAAAAAGAAACGCATTGAGCTGGCTCGCAAATTTGTGGAAGGAGCTTCGTTTAATATGACTAAAAACCTGCGGTATTACAACACACGTGGCAAAGATCTCGATGGTTTAATAGAAAAAATTGAAGAATATACTTCACAACTTCCGTATTTAAATGCGGTGGATGCACTCATGGGTATTGAAGGCAATGTCAGACAGATATATTACAAAGGCTTTGATTTGATACTGAATGATTTCAGTATGGATGGACGAAGCAAAATGCCGCCTCGTAACGAGGTAAACGCGCTTATATCGTTTGGAAACATGATGTGCTATAGTCAATGTTTGCGTGCCATTCATCAAACTCAACTTAATCCTACGATAAGCTATCTGCATACACCTGGAGAAAGACGTTATTCACTTTCGATGGATATTTCGGAAATTTTCAAACCAATTTTGGTCGATAGAGTCATTTTCAGACTGCTGAATAAAAGGGAATTGCAGGAAAAGCATTTCGATAATAAACTCAATCGCTGCTTACTCAACCCGACAGGCAAAAAGATTTTCGTAAAAGCTTTTGATGAGCGCTTATCTGAGACTATCCAGCACCGTTCGTTAAAACGAAAAGTGAGTTACAAACATCTCGTCAAGTTGGAATGCTATAAACTGAGTAAGCATTTGTTGGGAATGGAAGAATACAAGCCATTTAAGATGTGGTGGTAA
PROTEIN sequence
Length: 340
MKKTYYLFNPGTLERKDNTLKFTPFEEDGNGEMLKSGQPRYLPVEDIMEFYVFGSLRANSSLYNFLGQKGIAVHFFDYYENYTGSFMPRDSLLSGRMILAQTSAYQNKKKRIELARKFVEGASFNMTKNLRYYNTRGKDLDGLIEKIEEYTSQLPYLNAVDALMGIEGNVRQIYYKGFDLILNDFSMDGRSKMPPRNEVNALISFGNMMCYSQCLRAIHQTQLNPTISYLHTPGERRYSLSMDISEIFKPILVDRVIFRLLNKRELQEKHFDNKLNRCLLNPTGKKIFVKAFDERLSETIQHRSLKRKVSYKHLVKLECYKLSKHLLGMEEYKPFKMWW*