ggKbase home page

SCNpilot_expt_500_bf_scaffold_1885_18

Organism: SCNPILOT_EXPT_300_BF_Thiobacillus_SCN1_62_76

near complete RP 52 / 55 BSCG 51 / 51 MC: 1 ASCG 12 / 38 MC: 1
Location: 19271..20191

Top 3 Functional Annotations

Value Algorithm Source
CRISPR-associated endonuclease Cas1 {ECO:0000256|HAMAP-Rule:MF_01470}; EC=3.1.-.- {ECO:0000256|HAMAP-Rule:MF_01470};; TaxID=1458425 species="Bacteria; Proteobacteria; Betaproteobacteria; Burkholderiales; Comamonadaceae.;" source="Comamonadaceae bacterium A1.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 95.1
  • Coverage: 306.0
  • Bit_score: 589
  • Evalue 2.80e-165
cas1; CRISPR-associated protein Cas1; K15342 CRISP-associated protein Cas1 id=12496790 bin=THIO_MID species=Candidatus Nitrospira defluvii genus=Nitrospira taxon_order=Nitrospirales taxon_class=Nitrospira phylum=Nitrospirae tax=THIO_MID organism_group=Betaproteobacteria similarity UNIREF
DB: UNIREF100
  • Identity: 94.1
  • Coverage: 307.0
  • Bit_score: 584
  • Evalue 6.30e-164
cas1; CRISPR-associated protein Cas1; K15342 CRISP-associated protein Cas1 similarity KEGG
DB: KEGG
  • Identity: 86.6
  • Coverage: 306.0
  • Bit_score: 544
  • Evalue 3.00e-152
  • rbh

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Comamonadaceae bacterium A1 → Burkholderiales → Betaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 921
ATGGCCGATCTGCTACCGCCCCTCAAACCCATCCCCATCAAGGATCGCCTGTCCATCCTCTATATCGAGTACGGCCATCTGGACGTGCTTGACGGCGCGTTCGTTGTTGTCGATAAAACCGGCGTGCGCACCCACATTCCGGTCGGCGGCGTGGCCTGCCTGATGCTGGAGCCCGGCACGCGCGTTTCGCATGCCGCGTGCGCGCTGGCCGCGCGGGTCGGCACTTTGCTGGTGTGGATCGGCGAAGCCGGCGTGCGCCTGTACTCCGCGGGACAGCCGGGCGGCGCGCGTTCGGACAAATTGCTGTATCAGGCGCGTCTGGCGCTGGAAGACGGCCTGCGGCTGAAGGTAGTGCGCAAGATGTACGCGCTGCGCTTTGGCGAGGAGCCGCCGCAACGCCGTTCGGTCGAGCAGTTGCGCGGCATCGAGGGCGCGCGCGTGCGCGAAACCTACAAACGCATCGCCGCCAAGTACGGCGTGGAATGGAAAGCGCGTAATTACGACACGAGCGACTGGGACAAGGGCGATCTGCCGAATCGTTGTCTATCCTCCGCCACTGCCTGTCTGTATGGGGTAACCGAAGCCGCCGTGCTCGCTGCGGGCTATGCCCCTGCCATCGGCTTCATCCACACGGGCAAGCCCCTGTCCTTCGTGTACGACGTGGCCGATGTATACAAATTCGATACTGTCGTGCCACTGGCCTTTCGAATTGCGGCACGCCATCCAGCCAACCCCGAGCAGCAAGTTCGACTGGCCTGCCGCGACGTCTTCCGTGAATCGCGTCTCCTGGAACGCATCATCCCGGGCATTGAAGACATGCTCGCCGCCGGCGAAATCGAGCCGCCCGAACGCTTCGACGAACAAGTCGGCCCCGCCATCCCCAACCCGGAATCCATCGGCGATGCTGGTCATCGTTCTTGA
PROTEIN sequence
Length: 307
MADLLPPLKPIPIKDRLSILYIEYGHLDVLDGAFVVVDKTGVRTHIPVGGVACLMLEPGTRVSHAACALAARVGTLLVWIGEAGVRLYSAGQPGGARSDKLLYQARLALEDGLRLKVVRKMYALRFGEEPPQRRSVEQLRGIEGARVRETYKRIAAKYGVEWKARNYDTSDWDKGDLPNRCLSSATACLYGVTEAAVLAAGYAPAIGFIHTGKPLSFVYDVADVYKFDTVVPLAFRIAARHPANPEQQVRLACRDVFRESRLLERIIPGIEDMLAAGEIEPPERFDEQVGPAIPNPESIGDAGHRS*