ggKbase home page

SCNpilot_expt_500_bf_scaffold_461_24

Organism: SCNPILOT_EXPT_750_P_Alphaproteobacteria_novel_39_47

near complete RP 46 / 55 MC: 1 BSCG 47 / 51 MC: 4 ASCG 10 / 38 MC: 2
Location: comp(18780..21974)

Top 3 Functional Annotations

Value Algorithm Source
peptidase S41; K08676 tricorn protease [EC:3.4.21.-] similarity KEGG
DB: KEGG
  • Identity: 49.2
  • Coverage: 999.99
  • Bit_score: 1061
  • Evalue 0.0
  • rbh
Peptidase S41 n=1 Tax=uncultured candidate division OP1 bacterium RepID=H5SS53_9BACT similarity UNIREF
DB: UNIREF100
  • Identity: 53.6
  • Coverage: 999.99
  • Bit_score: 1210
  • Evalue 0.0
  • rbh
Uncharacterized protein {ECO:0000313|EMBL:AIL13041.1}; TaxID=244581 species="Bacteria; Proteobacteria; Alphaproteobacteria; Rickettsiales; Caedibacter.;" source="Candidatus Caedibacter acanthamoebae.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 94.0
  • Coverage: 999.99
  • Bit_score: 2101
  • Evalue 0.0

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Candidatus Caedibacter acanthamoebae → Caedibacter → Rickettsiales → Alphaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 3195
ATGAGTGTAGAAGGCTATTATCGTTACCCAGCAATTTTTAAAGATCAGGTTGTCTTTGTTTCTGAGGATGATCTTTGGGTGGCCTCAACTGAGGGAGGTGTTGCACGGCGGCTTACTTCTGGCCTTGGTTCCATCACTACGCCTGTTTTTTCGCCTGATGGTTCTCAATTGGCATTTGCTGGTACAGAAGAAGGATATCCTGAAGTGTATATCATGCCAAGCAAGGGCGGGCCTATCAAGCGGCTTACCTTTTTGGGTGAAGAAGTAAATGTAATCACATGGACAGAAGAGGGAATTATTTTTGCAAGTTCCTCAGGGCAACCTTTTAGCCGTTGGAATGCCTTATGGTGTGTGCCGTCTCATGGAGGAGAACCTCAACGTCTTCCAATAGGGCCCGCTAATTTTATTTCTTTTAAGGAAAAAGGAAGTAAAGTTGCTGTTATCCAAAGGCATGGCTACCGGGAGTATGGGTTCTGGAAGCGTTATCGTGGGGGAACGGCGGGTCAACTTTGGATTGATCAGTCTGGTAAAGGTGATTTTAAGAGCCTTTTAGAGTTAAGAAGCGATTTTGCCCGTCCTTTATGGGTTCAAGATCGTATTTATTTTAGCTCTGATCATGAAGGCGTTGGAAACTTATATTCTTGTCTTATTGATGGATCGGATATAAAGCAGCATACAAATCATTCAGATTACTATGTTCGTAACCAATCAACGGATGGTTATCGGATCGTCTACCATGCTGGTGGCGATATTTATCTTTTTGATCCTCAAGAAAATACCACAAAAAAAGTCACATTCGATTATCACAGTACTAAATTTCAACGTAATCGCAAATTTATTGCGCCTGGGCGTTATTTGGAAGATTTCACAATCCATCCCAAAGGACATCATTTAGCCATTGCAACGCGTGGCAAGGCATTTGCATTTGGAAATTGGGAAGGAGCCGTTTTTCAACTTGGTGCACAACAAGGTGTTCGTTTTCGTATCCCCAGGTGGTTGCATGATGGTGAAAGAGTTCTTTTAATTCATGATCGTGATTGTGAAGAAACCCTTGAAATCTATCATGGGGGGACGTCTGAGTGTTTAAGTTCATCAGGCGAGCTATCTTTCGGGCGTGCAGAAGACATTTACCCGAATCCTTGTAAGGATGAGGCAATTCTCGTGAATCATCGAAATGAGATATTTCATATTGATCTAAGTTCTTGGAAGCTGACAAAAATTGATCGAAGTGAGTATTCTAATATTGATGATGTGGCTTGGTCTCCAGATGGGGAGTGGGTTGCTTATAGCTGCTCATTTACACGTCGTACGATAGGATTAAAGCTCTATCATGTAAAAACGAAAAAACTTACGCCAATTACACAGCCATTAATGCGTGATTTATCACCTGCGTTTGATCCGGAAGGAAAGTATTTATATTTTCTATCCTATCGCCATTTTAACCCCAGCTGGGATGCACTCCATTTTGAGTTAGGTTTTCCTCGAGGGATGAAGCCTTATGCAATTGCCCTTCAAAAGGATACAACTTCTCCATTTATTCCGAAAGCTCTTGATCTCTCAACTAAAGAGGAAGAAGAAAAGGGCAAAAAGAAAGATGACGATAAGAAAGATAAAATTGAGAAAATTAACATTGATCTAGAAGGCATCGAGAATAGGTTAATTGCTTTTCCAATTGAAGAAGGGCTTTATAGTGATTTAGTCGCCCTTAAAGGAAAGATAGCTTATCTATCTTGGCATGTTGAAGGAACATTGCATGATACGGACGATTCATCTGACTATGAAGGTGGCACACTTGAAGTCTTTGATTTTGAATCACAGAAGGTAGATGATTTAATCCATAACGTTTCTGTTCTTCATTTTTCTTTAGATCATCAATGGATTTGTTATAAAACGGGTCAAAAGCTGCGTGTTTTTAAAGCTGATGAAAAACCAGATGATCGCGATATGGATAAGCCAAACCGTAAGAATGGTTGGATAGACTTAAGTCGTCTTCGGGTTGCTGTAAATCCTGTGTTTGAATGGGAGCAAATGTATAAAGAAGCTTGGCGTCTTCAGCGAGATCATTTCTGGAGTGAAGACATGTCCAAGATTGATTGGCAACAGATCTATAAAAGATACTATAACTTACTTCCTCGTCTTGGAAGTCGCGGTGAATTAAGTGATTTACTTTGGGAAATGCAAGGAGAATTAGGAACTTCACACGCGTATGTTTATGGTGGAGATATGCGAATGGCTCCTCGTTATACTGTGGGGCAGTTGGCCGCTGATTTTATCTTTGATCCTGAGCAAAGAGCCTACCGTTTTCTAAAAATTGCACGTGGTGATCATTGGTTGCCAACAAAGGGATCTCCTTTATTACAGCCTGGCTTGGGTATCCAAGAAGGTGATCTTTTATGGGCTATCAATCACCAAGAGTTAGATGAAACAACATCACCCAGTTCTTTGCTTGTTTACCAAGCGAACATTGAGGTTGCTTTAACTGTCAGTGATAAGAATGGGGCGAATAAAAGAGATGTGATTGTTAAAACAACACGCTCTCAAACAAATATTCGTTATCGTGATTGGGTAGAGACAAATCGTGCTTATATTCATGAAAAATCACAAGGGAGAATTGGTTATATTCATATTCCTGATATGGGGCCTCATGGATTTGCTGAGTTCCATCGCTCATTTCTTGCCGAATGTGATCGGGAAGGACTCATCGTAGACGTGCGCTTTAATGGCGGTGGGAATGTTTCTGCTCTCTTGCTTGAAAAGCTTGCTCGTCGCCGTTTAGGTTATGATGCTTCTCGTCATCATGGTTTAATTCCTTATCCAGAAGACTCTCCGGCGGGGCCAATGGTTGCCATTACAAATGAGTATGCAGGGTCTGATGGTGATATGTTTTCTCATGCCTTTAAATTAATGAAATTAGGTCCTTTGATTGGAAAAAGAACTTGGGGAGGTGTTATCGGTATTGCGCCAAGATATCCGCTTGTGGATGGGGGGATGACAACACAGCCAGAATTTTCTTTTTGGTTTAAAGATGTTGGCTTAAAGCTTGAAAATTATGGTGTAGATCCCGACATTGAGGTCGACATTACACCCCAAGATTATGCGGGTGGGAAAGATCCGCAACTGGAGAAAGCTCTCGAAGAAGTATCCGAAATTATGAAGAACTATTCTTATGTTTTGCCTGAGTTTGGTAAACGATAG
PROTEIN sequence
Length: 1065
MSVEGYYRYPAIFKDQVVFVSEDDLWVASTEGGVARRLTSGLGSITTPVFSPDGSQLAFAGTEEGYPEVYIMPSKGGPIKRLTFLGEEVNVITWTEEGIIFASSSGQPFSRWNALWCVPSHGGEPQRLPIGPANFISFKEKGSKVAVIQRHGYREYGFWKRYRGGTAGQLWIDQSGKGDFKSLLELRSDFARPLWVQDRIYFSSDHEGVGNLYSCLIDGSDIKQHTNHSDYYVRNQSTDGYRIVYHAGGDIYLFDPQENTTKKVTFDYHSTKFQRNRKFIAPGRYLEDFTIHPKGHHLAIATRGKAFAFGNWEGAVFQLGAQQGVRFRIPRWLHDGERVLLIHDRDCEETLEIYHGGTSECLSSSGELSFGRAEDIYPNPCKDEAILVNHRNEIFHIDLSSWKLTKIDRSEYSNIDDVAWSPDGEWVAYSCSFTRRTIGLKLYHVKTKKLTPITQPLMRDLSPAFDPEGKYLYFLSYRHFNPSWDALHFELGFPRGMKPYAIALQKDTTSPFIPKALDLSTKEEEEKGKKKDDDKKDKIEKINIDLEGIENRLIAFPIEEGLYSDLVALKGKIAYLSWHVEGTLHDTDDSSDYEGGTLEVFDFESQKVDDLIHNVSVLHFSLDHQWICYKTGQKLRVFKADEKPDDRDMDKPNRKNGWIDLSRLRVAVNPVFEWEQMYKEAWRLQRDHFWSEDMSKIDWQQIYKRYYNLLPRLGSRGELSDLLWEMQGELGTSHAYVYGGDMRMAPRYTVGQLAADFIFDPEQRAYRFLKIARGDHWLPTKGSPLLQPGLGIQEGDLLWAINHQELDETTSPSSLLVYQANIEVALTVSDKNGANKRDVIVKTTRSQTNIRYRDWVETNRAYIHEKSQGRIGYIHIPDMGPHGFAEFHRSFLAECDREGLIVDVRFNGGGNVSALLLEKLARRRLGYDASRHHGLIPYPEDSPAGPMVAITNEYAGSDGDMFSHAFKLMKLGPLIGKRTWGGVIGIAPRYPLVDGGMTTQPEFSFWFKDVGLKLENYGVDPDIEVDITPQDYAGGKDPQLEKALEEVSEIMKNYSYVLPEFGKR*