ggKbase home page

scnpilot_p_inoc_scaffold_28_curated_151

Organism: scnpilot_dereplicated_Paludibacter_1

near complete RP 51 / 55 BSCG 51 / 51 ASCG 13 / 38
Location: 190521..193898

Top 3 Functional Annotations

Value Algorithm Source
Uncharacterized protein {ECO:0000313|EMBL:KIF34141.1}; TaxID=1304833 species="Bacteria; Cyanobacteria; Nostocales; Microchaetaceae; Hassallia.;" source="Hassallia byssoidea VB512170.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 34.7
  • Coverage: 999.99
  • Bit_score: 686
  • Evalue 6.10e-194
Spore coat protein CotH id=4408774 bin=GWF2_Bacteroidetes_49_14 species=RAAC39 genus=RAAC39 taxon_order=RAAC39 taxon_class=Ignavibacteria phylum=Ignavibacteriae tax=GWF2_Bacteroidetes_49_14 organism_group=Bacteroidetes organism_desc=Good (not ab) similarity UNIREF
DB: UNIREF100
  • Identity: 33.4
  • Coverage: 999.99
  • Bit_score: 618
  • Evalue 1.90e-173
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 33.3
  • Coverage: 624.0
  • Bit_score: 336
  • Evalue 3.50e-89

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Hassallia byssoidea → Hassallia → Nostocales → Cyanobacteria → Bacteria

Sequences

DNA sequence
Length: 3378
ATGAAAAAAATACTGGCAATTGTTTTTATAGTAGTAAGCTGTAATATTCAGGCGCAATTAAAAATTAATGAAATAATGGCCACCAATGTCTCGGCAGTATGGGACAATTGGTACAATTTTACAAATTGGATTGAACTCTACAATCCGACCGGCATGCCGGTATTTCAAAACGATTATGCCCTTACCGACGATATGAACGAGCCGCGCAAATGGCAGTTGCATTATAAAAGTATACCGGCAAAGGGATTTGCTTTATTGTGGATGGAACGACCCGAAAGAGAGTTTCACTCCCCCTTTAAACTGAAACCCGGCGGGGGCAGCCTCTACCTTGTCGACTGGAACAACGAGATTATCGATCAGTTTTCGTATCCTGCTCAGTTTCGCAATACCTCGTTCGGACGAAAATCCGATGGAGCCGACGAACTGGTCTTTTTTGAAGACCACAGTCCGGGATCGAGTAACAATGGCCGGAACTGGTCCAATACCCGTTGTATCGAACCGGTACCGACCGTAGCAGGAGGATTGTATCCGTCTGAACTGGCTGTTGCATTTGAAGCCCCCCAACCCGGCGACACCATTATCTATACCTTGAATGGCGACGAACCGACCCGGAACAATGCCATAAGGTATCAACCAGGGTCGACCCTGCAAATCACCAACAACACGGTTATCCGGGCCATCACCATCGGCAAAGGAAAACTCGCCAGCGATATTACGACCGCTTCGTACCTGATCGGACAACGCGAATTCAACCTTCCGGTAGTTTCGCTGGTTACTTCGCCAAAATTCCTGTTCGACAATACCATAGGAATCTATGTTGAAGGCACCAACGGTATTGAAGGTTGTGGCGATCCATTACCCCATAACTGGAACCAGGACTGGGACAGGCCGGCCAATTTTGAGCTGTTCGATCGCTCGAAACTATCGCGGTTGAACCAGGAAATCGATATTCAAACAGCAGGTTGTGGTTCGAGAGCCAACAACCGGCAGAAATCCCTTCATATAAAACCTAAAAACAAGTTCGGCGATAATACCTTGAACTATCCCATTTTCGGATCGCGCCCCAACAAAAAATACAAGGACATTGCACTCCGCAATTCGGGCAACGACCACAAGTATTCGATGATGCGCGATGGGATGATGCAATCGCTTATTATTGGGCGTATGGATCTGGAATACCTGGCTTACGAACCTGCCGTTTTATTCATCAACGGCGAATATTACGGGATTCAAAATCTTCGCGAACGGTCAAATGCCGATTTATTATATGCAACACATGGGTACGATGAAGAAGACATCATTAAAATCGATACGTACGACATCGTGAATCATCCTTTGTACCAAGAACTCATCGGATTCGTCAGCAACAACGATATTACACAGAATGCAGTGTATGAACAAGTAAAACAACAAATGGATGTCGAAAACTACATTCAAAATATCATCACCCATATTTTTGTTGCCAACTACGACTGGCCGCACAACAATGTAAAGATGTGGAAACCCATTGAAAACGGGAAATGGCGCTGGATTCTGTACGATACGGATTTCGGCTTTAACCTGTTCATTGACAATCTGCACGATTTTAATTCGTTGACCTATGCACTGGGAGAAAATAATGGATTTGAGACACAGCCCTGGGCTACCGAACTTTTCCGCCGGTTGATGCAAAACCCGACCTTCCGAAACGACTTTATCGATCGCTTTACGGTACATCTTTCGTCGACCTTTAAATCGGAACGGGTAATTCATATCATCGATTCGATTGCGGCGCGAATCCGTCCCGAAATCGGCTATCATAAACAACGCTGGGACTCGGAACGCGATTTCGAAACCGATATCAACCTGATGAAAACATTCGCCAACGCACGTCCCGGGAATATGTACCGGTTTATCGGCGATCGTTTTCTGCCCGGCACAGCTTTGCACACCATCCGTATTTCGTCTAACATACCCTCTGCAACCTTTACCTACAATACGGTCCATATTCCCGACCCTTCCATCGAATTGCAAAGCTTCAAGGGCCGCAGTTATACCTTAAAAGCAAATGAGGTAAGTGGTTATGTTTTCAAACGCTGGGAAGTCACCGGAGTGAACCATAGCACACTGCCGTGGGATAGTGAATGGAAATATTGGGATTCGTCGACGGTCCCTGCCGCCAACTGGTACGAACCGGGTTATTCGGATGCAACATGGAAAAAAGGCCCGGCCCAATTTGGCTATGGAAACAAAGGAGAAACAACAGTGGTCGATTACGGGCCAAATGCCGCCGATAAATTCACGACCAGTTACTATCGGAAGAACTTCTCTATTTCCAATCCTGCAAACTTGTCGAAAGCTACTATCCGAATGCTGGTCGACGACGGCGCCGTGATATACCTCAACGGCGTTGAGCTGGCCCGTTACAACATGCCCGAAGGAACGATCAATTTCCAAACCTATGCCCTCACGGCCAACAATGGCGACTATGTCGATATTGAAGTTCCCTTTTCAATGTTCGCGAAAGGAACCAATATTATCGCCGTCGAAGTCCACCAGGCCAATGCTTCCAGTTCCGATTTGATCTTTAATCTCGAATTGCTGACCGAAAATAATGCCTCCACCGGAGGAGATCTGACCGAAAACGAAATTTCGGCCACTCTCACCAACGATCAGCAACTGGTTGCAATCTACGAGGAAGACGATTCGATCGATCCCCTTGACCAGCTCCGGGTGACGATCAACGAAATACTCTCGTCGAATTCGGTGATTCGTGATGAATTCGGCGATAAGGACGATTACATCGAACTGTATAATGCAGGCGATCACGATGTGAATATATCGGGTTGGTATGTATCCGACAAGAAAGGTATTCCTGATTACTGGCAAATCCCGACCGACGCAGCCGCAGTCATTCCTTCGGGAGCTTATCTGCTGCTGTGGGCCGACGAACATCCATTCCAGGGGGCATTGCACATGAATTTTAAGCTAAGCGCATCGGGCGAGTTTCTGAGTTTGTATGCCCGTAATAAATTCGGAACCCTGGTAGGGATCGATTCGATCAGCTTCCCGGCACTCCCTGCCAATCAATCGTATTCGCGAATGCCCGATGGGAGCGAAAACTGGACCATCAAAGCGCCGACACCGTCAGCCTCCAACCTGCTTTCGGCCACCGGATCAGCAACAGCGCCTCACTATAAAGTATATCCAACAAGGATCACCGATGTATTACATATTGAACAGGCTCACGGTCAGCTTATTCAACTGTACACCTTAACCGGCAAAAAGGTGTTTCAAAACGTCAATCACGATCCGCACGTCACCATAGCTACCAGTCACCTTCCTGCGGGCATCTACCTGCTTAAAGTAGGTGAAAAATCGTTCAAACTTATTAAATAA
PROTEIN sequence
Length: 1126
MKKILAIVFIVVSCNIQAQLKINEIMATNVSAVWDNWYNFTNWIELYNPTGMPVFQNDYALTDDMNEPRKWQLHYKSIPAKGFALLWMERPEREFHSPFKLKPGGGSLYLVDWNNEIIDQFSYPAQFRNTSFGRKSDGADELVFFEDHSPGSSNNGRNWSNTRCIEPVPTVAGGLYPSELAVAFEAPQPGDTIIYTLNGDEPTRNNAIRYQPGSTLQITNNTVIRAITIGKGKLASDITTASYLIGQREFNLPVVSLVTSPKFLFDNTIGIYVEGTNGIEGCGDPLPHNWNQDWDRPANFELFDRSKLSRLNQEIDIQTAGCGSRANNRQKSLHIKPKNKFGDNTLNYPIFGSRPNKKYKDIALRNSGNDHKYSMMRDGMMQSLIIGRMDLEYLAYEPAVLFINGEYYGIQNLRERSNADLLYATHGYDEEDIIKIDTYDIVNHPLYQELIGFVSNNDITQNAVYEQVKQQMDVENYIQNIITHIFVANYDWPHNNVKMWKPIENGKWRWILYDTDFGFNLFIDNLHDFNSLTYALGENNGFETQPWATELFRRLMQNPTFRNDFIDRFTVHLSSTFKSERVIHIIDSIAARIRPEIGYHKQRWDSERDFETDINLMKTFANARPGNMYRFIGDRFLPGTALHTIRISSNIPSATFTYNTVHIPDPSIELQSFKGRSYTLKANEVSGYVFKRWEVTGVNHSTLPWDSEWKYWDSSTVPAANWYEPGYSDATWKKGPAQFGYGNKGETTVVDYGPNAADKFTTSYYRKNFSISNPANLSKATIRMLVDDGAVIYLNGVELARYNMPEGTINFQTYALTANNGDYVDIEVPFSMFAKGTNIIAVEVHQANASSSDLIFNLELLTENNASTGGDLTENEISATLTNDQQLVAIYEEDDSIDPLDQLRVTINEILSSNSVIRDEFGDKDDYIELYNAGDHDVNISGWYVSDKKGIPDYWQIPTDAAAVIPSGAYLLLWADEHPFQGALHMNFKLSASGEFLSLYARNKFGTLVGIDSISFPALPANQSYSRMPDGSENWTIKAPTPSASNLLSATGSATAPHYKVYPTRITDVLHIEQAHGQLIQLYTLTGKKVFQNVNHDPHVTIATSHLPAGIYLLKVGEKSFKLIK*