ggKbase home page

SCNpilot_bf_inoc_scaffold_359_curated_18

Organism: scnpilot_dereplicated_Bacteroidales_1

near complete RP 52 / 55 MC: 4 BSCG 51 / 51 MC: 2 ASCG 13 / 38
Location: 24100..27501

Top 3 Functional Annotations

Value Algorithm Source
Uncharacterized protein {ECO:0000313|EMBL:KIF34141.1}; TaxID=1304833 species="Bacteria; Cyanobacteria; Nostocales; Microchaetaceae; Hassallia.;" source="Hassallia byssoidea VB512170.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 34.6
  • Coverage: 999.99
  • Bit_score: 667
  • Evalue 3.80e-188
Spore coat protein CotH id=4408774 bin=GWF2_Bacteroidetes_49_14 species=RAAC39 genus=RAAC39 taxon_order=RAAC39 taxon_class=Ignavibacteria phylum=Ignavibacteriae tax=GWF2_Bacteroidetes_49_14 organism_group=Bacteroidetes organism_desc=Good (not ab) similarity UNIREF
DB: UNIREF100
  • Identity: 33.0
  • Coverage: 999.99
  • Bit_score: 615
  • Evalue 9.40e-173
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 32.2
  • Coverage: 633.0
  • Bit_score: 335
  • Evalue 7.80e-89

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Hassallia byssoidea → Hassallia → Nostocales → Cyanobacteria → Bacteria

Sequences

DNA sequence
Length: 3402
ATGACATATAAAAATATAAGTTTTCATTTACTATTATTATTAGCACTATTATCGGTTAAAAACATTAACGGACAAATAAAAATCAACGAAATTATGTCCAACAATGTGTCGGCTGTCATAGATAATTCTTATAATTATAGTATGTGGGTTGAAGTTTTCAATTCAAGTGCATCAGAAACAATTAATATTAGAGATTATTATTTTACTGATGATTTATCAGATATTAATAAGTGGAGACCTCCCATCGTTAGAATAGCTCCCGGAGGTTTTGCTGTTTTATGGTTTGAACGTGAAGACAAGATTAATCATGCAAACTTCAAATTAAATCCTGAAGGTGGTAAATTATATCTTATAAACAAATCCGGCGAAGTTGCCGATATGGTTGAGTATCCTGCTCAACATAGAAACACTTCCTTTGGTAGGATTCAAGACGGTGCAAATGAATGGGTTTATTTTTCCGAATATAGTAACGGTGCATCGAATAATGGTAAGAAATGGTCAGCAACACAATGTTCAACCCCTAAATTTATCACTCCAGGCGGTTTTTATAAATCAGGGTTTACTTGTAAATTTGAAACACCGGCAACCGGAGAAACTATATATTTTACAACAGATGGACGCGAACCAAATACAACATCTTCAAAATACAATGCTCAAACTGGTATTAATATAACAAAAACATCTATTGTAAGAGCAATATGTCTATCCAATCAAAAGCTTCCCGGACAAATTGCTTCTGCAACATATTTTATAAATGAGAGAACTTTCAATCTTCCTGTGTCTTCAATTATCACTGATAATAAAAACCTAACAGACAATACGATAGGAATTTATGTAAAAGGTACTAATGGAATTCCGGGAAATGGTACAAATGAAGCGGTAAACTGGAATCAAGATTGGAGTCGACCTGCAAATTATGAACTTTTTGATGTTAATGGAAATTCGTGTCTGAATCAAGAATTAGATATTGCAATATCAGGAGGATGGTCACGTACAATTAATCCTCAGAAATCATTAAAGATATCACCAAGAAAAAAATTCGGCGATAACAGACTAAGATATGATATTTTTTGGGCATCAAAACCAAACAGGAAATATAAAGATATTCAGATTCGTAATTCAGGAAATGATTTTGCAAATACTATGATGCGTGATGGATTTATGCAAACATTGGTTGCAAATAGAATGAATATCGATTATTTAGCTTATCAGCCTGCAGTATGCTTTATGAATGGACAATATTATGGCATTCAGAATCTGCGTGAACGTTCTAATAAAGATTATTTGTATTCTAATTATGATTTAGATGAAGAGGATTTCTATTTGTTGGATCATACGACAGTATCAAAAGAACCTTTCACATCTTTAGTAAATTATGTTAGAAATAACGACATTACAAATACTGAGATATACAATAGTGCATTATCAATGGTTGATGTCGAGAATCTGATTGACTACTATATCGCTCAAATGTTCTTCAACAATACTGATTGGCCTCACAACAATCTAAAAACTTGGAAGAAAAAAGATAATGGTCAGTGGAGATGGATATTATATGATACTGATTTCGGTTTTAATTTACATGGAGAAAATCAGCATAATGACAATACGGTTACTCATGTGCAAAACGCATCAGATCCATCTTCAGTTGTTTTCAAAAGATTAATGTTAAATCCAATTTTTAAAAGTAAATTCTTAGATAGGGTTTGTATTCATATATCATCTACTTTTGAGACTAACAGGGTTAATAACATAATGGACAGTTTAGCTAATGCTATACGAAAGGAATTTGTTTATCACAAACAACGATGGGGCGGTAATAATAATTTTGAATTTGAAATTAACAAGATGAAGCAATTTTCTCTACACAGACCCAATAGTTTTTTTACTTTTATTACGAATCAATATAACAGTGGAACACCTTATCAATCGGTTGATATAAGCTCCAATGCAGACAAGAACACATATTTAATTAATTCAGAAGAGTTTTTACAAAACAAGATTAATTTAAAATACTTCAAAAATCGTCAATTATCGATAGAAGCTATTGAGCATCCCGGACTTAAATTTAAATATTGGGAAGTTTATTCATCTGTAAAAGAGAATATTCTAATTCCTCTTAAAGCTACTTGGGATTATTGGGATAGAAACGGTAAACCTTCTGAGAATTGGTATAAAGCTGAATATTCATCAAGTTCGTGGTCAAACGCAAGAGCACCATTTGGATATGGGGCTAATTTCCCATCAGTTACAACCACTATAAGCTATGGAGGAAATACCGGCAATAAGTATATCACAAGTTATTTTCGCAAAACTTTTAATGTAAGTGACCCTTCAGAATTAGACAATATTCAAATTGCAGTTTATGTTGATGATGGAGTTGCAGTTTATATAAATGATGTGGAAATAGGTAGATATAATCTCCCATCGGGAAAGTTGGAATTTAATACATTAACCAACACATACAATAATGGTGAATGGGTGTATTTTGATATACCTCAGAGCTTTCTGAAAAAAGGAAATAACGTAATTGCTGCGGAAGTACATCAAGTCAATGCAACCAGTTCGGATATGGTATTTGAATTAACACTGACATCAAAGAAAACTGATAATCAATTTACCACTCAAACTAACCCTAAATTTTTTATTTCTTTAAATAATGATATTAAATTAAAAGCAATCTTTGAAGAAGTAGAATTAACAGACCCATTCGAAAACTCACAGATAGTCTTGAATGAAATAGTTGCGAGTAATTCTTTTGTTCAGGATGAATATAACGAATATGATGATTATATCGAAATATATAATAAAGGCGAAGAAAGCGTGAACATAGCAGGTTGGTATTTGAGTGATAACCCTTCAAATTTAACACTATCTCGTATCCCCGATACAGAGCCTTCAAAAACCACTATACCCGCTAAAGGGCGAATTATTATTTGGGCAGACGAGCAGGTTGAACAAGGTGTGTTACATGCTAATTTTAAAATCAGCAAAGATGGAGAAACCATTACCATATCAAGGAAAAATCCATATGAAGAGATAGTTATAGTAGATGAAATTACTATTCCTGAGTTAGCAAAAGACATGAGTTACTCTCGCTATCCTGATGGTGGTGATGAATGGGTTGTACAAGTACCTACATTCAATCGTTCTAATGCCGATTTTTCATCAACAGAATTAATCGAGAAGAAGAACATTATTTTCCCAACATTAGTAAGCAGTCATTTCAATGTGTCTGACTCTGAAGGGGAAATGATTAGGATAATTGATATTTCAGGAAAAATTATATTAGAACAAAAATGCAGCTCCGATTACGAAACTATTTACATTGATAATCTTCAAAAGGGAGTATATTTTGTTAATGTAGGGAATAGAACTATTAAGATTATTAAGACGTTGTAA
PROTEIN sequence
Length: 1134
MTYKNISFHLLLLLALLSVKNINGQIKINEIMSNNVSAVIDNSYNYSMWVEVFNSSASETINIRDYYFTDDLSDINKWRPPIVRIAPGGFAVLWFEREDKINHANFKLNPEGGKLYLINKSGEVADMVEYPAQHRNTSFGRIQDGANEWVYFSEYSNGASNNGKKWSATQCSTPKFITPGGFYKSGFTCKFETPATGETIYFTTDGREPNTTSSKYNAQTGINITKTSIVRAICLSNQKLPGQIASATYFINERTFNLPVSSIITDNKNLTDNTIGIYVKGTNGIPGNGTNEAVNWNQDWSRPANYELFDVNGNSCLNQELDIAISGGWSRTINPQKSLKISPRKKFGDNRLRYDIFWASKPNRKYKDIQIRNSGNDFANTMMRDGFMQTLVANRMNIDYLAYQPAVCFMNGQYYGIQNLRERSNKDYLYSNYDLDEEDFYLLDHTTVSKEPFTSLVNYVRNNDITNTEIYNSALSMVDVENLIDYYIAQMFFNNTDWPHNNLKTWKKKDNGQWRWILYDTDFGFNLHGENQHNDNTVTHVQNASDPSSVVFKRLMLNPIFKSKFLDRVCIHISSTFETNRVNNIMDSLANAIRKEFVYHKQRWGGNNNFEFEINKMKQFSLHRPNSFFTFITNQYNSGTPYQSVDISSNADKNTYLINSEEFLQNKINLKYFKNRQLSIEAIEHPGLKFKYWEVYSSVKENILIPLKATWDYWDRNGKPSENWYKAEYSSSSWSNARAPFGYGANFPSVTTTISYGGNTGNKYITSYFRKTFNVSDPSELDNIQIAVYVDDGVAVYINDVEIGRYNLPSGKLEFNTLTNTYNNGEWVYFDIPQSFLKKGNNVIAAEVHQVNATSSDMVFELTLTSKKTDNQFTTQTNPKFFISLNNDIKLKAIFEEVELTDPFENSQIVLNEIVASNSFVQDEYNEYDDYIEIYNKGEESVNIAGWYLSDNPSNLTLSRIPDTEPSKTTIPAKGRIIIWADEQVEQGVLHANFKISKDGETITISRKNPYEEIVIVDEITIPELAKDMSYSRYPDGGDEWVVQVPTFNRSNADFSSTELIEKKNIIFPTLVSSHFNVSDSEGEMIRIIDISGKIILEQKCSSDYETIYIDNLQKGVYFVNVGNRTIKIIKTL*