ggKbase home page

sw_4_scaffold_328_1

Organism: SW_4_UNK

megabin RP 51 / 55 MC: 43 BSCG 44 / 51 MC: 35 ASCG 38 / 38 MC: 38
Location: comp(3..3125)

Top 3 Functional Annotations

Value Algorithm Source
Tail protein n=1 Tax=Geobacillus sp. MAS1 RepID=V6VAS5_9BACI similarity UNIREF
DB: UNIREF100
  • Identity: 23.8
  • Coverage: 804.0
  • Bit_score: 223
  • Evalue 7.30e-55
Tail protein {ECO:0000313|EMBL:ESU71111.1}; TaxID=1408282 species="Bacteria; Firmicutes; Bacilli; Bacillales; Bacillaceae; Geobacillus.;" source="Geobacillus sp. MAS1.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 23.8
  • Coverage: 804.0
  • Bit_score: 223
  • Evalue 1.00e-54
phage tail tape measure protein, TP901 family similarity KEGG
DB: KEGG
  • Identity: 22.3
  • Coverage: 755.0
  • Bit_score: 193
  • Evalue 3.90e-46

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Geobacillus sp. MAS1 → Geobacillus → Bacillales → Bacilli → Firmicutes → Bacteria

Sequences

DNA sequence
Length: 3123
ATGACTGATATTGGTAAGCTCACTTACTCTCTGTCCCTAGACACATCTCGCTTCAGCAATCAGCTCAAAAGCTTTAGCCGGAGAGCTGAGAAGCAAGGCAGTAAAGCCGGCAGAGAAATGGGTGAGGAGTTCACTCAGGGAGCCAAAAAAGAGATGAGCAATCTCGGCTCCATCATGCAAGGCGTGCTGCAAGGCGTAGGACAGGGAGCATTCCAAGCTGTGACTTCTGCTCTTCGAGGGGTGGGCAGTGCTATTGGTGGTACTGTCTCTCAAGCCTACAGCTTGGAAAAGTCGATGAAGACAATTCAGGGTAAATCTGGGGCTACCGAGAAAGAGATCAATAAGCTTAAAGATAGCTTTATTGAGCTGTCTGGCAACAGTACTAAGACTGGTGATGAGATAGCCAAAGCCGGTGAGCGAATGGCTATGCTGGGCTATACTACCAAGCAAACAGAAGGCTTAATGAAGAGTGTAGTCCAAGTCAGTGAAGCAACTGGCGAGACAATGGACACCGTCACTCAGTCCATTGGCGCTACAATCAACTCATTCGACCAATACAACCAGACTCAAGAGGATGCTGTTGAAGTTGGTGACAAGCTCACGGCAGTAGTTAACAACGCCAATGTCACTATAGACGCACTACGAACCGGAACGTCCAAATATGGCGCTGTTGCCAACATGACTAACCAGTCTGTCACAGATATGGCAGCAGCTTTCGGCGTTCTGAAAGATGCAGGGATGAGCACCGAAGAAGCTGGAACATCCCTGAAAACTGCTCTGATGAGGCTGGAGCCTTCCTCTAAGAAAGGGAAGGAAGCCATCAAGGAACTTGGCGTTGAGGTCAGAAACGCCGAGGGCAAGATGAAATCCATGCCCAAAATCTTGCAAGAGTTTCAAGAGGGCTTGGAAGGCATGGGAGAGAAGAAGAAGGCTGCTCTCCTCAAGCAGATCTTCGGCGCTCAAGGCATTAAGGGCTTTACAACCTTACTTGAGTCTGCTGGCGGTAAGTACGAAGAATTACGGCAGACTATTGAGGATTCCCAAGGTACAGCGGCAGAGACTGCTGAAAAGATGCGGACAGAGTGGGAGAAGTTCACAGCTTCCGTTGATGCGATGAAGCATCGTATTGGAGCTGCTGTGCTCCCTGGTTTGAATGCCGCATTGAAAGCTATCAATGATATCTATAAGGGAATAAAGAAAGTTGATATTAGTTTCAAGCCCATCACTAAGGCAGTAGATGCCCTCGGATTAGGCTTCGAGTACTCTGAGAAAGCAGCTAAGAACCTCGGCAAGTCGATTGGTAAAAACATTAACTCAGGGATCCAGACAGCCGCTGACCTAATCAAGCGAGTGCAACAGTTCTGGGAAAACAATAAGTCCACTATCAAGGAAGTGGGTAGGTTTATTGAGAATGGTGTTGAGAATGCCATCGACAATGTTCTTAATGTAGTCTCCAGACTCCAGGAATTCTGGGAAAACAATAAAGAGACTATCCAGGAAATAAGAGACCAGATCTATAACCGAGTGAAGACTGCTATCACTGCTGTTGTGGACACAGTAAGCGGCTGGATTGACCGCATACAGAAAGTCAGGGAGAACCTAATCAACAGTGAAGGGCTGCTTAACTCAATATGGAATCTCATTCAGTCAGGCTATCAAGCCTGGAATGCGGTTTGGGATGTGGTCAAGCTTGTCGGTACGGCTATTAAGGTAGTTGTCGAGAACGTCGGCAGATTTATCAACTTCATCGCTCAAGTAATCACCGGAACTAACAACGCTGCTGATGCTTGGAGTCAAATTGGTGAATGGCTTCAGAAGCTAATTGACTGGATTGCTGACTTCATCTCGGATACAAGCAAGCTAGTCAGTCTGGTTGCAAACGGACTTACCAAGGCATGGAAAACTGCTGGAGACTGGCTCGAAACGGCTGCGGAATGGATGGATAAGCAGCTTGAATCTGGCGGTGAGCTAGTTAGTACGGTTGGGGATAATCTTGCCGGAGCCTGGAAAACCGTTAAGGGCTGGGTTGATGGCATAACTGATGCTGTCGGCAATATCTGGGACATGGTCAATGGGCTGATCGACAAGGTCGGTAATAAACTCACTGGCGCATTCAACAAACTTAAAGACGCTGCTAGCAATATCCCCTTAATCGGGGGAGGCTACGGAATTCAAGGAACCGCCCAAGGGCTTGATGGCGCTAATAAGCATCTTAAGCCTCTTCTCGGAATTGCCCAGAAGCACGGTGTTCAGGTAACTTCGGGCTTGCGTCCTGGCTCCACCACTTCAACCGGCAACACTTCCATGCATGCTACAGGGAATGCTCTCGACTTCGCTGGCAGTCATGAGCAGATGAAATCCTTTGCTCAGGAAGTTGCTAGGCGCTTTGGAGACCGCATCCATAGCCTCATCTATTCGGGAGCACCGGGGGCACAACGCGCTAGAGGACAGCCTCATCAGTTTGGACAGCCCATTAAGAAAAACCACTGGGATCACGTCCATGTGGCTGCGAAGAAAGGGCAATTATTGCCTCTCACCGATAGGCAAGGCAGTGGATACGGTATCGGCAGTGGTGTTGGTACTGGTCAGCAAGCGGTCTCGGATCCTCGCAGTAATCAAGACTTTCAGAATACTTACAAAGCCGCTACTGGTAAGGAATACAGCGACCACGAAGAAAGCCCTGTCACCGAAGCTAAGCGCAAATTGAAGCAAGAGAGACAAAGTGACCTTAAGAATAAGGCGGATGATATCTCCCGCAGGACTGAAGAGATTGAAGCCCGAATCGAAAGGGTTAACAAGCAGATCAATGACTTAAAGGCTGCTCGCTCTACTAAGAAGCAACGCGGCGGAGACTATCAAGAAGATAAGGACTTCACTGCCCGCATTCAAGAGCTGCAAGAGTATCGGGACGAGCAGAAGAAACTACTTGAGGTCGAGAGAAAACTGGCTAACACCAAACTGGGTGATGCCATCAAGGGCGCTCTTGAAGACGGAACTCAATTTGTCCGGAGCTTTAGCGACCAGTTGAATCAACTCAACCGCGACTTCTCCGATTTGAGCTTCTCTGAGAAGTATCAGAACAAGCTCAAAGATATGCAGGATCGCTTTGATGAGCTGAAAGAC
PROTEIN sequence
Length: 1041
MTDIGKLTYSLSLDTSRFSNQLKSFSRRAEKQGSKAGREMGEEFTQGAKKEMSNLGSIMQGVLQGVGQGAFQAVTSALRGVGSAIGGTVSQAYSLEKSMKTIQGKSGATEKEINKLKDSFIELSGNSTKTGDEIAKAGERMAMLGYTTKQTEGLMKSVVQVSEATGETMDTVTQSIGATINSFDQYNQTQEDAVEVGDKLTAVVNNANVTIDALRTGTSKYGAVANMTNQSVTDMAAAFGVLKDAGMSTEEAGTSLKTALMRLEPSSKKGKEAIKELGVEVRNAEGKMKSMPKILQEFQEGLEGMGEKKKAALLKQIFGAQGIKGFTTLLESAGGKYEELRQTIEDSQGTAAETAEKMRTEWEKFTASVDAMKHRIGAAVLPGLNAALKAINDIYKGIKKVDISFKPITKAVDALGLGFEYSEKAAKNLGKSIGKNINSGIQTAADLIKRVQQFWENNKSTIKEVGRFIENGVENAIDNVLNVVSRLQEFWENNKETIQEIRDQIYNRVKTAITAVVDTVSGWIDRIQKVRENLINSEGLLNSIWNLIQSGYQAWNAVWDVVKLVGTAIKVVVENVGRFINFIAQVITGTNNAADAWSQIGEWLQKLIDWIADFISDTSKLVSLVANGLTKAWKTAGDWLETAAEWMDKQLESGGELVSTVGDNLAGAWKTVKGWVDGITDAVGNIWDMVNGLIDKVGNKLTGAFNKLKDAASNIPLIGGGYGIQGTAQGLDGANKHLKPLLGIAQKHGVQVTSGLRPGSTTSTGNTSMHATGNALDFAGSHEQMKSFAQEVARRFGDRIHSLIYSGAPGAQRARGQPHQFGQPIKKNHWDHVHVAAKKGQLLPLTDRQGSGYGIGSGVGTGQQAVSDPRSNQDFQNTYKAATGKEYSDHEESPVTEAKRKLKQERQSDLKNKADDISRRTEEIEARIERVNKQINDLKAARSTKKQRGGDYQEDKDFTARIQELQEYRDEQKKLLEVERKLANTKLGDAIKGALEDGTQFVRSFSDQLNQLNRDFSDLSFSEKYQNKLKDMQDRFDELKD