ggKbase home page

ar4r2_scaffold_4286_4

Organism: ALUMROCK_MS4_BD1-5_24_44_curated

partial RP 47 / 55 MC: 2 BSCG 34 / 51 MC: 1 ASCG 0 / 38
Location: comp(3219..6518)

Top 3 Functional Annotations

Value Algorithm Source
Uncharacterized protein n=1 Tax=Bacillus sp. CAG:988 RepID=R7F3G2_9BACI similarity UNIREF
DB: UNIREF100
  • Identity: 29.0
  • Coverage: 1004.0
  • Bit_score: 311
  • Evalue 2.80e-81
Uncharacterized protein {ECO:0000313|EMBL:CDE09098.1}; TaxID=1262708 species="Bacteria; Firmicutes; Bacilli; Bacillales; Bacillaceae; Bacillus; environmental samples.;" source="Bacillus sp. CAG:988.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 29.0
  • Coverage: 1004.0
  • Bit_score: 311
  • Evalue 4.00e-81
internalin A similarity KEGG
DB: KEGG
  • Identity: 35.6
  • Coverage: 551.0
  • Bit_score: 288
  • Evalue 4.30e-75

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Bacillus sp. CAG:988 → Bacillus → Bacillales → Bacilli → Firmicutes → Bacteria

Sequences

DNA sequence
Length: 3300
ACTACTGTGGTTTATAATTCTTGTACTTTAGACTGACAAACAATAAATCACAATCAAACTATTACTGCTTATAGTGAAAATAGTATATTATACTGAGCAAGTTATGAGTGTAGTGATAGAAGTCAAGAAAGAACTTGTAATAACGGAGTACTAACTTGAGATGATAGTTACCAATATAAAACTTGTGTAAAATGAACACCTAGTAATTGTGAAGCTAGTTGAAGTTATTTATATAATTCTCATACTTATTCGGTTCCTTCTGTAAATCATGGTGAGACTGCTACAAATATAAATTCCCAAGTAGTGACAATTCCAAACTGAACACAAGTATATAAATTAACAAGTATCTGATGTAATGACTGAGTACTAGTAAATGAAATAGAAGAAGCAAATCCAACAGTAACATGTGACAGTTGATATGTACAAAGTTGAAATAGTTGTGAAATTGCAACATATATAGTAAATTGAGATTTTGGAATAAATGCAAGTGGGGCAACAATAAATGTCTGTGGAACAAACGAAATAGCAGATGCAAACTGACGATTCACAACAACAAGAAATTATGGAAGTGTATGTGATACAATAACAGCAACAAGAACAAATTATGCATGTAGTACAACAACACAATGACCAGCAAGTTTAACTTCTAATATATCTAACATTGCTTGAAGTTGTAGTGCAAATAGTTATACAGTAACATTTGATTGAAACTGATGAACATGACATACTCCAACAACAATAAGTGTAGAATATAATACTGCAATCTGAACACTACCAACAAACCCAACAATGACTTGATACACATTTAATTGATGGTATACACAAGCAAATTGATGAAGTCAAGTAACAACATCAACTGTAGTAGCTTGAAATGCAACAGTATATGCACAATGGTGAATAAATAACTATACAGTGACATTTGACTGAAATGGATGAACTGGACATACTCCAAGTACAATGAGTGTAACATATAATACAGCAATATGAACATTACCAACTAACCCAACAAGAGAATGATATACATTTGCTTGATGGTTCACATCTACTACATGATGAAGTCAAGTAACAACAGCAACTATAGTAACTTGAAACGCAACAGTATATGCTCAGTGGTGAATAAATAACTATACAATAAATTTTGATTGAAACTGATGAACTGGACATGCTCCAACAAGTAAATCAGTAACATATGATACTGCTATATGAACATTACCAACTAACCCAACAATGACTTGATATACATTCAATGGTTGGTATACACAAGTAAGTTGATGAACACAAGTAACAGAATCAACAACAGTATTATGAGATGCAACAGTATATGCACAATGGTGAATAAATAACTATACAGTGATATTTGATGGAAATGGTTGAACATGACATACTCCAACAACAATGAGTATAATATACAATACAGTAATATGAAGTTTACCAACTGAACCTACAATGATATGATATACATTTGCTTGATGGTTTACTGCAACTACATGATGAACACAAATAGATACAAATACTTTAGTATTATGAGATGCAACAGTATATGCACAGTGGACTCTAAACACATATACAGTAAGCTGAGACTTTGGAATAAATGCAAACTGAGCAACGATAAATGTATGTGGAACAAATGTTATGGCAACATCAACATGAACATTTAGTACAACAAGAAACTATGGAGATACATGTAATACAATATCTGCAACAAGAACTTGATATACTTGTAGTACGACAATACAATGACCAGCGAGTTTAACAACAAGTGTATCAAACATAGCTTGATCATTCAAATCCATCTTTTACTACCATCAATGTTTAACAACAAGTGTATCAAACACAGCTTGATCATGTGCCGCAAATACACAAACATTTACATGTTGAGCAAAACCAGCAAATACAGTATGGAATACAGTATCAAGTTATTTACAGACATGGAATTGAACAGCATGGACTCCAATAAATAGTACAACAGTTTATAACACAACACCTAGTACTATTGATTGTAGATATAAATGTTCAAATGGATATTTATATGATTGATCTAGTTGTAGATTACCAACAACATGTGATGAAGTACAGACATATTCTGTAACAAGTGCATATTGAACAGATGTATATGATATAAATACTTGAATTTCTTGAATAACTAAATTAATATGTAATTCATGATGGACATTAGTTTCTAGTCAAGATTTGGATCCTGCAGTAACTTGATGATGATTATTTGCGTCATTAACAGAAGCTAAATACTATAGAGAAAATGATCCTAATCATGCTAAATACTCTATATTAAATCAACTTGAAAATTTTAGAAGCTCAAATTGAGCATTAACATTTAAATTAGAATGGCCTGATTTATGAAATAAATATAATTCTTGGAGTCAAACATCTAATCCATTAATTTCATGAAAGGTAACGTGATTTAACCCTATTTATATAGATTGATATTGAAATGGTTTTGGTTGACTAGAATATTCTGCCTGATCTGCTTTATTAGATTGAACAGTAAATCATTGAAACTGGTTTTATGCTGTATGACAAAATACTGCACGGTTAAATTGAGCAATTCCTGCAGATAGTTTACTAGATCCTTCTTCTCTAATAAATAAGTCTTACAATTTGTGGATAAAGAAAAGTAATCCTGTTTATAGAAAATCATGTAAGGAAATTAAAAATAATTCTAATGTTTATACTTTAAATGCTTATCCACAATGAGAATGATGAATTTATCGTATAGATCCTGATAATAATTGAATATGATTTGACGTAGTATGTGATATGTATACTGATTGAGGTTGATGGACAATGGTTCATAAAACAACAAGCAATACAAGTGATTTAAGCTGAGATTTAACAACAAATGAATGAACAGCAGATCGGTTTGATGACAATGAGTATAGATTATCCATAAATTATTGGAAAAATTTATCTACCGATAAAGCAATGGCAAAAAATATAAATGCAGCTTGACTTTTATGGAATGATATTGAGCCTTGAGTAATTAATTCTATATCAACAACATCAGTTAGTTTTTCTCAAACAGATACTTACAGAATTTTTAATGGCTGAACTGTAGGATGAACTCAAAATAATTGTACATCATGAACAAATTATTGGAATATTTCTTGTTGTTCTAGATGTGTAAATTATGATAATACTACTGCATATTGAACACCAGATAATTCACCAATGGTTAATTATTGATATACATCTTGTCCATATGCAGCAACTGCTGATTTAGCATGATGAACAAATGATGATACTCGGCATAGATTAAGTAAAATGTGATTATTTTTAAGATAA
PROTEIN sequence
Length: 1100
TTVVYNSCTLDGQTINHNQTITAYSENSILYGASYECSDRSQERTCNNGVLTGDDSYQYKTCVKGTPSNCEASGSYLYNSHTYSVPSVNHGETATNINSQVVTIPNGTQVYKLTSIGCNDGVLVNEIEEANPTVTCDSGYVQSGNSCEIATYIVNGDFGINASGATINVCGTNEIADANGRFTTTRNYGSVCDTITATRTNYACSTTTQGPASLTSNISNIAGSCSANSYTVTFDGNGGTGHTPTTISVEYNTAIGTLPTNPTMTGYTFNGWYTQANGGSQVTTSTVVAGNATVYAQWGINNYTVTFDGNGGTGHTPSTMSVTYNTAIGTLPTNPTREGYTFAGWFTSTTGGSQVTTATIVTGNATVYAQWGINNYTINFDGNGGTGHAPTSKSVTYDTAIGTLPTNPTMTGYTFNGWYTQVSGGTQVTESTTVLGDATVYAQWGINNYTVIFDGNGGTGHTPTTMSIIYNTVIGSLPTEPTMIGYTFAGWFTATTGGTQIDTNTLVLGDATVYAQWTLNTYTVSGDFGINANGATINVCGTNVMATSTGTFSTTRNYGDTCNTISATRTGYTCSTTIQGPASLTTSVSNIAGSFKSIFYYHQCLTTSVSNTAGSCAANTQTFTCGAKPANTVWNTVSSYLQTWNGTAWTPINSTTVYNTTPSTIDCRYKCSNGYLYDGSSCRLPTTCDEVQTYSVTSAYGTDVYDINTGISGITKLICNSGWTLVSSQDLDPAVTGGGLFASLTEAKYYRENDPNHAKYSILNQLENFRSSNGALTFKLEWPDLGNKYNSWSQTSNPLISGKVTGFNPIYIDGYGNGFGGLEYSAGSALLDGTVNHGNWFYAVGQNTARLNGAIPADSLLDPSSLINKSYNLWIKKSNPVYRKSCKEIKNNSNVYTLNAYPQGEGGIYRIDPDNNGIGFDVVCDMYTDGGGWTMVHKTTSNTSDLSGDLTTNEGTADRFDDNEYRLSINYWKNLSTDKAMAKNINAAGLLWNDIEPGVINSISTTSVSFSQTDTYRIFNGGTVGGTQNNCTSGTNYWNISCCSRCVNYDNTTAYGTPDNSPMVNYGYTSCPYAATADLAGGTNDDTRHRLSKMGLFLR*