ggKbase home page

ar4r2_scaffold_8809_1

Organism: ALUMROCK_MS4_Thiotrichales-related_46_269

near complete RP 41 / 55 MC: 1 BSCG 43 / 51 MC: 4 ASCG 11 / 38 MC: 1
Location: 2..1030

Top 3 Functional Annotations

Value Algorithm Source
Carboxysome shell protein CsoS2 n=1 Tax=Nitrosomonas eutropha (strain C91) RepID=Q0AHV9_NITEC similarity UNIREF
DB: UNIREF100
  • Identity: 31.3
  • Coverage: 316.0
  • Bit_score: 123
  • Evalue 3.40e-25
Carboxysome structural peptide CsoS2 {ECO:0000313|EMBL:BAP88541.1}; TaxID=1469502 species="Bacteria; Proteobacteria; Betaproteobacteria; Burkholderiales.;" source="Burkholderiales bacterium GJ-E10.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 32.9
  • Coverage: 298.0
  • Bit_score: 124
  • Evalue 2.80e-25
carboxysome shell protein similarity KEGG
DB: KEGG
  • Identity: 31.3
  • Coverage: 316.0
  • Bit_score: 123
  • Evalue 9.60e-26

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Burkholderiales bacterium GJ-E10 → Burkholderiales → Betaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 1029
GGTACCGATGTAGCACAGGGTTCACGTGTTACTGGCGGTGATACAGGTATGAGTCGTTCTATTACGGGTTCTTGCTACATGGATCAACGTCAGGCAGCAAAAGTGCCACTGAGCAAAGTAGAGCAATCTCATACCGCTCGTGGTTCTAGTGTAACTGGCTCATTAGCAACGCGTTCGTCAACCATGACTGGCGGTGAAGCAGGCGCATGCCGTGCGGTTACTGGTAGTGAGTATTTGACGAGTGAAGACTTCAAGTCTTGCGGAATCAACAAGTTTGTTGGTAATGCAGATAAAGTGGTTGCTGGTAGCAGTGCGAAAGGTATGCCAGTCACCGGGCAGTTGCTCGATGACACAGCAGGCCGCGTCACTGGTAATGAAAATCTCAAGAATGAACGTGTCACCGGTAATCAATACCACGACCGTCAATTGGCGAGTTTCAGTGCCCGTAATGGGTCTTCTATCAAGCGATTGAATGCAAGCTTGATTAAAGGTGTGGATGCACAGCAGGAGTCTTCTGGCTTCCCAAAGGCTGAAGGTGATGGACAGAACAGCATGCTGTCTGTCACCGGTCATTCAACAGGCGCTACACCAGAAGTAACGGATGTGATGACGGCTTACTGCCAATCAAACCCGCTACCCTTGGTGAGTAGCAAGCCAGTATCTGGCGACCTGCCATTGAATAATCAACGTATGACAGGCAACGATCGCGGTATTTGCGATAGTGGTGTTGTTACGGGTGGTTATCGGGCAGATACCCTAGTCGATTGTGCACGCCCGAGTGTGCAATCAGAGCAAGCAGCGGTGCCTTATGCATCTTTGGAGCGCAATAACCGCTTAACGGGTAATGCCTCTTGGGATGAGGGTATCGTAACGGGAACACGAGGCTTTGACCGTCGTTCTGCTCCCGCACGTCAATTTATGAATGTTTCTATGGCGCAAACCATGAAGCAGGTTAAAACGGTTGTGCAAGCCCCTCAGGTGGAAGCAGAAGTGATGCCAGTAGCGCCCGTAAAATCAACAAGGGGTTAA
PROTEIN sequence
Length: 343
GTDVAQGSRVTGGDTGMSRSITGSCYMDQRQAAKVPLSKVEQSHTARGSSVTGSLATRSSTMTGGEAGACRAVTGSEYLTSEDFKSCGINKFVGNADKVVAGSSAKGMPVTGQLLDDTAGRVTGNENLKNERVTGNQYHDRQLASFSARNGSSIKRLNASLIKGVDAQQESSGFPKAEGDGQNSMLSVTGHSTGATPEVTDVMTAYCQSNPLPLVSSKPVSGDLPLNNQRMTGNDRGICDSGVVTGGYRADTLVDCARPSVQSEQAAVPYASLERNNRLTGNASWDEGIVTGTRGFDRRSAPARQFMNVSMAQTMKQVKTVVQAPQVEAEVMPVAPVKSTRG*