ggKbase home page

ar4r2_scaffold_2822_3

Organism: ALUMROCK_MS4_Beggiotoa_37_524_curated

near complete RP 52 / 55 MC: 1 BSCG 51 / 51 MC: 2 ASCG 14 / 38 MC: 1
Location: comp(1218..2291)

Top 3 Functional Annotations

Value Algorithm Source
A/G-specific adenine glycosylase n=1 Tax=Beggiatoa alba B18LD RepID=I3CFP8_9GAMM similarity UNIREF
DB: UNIREF100
  • Identity: 59.5
  • Coverage: 346.0
  • Bit_score: 424
  • Evalue 7.40e-116
  • rbh
DNA glycosylase {ECO:0000313|EMBL:KHD05670.1}; TaxID=1003181 species="Bacteria; Proteobacteria; Gammaproteobacteria; Thiotrichales; Thiotrichaceae; Thiomargarita.;" source="Candidatus Thiomargarita nelsonii.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 62.7
  • Coverage: 357.0
  • Bit_score: 446
  • Evalue 2.00e-122
A/G-specific adenine glycosylase similarity KEGG
DB: KEGG
  • Identity: 58.3
  • Coverage: 357.0
  • Bit_score: 430
  • Evalue 3.80e-118

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Candidatus Thiomargarita nelsonii → Thiomargarita → Thiotrichales → Gammaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 1074
ATGTCACATCATCAATTTAGCCAAGATGTATTAGCTTGGTTTGAACAATATGGACGTAAAGACTTACCTTGGCAGCAAAATCCTACGCCTTATCGGGTATGGATATCAGAAATCATGTTGCAACAAACCCAAGTCAACACTGTCATTCCTTACTACCAACGCTTTATGCAACGGTTTCCAAATGTACAACAACTTGCTATTGCGCTCCAAGATGAAGTTTTACATTATTGGACTGGGTTAGGTTATTATGCTAGAGCACGTCATTTACATCAAACAGCGCAACAAATTATAAATCAATACAACGGGATATTTCCAACAAAATTAGATGAATTAATCGCTTTACCCGGTATTGGACGCTCAACTGCTGGTGCTATTTTAGCCTTAGCGTATCAATTACCTTTTCCTATTTTAGATGGCAATGTAAAACGTATCTTGTGCCGTTATTATGCTATTGAACAATGGTCAGGAGAATCTGCCGTTACTCAACGATTATGGGCATTAGCAGAACAACATACGCCTCAAACCCAAGTAGCTGCTTATACTCAAGCCGTGATGGATTTAGGAGCAACTGTGTGTATGCGTAGTAAACCTCGTTGTACATTGTGTCCTTTACAAAAAAATTGTTTAGCTTATCAACAAAATAAAACCGCAATTTATCCCGTTGCCAAACCGCGTAAGACGTTACCTACTAAACAAGTTTACTTCCTAATGTTGCAAAATGCACAAGGACAGATTTTATTAGAAAAAAGGCAAAATTCTGGAATTTGGGGTGGATTATGGAGTTTTCCTGAATATTCTACTCTACAAGAAATCGAACAATGGTGTAAAACTCATTTGTCTTCAATTGATTATACTTTAAATTATTGGGCAACTATGTCACATACTTTTACTCATTTTCATTTATCGATTACTCCAGTCCATATTTTTCTAAGAGCTGCGTGGCAGCAAAATATGCTAAAATCTACGCAGTTGTGGTATGACACGACTCAACCCATTCGTTGTGGTTTAGCCGCACCGGTAACCCGTTTGCTGGCACAACTTGCTTTCCCAAAAATAGGAGAACGATTGTTATGA
PROTEIN sequence
Length: 358
MSHHQFSQDVLAWFEQYGRKDLPWQQNPTPYRVWISEIMLQQTQVNTVIPYYQRFMQRFPNVQQLAIALQDEVLHYWTGLGYYARARHLHQTAQQIINQYNGIFPTKLDELIALPGIGRSTAGAILALAYQLPFPILDGNVKRILCRYYAIEQWSGESAVTQRLWALAEQHTPQTQVAAYTQAVMDLGATVCMRSKPRCTLCPLQKNCLAYQQNKTAIYPVAKPRKTLPTKQVYFLMLQNAQGQILLEKRQNSGIWGGLWSFPEYSTLQEIEQWCKTHLSSIDYTLNYWATMSHTFTHFHLSITPVHIFLRAAWQQNMLKSTQLWYDTTQPIRCGLAAPVTRLLAQLAFPKIGERLL*