ggKbase home page

ar4r2_scaffold_2520_1

Organism: ALUMROCK_MS4_SR1_33_49

near complete RP 44 / 55 MC: 3 BSCG 48 / 51 MC: 2 ASCG 8 / 38 MC: 1
Location: 168..3530

Top 3 Functional Annotations

Value Algorithm Source
hypothetical protein n=1 Tax=candidate division OP11 bacterium UW 659-4-B07 RepID=UPI0003797DC9 similarity UNIREF
DB: UNIREF100
  • Identity: 36.9
  • Coverage: 222.0
  • Bit_score: 116
  • Evalue 2.30e-22
hypothetical protein KEGG
DB: KEGG
  • Identity: 39.3
  • Coverage: 117.0
  • Bit_score: 73
  • Evalue 4.80e-10
seg 575..590 Tax=ACD80 similarity UNIPROT
DB: UniProtKB
  • Identity: 44.2
  • Coverage: 120.0
  • Bit_score: 83
  • Evalue 2.30e-12

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

ACD80 → SR1 → Bacteria

Sequences

DNA sequence
Length: 3363
ATGGGAAAGACGCAACATTTTATCCATTCATTATCTTTTATTATGCAAAAACATTATCACAAGACAAAAACTCATCTTAAAAAATTTCACAAACATTATCTTTTGTGATTTTTTTGATCATTTGCTATGATTAAAATGATTATTCTCTTTTTGTGATTTTTTTCAAGCATTCAACATATCCAACACAATGCAAGTGCATCATGACCTATTTTTATTACCAATGGATTCTGTAGTACGGTTACAGATATAAGCTATGACGAATGTATCTGAATTATAAGTTTATACAACAATACCAATGGTGAATATCGAACAAAAAACAATTGACGATGAGTATCATATGAAGCATGCTCTCGATATGGTGTTTCTTGTAATTGAGGAAATATTTATAGCCTCACCCTCGACAATAACAACCTTGTATGAAATATACCAACTGATTTTTCGAAACTTCTTTATCTTAAATCATTAAATATCTCTCACAACACCATTAATACTATTGCACCAGGATTTAAATGATTAAAATCACTTACGGCATTTGATGCAAGTTACAATATAATCTGAGATACATTAGGAATCGATTCTTTAATTGGTCTTCCCTCATTGCGCTATCTTAATCTTTCTCACAACAACATCTCCATTATCCCAAACACCTTTGATAGCTTTAGCAAAACACTTACAACACTCTACCTTTGATATAATAACATCAGTAACATTTCAATACTTGCGTCTTTGGGATCACTCCAAACACTTTGATTGGAACACAATATTATTGACACATTACCTATGCTTAAATGAATTAAATTAAGCTATCTCAATCTCTCCAAAAACCTCCTTGACAACGCTACCCCAGCCACTATTTGATGATCAATTTCACAGACATTAGAACGACTTGACCTATCTCACAATCGTATTGATACTTTAAATGAATGATTTGCTCAACTAGGTAAAGACAAGATAATTACCTATCTTAATCTTTCTTATAACCTACTCACCAATGCGGATATCCTAAAATGATTTACTAGTCTTCAAACACTCAAAATTGCAAATAATATGCTTACAAACATGATTGATTTTGAAAAATGATCTGCACTTGTTTTTTTAGATTATAGCTTCAATCCACTTAAATCATTTCCATTGGACCAATTATTAACGATTTCAGAAACATTAGAATATCTATTTCTCAGACACAACACCCTTACGTGACCTATTCCAAAAGAACTTATTAACCTCGAATTTCTCGTAGATAAATGAAGCCAAATTGACTACAATTTTCTGGAGGAATATACAACAACCGATCCAGATCTTATTGCTTTTTTGGATAAAAAATTTGTAGCGAATGAAAACTATACTGTATATGATAGAACAAATCGAAGAGATCAATATAGAGACATTGATAACAAAATCACTCTAGCACTTAGCGATCCAAACAAAATATATTATCCTTGAGATGAAATAGAGATCAATATGACATACAATAATCATTGAGTCTCTGCAACAAAAGAACTTAGTTTTTCTTTATACAAAGATACAGAAAATTTTATTCCCAAAGATGATATTCCTTTTGAAAACATGATATACGATGCAACTTTTGATGATACTGATCCATGCTTGGAACAACTAAACATAACAACAAGCTGACCATACCTCAAAGAGTTAAATATTCGAGCACAAAACAACAATCGACCAGATTTCTATAATTATTTACAATCTTCTTATCGTGATAGTGATTTTTATCCTATGGGTTCAGCAAATACATTGTATTCTCATACACGAGGAGAGTCTTTCACTGATTGGATAACAAAATACAACATCTATGATGCAATAATACCTAATTCTTTTGTAGAGGGACTTTCTAGTCGATTATTTACCATCAAAGCTAAGACCATAGCTAGTCAATGATGTGGAATATGATGAACGCCTGTATATACATATACACTCAATCCACTTAACTATAATGACGTACAAACATTTACCTTTACTATCATTACTGATCCAGATACACCTATAGGAACCCTTTCTCTTAGATGAGTCTTAGAAAGTGAAAATCCTTTGTATATACAAAATCTTGCAGATTCTAGCCTTGATATTGAGATAAAAGAAAATCCAATATTTTGTGGTAATGATATTATCGATACATGAGAGGTCTGTGATTGAAATGATTTATTATGAAAAGACTGTACTGATTTTTGATTATTAAATCCTGAAAGAGAGAAACCATTGGGAGATGTTTCTGTAAGTATATGAGGAGATGTTTCTGTAAGTATATGAGAAGATCTATCAGGTAAAGCAAAAATAGTTACATACCTTTCTTGTTCTTCTGATTGTCTCTCTTTTGACACTTGATCATGTATTGCTCCTACCAATTGTACATGAACATCCATCAATGCCAACTGCGAAACCATTTCTTTAGCAAATACAACCACACCAGCTACTACCTGCAGTAGCTATTATAGCACGACTCAAAATACCCAATGTATATATCAAATCAATACTTCTACTAGCAAAGAAAAACCAATTTCCACACAACCTATCAAATCAGTCAACACAACCAATAATTGATCATGATCATGCGTAAAGGGAAATATTTGTATAGTTCCTAGTGTTTCTGCCAAAATACAACCTGCATGATGAGGAGGTTGATGAGGAGTATCAATTTCTCTTAAAAGAGATATTTGTCTTGATTGAGATTTTTCATCAAGTTATTATGATGAAACATGTTGAACTAAACCAGTAGTTCAAGCTCACACACAGGAATCAATAATATGACAAACAAGCACCCAAACGCATAAAGAATGAAACAATGACAGACCAATTACTAGACTAGAACTTGCCCAACTTGTGGTACCATTTGCATCTACTATTTTACATATTACTGAAGATTCAAATAAAGTATGTAGCTATCCTGACATACATAATCTATCAAAAAACGATCAAAATACGATAATCCTTTCATGTCAACTTTACCTTATGGGACTTGAGCCTGATGGTAAAACAAAAAAGGATGTTTTTATCCCAAACACACTCGTCCAATTTAATGAATTTGTAACTGTATACTCTAGGCTCATATATGATAACCTTTACAATGTTCCCTTATCATCAAATAAAAAGTGGTATGAAAATCATCTTGCAGCATTAAAAGATGGTGGAATTGTGGGAAATCCTGTAATTGTCACACAATCCTACGCACAATCTATGCTTGCAAACATACAGCAGAACCCATTACTTGTACAAAGACATGATGCAAATATATGACATGCGGCTGCTGAATCTATTACTGACTTTTCTATTCTCAAAAAAACACCATTTATTAAGAATACACTAAAGGCTCTCTTATGAATATTTAATGCATTTACCATTAATAAATAA
PROTEIN sequence
Length: 1121
MGKTQHFIHSLSFIMQKHYHKTKTHLKKFHKHYLL*FF*SFAMIKMIILFL*FFSSIQHIQHNASAS*PIFITNGFCSTVTDISYDECI*IISLYNNTNGEYRTKNN*R*VSYEACSRYGVSCN*GNIYSLTLDNNNLV*NIPTDFSKLLYLKSLNISHNTINTIAPGFK*LKSLTAFDASYNII*DTLGIDSLIGLPSLRYLNLSHNNISIIPNTFDSFSKTLTTLYL*YNNISNISILASLGSLQTL*LEHNIIDTLPMLK*IKLSYLNLSKNLLDNATPATI**SISQTLERLDLSHNRIDTLNE*FAQLGKDKIITYLNLSYNLLTNADILK*FTSLQTLKIANNMLTNMIDFEK*SALVFLDYSFNPLKSFPLDQLLTISETLEYLFLRHNTLT*PIPKELINLEFLVDK*SQIDYNFLEEYTTTDPDLIAFLDKKFVANENYTVYDRTNRRDQYRDIDNKITLALSDPNKIYYP*DEIEINMTYNNH*VSATKELSFSLYKDTENFIPKDDIPFENMIYDATFDDTDPCLEQLNITTS*PYLKELNIRAQNNNRPDFYNYLQSSYRDSDFYPMGSANTLYSHTRGESFTDWITKYNIYDAIIPNSFVEGLSSRLFTIKAKTIASQ*CGI**TPVYTYTLNPLNYNDVQTFTFTIITDPDTPIGTLSLR*VLESENPLYIQNLADSSLDIEIKENPIFCGNDIIDT*EVCD*NDLL*KDCTDF*LLNPEREKPLGDVSVSI*GDVSVSI*EDLSGKAKIVTYLSCSSDCLSFDT*SCIAPTNCT*TSINANCETISLANTTTPATTCSSYYSTTQNTQCIYQINTSTSKEKPISTQPIKSVNTTNN*S*SCVKGNICIVPSVSAKIQPA**GG**GVSISLKRDICLD*DFSSSYYDETC*TKPVVQAHTQESII*QTSTQTHKE*NNDRPITRLELAQLVVPFASTILHITEDSNKVCSYPDIHNLSKNDQNTIILSCQLYLMGLEPDGKTKKDVFIPNTLVQFNEFVTVYSRLIYDNLYNVPLSSNKKWYENHLAALKDGGIVGNPVIVTQSYAQSMLANIQQNPLLVQRHDANI*HAAAESITDFSILKKTPFIKNTLKALL*IFNAFTINK*