ggKbase home page

ar4r2_scaffold_2520_2

Organism: ALUMROCK_MS4_SR1_33_49_curated

near complete RP 41 / 55 MC: 2 BSCG 48 / 51 MC: 2 ASCG 6 / 38 MC: 1
Location: 285..3647

Top 3 Functional Annotations

Value Algorithm Source
hypothetical protein n=1 Tax=candidate division OP11 bacterium UW 659-4-B07 RepID=UPI0003797DC9 similarity UNIREF
DB: UNIREF100
  • Identity: 36.9
  • Coverage: 222.0
  • Bit_score: 116
  • Evalue 1.40e-22
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 31.3
  • Coverage: 377.0
  • Bit_score: 116
  • Evalue 3.90e-23
Uncharacterized protein {ECO:0000313|EMBL:BAP55221.1}; TaxID=40754 species="Bacteria; Proteobacteria; Gammaproteobacteria; Thiotrichales; Thiotrichaceae; Thioploca.;" source="Thioploca ingrica.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 31.3
  • Coverage: 377.0
  • Bit_score: 116
  • Evalue 1.90e-22

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Thioploca ingrica → Thioploca → Thiotrichales → Gammaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 3363
ATGGGAAAGACGCAACATTTTATCCATTCATTATCTTTTATTATGCAAAAACATTATCACAAGACAAAAACTCATCTTAAAAAATTTCACAAACATTATCTTTTGTGATTTTTTTGATCATTTGCTATGATTAAAATGATTATTCTCTTTTTGTGATTTTTTTCAAGCATTCAACATATCCAACACAATGCAAGTGCATCATGACCTATTTTTATTACCAATGGATTCTGTAGTACGGTTACAGATATAAGCTATGACGAATGTATCTGAATTATAAGTTTATACAACAATACCAATGGTGAATATCGAACAAAAAACAATTGACGATGAGTATCATATGAAGCATGCTCTCGATATGGTGTTTCTTGTAATTGAGGAAATATTTATAGCCTCACCCTCGACAATAACAACCTTGTATGAAATATACCAACTGATTTTTCGAAACTTCTTTATCTTAAATCATTAAATATCTCTCACAACACCATTAATACTATTGCACCAGGATTTAAATGATTAAAATCACTTACGGCATTTGATGCAAGTTACAATATAATCTGAGATACATTAGGAATCGATTCTTTAATTGGTCTTCCCTCATTGCGCTATCTTAATCTTTCTCACAACAACATCTCCATTATCCCAAACACCTTTGATAGCTTTAGCAAAACACTTACAACACTCTACCTTTGATATAATAACATCAGTAACATTTCAATACTTGCGTCTTTGGGATCACTCCAAACACTTTGATTGGAACACAATATTATTGACACATTACCTATGCTTAAATGAATTAAATTAAGCTATCTCAATCTCTCCAAAAACCTCCTTGACAACGCTACCCCAGCCACTATTTGATGATCAATTTCACAGACATTAGAACGACTTGACCTATCTCACAATCGTATTGATACTTTAAATGAATGATTTGCTCAACTAGGTAAAGACAAGATAATTACCTATCTTAATCTTTCTTATAACCTACTCACCAATGCGGATATCCTAAAATGATTTACTAGTCTTCAAACACTCAAAATTGCAAATAATATGCTTACAAACATGATTGATTTTGAAAAATGATCTGCACTTGTTTTTTTAGATTATAGCTTCAATCCACTTAAATCATTTCCATTGGACCAATTATTAACGATTTCAGAAACATTAGAATATCTATTTCTCAGACACAACACCCTTACGTGACCTATTCCAAAAGAACTTATTAACCTCGAATTTCTCGTAGATAAATGAAGCCAAATTGACTACAATTTTCTGGAGGAATATACAACAACCGATCCAGATCTTATTGCTTTTTTGGATAAAAAATTTGTAGCGAATGAAAACTATACTGTATATGATAGAACAAATCGAAGAGATCAATATAGAGACATTGATAACAAAATCACTCTAGCACTTAGCGATCCAAACAAAATATATTATCCTTGAGATGAAATAGAGATCAATATGACATACAATAATCATTGAGTCTCTGCAACAAAAGAACTTAGTTTTTCTTTATACAAAGATACAGAAAATTTTATTCCCAAAGATGATATTCCTTTTGAAAACATGATATACGATGCAACTTTTGATGATACTGATCCATGCTTGGAACAACTAAACATAACAACAAGCTGACCATACCTCAAAGAGTTAAATATTCGAGCACAAAACAACAATCGACCAGATTTCTATAATTATTTACAATCTTCTTATCGTGATAGTGATTTTTATCCTATGGGTTCAGCAAATACATTGTATTCTCATACACGAGGAGAGTCTTTCACTGATTGGATAACAAAATACAACATCTATGATGCAATAATACCTAATTCTTTTGTAGAGGGACTTTCTAGTCGATTATTTACCATCAAAGCTAAGACCATAGCTAGTCAATGATGTGGAATATGATGAACGCCTGTATATACATATACACTCAATCCACTTAACTATAATGACGTACAAACATTTACCTTTACTATCATTACTGATCCAGATACACCTATAGGAACCCTTTCTCTTAGATGAGTCTTAGAAAGTGAAAATCCTTTGTATATACAAAATCTTGCAGATTCTAGCCTTGATATTGAGATAAAAGAAAATCCAATATTTTGTGGTAATGATATTATCGATACATGAGAGGTCTGTGATTGAAATGATTTATTATGAAAAGACTGTACTGATTTTTGATTATTAAATCCTGAAAGAGAGAAACCATTGGGAGATGTTTCTGTAAGTATATGAGGAGATGTTTCTGTAAGTATATGAGAAGATCTATCAGGTAAAGCAAAAATAGTTACATACCTTTCTTGTTCTTCTGATTGTCTCTCTTTTGACACTTGATCATGTATTGCTCCTACCAATTGTACATGAACATCCATCAATGCCAACTGCGAAACCATTTCTTTAGCAAATACAACCACACCAGCTACTACCTGCAGTAGCTATTATAGCACGACTCAAAATACCCAATGTATATATCAAATCAATACTTCTACTAGCAAAGAAAAACCAATTTCCACACAACCTATCAAATCAGTCAACACAACCAATAATTGATCATGATCATGCGTAAAGGGAAATATTTGTATAGTTCCTAGTGTTTCTGCCAAAATACAACCTGCATGATGAGGAGGTTGATGAGGAGTATCAATTTCTCTTAAAAGAGATATTTGTCTTGATTGAGATTTTTCATCAAGTTATTATGATGAAACATGTTGAACTAAACCAGTAGTTCAAGCTCACACACAGGAATCAATAATATGACAAACAAGCACCCAAACGCATAAAGAATGAAACAATGACAGACCAATTACTAGACTAGAACTTGCCCAACTTGTGGTACCATTTGCATCTACTATTTTACATATTACTGAAGATTCAAATAAAGTATGTAGCTATCCTGACATACATAATCTATCAAAAAACGATCAAAATACGATAATCCTTTCATGTCAACTTTACCTTATGGGACTTGAGCCTGATGGTAAAACAAAAAAGGATGTTTTTATCCCAAACACACTCGTCCAATTTAATGAATTTGTAACTGTATACTCTAGGCTCATATATGATAACCTTTACAATGTTCCCTTATCATCAAATAAAAAGTGGTATGAAAATCATCTTGCAGCATTAAAAGATGGTGGAATTGTGGGAAATCCTGTAATTGTCACACAATCCTACGCACAATCTATGCTTGCAAACATACAGCAGAACCCATTACTTGTACAAAGACATGATGCAAATATATGACATGCGGCTGCTGAATCTATTACTGACTTTTCTATTCTCAAAAAAACACCATTTATTAAGAATACACTAAAGGCTCTCTTATGAATATTTAATGCATTTACCATTAATAAATAA
PROTEIN sequence
Length: 1121
MGKTQHFIHSLSFIMQKHYHKTKTHLKKFHKHYLLGFFGSFAMIKMIILFLGFFSSIQHIQHNASASGPIFITNGFCSTVTDISYDECIGIISLYNNTNGEYRTKNNGRGVSYEACSRYGVSCNGGNIYSLTLDNNNLVGNIPTDFSKLLYLKSLNISHNTINTIAPGFKGLKSLTAFDASYNIIGDTLGIDSLIGLPSLRYLNLSHNNISIIPNTFDSFSKTLTTLYLGYNNISNISILASLGSLQTLGLEHNIIDTLPMLKGIKLSYLNLSKNLLDNATPATIGGSISQTLERLDLSHNRIDTLNEGFAQLGKDKIITYLNLSYNLLTNADILKGFTSLQTLKIANNMLTNMIDFEKGSALVFLDYSFNPLKSFPLDQLLTISETLEYLFLRHNTLTGPIPKELINLEFLVDKGSQIDYNFLEEYTTTDPDLIAFLDKKFVANENYTVYDRTNRRDQYRDIDNKITLALSDPNKIYYPGDEIEINMTYNNHGVSATKELSFSLYKDTENFIPKDDIPFENMIYDATFDDTDPCLEQLNITTSGPYLKELNIRAQNNNRPDFYNYLQSSYRDSDFYPMGSANTLYSHTRGESFTDWITKYNIYDAIIPNSFVEGLSSRLFTIKAKTIASQGCGIGGTPVYTYTLNPLNYNDVQTFTFTIITDPDTPIGTLSLRGVLESENPLYIQNLADSSLDIEIKENPIFCGNDIIDTGEVCDGNDLLGKDCTDFGLLNPEREKPLGDVSVSIGGDVSVSIGEDLSGKAKIVTYLSCSSDCLSFDTGSCIAPTNCTGTSINANCETISLANTTTPATTCSSYYSTTQNTQCIYQINTSTSKEKPISTQPIKSVNTTNNGSGSCVKGNICIVPSVSAKIQPAGGGGGGGVSISLKRDICLDGDFSSSYYDETCGTKPVVQAHTQESIIGQTSTQTHKEGNNDRPITRLELAQLVVPFASTILHITEDSNKVCSYPDIHNLSKNDQNTIILSCQLYLMGLEPDGKTKKDVFIPNTLVQFNEFVTVYSRLIYDNLYNVPLSSNKKWYENHLAALKDGGIVGNPVIVTQSYAQSMLANIQQNPLLVQRHDANIGHAAAESITDFSILKKTPFIKNTLKALLGIFNAFTINK*