ggKbase home page

ar4r2_scaffold_8270_4

Organism: ALUMROCK_MS4_Thiothrix_nivea-related_50_537_curated

megabin RP 46 / 55 MC: 7 BSCG 48 / 51 MC: 12 ASCG 13 / 38 MC: 6
Location: comp(803..4261)

Top 3 Functional Annotations

Value Algorithm Source
hypothetical protein n=1 Tax=Thiothrix disciformis RepID=UPI00036724B0 similarity UNIREF
DB: UNIREF100
  • Identity: 35.2
  • Coverage: 1206.0
  • Bit_score: 537
  • Evalue 3.30e-149
  • rbh
PBS lyase HEAT-like repeat {ECO:0000313|EMBL:EDN70822.1}; TaxID=422289 species="Bacteria; Proteobacteria; Gammaproteobacteria; Thiotrichales; Thiotrichaceae; Beggiatoa.;" source="Beggiatoa sp. PS.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 44.9
  • Coverage: 539.0
  • Bit_score: 456
  • Evalue 8.00e-125
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 29.9
  • Coverage: 395.0
  • Bit_score: 165
  • Evalue 5.70e-38

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Beggiatoa sp. PS → Beggiatoa → Thiotrichales → Gammaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 3459
ATGACCAACGAAGAAAGAAACCTCCTCTCTTCTGATCCTTATCCCGGTCTACGGCCTTATCATGAGGATGAGCAGGACAAGTTTTTTGGGCGCGATGCGGATGTGGAAGTCCTGATCGACAAGGTGTTGACCAATCGCTTGACGTTGTTGTTTGCGGCGAGCGGGGTGGGCAAAAGTTCGCTGTTGCAGGCGGCGGTGATCCCGCGTTTGAAATCGCCGACAGGGGAGAATCTGGAGGTGGTGTACCACATCGACTGGGTGAGCGAGTCAGTCAGCAGTGTGCGGGCAGCGGTGCTACAAGCTTTGCACGGCAGCGGCAAGTTGCCCGAAGGTGCGGCGGATGAGGCGGGGGAAACACTGGCAGAGTTGCTGGAATTCTGCGGGCTGTTTGTGCGTCATCCGCTGGTACTGATCCTTGACCAGTTTGAAGAGTTTTTCCGTTATCAGCGGGCGAGTGCCAACTTCCAGCCCTTCATTGAGCAATTGACAGCAATCATCACTAATCCGCAATTGCCCGTGAGTCTGGTGCTGTCGATGCGCGAAGATTTCGCACTGGAACTCAATGCGTTCAAGCCGCGCTTGCCGACCATCCTGTTTGAAAACTTCTATCGGTTGGAAAAGCTGGGGCGGGTGGGCGCACGCGAGGCGATTGTTACACCCGTGGAACAAGTCGGCTTCCGCTATGAATCTGTGCTGCTGGAAACGCTGTTGGATGATTTGCTCAGTCGTGAACTTGACCGCTTGCCGAACTCCCCAGCGGCGGAATTGCTGGAAACGGTCGAGCCGCCTTATTTGCAAATCGTGTGTGCGCAGTTGTGGGACTTAAACCGAGCCGACCCTGACAAGCTATTACGGTTGGTAACTTATGAGAAGGCGGGGAAGGCGAAAGGCATCCTGAGCAATTACCTGAAAGGGGTGTTGCAGGGATTCTCGCTCACGGAAAAACAGCTTGCTTCCAAAGCCTTTGACCATTTGGCAAGCCAGCGCGGGGTGAAGATGGCTTACACCGCTGCGGCGTTGGCGGAAACAGTGCGGGAAAGCGAAGCCGCTTTGGGCAAGGTGCTGGATAAGCTGGCGGCGGTGCGGATTTTGCGTACTCAACAGCGCGGAGAAGCGACGTGGTATGAGCTTTACCACGATATGTTTTCTGGCAGTATCGAAAGCTGGAATGGTGAGTGGAAAGACCGGATGCGGCGGCGCAAGTTTGCGCAGGTGACTGTGGTTGGATTTGTCAGCATTGCGGCTTTATTCGCAGGGTGGGATTACTACGTCAATGCCTCAAATTATCACTTACGTCTTAGCCCCAAACAGGGCATCTCTGACCGGGTGGAATTATGGCAGGGCAAGCTGGGGAGTTTTGACCTATTCCACCAACAGCGTTATTTGGCTGAAACTGATTTTGAGCGCAATCAGCTTGAGCCGGATAAACAGTTTAATCAAAAGCCAGTGGCAGATTACGACGAGTTGCAGACGGAACTGACAGGTAGCTTGCCTATCGAATCGCGGGTTACAGCTTATGCCAACGCTGGGGATTCCAAAGAAGCATTGATGTTGGCGGAAAAAGCCATCACCCCAGAACACAGTGAACTTGCTAAGCGAACATTGTCGTCTCTTGAAGTCATGACCGTACCTGATGCAGCGCGGATGATTTATCGCTTGCTGATAACTCCTGAACCACATAACGATCAAATTAAAGAGTCAATTTCTAATATTACTAATACTGCTATTGGGATTTTTTTGGTGCAGGAGGGGGCGGTTATTACGCCCGATATGTCCGAAGCAAGCAAGAGTTTTTGGGGGAAGCAATTGGTCGGTCAGGTTGCTACCAAAGTTGCACGAGATCAGTTGATGACGAATGGCGAGAGAAAAAGCGACAATGATCTAGTAGTATTATCTGACTTTTATGGTCTAATTGATGTCAATTCGTTTTTGCTTGGTTTACTTAAGGACAAGACGGACAAGTATGCTAATCGCCATTCAATGATGCTTATCTTGGGGGCGCGTCATGCGAAGGATGCTATTCCTATACTGCTGGGTTTTCTCAAGGATGAGGATGCCCGTGTTCGCGTATCAGCGGCAAATGCCTTGGCTGATTTGCAGGTCAAAGATGCTATTCCCATGTGGTTGGGCTTGCTCAAGGATCAGGCGGTCGAAGTCCGCATATCAGCGGCAAGTGCTTTGGCTGATTTGCAGGTCAAGGATGCCATTCCCGTGCTGCTGGGCTTGCTCAAGGATCAGGATACTAACGTCCGCTTATTAGCGGCAGATGCATTAGCTAAGTTGCAGGTCAAGGATGCCATCCCCGTGCTGTTGGGCTTGCTCAAGGAACAGAATGCCGATGTCCGCTTATTAGCGGCAGATGCATTAACTAAGTTGCAGGTCAAGGATGCCATTCCCGTGCTGCTGGGCTTGCTCAAGGATCAGGATGAAAATGTTCGCTCATTAGTTGCAGGGGAGTTGGTTAGGTTGCAGGGTCAGGATGCCCCCTCCATGCTGTTGGGCTTGCTCAAGGATCAGGATAAAAATGTCCGCTTATCAGTGGCAAGTTCTTTGGCTGATTTGCAGGTCAAGGATGCCATTCCCGTGCTGCTGGGTTTGCTCAAGGATCAGGATGCTGAAGTCCGCTCATCAGCGGCAAGTGCCTTGGTTAAGTTGCAGGGCAAAGATAGCATCCTCATGCTGTTGAGTTTGCTCAAGGATCAAGATGCCAGTGTCCGGGATCTAGCAGCAAGTACCTTGGGGAAATTGCAGGTCAAAGAATTTATATCTGGTCATTATCCATCCATTGACGGTGAAGGAAAAGAAAAAATCCAGAAACTTCCCATTAACCAACCCGTAGTGGAAGATAAGCCGCTGACGCTGGACGAACTCAAAGCCAAACTGGATGGTTTCGACCATGCCTACGCCGACTGGCGCGAACGCCGCGATGCTGATGCGTCGACTCGCAACGAAGTCGATAAACTCGCCGATTCCGCCCTTTTCATCTACGAATACGCCTATGCCATCGCGGGGATGGACGAAGCAGAAGGCATCAAACTGTTAAGCCACAACCTCTGCAAAGTCCGTGAAGCCGCCGCCCATAGCCTCGCCGACAGTCACTTCCTCGGCGTGCCGCTGCTGCAAAAGCTTGAACAGGCATGGCTCACCACCGACAACCCCATCACCCGCCAAAGTTTGTTCCACGCCATCGATCTTGCACTACGGGCGATGGAACGCAATGGCAGAGGCAAGGAACTTGTTGCGTTGAAAACTTACGAACCTATCCTGACCAACGGGCGTTCCGCCACCTCCATCAAACCACGTGTCGAATGGACAGTCGCACAACTCCAATGGCGCGAAAATGCACGGGCAGAATTGGCGCAGGAAGCAAGAGACCTCCCCGCCGATACGCTAAAAGAATACTGCCTCAACCCTGATGGCACGGAGATCAAGCCAGAGGAATGCACGTTTAAACGCTAA
PROTEIN sequence
Length: 1153
MTNEERNLLSSDPYPGLRPYHEDEQDKFFGRDADVEVLIDKVLTNRLTLLFAASGVGKSSLLQAAVIPRLKSPTGENLEVVYHIDWVSESVSSVRAAVLQALHGSGKLPEGAADEAGETLAELLEFCGLFVRHPLVLILDQFEEFFRYQRASANFQPFIEQLTAIITNPQLPVSLVLSMREDFALELNAFKPRLPTILFENFYRLEKLGRVGAREAIVTPVEQVGFRYESVLLETLLDDLLSRELDRLPNSPAAELLETVEPPYLQIVCAQLWDLNRADPDKLLRLVTYEKAGKAKGILSNYLKGVLQGFSLTEKQLASKAFDHLASQRGVKMAYTAAALAETVRESEAALGKVLDKLAAVRILRTQQRGEATWYELYHDMFSGSIESWNGEWKDRMRRRKFAQVTVVGFVSIAALFAGWDYYVNASNYHLRLSPKQGISDRVELWQGKLGSFDLFHQQRYLAETDFERNQLEPDKQFNQKPVADYDELQTELTGSLPIESRVTAYANAGDSKEALMLAEKAITPEHSELAKRTLSSLEVMTVPDAARMIYRLLITPEPHNDQIKESISNITNTAIGIFLVQEGAVITPDMSEASKSFWGKQLVGQVATKVARDQLMTNGERKSDNDLVVLSDFYGLIDVNSFLLGLLKDKTDKYANRHSMMLILGARHAKDAIPILLGFLKDEDARVRVSAANALADLQVKDAIPMWLGLLKDQAVEVRISAASALADLQVKDAIPVLLGLLKDQDTNVRLLAADALAKLQVKDAIPVLLGLLKEQNADVRLLAADALTKLQVKDAIPVLLGLLKDQDENVRSLVAGELVRLQGQDAPSMLLGLLKDQDKNVRLSVASSLADLQVKDAIPVLLGLLKDQDAEVRSSAASALVKLQGKDSILMLLSLLKDQDASVRDLAASTLGKLQVKEFISGHYPSIDGEGKEKIQKLPINQPVVEDKPLTLDELKAKLDGFDHAYADWRERRDADASTRNEVDKLADSALFIYEYAYAIAGMDEAEGIKLLSHNLCKVREAAAHSLADSHFLGVPLLQKLEQAWLTTDNPITRQSLFHAIDLALRAMERNGRGKELVALKTYEPILTNGRSATSIKPRVEWTVAQLQWRENARAELAQEARDLPADTLKEYCLNPDGTEIKPEECTFKR*