ggKbase home page

L1_008_000M1_scaffold_32_30

Organism: dasL1_008_000M1_concoct_25_sub_fa

near complete RP 49 / 55 BSCG 51 / 51 MC: 2 ASCG 12 / 38 MC: 1
Location: comp(31622..34621)

Top 3 Functional Annotations

Value Algorithm Source
Uncharacterized protein n=1 Tax=Collinsella sp. CAG:166 RepID=R5ZMV7_9ACTN similarity UNIREF
DB: UNIREF100
  • Identity: 99.0
  • Coverage: 999.0
  • Bit_score: 1954
  • Evalue 0.0
Uncharacterized protein {ECO:0000313|EMBL:CDA35676.1}; TaxID=1262850 species="Bacteria; Actinobacteria; Coriobacteriia; Coriobacteriales; Coriobacteriaceae; Collinsella; environmental samples.;" source="Collinsella sp. CAG:166.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 99.0
  • Coverage: 999.0
  • Bit_score: 1954
  • Evalue 0.0
SARP family transcriptional regulator similarity KEGG
DB: KEGG
  • Identity: 40.6
  • Coverage: 985.0
  • Bit_score: 738
  • Evalue 2.40e-210

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Collinsella sp. CAG:166 → Collinsella → Coriobacteriales → Coriobacteriia → Actinobacteria → Bacteria

Sequences

DNA sequence
Length: 3000
ATGGTAAATGATTTGAGCGGTATAGACGACCTCGAGCTGATGCCGCTGGGCGGAGGCTATATGGTGCGACGCGAACGCTTGATGAATGAGCTTATCGCCAAGGGCCGAAAGGCCGGCATCGTGGTGCTTTATGCGCCCGACGGGTTTGGGAAGACCTCGGTGCTGCTGCAGTACACCCATGAGGTTAAATGCGACCCGACGCGAGGCCCCGTGCGGATTATCGAGGCCGATCGCGCCACTGGGCGCGAGGTGTTTATGCAGCTCGAGGTGGTTACCGAAGAACTCAAAGACAAACCGCACTCCCTTATCGCGATTGATAATGTGCCAAACCTCGATCAGCACGATACCGAAGACCTCATCGACAGGATTCGCGGCCTGCGCGCTATGGGGATAGGGGTCTTTATCTCCTGCCGTCCTTCAAATCGTCAGCTGATTCATGGGCTGGGCGATTCGGTTAAGATCAATGCGCAGATGCTCAAGGTGCATGCCTATGAGTATTCGGCGTGGGCATCGGCGTTTTCGATCAGCACATCGCTCGACTTTTATCAGCTCACGCAGGGCGTGCCCGAGCTCGTTTCGGCATTGCAGACGGGTCTGTATGGCCAAGGCGACGTAGCCGGTCTGCTCGAAAACGAGATTGTCAACGTATATGGCGCGGCACTTGCCGATCTGGCTTCGCTCGACAATAATGCGCTGTTTTGCGTGGCATGTCTCATGGTTTTGATGGGCGAGGGCAATATCGCAGAACTCGAGGCTTGCGGGGTGAGGCTGTCGATGGTCGACCAGTCCTACTTTGTACGCGACTACCCGATCTTTGGCTTGGATCCCGCCGAGCGGAGTTTTACGTGTCTGGGCACGGAGGATAACGGACGCTTGCGCTTGCGTAAGTTGGTGGCCGAGGTTCGCCCCGAACTTGTTCCGAGGGCGGCCCGGATTTTGCTTAAGGCGGGCCGATGCGATGCGGCGATGGGCCTGGCGGACGCCTTTTTGGACCGTGAGGCGGTTCTTGAACTTGCTGGGCAGTATGGGGTCGATTTTGCTCTGACGGGGCACGGGGCCGATATCTGCCGAGCAGCGCTGGGCACGATCGACGCTGACGATCCGGCTCCAGAGCCAACGCCTGCCGAGGCGCTTGGCGTATATCTGGCAGCGGTCAGCGTGTCCAACACAAAGCTCGCCCGCTATATGGCATCGGTCATCGAGCGCGGGGGCGAACGGGCGGCCTGCGAAATAGACCCTGTTTCATGGAGGGTGGCCCAGACACTGACGGCTGTTTGCTATGGGGACAACGGGCTTGGGCTTCCTACGAATCTGGCCGTGCCAGAAATCGAGACATCCCATACCGCACTCGATATGTTGTCTGCGCATATCGAGGTCAAGCGCTGTCTGACCGAGCGGGGAAATGACGGCGGCGTTCTACAGCAGCTCAAAACGAGCAGGGCCTACGACTGCGAGCTCGACATCGCGGGGATTGTTATACGGGCCGATAGCATGCTGGTGGAGCTCTTTGATGGGTCGTTTGCGGGAACCGACGAGCGCGACGACGAGATGACGGTGGTGCGGGAGGCGCTCGACGTACGTGGGCTTAAGGCGATGGCTATCTGGGTGCGCATGGTGTTGGCGGCACGGCGGCTCCTTTCCGGCTTGCCGGTAACCGACGATGCGGCGTTCAACGAGCTCGATCGTTTTGCCGTGCGCATGCGCGACAACCAGATTCAGCTGTTTGGCCTGTTGCTAGAAGGTTGGCAGTCGCTGGCAGAGGGACGGCCGGTTAATGCAAAGTTCCGTGCGGTCCAAGTGCTTAAACTTGCCGAGGAGTCGATTGCGTACATGCGCGATAACGCATTGCTGCTGGAGCGCGCAGCGTATCTGCGTAACACGTCGATGGTTTCGGTACGCGAGGAAGCGGAGCTACTCGATATAAGCCAGACGCAGGTTGGTGGCGCAGAAGCCTGGATGGTGGCGCTGCATTTGGCCTGTGCGGGCCGAGACAGCGATCTTGCGGCCTGGATGTCCATGAATCGTGCGGGAATTCTAGAGCCATCGTATCGGCTCTTTGCGCGCCTTGCCATGCATTGTCTGGGCGAGCCGGCGGCCAGGATACGCAAGAAGCTTCCCGTGCGCGAGCTGTCGCGGTATTCGCTGCGCGACGATTTAGAAGGGGAGGGCGAGCGCCTGTTCCAGGCCGGCGAGGTTGAAGCCGTTGATACCATCGACCATATCGAGATTCGTATGTTTGGGGCGTTTCGCGCCGAGCGCGACGGGTTTCCCATCACGGACAAAATGTGGCGCCGCAAGAAGGCGGCGACACTTGCCGAGCGGCTTGCGCTGGGCATGGATGCGCTCGTCGATCGTGAGACGCTGGCCATGGAGCTGTGGCCCCATGCCGAGTTCAATAGCGCCCGCAACAACCTGTACTCGACGATATCGCGCTTGCGGTCGGCTTTGGGGCCGACGCCGGATGGCAAGTCATGCGTGCTCATCCAAAATGAATGCATTGGGCTCAACGGCGACTACGTCAAGACCGATGTACGGCTGTTTGACCAGATAAGCCGCGAGGTTTTGGGAAATCGCATGGGTGCGCGCGGGCCCCATCTGGTCGAGTTGTGCCTTAAGATTGAACAGCTTTATGCCGGACCGTTGTATGTTCCCAATGGCTGTAATCCCACCTACTTTTTGCGCATGCGGCGCATTATGCAGTCAAAGTACATCGATTGCATGATTAAGGGAGCAAATGCCGCTCTAGAAGAAAACGATCTGCAGTCGGCGATTTGGCTGGCGGAGTCGGGGCTTCGACAGGAAACGGCGCGTGAGGATATGGTGCGCTGCGCCATGCGCGTCTACAGTGCGGCGGGGCGACGGCGCGATATCGTCGAGCTGTACAGCGGGCATATGCACCACCTGAGGGAGCAGGTCAATGGCGTGCCCGAGCCCGAGACGCGGCGCCTATACGAGCGCTTGGTGGAGGGGCGGCTCAATCGCGTGCTGGTCGAACGATAA
PROTEIN sequence
Length: 1000
MVNDLSGIDDLELMPLGGGYMVRRERLMNELIAKGRKAGIVVLYAPDGFGKTSVLLQYTHEVKCDPTRGPVRIIEADRATGREVFMQLEVVTEELKDKPHSLIAIDNVPNLDQHDTEDLIDRIRGLRAMGIGVFISCRPSNRQLIHGLGDSVKINAQMLKVHAYEYSAWASAFSISTSLDFYQLTQGVPELVSALQTGLYGQGDVAGLLENEIVNVYGAALADLASLDNNALFCVACLMVLMGEGNIAELEACGVRLSMVDQSYFVRDYPIFGLDPAERSFTCLGTEDNGRLRLRKLVAEVRPELVPRAARILLKAGRCDAAMGLADAFLDREAVLELAGQYGVDFALTGHGADICRAALGTIDADDPAPEPTPAEALGVYLAAVSVSNTKLARYMASVIERGGERAACEIDPVSWRVAQTLTAVCYGDNGLGLPTNLAVPEIETSHTALDMLSAHIEVKRCLTERGNDGGVLQQLKTSRAYDCELDIAGIVIRADSMLVELFDGSFAGTDERDDEMTVVREALDVRGLKAMAIWVRMVLAARRLLSGLPVTDDAAFNELDRFAVRMRDNQIQLFGLLLEGWQSLAEGRPVNAKFRAVQVLKLAEESIAYMRDNALLLERAAYLRNTSMVSVREEAELLDISQTQVGGAEAWMVALHLACAGRDSDLAAWMSMNRAGILEPSYRLFARLAMHCLGEPAARIRKKLPVRELSRYSLRDDLEGEGERLFQAGEVEAVDTIDHIEIRMFGAFRAERDGFPITDKMWRRKKAATLAERLALGMDALVDRETLAMELWPHAEFNSARNNLYSTISRLRSALGPTPDGKSCVLIQNECIGLNGDYVKTDVRLFDQISREVLGNRMGARGPHLVELCLKIEQLYAGPLYVPNGCNPTYFLRMRRIMQSKYIDCMIKGANAALEENDLQSAIWLAESGLRQETAREDMVRCAMRVYSAAGRRRDIVELYSGHMHHLREQVNGVPEPETRRLYERLVEGRLNRVLVER*