ggKbase home page

L1_007_000M1_scaffold_37_22

Organism: L1_007_000M1_public_UNK

megabin RP 52 / 55 MC: 52 BSCG 51 / 51 MC: 51 ASCG 19 / 38 MC: 17
Location: 25047..26045

Top 3 Functional Annotations

Value Algorithm Source
Transcriptional regulator AraC family n=1 Tax=Collinsella sp. CAG:166 RepID=R5ZMW8_9ACTN similarity UNIREF
DB: UNIREF100
  • Identity: 95.8
  • Coverage: 332.0
  • Bit_score: 650
  • Evalue 5.90e-184
Transcriptional regulator AraC family {ECO:0000313|EMBL:CDA35691.1}; TaxID=1262850 species="Bacteria; Actinobacteria; Coriobacteriia; Coriobacteriales; Coriobacteriaceae; Collinsella; environmental samples.;" source="Collinsella sp. CAG:166.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 95.8
  • Coverage: 332.0
  • Bit_score: 650
  • Evalue 8.30e-184
putative AraC-type DNA-binding domain-containing protein similarity KEGG
DB: KEGG
  • Identity: 29.5
  • Coverage: 325.0
  • Bit_score: 157
  • Evalue 4.50e-36

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Collinsella sp. CAG:166 → Collinsella → Coriobacteriales → Coriobacteriia → Actinobacteria → Bacteria

Sequences

DNA sequence
Length: 999
ATGCATCGGGCCGCGACCGACATAACCGAGCTCTATGCACCACAGTTTGAGCAGTTTGGCCTGGAGCTTTCCGGCAAGGGCGCCGTCTTTACCGGCGAGGTGGCAAACGATCGGGCACACGGCCGCGCATGGATCATGCCCCTCTCCCCTGCCTGCATCGTCATGGAGCATTTCATCACCCCGACGCACGATATGTACCTGGCCGAATACACACCCGAACCGTACGCGTGTGTGAGCGAAGTCAGCATGCCCACGCTCATGTGCATGCCCGAGGCTGGCATAACGCCCGCGAACCTTAAACCCTTGCATGGACCGTGGCCAAACAATGCCGTTTGCAGCTTTATCCAAGATAGCTGTGGCGAGGAATTAAGCCCGCTGTTTGCGGGGGAGCTCTATCACTCGTGCTCGGTGCTCTTTTTGCCTGGATATTTTGACGAACTGGAGCACCGCTATCCCACCGAGTTCACGGGAATCTTTGAGGCGTTTGCCGAGCCATGGCACGAGGAAGCGACGTCTGCCATCTGCCACACGCTGCGCCGGATTAACGAGGAACGCGCCCGCACCGTAGGCGGACACGTCTATATGCAGGGCATCGTGGAGACCATGGTCGCGGAGCTCGCCTGTTCGCGCGCGGCCCACAAGCAGGCGCGGCAGGCGGCAGACACACGCGCCAGCATAACCATTGCCGAAGAAGCAACGGCAATGATTGAGCGCGCGCTCGACAAAGGCAGACGTGTGGGTGTCAACGAAGTGGCCGAGAGGCTCTACACAAGCCGCTCCAAACTATGCGCCACCTTTAAGGCCCAAACTGGCGAGTCCCTGGGCGCCTACATTCGCAGACGCCGCATGGAGCGAGCACAGGACCTTTTGGCCGATAGCGCGCTCACGATAGCGCAGGTGGCAGAGCGTCTAGGCTACCCTCAGCAGGCCGCCTTCGCCCAAGCCTTCAAGCAACACACCGGCACCACCCCCACCACTTGGCGAAGCAAGCATCGCTAA
PROTEIN sequence
Length: 333
MHRAATDITELYAPQFEQFGLELSGKGAVFTGEVANDRAHGRAWIMPLSPACIVMEHFITPTHDMYLAEYTPEPYACVSEVSMPTLMCMPEAGITPANLKPLHGPWPNNAVCSFIQDSCGEELSPLFAGELYHSCSVLFLPGYFDELEHRYPTEFTGIFEAFAEPWHEEATSAICHTLRRINEERARTVGGHVYMQGIVETMVAELACSRAAHKQARQAADTRASITIAEEATAMIERALDKGRRVGVNEVAERLYTSRSKLCATFKAQTGESLGAYIRRRRMERAQDLLADSALTIAQVAERLGYPQQAAFAQAFKQHTGTTPTTWRSKHR*