ggKbase home page

L3_105_000M1_scaffold_4187_21

Organism: dasL3_105_000M1_concoct_33_fa

near complete RP 51 / 55 BSCG 51 / 51 MC: 1 ASCG 12 / 38 MC: 1
Location: 16330..19686

Top 3 Functional Annotations

Value Algorithm Source
SCP-like protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4ED70_9ACTN similarity UNIREF
DB: UNIREF100
  • Identity: 87.8
  • Coverage: 1119.0
  • Bit_score: 1822
  • Evalue 0.0
Uncharacterized protein {ECO:0000313|EMBL:KGI72394.1}; TaxID=742722 species="Bacteria; Actinobacteria; Coriobacteriia; Coriobacteriales; Coriobacteriaceae; Collinsella.;" source="Collinsella sp. 4_8_47FAA.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 93.7
  • Coverage: 1118.0
  • Bit_score: 1976
  • Evalue 0.0
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 25.5
  • Coverage: 1038.0
  • Bit_score: 209
  • Evalue 3.30e-51

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Collinsella sp. 4_8_47FAA → Collinsella → Coriobacteriales → Coriobacteriia → Actinobacteria → Bacteria

Sequences

DNA sequence
Length: 3357
ATGACCCAGGGAAAGCACTTCGCTGCCGGTGGCGATCACTTCGCCGCACTGCCAAACAAAAAGATTCACGCCCCGAAGGGGATCGCGCGTCTGACCGTCACCGCGGCGACCATGGCCGCCATGACCGGATCGAGCCTCATCTCACCGTTCACGGCATTCGCCCAGACCGGTGACGGGGGCACGCAGCACCCAGCCGTGATGTCGCCGATTGCCGCTCACGCCGGCGCGGCATCCGGTACCGGCGCGAAATCCGCCGCACAGACCATCGCGGACCTGCAGAAGGCAGTCGACGAGGCGAAGGCGAAGGAGGACGCCGCCAAGGCATCCTACGATGAGGCCGCCGGTCCCTACAACGAGGCGGCTTCCGCCCGCGACCAGGCCAAGGCATCCTACGATTCGGCAGTCAGCGCCGGCACGGCAGCCGACCGAGCGGCAATGGACGAGTACGCCCGTCAGGTCGCGGAGGGCAAGGACGCCGCCGATGCCGCCGGCAAGGACCTCGAGCAGGCGAAGGCGGGTCTCGCAGACGCTAAGGCGGATGCGTCCGAGAAGGACGAAGCGTACCAGTCCGCCCTCAAGGCGGCTCAGGATGCCAAGGATGCCCTCGATAAGGCCAAGGCAGATGCTGTAAGCGCCACCCCGGAGGCAATCCGTGCCGCCGAGCAGGCCGTGCGCGACGCCCAAGATGCCGTGAGCAGGGCACAGGCCGACCTCGCCAACGCCAACGCGACGCTTGCCGATGCCCAGTCCAAGCTGGTTGCGGCCCAGTCTGCAAAGGATTCCGCAGATACCGTCCTCGCCGCAGCCCAGCAGAACAAGGACACGGCGGATGCGAAGGCCGCAGCCGCCAGCGCCGCCTACGAGCAGGCGAAAGCCGACCTCGCCGCAGCGGAGGCCGGTGCCAGCGGTCCGGAGTACGATGCGGCAAAACAGAAGGTCGCGGACGCAGAGGCAACCCTCGCCGCCGCACGAGCGGTGCAGTCCCAGTGCGAGTCCGAGCTTGAGCAGGTGCAGTCCGCCGCGGCAACGGCACAGACCGAGCTAAATGATGCCCAGGCATCCCTCTCTGCGAAGCAGCAGGCCGCGGCCGATGCCGAATCCGGTGTGACCGCCGCCCAGTCCGCTCTCGCTGCCGCGAACGCCGACCTCGATGCCGCCAAGCAGGCGAACGTCGACGCCATCGCCAAGCTGGATGCAGCGAAGCAGGCTGTCAAGGATGCCGAGTCCGCCAAGGCAGCCGCCGACGAAGAGCTTGCGAACGCCAAGACCGCAAAGGATACCGCTGACGCGGCCGTGAACGCCGCCCAGCAGAAGGTCGACGAGGCACAGGCAAAACTCGATTCCGCCGACGCGCAGCTCAAGCAGGGAGCCATCGGCTTCTTCCGTGCCATGGGTGCCGAAGATGCAGCCAACATCATCCTGAACGCCAAATATGCCGGAAAGACCGAAGTCGGCAACTCCAAGGACGCCACATCCCTTGATAACATGCTCAATGCGATCAGGTGGATGAAGTCCGTCAACGACTACCGCAAGTCGGTCGGCCTGTCCGAGCTCCATGTCACCTACAAGCTCATCGCCGGTGCGATCGCAGATGCCAACTACAGCGACACCGTGCTCGATCATGCCCGTCAATATGATTTTGCCGAAAATCTGGCATGGAATTACGGAATCGATCCGAGTGGGCAGTGGATCGAGCAGGAGAAGGGCTTTTTCGACAAGGCAACGGAGGCCCTTTATGGAGTCACCGGTCTTGTCGGCAAGGATGCCTATGATTTCTATGCAAAGAACGGTGTCGCAATCAACCATTGGATTGCAGACAACTGCCACTGGGAGAACGGCAGTAGCGGCACCGTCGGCCATTACATGAACATCATCAATCCCGAACTCGCCGTTATGGGAATGGCAACCTGCACCAAGGGCACCATGTCCGGGCTTCAGACGCAGTGCTACACCGCAGAGATCTCCGGCTGGTTCGGCTCCGGTTGGAACACGAACCCCATCTCCGTCGACGAGTACGAGCAGAAGCTGACCTCCTATATCAACGGCCTCAAGAACGCCAAGAGCGCACTCGACGCAGCCAAGACCAATCTCGCATCCAAGAAGCAGGCAGCCGCCGGCGCCGCCGCAACCGTCCAGCAGAAGCAGGTCGCAGCGGATTCCGCACAGGCAGGTGTCGATGCCGCAAAGCAGGGTGTTGCCGATGCCCAGCGTGCCGTCGATGTCGCAAAGGCCGATCTCGCATCCAAGCAGCAGGGCGTCACGGATGCCCAGACCGAGCTTAATGCGGCGAAATCGGACCTCGATGCCGCCAACGCGGCAGTCGATACGGCAAAGTCGACCGTCCAGCAGAAGCAGGTCGCCTTTGATGCCGCCAATGCGGTCGTAACGGCCGCACAGACCAAGCTGGATTCCGCCAAGGCGGACACCGAGGCCAAGCAGCAGGATGTCATGGATGCGAACGCCGATCTCGCGAAGTTCTTCCAGGACGTCGCCGACGCCAAGAAGGCGGTCGATACAGCAAAGAGCGTCCATGATGCGGCGGTAGCCGATCAGGTTGAGAAGGCGGCGGTCCTCGCCGCCGCCGAGCAAAAGGCGGACGCCACCGCACGCGACCTCGCGGATGCCCAGCGTGCCGTCGATGCCGCAAAGGCCGACACCGGCGTTGTCGCCGACAGGCTTACCGGCTCCCAGACCGATCTCGAGGACGCACAGTCGAACCTCGACATCCTCACCGGCCTTGCCGCGAAGCTCGCAGAGGCACAGCAGCGCGAGCAGGATGCAGTAAAGGCCGTCAATGACACCAAGGCCGCGCTTGATGCCGCGAAGGCGAATACCATCGCCGCCGAGTCCCTGGTCTCCGCCGCCGAGCAGGCAAAGGCGCAGGCAGACGCCAAGCTGTCGAAGCTGAACTCCATCGATGCCGGCGCGGCGATCGCTTCCGGCCATGATGTGAACGCGGATGACGCCCTCAACGCGCTTTTCGCCGCAGCAGTCGAGGCACGTGCCAAGGTCGCGCCCGCCAAGGCCATCCTGGACGAGAAGCAGGCCGCGGTGGACGAGCTCCAGCCCGGCTATGATGCGGCACTCGCCGCCTACGAGTTGGCGAAGTCCGACCGCATCGCAGCGGAGCAGAAGCTTTCCGATGAGATCGCACGACAGGAGGCAGAGGAGGTCGCCAAGCAGCAGGCGGCATACACCCCGAAGCACCTCGCCGGCACGGATACCGCCCAGACCGGCAGCCTCGCCCAGACCGGTGACTGCGCGGGACTCATCGGCGAGACGTTCGTCATCGGCGGTACCGTCCTCGTGGCGGCCGGCGTCTTTCTCGATCGGAAGAAGCGCCGCGAGCAGATGTAG
PROTEIN sequence
Length: 1119
MTQGKHFAAGGDHFAALPNKKIHAPKGIARLTVTAATMAAMTGSSLISPFTAFAQTGDGGTQHPAVMSPIAAHAGAASGTGAKSAAQTIADLQKAVDEAKAKEDAAKASYDEAAGPYNEAASARDQAKASYDSAVSAGTAADRAAMDEYARQVAEGKDAADAAGKDLEQAKAGLADAKADASEKDEAYQSALKAAQDAKDALDKAKADAVSATPEAIRAAEQAVRDAQDAVSRAQADLANANATLADAQSKLVAAQSAKDSADTVLAAAQQNKDTADAKAAAASAAYEQAKADLAAAEAGASGPEYDAAKQKVADAEATLAAARAVQSQCESELEQVQSAAATAQTELNDAQASLSAKQQAAADAESGVTAAQSALAAANADLDAAKQANVDAIAKLDAAKQAVKDAESAKAAADEELANAKTAKDTADAAVNAAQQKVDEAQAKLDSADAQLKQGAIGFFRAMGAEDAANIILNAKYAGKTEVGNSKDATSLDNMLNAIRWMKSVNDYRKSVGLSELHVTYKLIAGAIADANYSDTVLDHARQYDFAENLAWNYGIDPSGQWIEQEKGFFDKATEALYGVTGLVGKDAYDFYAKNGVAINHWIADNCHWENGSSGTVGHYMNIINPELAVMGMATCTKGTMSGLQTQCYTAEISGWFGSGWNTNPISVDEYEQKLTSYINGLKNAKSALDAAKTNLASKKQAAAGAAATVQQKQVAADSAQAGVDAAKQGVADAQRAVDVAKADLASKQQGVTDAQTELNAAKSDLDAANAAVDTAKSTVQQKQVAFDAANAVVTAAQTKLDSAKADTEAKQQDVMDANADLAKFFQDVADAKKAVDTAKSVHDAAVADQVEKAAVLAAAEQKADATARDLADAQRAVDAAKADTGVVADRLTGSQTDLEDAQSNLDILTGLAAKLAEAQQREQDAVKAVNDTKAALDAAKANTIAAESLVSAAEQAKAQADAKLSKLNSIDAGAAIASGHDVNADDALNALFAAAVEARAKVAPAKAILDEKQAAVDELQPGYDAALAAYELAKSDRIAAEQKLSDEIARQEAEEVAKQQAAYTPKHLAGTDTAQTGSLAQTGDCAGLIGETFVIGGTVLVAAGVFLDRKKRREQM*