ggKbase home page

ar4r2_scaffold_376_1

Organism: ALUMROCK_MS4_BD1-5_24_44

near complete RP 48 / 55 MC: 1 BSCG 45 / 51 MC: 1 ASCG 1 / 38
Location: 158..3694

Top 3 Functional Annotations

Value Algorithm Source
Bacterial surface protein 26-residue PARCEL repeat (3 repeats) n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=D7VPP4_9SPHI similarity UNIREF
DB: UNIREF100
  • Identity: 40.3
  • Coverage: 429.0
  • Bit_score: 273
  • Evalue 1.20e-69
hypothetical protein Tax=GWF2_Bacteroidetes_35_48_curated similarity UNIPROT
DB: UniProtKB
  • Identity: 49.1
  • Coverage: 281.0
  • Bit_score: 269
  • Evalue 1.80e-68
PARCEL domain-containing protein similarity KEGG
DB: KEGG
  • Identity: 57.5
  • Coverage: 268.0
  • Bit_score: 260
  • Evalue 2.90e-66

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

GWF2_Bacteroidetes_35_48_curated → Bacteroidia → Bacteroidetes → Bacteria

Sequences

DNA sequence
Length: 3537
ATGCAAAACAAGAATACAGAATCTAGAATAAAGACTAAAGATAAAAGATTAAACTGTCTAAGTACTAAATTCTATCAACTAAATTCTAATAAAACTGGTTTTACTCTAGTAGAATTAATAGTAACAATAGTTATACTAGCAATTTTGTGAACTATAGCATTCATTAGTTTTCAATGATATTCAAGGAATTCAAGAGATAGCGTAAGAGTAGCAGATCTAAACAATTTAGAAAAATCACTATGAATATATGTAGTAAAAACATGATCATATCCAATTCCTGATAGTAGTGCAGAGATAACATATGCTTGATGAACAGCATGGATAGAATGAACAATCTGAGATACTGTGATGAGAAATATAGAAAGTATAAGTAAAAAAACATTAGATCCATTAACTGCAAACGAATATACATATAGTATAACAACATCAAAAAAAGAATATGAAATAGGAGCTATAAATGAATGATGAGAATTAACATATAATCAATTACCAACAGGTACAACATATGCTGCAGATTTAAGTAAAACAAATGCAGTAGCATATATGAAATGAAATTATAATGAAAAAATAATAAAAATAAGCACATGATGACAAAACTACATATTAGCAATTCCAACAATAATAGTAACAGATATAAGTAATCCGACAATAGAAAGCCTATTATCAGAGAAAAAACTAGTATACAGAAACTATAATAATCTACCTCATTCATATAATCCAACAAATACTGGTTGATTTAATTATGAAACAAACAAATTAGTAGTGTATAGTTGAGCAACAATAAACTTAGAAGATAATAACAATAAAATAGACTTTGTAGAAAACCTACAAGAAGCATATAGTGGAACAATACTATGATGAACAGAAAACTATAAAGATATAACAAGTATAAATACAACAAGTATAAATACAACAAGTGAACCTGATAAAGCAGTAGATTTAGTTAATAACTATATAACAAATAATGTATGATGAATAACTTGAAAGATAACAACAGTAACATACAATTCTTGTACTTTAGACTGACAAACAATAAATCATAATCAAACTATTACTGCTTATAGTGAAAATAGTATATTATACTGAGCTAGTTATGAGTGTAGTGATAGAAGTCAAGAAAGAACTTGTACAAACTGAGTACTAAGTTGAGATGATAGTTATCAATATAAAAGTTGTGTAAAATGAGCTCCAAATAATTGTAGTGAAAACTCAAATTATACATATCTTACACATATATACTCAATTCCAGCAATAAATCATGGAGAAACAGCTACAAATATAAATTCACAAGTAGTGACAATTCCAAATTGAACACAAGTATATAAATTAACAAGTATCTGATGTAATGACTGAGTACTAGTAAATGAAATAGAAGAAGCAACTCCAACAGTAACATGTGATAGTTGATATGTACAAAGTTGAAATAGTTGTGAAATTGCAACATATACAGTAAGTTGAGATTTTGGAATAAATGCAAATGCAGCAACAATAAATGTCTGTGGAACAAACGAAATAGCAGATGCAAACTGACAATTCACAACAACAAGAAATTATGGAAGTGTATGTGATACAATAACAGCAACAAGAACAAATTATACATGTAGTACAACAATACAATGACCAGCAAGTTTAACTTCTAATATATCTAACATTGCTTGAAGTTGTAGTGCAAATAGTTATACAGTAACATTTGATTGAAACTGATGAGTATGACATAATCCAACAACAATGAATGTAACATACAATACTGCAATCTGAACACTACCAACAAATCCAACAATGACATGATACACATTTAATTGATGGTATACACAAGCTTCATGATGAAGTCAAGTAACAACAGCAACTGTAGTAACTTGAAACGCAACAGTATATGCACAATGGTGAATAAATAACTATACAGTAACATTTGATTGAAACTGATGAACATGACATACTCCAACAAGTAAATCAGTAACATATAATACTGCAATTTGAACATTACCAACTAACCCAACAAGAGAATGATACACATTCAATGGTTGGTATACATCAAACACAGGATGAATACAAGTAACAGAATCAACAACAGTATTATGAGATGCAACAGTGTATGCACAATGGTGAATAAATAACTATACAATAACTTTTGATTGAAACTGATGAACTGGGCATACTCCAACAAGTAAAACTGTAGCATATAATACTGCAATTTGAACATTACCAACTAACCCAACAAGAAGTTGATATACATTTGCTTGATGGTTTACGACAAGTACATGATGAACACAAATTACCACATCAACAGTTGTATTATGAAATTCAACAGTATATGCTCATTGGAATATAGTACTTTGAAGTATGACTTTAAAATATGAAGGATTAGTTTCCTGAGATATTATTAAATTACCTTTTAAATGAACAGTAAATATAACAAATATAGATTGGTGAGATGATTGAGTAAATTCTTGTCCAACTATAGCTACTTGATCTATTTCTTGTACTTATACAAATGCATCTAAATGAAATTATACAATTCAAGTAACTTGATTAACAACTTGATTTTGAAATGCATCAACAGTTTCTACATGAATTGCAAAATTAACTCAAATAACTCAGTGGCAATGAATGTGATTAACAGATTTAAGTTATGCCTTTTATTGAGCAACTAATTTTGTATATTTAGATCCAAACTTGGATACTTCCAATGTTACTAATATGAGCTCTATGTTTTATCAAGCAAGTAAATTTAATAGTTCTATTAGTAATTTTAATACTTCTAATGTTATTGTTATGAGTTATATGTTTTATCAAGCAAGTATTTTTAACCAATCAGTAAGTAATTTTAATACATCAAAAGCTACTAGTATGAGTTATATGTTTCGGAATGCATCAGTTTTCAATCAATCTGTAAGTAATTTTAATACTTCTAATGTTACAGATATGAGTTTTATGTTTTATCTAGCACTTGCATTTAATCAACCTGTAAGTAATTTTAATACTGCAAAAGTTACTAATATGAGTTTTATGTTTAATTCAGCAAAAGCTTTTAATCAATCAGTAAGTAATTTTGATACATCTAATGTTACAGATATGAGTTTTATGTTTTATCTAGCACTTGCATTTAATCAACCTGTAAGTAATTTTAATACATCCAATGTTACTAATATGTATGCAATGTTCCAATCTGCAGCAATATTTAATCAATCAGTAAGTAATTTTGATACATCAAAAGTTACTAATATGTGATCTATGTTTAATTATGCAACGGCATTTAATCAACCAGTAAGTAATTTTAATACATCCAATGTTACTAATATGTATTTAATGTTTCAGGAGGCTAAATCTTTTAACCAATCTTTAGCTAATTTTAATACTATTAAGGTTAATAATATGTGATATATGTTTTTTGGGGCAACAAATTTTAATAAAAATATATCATGTTGGAATGTATGACTTATTGTATCGGAACCTACAAGTTTTGCAACAAGTTCAGCATTGATATCATCAAATAAACCATTATGGTGAACAACTGGTTCAACTTGAACTTGTGAGTAA
PROTEIN sequence
Length: 1179
MQNKNTESRIKTKDKRLNCLSTKFYQLNSNKTGFTLVELIVTIVILAIL*TIAFISFQ*YSRNSRDSVRVADLNNLEKSL*IYVVKT*SYPIPDSSAEITYA**TAWIE*TI*DTVMRNIESISKKTLDPLTANEYTYSITTSKKEYEIGAINE**ELTYNQLPTGTTYAADLSKTNAVAYMK*NYNEKIIKIST**QNYILAIPTIIVTDISNPTIESLLSEKKLVYRNYNNLPHSYNPTNTG*FNYETNKLVVYS*ATINLEDNNNKIDFVENLQEAYSGTIL**TENYKDITSINTTSINTTSEPDKAVDLVNNYITNNV**IT*KITTVTYNSCTLD*QTINHNQTITAYSENSILY*ASYECSDRSQERTCTN*VLS*DDSYQYKSCVK*APNNCSENSNYTYLTHIYSIPAINHGETATNINSQVVTIPN*TQVYKLTSI*CND*VLVNEIEEATPTVTCDS*YVQS*NSCEIATYTVS*DFGINANAATINVCGTNEIADAN*QFTTTRNYGSVCDTITATRTNYTCSTTIQ*PASLTSNISNIA*SCSANSYTVTFD*N**V*HNPTTMNVTYNTAI*TLPTNPTMT*YTFN*WYTQAS**SQVTTATVVT*NATVYAQW*INNYTVTFD*N**T*HTPTSKSVTYNTAI*TLPTNPTRE*YTFNGWYTSNTG*IQVTESTTVL*DATVYAQW*INNYTITFD*N**TGHTPTSKTVAYNTAI*TLPTNPTRS*YTFA*WFTTST**TQITTSTVVL*NSTVYAHWNIVL*SMTLKYEGLVS*DIIKLPFK*TVNITNIDW*DD*VNSCPTIAT*SISCTYTNASK*NYTIQVT*LTT*F*NASTVST*IAKLTQITQWQ*M*LTDLSYAFY*ATNFVYLDPNLDTSNVTNMSSMFYQASKFNSSISNFNTSNVIVMSYMFYQASIFNQSVSNFNTSKATSMSYMFRNASVFNQSVSNFNTSNVTDMSFMFYLALAFNQPVSNFNTAKVTNMSFMFNSAKAFNQSVSNFDTSNVTDMSFMFYLALAFNQPVSNFNTSNVTNMYAMFQSAAIFNQSVSNFDTSKVTNM*SMFNYATAFNQPVSNFNTSNVTNMYLMFQEAKSFNQSLANFNTIKVNNM*YMFFGATNFNKNISCWNV*LIVSEPTSFATSSALISSNKPLW*TTGST*TCE*