ggKbase home page

SCNpilot_bf_inoc_scaffold_2453_curated_1

Organism: scnpilot_dereplicated_Xanthomonadales_2

near complete RP 52 / 55 BSCG 51 / 51 ASCG 13 / 38
Location: 280..3348

Top 3 Functional Annotations

Value Algorithm Source
YD repeat protein id=3751892 bin=GWF2_Geobacter_54_21 species=Delftia acidovorans genus=Delftia taxon_order=Burkholderiales taxon_class=Betaproteobacteria phylum=Proteobacteria tax=GWF2_Geobacter_54_21 organism_group=Deltaproteobacteria organism_desc=Good similarity UNIREF
DB: UNIREF100
  • Identity: 33.0
  • Coverage: 719.0
  • Bit_score: 318
  • Evalue 2.20e-83
Uncharacterized protein {ECO:0000313|EMBL:KHD08918.1}; TaxID=1003181 species="Bacteria; Proteobacteria; Gammaproteobacteria; Thiotrichales; Thiotrichaceae; Thiomargarita.;" source="Candidatus Thiomargarita nelsonii.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 31.4
  • Coverage: 692.0
  • Bit_score: 290
  • Evalue 1.20e-74
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 30.0
  • Coverage: 767.0
  • Bit_score: 268
  • Evalue 1.00e-68

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Candidatus Thiomargarita nelsonii → Thiomargarita → Thiotrichales → Gammaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 3069
ATGTATGCGTACGATCGCGAAAACCGTCTGGTCAGCGAAACCCGCGCCGGCCTGCTGCTGCAGGCGTTGGAACACGATCCCGAAGGCAACATACGCCAGCACACCGACGCGCTCGGCCGCATCACGGCATCCACCTACGACAAGGCCAACCGCAAGCTCAGCGAAGACCGCAGCGGCCTGGCCGTCGAGCGCTGGACCTACACGCCGTCAGGCGACATCGCCACATACACCGATCCCGACGGCCGCATGACGGCGCACACATATACGCCGCGGCGCCTGCTCGAATCCGAATCGCTGGCTGGCGAGACGACACGCTACACCTACGACGGCGCCGGTCATCGCCTGTCGCGCGAACGGCCGAACGGAGCAGCCTCGACATGGACGTACGCCTACGACGCGGCCGGCAATCTGGCCGCCGTCACCGATCCGGACGGTCACTCGACCACGTTCGACCATGATGCGAACAACAACCGCACGCGCGTCGTCGACGCCAACGGCCACGCGACGGCCTTCGCCTACGACGAACGCAATCGTCTCGACAGCAAGACCTATCCCGACGGTACCGCCTGGGCTTGGCGCTACGACGGCGACAACAACCGCATTCGCAGCCAGGCGCCGAACGGCCGCGTCAGCGAAACCGCCTACGACGCACTGAATCGCCCGACCCAGACGACCTACCGCGATGCCCCGGCCGGCGAGGTGCAATCGACCGCCTACACTTACGACGGCAACAGCAACGTCCGCACGATCACCGAGACGAGCTCGACCGGCACGCGCACCGAAACACGCGACTACGACGACTTCGATCGCCTGACCGAGGTAAGCGACGGCGATGGACGCCACCTCAGCTACGCCTACGACGATGTCGGCAACCGAACACGGATGAGCGATTCCGACGGCCACGATACCGTCTGGACCTACAACGACCTGAATCAGAACACGCGCGTGACCGTGCCGGGCATGGGCTCGACGAGCCTGGGCTACGCCCCCAGCGGCCGCGTGACCGAGATCAGCCGTCCGGACGGTTCGGTGACCGAGCAGACGTTCTTCGACAATGGCCGCCTGCAGAGCATTCGCCACAGCAGTGCTGGCCAGACGCTGGCCCGGTACGACTACGTCTACGATCCGAACGGCAACCGCACCGAGCAACGCGAACTCAACGGCGCGACGACGGCCGATACGACGCAACGCACGCGCTACGTCTACGACGATGCCGACCGGCTGGTCGAAGTCCAGGAACCGAATCGCACCACGACTTATACGCTCGATGCCGTCGGCAACCGCATCACCGAGCGGGTCGTCGACGACAGCGGCGGCGTCATCAGCGACTCCACGCTGACCTACGGCGAGCGCGACCAGCTGACCCGCCGCAGCGATCCGGCCGCCGACGTCCACGTCGACCAGACCTGGGACCCCAACGGCAATCTCGCAACACAAACCGTCGACGGCCAGCCGCCGCGCGTCTACACCTACGACGCCCGCGATCGCCTGATCGGACTGACCTTGCCGAACTCGCCGAGCGGCCCGACCACGTTGCGCTTCGCCTATCACGCCGACGGCCTGCGCCGCGAGAAGACCGACGGCATCGCGACGACGCGGTACCACTATGACGGCCAGAGCCTGCTCGCCGAGACCAACGCGATCGGCAACACCCTGCGGCAGTTCCACTACAGCGCGACGCAGCTGATCGCTCAGACCCAGACCGGAACGACGCCGGCCCATCGCCACGTGCTGCTCGATGCCCTGCGCTCACCGATCGCGCTGCTCGACCCGACGGGCCTGGTCACCGCCAGAACCAGCTACGACGCCTTCGGCGAGATTCGCGCGCAACTCGGCACGAACGGCACGCTGGCGACACCGAGCCGCGATGCCGCCAACGCGGAACTCATCAGCACCGACAACCAGCCCGTCGGCTTCACGGGCTATCTCAAGGACACCGAATCCAGCCTCTACTACGCGAAGGCGCGCTACTACGACCCGGCGACAGCGCGGTTCACGACCGAAGACCCCGAAGCCGGCAAGGATCTTGAGCCGCCATCCTTGCACCGATACTTGTACGCGTACGCCAATCCGCTGGCTTATGCAGACACGACCGGGCGTCAAGCTGAATTGCTCAGCATGCAGAGCGCGGCGGCGCAACTGGATCTCGATCCGGAGTTCCGCCGAACCAGGGATCTCCATAGCCGTGCCGACTTCGCTGCAACAGGGGAATTCATAGTCGATTCGATCAAGTCGACAGTCAAACTTGCCGGGTTTCTGCTGGCTTCGCAGGCAGAGACCAATATGCGCGATGAGAACGGCGCTGCCACGCAACAGCTACGCGCGGGACTCGACAAGGCCTGGGAAGACGTAACCCAACACCCCATCGAGAAAGCGTTCGCGGCAATCCATGCGCAAAGCGAAAAAGCCGATGCCTATCAGGCCGCAGGCGATGACTTCAACGCCAGAAAGACCCGCACCAAGCAGAAACTCGAAATCGGCAGTGTGGTTTCCGGCGGCTATGGGCTTGCACGATCAGCCGTTAAGAACATCGCCAGACTGAGCACGAGGGGGCTAGCAGCTACTGCAGCCACGACAGGCAGCGAGGCGATCACCGCAACTGTCATGGAAGGCGCAGACGGGTCGGTTATGGCTGCACCCTATCTTTTCAGGGGCACTTCAGAAGGATATCCAGGCAGCCCAGGCTTGCGGCGCATTGGCGTCACCCCTGCATCTATCGACCCGATCGTATCGACGCTTTTCGCTACTGAAAGTGAGAACTACGGCCGAGGGGTTCTGCACATTGCTTCGCCGGACGACCTCAAGGGCATCGATATCGACAAAGGAAACGTACTCGCCGGGAAGGAGGCGGAAGTCAGCGTTGGGATCTCTCCGCTCGAGTTTGGACAGCGGGCATCTACGACGATCTCCGCCGCACAAGCACGAGCCAGCCTGCGCGAGATGGGCATTGACCTTCCCGAGAAAATCTACGACAAGGCCGGACTGGAGAGCGACATTCAAACGACCCCACGATTGGATGCCGAGCAGATACACATATTTGTTGAACGAGCCAGAAGCGGTAGTGACTAA
PROTEIN sequence
Length: 1023
MYAYDRENRLVSETRAGLLLQALEHDPEGNIRQHTDALGRITASTYDKANRKLSEDRSGLAVERWTYTPSGDIATYTDPDGRMTAHTYTPRRLLESESLAGETTRYTYDGAGHRLSRERPNGAASTWTYAYDAAGNLAAVTDPDGHSTTFDHDANNNRTRVVDANGHATAFAYDERNRLDSKTYPDGTAWAWRYDGDNNRIRSQAPNGRVSETAYDALNRPTQTTYRDAPAGEVQSTAYTYDGNSNVRTITETSSTGTRTETRDYDDFDRLTEVSDGDGRHLSYAYDDVGNRTRMSDSDGHDTVWTYNDLNQNTRVTVPGMGSTSLGYAPSGRVTEISRPDGSVTEQTFFDNGRLQSIRHSSAGQTLARYDYVYDPNGNRTEQRELNGATTADTTQRTRYVYDDADRLVEVQEPNRTTTYTLDAVGNRITERVVDDSGGVISDSTLTYGERDQLTRRSDPAADVHVDQTWDPNGNLATQTVDGQPPRVYTYDARDRLIGLTLPNSPSGPTTLRFAYHADGLRREKTDGIATTRYHYDGQSLLAETNAIGNTLRQFHYSATQLIAQTQTGTTPAHRHVLLDALRSPIALLDPTGLVTARTSYDAFGEIRAQLGTNGTLATPSRDAANAELISTDNQPVGFTGYLKDTESSLYYAKARYYDPATARFTTEDPEAGKDLEPPSLHRYLYAYANPLAYADTTGRQAELLSMQSAAAQLDLDPEFRRTRDLHSRADFAATGEFIVDSIKSTVKLAGFLLASQAETNMRDENGAATQQLRAGLDKAWEDVTQHPIEKAFAAIHAQSEKADAYQAAGDDFNARKTRTKQKLEIGSVVSGGYGLARSAVKNIARLSTRGLAATAATTGSEAITATVMEGADGSVMAAPYLFRGTSEGYPGSPGLRRIGVTPASIDPIVSTLFATESENYGRGVLHIASPDDLKGIDIDKGNVLAGKEAEVSVGISPLEFGQRASTTISAAQARASLREMGIDLPEKIYDKAGLESDIQTTPRLDAEQIHIFVERARSGSD*