ggKbase home page

scnpilot_p_inoc_scaffold_369_curated_63

Organism: scnpilot_dereplicated_Planctomycetales_1

near complete RP 50 / 55 MC: 2 BSCG 50 / 51 MC: 1 ASCG 14 / 38 MC: 3
Location: comp(81907..85245)

Top 3 Functional Annotations

Value Algorithm Source
hypothetical protein n=1 Tax=Gemmata obscuriglobus RepID=UPI00016C41AC similarity UNIREF
DB: UNIREF100
  • Identity: 35.2
  • Coverage: 967.0
  • Bit_score: 485
  • Evalue 1.90e-133
Thioredoxin H-type {ECO:0000313|EMBL:EMI16178.1}; TaxID=1265738 species="Bacteria; Planctomycetes; Planctomycetia; Planctomycetales; Planctomycetaceae; Rhodopirellula.;" source="Rhodopirellula maiorica SM1.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 29.5
  • Coverage: 675.0
  • Bit_score: 255
  • Evalue 4.50e-64
sigma-70 family RNA polymerase sigma factor similarity KEGG
DB: KEGG
  • Identity: 28.0
  • Coverage: 996.0
  • Bit_score: 243
  • Evalue 3.90e-61

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Rhodopirellula maiorica → Rhodopirellula → Planctomycetales → Planctomycetia → Planctomycetes → Bacteria

Sequences

DNA sequence
Length: 3339
ATGGGCGGGGGGACGGCGGATCGGACGCAGCGGCGGATCCAGGCGCTGTATGCGACGGGGCCGCTGGGGGCGCTGGGGGACGCGGAGCTGCTGGGGCGGTTCGCGGCGGGCGAGGGCCTGGACCGCGAGGACGCCTTCGCGGCCCTGGTGCGCCGGCACGGGCCGATGGTCCTGTCGACCTGCCGTCGCATGCTGGACGGCGACCGCGCCGGGGCCGACGACGCGTTCCAGGCCGTGTTCCTGGTCCTGGCCCGCAAGGCCGGCTCGTTGCGACGCCCCGACGACCTGCGGCCGTGGCTCTACGGCGTGGCGGTGAAGGCCGGGAAGGAGGCCCGCCGCCGCGCGGCGCGGATCCGCGCCAGGGAAGGGGGGGCGCTCGCCGACCTCCCCGCGCCCGGGGCCGACGCCGACCTGTTCGACCTCCGCGCGGCGATCGACGAGGAGCTGGAGCGGCTGCCGGGGCGGTATCGCGAGCCGATCCTGCTCTGCGAGCTGGAGGGCGCCTCGCGCCGCGACGCCGCCGAGCGCCTGGGCCTGGCCGAAGGGACCCTCTCCAGCCGCCTCGCCCGCGGGAGGTCGCTGCTGCGCGACCGCCTGGCGCGGCGCGGCCTGGCCGTGGGCGCCCTGGGCGCCGCGCTCGCGCCGACGGCGAAGGCCGCCGGCCTCGACGCGCTCGCCGACGCCTCCGTCCGCCTGATGCTCGCATCACCATCCCTGCACGGACCGCCCGGGACGGCCTCGGCGGCCGTGGCCGTCGCCGAAGGAGTCCTCGCCATGCTGTCCGCCGCCAAGCTCAAGGCCCTCGCCGCCGCCTCCGCCGCGGCGCTGGGGGTCCTCGTCGTCACCACCGGATTGGCCTGGGGCCTGTCCCATGGCGGCCCGAATCCGGAAGTCCCCGCGCAGGCCGAGGGGCCTGAGCCCGGGGCGAAGGCGGGCGAACTGCGACTCCGCGGCGTCGTGGTCGACGAGGCCGACCGGCCGCTCGCGGGCGCGGAGGTGCGACTGATGCCCTTCACCCCGCGCGAGGTCGGGGTCAGGACCGGGCCGGACGGCTCCTACGCCCTGACGTCCCCGGGCCGGAACGTCGGGCTCCAGATGATCCTGGCCCGCTCCGGCGACGGCCGACTCCTCGGGACCTTCACCTACGGATTCGACCTGACCCCGCCCGCGGCCCCGACCCGGATCGTCGCGAAGCCCGCCAAGGAAGTCCTCGTCCGCGTGGCCGATTCGGCCGGGAAGCCGGTCGAGGGCGCGACGGTCGAGGCCGCGGGCGACTATACCGCCCTGGCGAGCGCGACGAGCGCCGCCGGAGAGGTCCGGATCGCCATCCCGGCCGACCACCGCGTCCAATGGATCGTGGCGAAGAAGGCGGCCGTCGGCTGCGACTATGCGGAGTTCGGCGACTTCGACAAGTATCCCGGCCGGAGCGAGGGCGTCCGGGCCGCCGACTTGCCCCCGTTCGTCGCGCTCACGCTGGGCGAGCCTCGGACCGTGAAGATCCGGGCCCTGGATGACGACGGCACCCCGGTCGCGGGCTCCTCGTTCAGCCTCTGGCTGCTCAAGAAGGAGGGGAAACAGGGAGAGGTCAATTACGGGAGCGGGCTCCAACGCGAGGTGGCGGGCCCGGACGGGGTCGCCGTGTTCGACTGGCTCCCCAGAAGCTCGAAGGGCCTGCTGCAATTCTGGCCCGAAGGCGGCGCGATCGCGCGCCGCAGGGTCGTCGTGGAGCCGGACCAGTCCGAGGTGGTCGCCCGGCTGGTCCGGAACGTCGCGATCCGGGGCCGGGTGGTCCTGCCCGATGGCTCGCCCGCCGTGGGGGTGAAGGTCCAGGCCGACGGCTCCGGCCGCGAGTTGGACAACGGCCGGGGATCGGCCGTCACGGCCGCGGACGGCCGCTACGAGATGGCCGTCCCCCCGGACGAAGCCTACGCCGTGCGGGTCGAGGAGCCGGACTGGGCCGCGGCGGCGCGGATGGACGTGGTCGTCCGACGGGGCCGGCCGGTTGAAGGGGTCGATTTCCACCTGGCGCAAGGGACGGTGCTGAAAGGCACGGTGACCCTCGGCCAGGACGATCGCCCGGCGACGGGAGTTTACGTCCAGATTCAGCAGGAGGGGATGGACGCCCCGGACGACCTGCGCGAGCCGGGGGATCGGTTCGGCCACCGGGTCCGCTGGTCGACCTCGGGCAAGGTGGACGATCGGGGACGCTATGCCTTGCGGCTGGGCCCCGGACGGTACAGCCTGCTGGAGCCCGGAGCCATGAAGTGGGTGGAGATCGAGATCGCGGATCAGCCCGAGCTGATCCACGACGTCCACATGCCCCGGCCGGAGCGGGGCCCAATCGCCGGTCGCGTCATCGACTCGGCCGGCAGGCCGGTCGCGGACGCCTCGGTGGAGTTCGCGCCGGTCGACTTCGCCGGCGCCCGTGTCTTGGCGACGACCGACGCCGACGGCCGGTTCGCCACCGAGCGGCGGCTGCTCCGGACGTTCGTCTGCGCCAAGAGCCCCGACGGGAGCCTGGGAGCGCTGGTGGAGATCGGCCCCGACGACGCCGAACTCGGCCTGGTGCTGGCCCCTACGGCGACGGCGACCGGGCGTCTGCTCGACGAGAATGGCGAGCCCGTACGGCGGGAGAAGCTCCACTGGGGACGCCGCATCCCCTGCAACGACGACCCGGACGGCCCGAGTTACAACGCCTTCGCGCCAGCGGTCGTCACCGACGACGACGGCCGGTTCACGCTGCCGTCGCTGGTGGTGGGCGAGACCTACCGTATCAACATCGAGGGCGAGAACTTCTATCCCTCGGCGGGCGTCGTCCAGCCCTCGCGACCGGGCCCCATGGACGTTGGGACCCTGCAACTCGGCTCCGGCAGCGGCGGAGACGCCATCTTCCTCAACGTCGCGCCCAAGGTCGGAGCCGTCGCCCCGGCGTTCGAGGCGACGACGCTGGACGACAAACCGCTCTCGCTGGGGGACTTCGCCGGCAAGTACGTCCTGCTGGACTTCTGGGCGACGTGGTGCGGCCCCTGCCTCGGCGAGATGCCCCACATCCGCGCCGCTTACGACGAATTCGGCAAGGACGGCCGGCTGGCGGTCGTCAGCCTGAGCCTCGACGACGCGATCGACGCCCCCCGGAAGTTCCAGGAAGAGCGCAAGCTCCCTTGGACGGTCGGCTGGGCCAAAGGGGGGATCGCGAGCGGCCCCAACGCCGCCTACGGCGTCCGCGCCATCCCGGCCCTGTTCCTCATCGGCCCGGACGGCAAGGTCGTCGACCGCGGGATGCGCGGGGAGGGGATCAAGCAGACCGTCGCGAAGGCGCTGAAGCGGCCGTCCCGTTGA
PROTEIN sequence
Length: 1113
MGGGTADRTQRRIQALYATGPLGALGDAELLGRFAAGEGLDREDAFAALVRRHGPMVLSTCRRMLDGDRAGADDAFQAVFLVLARKAGSLRRPDDLRPWLYGVAVKAGKEARRRAARIRAREGGALADLPAPGADADLFDLRAAIDEELERLPGRYREPILLCELEGASRRDAAERLGLAEGTLSSRLARGRSLLRDRLARRGLAVGALGAALAPTAKAAGLDALADASVRLMLASPSLHGPPGTASAAVAVAEGVLAMLSAAKLKALAAASAAALGVLVVTTGLAWGLSHGGPNPEVPAQAEGPEPGAKAGELRLRGVVVDEADRPLAGAEVRLMPFTPREVGVRTGPDGSYALTSPGRNVGLQMILARSGDGRLLGTFTYGFDLTPPAAPTRIVAKPAKEVLVRVADSAGKPVEGATVEAAGDYTALASATSAAGEVRIAIPADHRVQWIVAKKAAVGCDYAEFGDFDKYPGRSEGVRAADLPPFVALTLGEPRTVKIRALDDDGTPVAGSSFSLWLLKKEGKQGEVNYGSGLQREVAGPDGVAVFDWLPRSSKGLLQFWPEGGAIARRRVVVEPDQSEVVARLVRNVAIRGRVVLPDGSPAVGVKVQADGSGRELDNGRGSAVTAADGRYEMAVPPDEAYAVRVEEPDWAAAARMDVVVRRGRPVEGVDFHLAQGTVLKGTVTLGQDDRPATGVYVQIQQEGMDAPDDLREPGDRFGHRVRWSTSGKVDDRGRYALRLGPGRYSLLEPGAMKWVEIEIADQPELIHDVHMPRPERGPIAGRVIDSAGRPVADASVEFAPVDFAGARVLATTDADGRFATERRLLRTFVCAKSPDGSLGALVEIGPDDAELGLVLAPTATATGRLLDENGEPVRREKLHWGRRIPCNDDPDGPSYNAFAPAVVTDDDGRFTLPSLVVGETYRINIEGENFYPSAGVVQPSRPGPMDVGTLQLGSGSGGDAIFLNVAPKVGAVAPAFEATTLDDKPLSLGDFAGKYVLLDFWATWCGPCLGEMPHIRAAYDEFGKDGRLAVVSLSLDDAIDAPRKFQEERKLPWTVGWAKGGIASGPNAAYGVRAIPALFLIGPDGKVVDRGMRGEGIKQTVAKALKRPSR*