ggKbase home page

scnpilot_solids2_trim150_scaffold_863_6

Organism: SCNPILOT_SOLID2_TRIM150_UNK

megabin RP 53 / 55 MC: 52 BSCG 51 / 51 MC: 51 ASCG 16 / 38 MC: 16
Location: comp(8330..11368)

Top 3 Functional Annotations

Value Algorithm Source
hypothetical protein id=12557494 bin=CNBR_ACIDO species=Azoarcus toluclasticus genus=Azoarcus taxon_order=Rhodocyclales taxon_class=Betaproteobacteria phylum=Proteobacteria tax=CNBR_ACIDO organism_group=Acidobacteria organism_desc=why is coverage listed as 1? similarity UNIREF
DB: UNIREF100
  • Identity: 65.2
  • Coverage: 990.0
  • Bit_score: 1297
  • Evalue 0.0
  • rbh
Uncharacterized protein {ECO:0000313|EMBL:KJR65360.1}; TaxID=528244 species="Bacteria; Proteobacteria; Alphaproteobacteria; Rhodospirillales; Rhodospirillaceae; Azospirillum.;" source="Azospirillum thiophilum.;" similarity UNIPROT
DB: UniProtKB
  • Identity: 64.2
  • Coverage: 999.99
  • Bit_score: 1269
  • Evalue 0.0
hypothetical protein similarity KEGG
DB: KEGG
  • Identity: 67.6
  • Coverage: 917.0
  • Bit_score: 1248
  • Evalue 0.0
  • rbh

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Azospirillum thiophilum → Azospirillum → Rhodospirillales → Alphaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 3039
ATGATCACACCACCCGTCAAGACACGAAAGAAGCTCATCGAAGTGGCGTTGCCACTCGACGCCATCAACACGGCCTCGGCACGTGAGAAGAGCATCCGCCACGGCCACCCAAGCACGCTCCACCTGTGGTGGGCGCGCCGACCCCTGGCCGCCGCGAGAGCCGTGATATTTGCGCAGATGGTCGACGATCCCTCGGAGTACGTCGACGAGCTCCTTTCAGATTCCACCCTGATTTCGCAAGCTCAGCGCGAGCTGGAGCAGCGCCGCGTTCTTTGGGAGAAACGCAGAAGCGCGCGCGAAGAGGCCGAGCTAGTGGGCGACACTACGGTGCCCGAGGCCGGATCACCTCCGACGCTTGAAGAGTGCGCTGCCGACCTTGAGCGGCAACGACTCTTCACCCTCATCGAAGAACTCGTCCAGTGGGAGAACACCACGAACGAGCGAGTGCTCGACCGCGCAAGAGCTGCGATAAGTAAGAGTTGGCGCCGCACCTGCCGCGACAACGCCGACCATCCTCAGGCGGCAGAACTATTCGACCCCGAGCGGCTACCCGCCTTCCACGACCCCTTTGCGGGCGGCGGTGCGCTTCCGCTCGAGGCGCAGCGCCTCGGACTCGAAGCCCACGCCAGCGACCTTAACCCCGTCGCGGTGCTGATCAACAAGGCCATGATTGAAATCCCGCCCAAGTTCGCGGGACTACCTCCGGTCAACTCCGAGTCGCGAAGCGACAAAGAACTCTTTACGCGCGATTGGAAGGGCGCCCAGGGCCTGGCTGAGGACGTGCGTTTTTACGGGAAGTGGATGCGCGATGAAGCCGAGAGGCGCATAGGCCACCTCTACCCCAAATTGAGAATCACACCCGAGCTCGCGCGGGAGCGCCCCGACCTGAAGAAGTACGAGGGCCGCGAACTCACAGTAATAGCCTGGCTCTGGGCCCGCACCGTCAAGAGCCCCAACCCCGCCTTCGCCCACGCCGATGTGCCGCTAGCGTCTACCTTCATGCTTTCAACAAAAGAAGGGAAAGAGGCGTACGTCCAGCCGTTGGTCGAGAACGGTAACTATCGCTTTAAAGTCGAGATTGGAATGCCAAAGGATCCTGAGTCGACGAAAGATGGCACCAAACTCGGGCGAGGGGCGAACTTTCGTTGCGTCATCTCAGGGGCGCCGATTTCCGGCGATTACATCAAGTCGGAAGGAACGGCCGGGCGAATGGGCTCACGAATGTTGGCGATTGTCCTTGAGGGCGATCGCGAACGCGTGTACCTGGCGCCAACGGCGGAGATGGAAGAGATTGCGAGCAACGCCAGGCCGACTTGGAAGCCAGAAGGCGACGTGCCGGCAAAACTCACGGGTGGTACTTGCGTCCCATACGGGTTGACGACATGGGGTGACCTCTTCACCGATCGCCAGCTCGTAGCGTTGACCACCTTCTCCGACTTGGTAATAGAAGCAAGGGAGCGGGTATTGCGGGACGCCGTAGCGTCGGGCTTACCCGACGATAACCAAGGACTGGAAGTTGGAGGTGAAGGGGCACGAGCTTATGCCGAAGCTATCCAAGTATACTTGGCTTTCTCCGTAAGCAAAGCGGCAGACAGAAACTCTGCGCTCTGCGTTTGGGAGAATAGGATGGACCGCCTTCGTGGGACGTTCGGGCGTCAAGCGCTTCCTATGGTGTGGGACTATGCCGAAACAAACCCCTTCGCGGGCGCGGGCGGAGACATCTATGGAACAACAGTTTCGCTGACAGAAGTGCTTGCAAAGCTCGGCAGCGCAAGAGGTTCGAGCGCATCTCAGAAGGATGCTAGCCGGCAAAATCTCAGCAACAATAAGCTCGTCTCAACAGATCCACCCTACTATGACAACATCGGTTACGCGGATCTAAGTGACTACTTCTACGTTTGGCTTCGTCGGTCCTTGAGACCCCTACTGCCTGAACTATTCGCCACTCTAGCGGTACCAAAGACCGAGGAACTGGTTGCCAGCCCCCATCGGCATGGCAGTAAGGAGGAGGCTGAACAGTTCTTCTTAACTGGCATGACCCAGGCTATAGGCCGACTAGCTGAGCAGGCACACCCTGCGTTTCCCACTTCCATCTATTACGCCTTCAAGCAGACGGAAACTACAAGTTCAGGAGGAATCTCTAGCACTGGATGGGAAACGTTTCTGGACGCCGTGATTAAGTCTGGTTTTACAATTGTGGGCACATGGCCAGTTCGATCTGAGCTCGCTACAAGGAACATCGGTAGGGGTACGAACGCTTTAGCGTCCAGCATCGTGCTTGTCTGCCGCCCACAAGCGAAGGACGCCCCGACCGCCACACGCGGGCAGTTCATCAGCCAGCTAAAGACAGAGTTACCGAAAGCTCTTAGTCAACTCCAGCGCGGAAACATCGCACCCGTTGACCTCGCCCAGGCATCGATCGGCCCAGGGATGGCCGTCTACACGCGATTCTCGCAAGTACTTGACGCTGCTGGGAACCGCCTTTCAGTTCGCGACGCTCTGGCTGTCATTAACCACATGCTTGACGAAGTCATGGCGGAACAGGAAGGAGACTTTGAGTCCGAAACACGTTGGGCTCTAGCCTGGTTCGAGCAACAGGCTTTCGATGAGGGCGATTATGGAGTGGCCGACACCCTTTCGAAAGCGAAGAACACAAGTGTGGCGGGGATGGTTCAGGCCGGACTCATCCAAACTAGGGCCGGCAAAGTCAGGTTGCTTCGGCCGGAAGAGCTCCCTTCCGAGTGGAATCCCTTGGAAGATGCTCGCCTAACTGTGTGGGAAGCGGTACATCACCTCATACGCGTTCTCGAGTCCAAAGGTGAACAGGGCGCCGCGGAACTTGCAGCGCAGTTGGGCCCCCGCGCCGAGTTAGCACGCGAACTTGCTTATCGCCTTTACAGCATCTGTGATCGGAAGAAATGGGCCCAGGAAGCCCGTGTTTACAACGCACTAGTACAGAGCTGGCCCGAGATTCAGCGGCTATCCCAGGAACTGCGTGCAAACACGTCCGAGGCGGTTGGCCTCTTCACGGGAGTCTAG
PROTEIN sequence
Length: 1013
MITPPVKTRKKLIEVALPLDAINTASAREKSIRHGHPSTLHLWWARRPLAAARAVIFAQMVDDPSEYVDELLSDSTLISQAQRELEQRRVLWEKRRSAREEAELVGDTTVPEAGSPPTLEECAADLERQRLFTLIEELVQWENTTNERVLDRARAAISKSWRRTCRDNADHPQAAELFDPERLPAFHDPFAGGGALPLEAQRLGLEAHASDLNPVAVLINKAMIEIPPKFAGLPPVNSESRSDKELFTRDWKGAQGLAEDVRFYGKWMRDEAERRIGHLYPKLRITPELARERPDLKKYEGRELTVIAWLWARTVKSPNPAFAHADVPLASTFMLSTKEGKEAYVQPLVENGNYRFKVEIGMPKDPESTKDGTKLGRGANFRCVISGAPISGDYIKSEGTAGRMGSRMLAIVLEGDRERVYLAPTAEMEEIASNARPTWKPEGDVPAKLTGGTCVPYGLTTWGDLFTDRQLVALTTFSDLVIEARERVLRDAVASGLPDDNQGLEVGGEGARAYAEAIQVYLAFSVSKAADRNSALCVWENRMDRLRGTFGRQALPMVWDYAETNPFAGAGGDIYGTTVSLTEVLAKLGSARGSSASQKDASRQNLSNNKLVSTDPPYYDNIGYADLSDYFYVWLRRSLRPLLPELFATLAVPKTEELVASPHRHGSKEEAEQFFLTGMTQAIGRLAEQAHPAFPTSIYYAFKQTETTSSGGISSTGWETFLDAVIKSGFTIVGTWPVRSELATRNIGRGTNALASSIVLVCRPQAKDAPTATRGQFISQLKTELPKALSQLQRGNIAPVDLAQASIGPGMAVYTRFSQVLDAAGNRLSVRDALAVINHMLDEVMAEQEGDFESETRWALAWFEQQAFDEGDYGVADTLSKAKNTSVAGMVQAGLIQTRAGKVRLLRPEELPSEWNPLEDARLTVWEAVHHLIRVLESKGEQGAAELAAQLGPRAELARELAYRLYSICDRKKWAQEARVYNALVQSWPEIQRLSQELRANTSEAVGLFTGV*