ggKbase home page

qh_2_scaffold_3360_6

Organism: QH_2_UNK

megabin RP 54 / 55 MC: 47 BSCG 45 / 51 MC: 42 ASCG 38 / 38 MC: 38
Location: 5131..8550

Top 3 Functional Annotations

Value Algorithm Source
Thrombospondin type 3 repeat family protein n=1 Tax=Haloarcula sinaiiensis ATCC 33800 RepID=M0JMN8_9EURY similarity UNIREF
DB: UNIREF100
  • Identity: 46.1
  • Coverage: 964.0
  • Bit_score: 732
  • Evalue 7.00e-208
Thrombospondin type 3 repeat family protein {ECO:0000313|EMBL:EMA09598.1}; TaxID=662476 species="Archaea; Euryarchaeota; Halobacteria; Halobacteriales; Halobacteriaceae; Haloarcula.;" source="Haloarcu similarity UNIPROT
DB: UniProtKB
  • Identity: 40.2
  • Coverage: 999.99
  • Bit_score: 765
  • Evalue 1.00e-217
Thrombospondin type 3 repeat-containing protein similarity KEGG
DB: KEGG
  • Identity: 34.7
  • Coverage: 999.99
  • Bit_score: 555
  • Evalue 2.50e-155

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

Haloarcula sinaiiensis → Haloarcula → Halobacteriales → Halobacteria → Euryarchaeota → Archaea

Sequences

DNA sequence
Length: 3420
GTGACGGTGTATGCCGAGGATAACAACGGAAACGTTGGCGAGGAAACGGTCACGTTCACCGTCGATACGACCGCGCCGACGGTCAGTGTCTCCGAACCGACTGACGGAGAGACCTACGGGAGCGAGGACGTTTCGCTCAATGTTAGCGCGGACGAGGACGTCAGCGGGTGGAATTACAGCCTGGACAATGGCAACGATCAGACGTTCGACCCCGAGACGGAGACGACCCTGACCGGTCTCGACGACGGCGAACACACGGTGACGGTGTACGCCGAGGACGACAGCGGGAACGTCGGCGAGGAAACGGTCACGTTCACCGTCGATACGACCGCACCGTCGATCACCTTCCTCGACCCGGTCAACGGAACGACGTACGACAGCAGCAACGTGCCGCTCAACGTCAGCGCGTACAGCCTGGACAACGGCGGCAATCAGACGTTCGTCTCCAGTACGACCCTGACCAATCTCTCCGACGGCGAGCACACGGTGACGGTGTACGCCGAGGACGACGGCGAGAACGTCGGCACCAACACCTCCACGTTCTTCGTCAGTACGTCGACCGAGACGAAAGACACGGATGGCGACGGTATCCCCGACTCGGAGGAGGGTGACGGCGACATCGACGGCGACGGTGACCCGAACCTCGACGACACCGACTCGGACGGCGACGGCATCCCCGACTCGGAAGAGGGCACTGGCGACGCCGACGACGACGACACGCCGAACTACAAGGATACGGACTCGGACGGCGACGGCATCCCCGACTCGGAGGAGGGGATCGACGACACCGACGGCGACGGCACGCCGAATTACAAGGACGAAGACTCCGACGACGACGGCATCCCCGACTCCGAGGAAGGGGTCGACGATACCGACGGCGACGGCACGCTGAACTACAAGGACGGAGACTCCGACGGCGACGGTATCCCCGACGCCGACGAAGGGGCCAGTGACCTCGACGGCGACGACACGCCGAACTACAAGGATACGGACTCGGACGGCGACGGCATTCCGGACGCGCAAGAGGGCAGCGACGACGTCGACGACGACGGCATTCCGAACTACAAGGATACCGATACGGACGGCGACGGTATCCCCGACCTAGAGGAGGGTGACGGCGACGTCGACGGCGACGGCACCCCGAACTACAAGGACGAAGACTCCGACGGCGACGGGATTCCCGATGCCGAGGAAGGAACCGGCGACGCCGACGACGATGGCGTCCCGAACTACAAGGATACGGACTCGGACGGCGACGGTATTCCGGACGCCGAGGAGGGCACCGGCGACGCCGACGGCGACGGCGTGCCGGACTACCTCGACGACACCGACGGCACGTCCGACGAAGACGAGGATCTGGACGACGGCGACGCGGACGGTGACGGCATTCCGGACGCCGAGGAAGGCGCGGGCGACCTCGACGGCGACGGCACGCCGAACTACGAGGACCCCGATACCGACGGTGACGGGATCCCGGACGCCGAAGAGGGCACCGGCGATGTCGACGGCGACGGCACGCCGAACTACAAGGACGAGGACTCCGACGGCGACGGTATTTCCGACGCCGAAGAGGGCACCGGCGACGTCGACGGCGACGGCACGCCGAACTACAAGGACGAAGATTCGGACGGTGATGGCGTCCCCGACGCCGAGGAGGGCACCGGCGACGCCGACGACGACGGCGTGCCGGACTACCTCGACGACGACACCGACGGCAGCGCCGAAGAGGGCGACGACTCAGAAGAGGACGGGGACACCGACGATGACGGCATTCCCGACGATGAAGAAGGGACCGGTGACAGCGACGGCGACGGCACGCCGGATTATCAGGACGGAGATTCGGACGGCGACGGGATTCCTGACTCCGAAGAGGGGACCGGCGACACCGACGGCGACGGAACGCCGGACTACAAGGACGAAGACTCCGACGGCGACGGCATCCCTGACGCCGAGGAAGGCACCGACGACACCGACGGTGACGGCACGCCGGACTACAAGGACGAAGACTCCGACGGCGATGGCATCCCCGACGCCGAGGAAGGGGCTGGCGACGCCGACGGCGACGGCACCCCGAACTACAAGGACGAAGATTCGGATGACGACGGGATTCCCGACTCGGAGGAGGGCACCGGCGACGCCGACGACGACGGCACCCCGGATTACCTCGACGGCGACACCGACGGCAGCGCCGAGGAGGGCGACGACCCACAAGAGGACGGAGACTCCGACGGCGACGGGATTCCCGACGACGAGGAGGGGACCGATGACACCGACGGTGACGGCACGCCGGATTATCAGGACGGGGACTCCGACGGTGACGGCATCCCCGACGACGAGGAGGGCGCCGAGGACACCGACGGCGACGGCACGCCGGACTACAAGGACGGAGATTCGGACGGCGACGGCATTCCGGACGCCGAAGACGGAACCGAGGACACCGACGGCGACGGCACGCCGAACTACAAGGACGAGGATTCGGACGACGACGGGATTCCCGACTCGGAGGAGGGCACTGGCGACACCGACGGCGACGGCACGCCGGACTACAAGGACGGAGACTCCGACGGCGATGGGATTCCCGACGAGGAGGAGGGCACCGGCGACGCCGACGACGACGGCACGCCGAACTACAAGGACGGAGACTCCGACGGGGACGGGGTTCCCGACGCCGAAGAGGGCACCGGCGACGCCGACGACGACGGCACCCCGGATTACCTCGACGACACCGACGACAGCGCCGACGACGGCACCGACGACAGCACCGACGACGACACCGACGACAGCACCGACGACGACACCGATGACAGCGCCGACGACGACACCGACGACAGCGCCGGCGGCGACGGCGACGATGACGCGGCGTTCGACGGCGATGACGACGAACGTTCATCGGGCGCCGGCGGAACCGGCGATGGCACGCCGGACGGGAACGAAACCAACGGAACGTCGGGCGGCGAGGACCCCACCGAAGGGACTAATAGGAGCGAATCGAACGAGCAGCAGACGATAATCAACGCGTCTCTGAGCCGGTACGAGGTGACGGTCGGCGAGGAAATCGAGGTCATCGTCGAGGTCGAGAACGCGGCCGAGGAGCGCGACGAGTTCGTCCTCCAGGTGCGGAACGACGGCGATCTGGAGAAAACGGAGACGTTCACCCTCCCGGCGAACGGGACGACGACCTTCCGGGTGCCCTACCGCGTGACCGAGCCGGGGAACCACACCATCGAGGCGAACCAGACCGTGGCCGGCATCGTCCGCGCCGAGGCTGCCACCGCGACGGAGGAGGGTGGAGAGGATGACTCCGGTGGGGTGCTGCCCGGCGGCGGCGGCGTGCTCCCGACACTCGCTCTGGTCGTGTTGATCGGTGTGATACTAGCGATCATGGCATTCGCGCGGTACCGCGACCGGACCGACTCCTGA
PROTEIN sequence
Length: 1140
VTVYAEDNNGNVGEETVTFTVDTTAPTVSVSEPTDGETYGSEDVSLNVSADEDVSGWNYSLDNGNDQTFDPETETTLTGLDDGEHTVTVYAEDDSGNVGEETVTFTVDTTAPSITFLDPVNGTTYDSSNVPLNVSAYSLDNGGNQTFVSSTTLTNLSDGEHTVTVYAEDDGENVGTNTSTFFVSTSTETKDTDGDGIPDSEEGDGDIDGDGDPNLDDTDSDGDGIPDSEEGTGDADDDDTPNYKDTDSDGDGIPDSEEGIDDTDGDGTPNYKDEDSDDDGIPDSEEGVDDTDGDGTLNYKDGDSDGDGIPDADEGASDLDGDDTPNYKDTDSDGDGIPDAQEGSDDVDDDGIPNYKDTDTDGDGIPDLEEGDGDVDGDGTPNYKDEDSDGDGIPDAEEGTGDADDDGVPNYKDTDSDGDGIPDAEEGTGDADGDGVPDYLDDTDGTSDEDEDLDDGDADGDGIPDAEEGAGDLDGDGTPNYEDPDTDGDGIPDAEEGTGDVDGDGTPNYKDEDSDGDGISDAEEGTGDVDGDGTPNYKDEDSDGDGVPDAEEGTGDADDDGVPDYLDDDTDGSAEEGDDSEEDGDTDDDGIPDDEEGTGDSDGDGTPDYQDGDSDGDGIPDSEEGTGDTDGDGTPDYKDEDSDGDGIPDAEEGTDDTDGDGTPDYKDEDSDGDGIPDAEEGAGDADGDGTPNYKDEDSDDDGIPDSEEGTGDADDDGTPDYLDGDTDGSAEEGDDPQEDGDSDGDGIPDDEEGTDDTDGDGTPDYQDGDSDGDGIPDDEEGAEDTDGDGTPDYKDGDSDGDGIPDAEDGTEDTDGDGTPNYKDEDSDDDGIPDSEEGTGDTDGDGTPDYKDGDSDGDGIPDEEEGTGDADDDGTPNYKDGDSDGDGVPDAEEGTGDADDDGTPDYLDDTDDSADDGTDDSTDDDTDDSTDDDTDDSADDDTDDSAGGDGDDDAAFDGDDDERSSGAGGTGDGTPDGNETNGTSGGEDPTEGTNRSESNEQQTIINASLSRYEVTVGEEIEVIVEVENAAEERDEFVLQVRNDGDLEKTETFTLPANGTTTFRVPYRVTEPGNHTIEANQTVAGIVRAEAATATEEGGEDDSGGVLPGGGGVLPTLALVVLIGVILAIMAFARYRDRTDS*