ggKbase home page

SCNpilot_expt_500_p_scaffold_3575_curated_2

Organism: scnpilot_dereplicated_Caedibacter_1

near complete RP 52 / 55 BSCG 51 / 51 MC: 1 ASCG 11 / 38 MC: 2
Location: 1815..9149

Top 3 Functional Annotations

Value Algorithm Source
Filamentous hemagglutinin family N-terminal domain protein id=2719303 bin=GWC2_Alphaproteobacteria_42_16 species=Shigella flexneri genus=Shigella taxon_order=Enterobacteriales taxon_class=Gammaproteobacteria phylum=Proteobacteria tax=GWC2_Alphaproteobacteria_42_16 organism_group=Alphaproteobacteria organism_desc=Rhodospirillales related? Good + similarity UNIREF
DB: UNIREF100
  • Identity: 24.8
  • Coverage: 999.99
  • Bit_score: 243
  • Evalue 1.60e-60
Filamentous hemagglutinin family N-terminal domain protein id=2719303 bin=GWC2_Alphaproteobacteria_42_16 species=Shigella flexneri genus=Shigella taxon_order=Enterobacteriales taxon_class=Gammaproteobacteria phylum=Proteobacteria tax=GWC2_Alphaproteobacteria_42_16 organism_group=Alphaproteobacteria organism_desc=Rhodospirillales related? Good + similarity UNIREF
DB: UNIREF100
  • Identity: 24.8
  • Coverage: 999.99
  • Bit_score: 243
  • Evalue 1.60e-60
Tax=RIFCSPHIGHO2_02_FULL_Alphaproteobacteria_42_30_curated similarity UNIPROT
DB: UniProtKB
  • Identity: 29.6
  • Coverage: 999.99
  • Bit_score: 498
  • Evalue 3.90e-137

Lists

This feature is not on any list.

Notes

This feature has no notes.

Taxonomy

R_Alphaproteobacteria_42_30 → Alphaproteobacteria → Proteobacteria → Bacteria

Sequences

DNA sequence
Length: 7335
ATGTTTAAGAAATTTCTATGTTGGCTGCTGATTTTTCAGCACATAAATGTCTATGTTTTGCACGCAACGCCTGGGGACATCAAGGTTCTTTTTGAAAGAGAAGATGAGCATCATTTAGCTTCTCAACAGGGCAAGCCCTCTGCCTTGCACTTGAGTATCCTTAAAGAAGGATCGCAAGGGCAAATGGCTCCTTTGTTTCAAGGTACGTTTGAAACAGATCCGAACTCTCCTTTAACATTCAATCTTAAGCAAAGCTTGCCTAAAAGTCTTGAAGGATTCGAGGTGAAATTAGGTCTTGACAACCAAAGCCTTCATCTTTCCTCAACCCTTGAGGGAGAGGCTTTTTCGCTTCATTTAAACCCCGATGGGAAAGTCAATCTTCAAGAGATCAATTGTCTGCGGTATCTAGGGATTCATACATTTGGCGATATTCATACCCAGAGAAAAATTGAAGCAAAAGAGCTTTCTCTTGAAGGGAGAGAGATCAGAACATCCGTTCCTCTTTTTGTTGAAAAATTCCATGTATCTGGAGCTTTAAAAAACACATCTGAAGGCCAACTTTATATTGGAAAAGAGGCCCATATTCGAGACGGAAATTTTGTTAATGAAGGACGCATTTGGGGTAGTGATCAGAGCTTTCTTGATCTTTATGGGAACAATTTTACGAATGAAAACATCACACAGAAGAAGTTTCTCGCTCCGGATCAGTCAAACACTCCCCATAGGTATACGAGTGGGAAAAAAAGATATTCTTTTCAAACGCAAATCATGGGGGCTGGTCGCTTTTCTATTAGGGGGGTTGGGACATTTTTAAATAAATCCCTTATTCTTCCCGAAGAAGGCAAAAAGGATGTTTCCCTTACGATAGAAGCCAAAGAACTCCATAACACGGGTCAAGGACTCATTGCGAGTGAGGATCTCACCTTAAGGTGTCTTGGGAATCTCCTAAATGAAGGAGAGATGCGAGGCATAACAGATCTTACCTTAGATGTGAAAGGTAACCTTGAGAACAGGAGTAAAATTGAAAGTTTAGGTTCTTTAAGAGCGGATGTTCTAGGAAATCTGATGAATCGGCGGGGGGCTACATTCACAGGAGAAAGACTAATCCTGAATGTCGAAGGCTTAAATCTTAATGAGGGAAGACTCTCTGCACGCTTTGCGCTGATCAAAGGAAAGCTAGAGAATTTTGGTGGAATCACGATTGAAGAAGAAGGAGAGATTGATAGTGGTGACGGTGACCTTATCAATAGGGATACGGCCCATATTATGTCCCATGGCGTTTTAAGGCTCTATGGGCAGTCACTTCTCAATGAAAGAGGACCTTCGTCTCGTGCTCTTATTGATGCGCGCCATCTTATGATGAAAGTTGGGCATGTTCATAATGATGGAGAAGTCATAGCCCGTGAATCAGCCAATCTTCATTTTCCCCTTTTTGTGAATGGTCCTTTAGGGATTTGGAATCTTCCCGGGTCAACCCAAACATCTACAGATAACTTTAAAAATCATGAAAGTGTTTATAGCGAAAACCTTCAAATGACAGCTGGGGTTTTTGAAAATTTTGGCAGAATTATTTCTGCTGGAATATTTACGATTATTGTTGAAGAAGCATTTAGAAATGTTAAAGACGCTCTTCTTTCAGCTTTTCAGGGAATTTTTCTCAAAGGTCAAGGTAAGGTTCACAATCAAGGGATTGTGGCAAGTCACGGAAAGGTTGAAATTGAAGCGGCTCAGACATTGAATCAAGGACAAATCCTTTCAGATGAAGGTGTTGGTATTCGAACAGAAGGGGGGAGAAGGTCTGACCCAAATAATATTCCTGATTTTATGAATGAGGGATTGATTTTAAGTCCTGATATCAGTGTGGACGTCCAAAAAGGCCTTAATAAAGGAGACGGTTTAATTGCGAAAAGATCTCTTACGCTTACAGCGCATGACACCTTTATAAATGAAAGCTCTGCCCTTGGAGGCACATCCATAGATCTTTTAGGAGAAGGCATCTTCACGAATGCTCGGACACTTATCAGTCAGGGAATTTTAAGGTCTCAAGTGAGACATTTAGAAAATATAGGACATATTTTAGCCAAAACCGGGATTTTAGAAGGCAGCTCTCTTCATCATACAGGCTCACTGCATTTAGAAGACGCTTCAAGAGTCACATTTAAAGATATCCAAGTTTCTGAAGGGGCTAGCATGACTTCCCAAGGAGGACTGCACTTTCAGTTTTCTAGGCTTGATAATAAGGGGGAAATTGAAACTGGTGAAGTGACAATTTTTGAAGGTCAACATTTGAAGAATGAGGGAGAAATCCAAAGTGCTGTTTTACAGGCAAGAGGCCTTTTAGGACATGAAAGTACGTTTTTAAATCGTGGTGTGTTTCATGGAGTTCAAAACGTTAGGATAACAACTTCTGAAATTGAGAATCAAAATGAGATATCTTCTGATGGATCAGTGACAGTTGATGTTTCCAAATCATTTCTCAATTCTGCGGATGGGGTGGTTGCATCCTCAAAAGACCTTATTATTTCTGGGGATGGAAAAGTTAAAAATTATGGAAGCCTTGAGAGTGATGGACAAGTTTCTCTTGCCATGCTGCAGGTCGATAACAGGGGGGCAATCATTGCAGATGAAAGTGTGATGATCAGGACGGGAGGAGATGGTTCTAAGTCAGAAGGTGATCTCGATTTTATGAATGAGGGAGCCATCTTAAGTCCTGAGATCATCCTCAGATCGCACACAGGATTCAATGGGGGAGAAGGCCTTATTGCGAAAAGATCTCTCACTCTTACGGCCCATAGGACTTTTACAAATCATGGAGATATTTTTGGAGGAAAATCCGTAGATCTTTTAGGAAAAGGCACCTTCACAAATACCAAGATACTTGCCAGCCAGGGAGAATTGAATTCCGAAGTGAGACATTTAGAAAATACAGGAGATATTCAAGCCCAAACGGGGATCTTAAAAGGAGACTCTCTTCATCATGCAGGATCGATGCATTTGCATGAACCATCAGGAATAGCCTTTAAAAATATTGAGACTTCTGAAGAGTCTAGCGTTACTTCACAAGGAGGAATTCGCTTTGGATTTTCTACGTTTGACAACAAAGGACGAATTTTAACGGAAGGTGAGACCTCTTTTACAGGTCAAGGCTTGAGGAATCAAGGAGAGGTTCAAGCTTCTCTTTTGCAAGTTAGAGATGGCACCGGAGCTGAAAATATATTTTCCAATGATGGGGTCTTCCATGGATACCAAGGAGTCACTGTTACAGCATCCCACGTTGAAAATCAGGATGAAATATCTTCGCAAGGTTCTCTTTTTCTTGATATTAGAAAATCTTTTACAAATGCCCTTGGGGCTTTGGTTTCATCTCTAAAGGAGATGGTCGTCTCTGGGAAAGGAGATGTGCAGAATCATGGAGACATGGAAAGCCAAACATCTATTGATATTGCCACATCAACTACACAAAACAAAGGTCGAGTGATTGCAGAAGAGAGCGTCTCCATCCACACGAAAAAAACGACTCTACCCACAGGAGAAAAACAAAAAGACTTTGAAAATGACGGGCTTGTGATGAGCTCACATGTTACGGTTCACTCTGACACAGGGTTAAATGCAGGCTCTGTGAACTACTCTATTGATGCGAACTCGGATGATCTTCAAGGTCTTATCGCAACGGATCGTTTAACTCTTTTGGTAGGAAATAAGTTTACAAATCACGGACAGATGGTGGGAGGAAGTCTTGTTAACCTCGAAGGGGATGGCACTTTTGACAATGAAGGGGATATTCATAAAAGTGATCTGATAGAATCCCGTCTTCAAGGATTTGATAATAAGGGTGTGGCGCGTGCTCATCATTCCGTTTCCTTTGATCAACTGATGCAAGATCTGAAAAACAAAGGGCTGATTCTTTCAGAAGGAAGAGTTTCAATAAACTCGACATCTCGTGTTTTAAATGAAGGGGTCATTGAAGGAAGAGACGGGGAAACATCTCTAAGAGCGGAAGAGCTTGAGAATCGTGGACTCTTGAGTGCCCTTGGGAGTCCTCTTGTTGTCACAGCAACACAAGCAAAAAACCATCATCTTATCTTTGGAAAGAAGGGGGTGGATTTAACAATTGGCAAAATTTTTACAAATGAGAATGGTGAAGATTCTGCATTTGGAAGAGTCGAGAGTCATGGATCCCTGACAGTCCAAGGAAAAGGAAGTGTACACAACAAAGGATCGATTGCGTCCCAAGGAGATGCTGATATCAGGACAGATCTTGACAATGAAGGTATATTGCGAAGTGCGCATGATGTTGTCTTTTCAGCCTCTGATAAAGCTCTCACCAATACGGGTGAAATCCTAGCAGATGAGACTGTCCGCATTGCAACTGAAAGACCGTTGGAGAATACAGGTCTTGTCCAAGGACAAGAAACGTCCCTTGCAGCTTTAGGGACGTTTCACAATGCAGGAGAGATTCTAGCAACAAATGGTCAAGCCCTTTTAACCCTTGGAGAAGGTTTGAATAAAGGACGGGTGATTGGAAGTGAGGGCGTTGATATTACTGTTGTGAAAACTCTCACCAACCAAGGGCACGTTCTTTCTCCAGGATCAACCGTCTTTCATGGAACAGGCTCATTAGAGAATAGTGAGAGTGGTGTTTTAATGGGAGATAAGAGTTTAGACATAAGTAATCTTGCTCTTGAAAATAAAGGTCTTTTTGGATCAGCTCGGGATCTTGCTCTTCACCATCTGCAAGGCCCCTTACAAAATACAGCTTCTATTATCTCTCAAGGTAGTATTCGTATTGCAGAGACAGCTTCAGTTCATAATTCCGGACATATTCAAGGGACTACCCACACAGCCGTGGTTGTTAAAGAGTTAGACAATAAAGGCGAGATTTCATCGCTAGAAGGATCCGTTGATCTCATTGTTCAGAAAGGGGCGAATAGGGAAGGTAGTGTTATCGTTGCAAGGACAAATCTTGCTGTGAATGCGCAAACAGCATTTGTGAATAATGGTGTAATGAGTGGCGAAGCTTCAACCACTCTTAATGGAGAGAGGTTTACAAATCATAACCTCTTACAAAGCAGAGGAGCACTGAAAGTTGATCTTCAGAGCTTAACAAACCATAAGGAGATCCTAGCAAAGGAAGCAGATTTTTCTCTCCATACGCTTGAGAATGCGAGAGATATTTTTATTGGAGATGATTTACGTCTTGATGTTCAGCAAGGGGTCAATGCAGGGGATATTCAAGCTCAAACGTTAACGCTAGATGTAAGGGCTCCTGTCAGCTCTACAGATCAATTTCACAATAAGAAAACCTTGTATGCATCCTCTCATTTAGCTCTTCAAGGAAAAGGAATGGTTTCCAATGAAAGCGATATCCTTTCTGAAGGAACGCTAGAGGTTCACAGTAAAGCCCTTATCAATGGAGAGAAAGGCCAGCTTCAAGCGACGAGAGGGATTAACATTTCAAGTACAACACACGTTGAAAACGCTTTTGGCGCTAAGATTTTAAGTCAATCAACCATTGAAGTCGGGAGAGAGGCGAGTGTTACCAATCAAGGATTGATCTCAGGAAAGTCTCTCGTTTTTCAACAACCCTCCTATACAGTTAGTGGAAGACTCGAAGCTAAAGGAGATATTACGTTTCCTCATATTCAAACACTTCACACCTCAAAAGGAAGTTCTCTTTATACTGAGGAAGGACAAATTGTCCTGCCTCAAGTTCAGGTCGTGGATCATGCAGGCTCACTTTTAAGTCAGAAAAGTATCTCCTTACCAAACCTTCTCAGTTTTAAGAATAATGGATCTTTTCAATCGAATGGAGAAGTTACTCTTGTCTCTTCTGGAGAGTTGGCTAATCACCATCGAATCGTGGGGCGTTCGGGCGTCTCACTCAAGGCACAAAAAGTATCCAATCAAGGAGGTGTTGTCTCTGAGGACACGATTACCATTCAATCACCTCAAATCGATAACTCTCGTCTTATTGTCGGAGCAAAAGGGGTAAAGGTTGTCTCGTCCCAGCCGATTGTCCAAGGCCAAACAGGTCGAATGGAAAGTAGGGGAGGCTACCTTAGGCTTGAATGTCCAGAGTTTAAGGGGGCGGGAGAGTTTGTAGCAAAAAAGGTTGAGATTCACTCCACTGGCAAAAAGCCTTTTTCCTTAGATGGCATTCAGATAAAATCCCTTGAAGATTTAGCACTCACCGTTTCTTATGGATGGGATTTAAAAGGCAAAGCCCAAAGCTTTGGTCATGGGATTCATTTAACAGGGCCTCTTTTGAATCCCTCCCATTTGACGATTGATGGGGATTTTATATGGGATCACGCGGGAGGAGATCTTCAAAACAAATTCAATATGATTGTAGCTGGCCTTTTGTCCTTACGCTTGAAGGGTCAGTGGATCAATTTGGCAGATGTTCAGGCTCGAGGAGGAGCAGCCGTTGTTGCAACCCAGATCCTTAACAAAGGAACGTTTTATTCTTATGCTTCCACAAGCTTTGAATGCCCAAAAGGTTTTGAGAATTATCATCAACTGGAGGTTTGGGGAGATATGAATCTTCTGGCATCCCAGGGTTCAATTGTCAATTATCCTGGATCTCACTTCCATCAAAAGACAATTCCAGGTGGAAAAAACCACGTTGTTTTTAAAGCCCTTCAAGATATTAAGGATATTAGTGGAACGGTTGTGATCGAAGGGGTGCTGGATAGCACATCGAAGGTGTTTTCCAATGAAGTGGCTCCTCCGACTCAACAGAATGGAACAAAAATTATTATATTTAAAGGTCAGAACACGTCCGTTCCTGCTACGCGAGAGGTTCCTGCTCCTCCCGCACGCTTTTATGCAGGCAGTGCCAATGTGAGGGCAGAAAGTCTTCAAAATCTAGGAGGGACTCTCTCTACATCTGACGGGTTTTATTTTGAAGGGAATAAGGTCAAGAATCTATCCCGTACTTATAAGGAAGTTAGCCAAAAGGAACACTCATGGACAGAAAAAAAGAGGAAGAGGTGGTATAAAAAGAAGAAACGGATAACAAAGTATGAATTAATTCCTTCTACAACTGTCATTGATGATCTCACAAGCGTTGCTAAACTTTACTCTCGAGGCGTCCTTAAACTTGTTGTCAATGGAACGGTCTTTGAGAAAACATTTCCTAAACAGAAAGAAGAATCCACTCAACGACGCATCAGAAGAAAAAGGCAAGGAGAAGATCCTTTTAAGGCTTTGGTGGAAAATACAGGCGATATTCGGGCATCTCTTCTGACAATGAATGTTCGGGAGTATTTAATCAACTGTGTCAGTTCTGATAATGTAATGATTATTTTGAAGGAAGGACTGACAGATGGAAGT
PROTEIN sequence
Length: 2445
MFKKFLCWLLIFQHINVYVLHATPGDIKVLFEREDEHHLASQQGKPSALHLSILKEGSQGQMAPLFQGTFETDPNSPLTFNLKQSLPKSLEGFEVKLGLDNQSLHLSSTLEGEAFSLHLNPDGKVNLQEINCLRYLGIHTFGDIHTQRKIEAKELSLEGREIRTSVPLFVEKFHVSGALKNTSEGQLYIGKEAHIRDGNFVNEGRIWGSDQSFLDLYGNNFTNENITQKKFLAPDQSNTPHRYTSGKKRYSFQTQIMGAGRFSIRGVGTFLNKSLILPEEGKKDVSLTIEAKELHNTGQGLIASEDLTLRCLGNLLNEGEMRGITDLTLDVKGNLENRSKIESLGSLRADVLGNLMNRRGATFTGERLILNVEGLNLNEGRLSARFALIKGKLENFGGITIEEEGEIDSGDGDLINRDTAHIMSHGVLRLYGQSLLNERGPSSRALIDARHLMMKVGHVHNDGEVIARESANLHFPLFVNGPLGIWNLPGSTQTSTDNFKNHESVYSENLQMTAGVFENFGRIISAGIFTIIVEEAFRNVKDALLSAFQGIFLKGQGKVHNQGIVASHGKVEIEAAQTLNQGQILSDEGVGIRTEGGRRSDPNNIPDFMNEGLILSPDISVDVQKGLNKGDGLIAKRSLTLTAHDTFINESSALGGTSIDLLGEGIFTNARTLISQGILRSQVRHLENIGHILAKTGILEGSSLHHTGSLHLEDASRVTFKDIQVSEGASMTSQGGLHFQFSRLDNKGEIETGEVTIFEGQHLKNEGEIQSAVLQARGLLGHESTFLNRGVFHGVQNVRITTSEIENQNEISSDGSVTVDVSKSFLNSADGVVASSKDLIISGDGKVKNYGSLESDGQVSLAMLQVDNRGAIIADESVMIRTGGDGSKSEGDLDFMNEGAILSPEIILRSHTGFNGGEGLIAKRSLTLTAHRTFTNHGDIFGGKSVDLLGKGTFTNTKILASQGELNSEVRHLENTGDIQAQTGILKGDSLHHAGSMHLHEPSGIAFKNIETSEESSVTSQGGIRFGFSTFDNKGRILTEGETSFTGQGLRNQGEVQASLLQVRDGTGAENIFSNDGVFHGYQGVTVTASHVENQDEISSQGSLFLDIRKSFTNALGALVSSLKEMVVSGKGDVQNHGDMESQTSIDIATSTTQNKGRVIAEESVSIHTKKTTLPTGEKQKDFENDGLVMSSHVTVHSDTGLNAGSVNYSIDANSDDLQGLIATDRLTLLVGNKFTNHGQMVGGSLVNLEGDGTFDNEGDIHKSDLIESRLQGFDNKGVARAHHSVSFDQLMQDLKNKGLILSEGRVSINSTSRVLNEGVIEGRDGETSLRAEELENRGLLSALGSPLVVTATQAKNHHLIFGKKGVDLTIGKIFTNENGEDSAFGRVESHGSLTVQGKGSVHNKGSIASQGDADIRTDLDNEGILRSAHDVVFSASDKALTNTGEILADETVRIATERPLENTGLVQGQETSLAALGTFHNAGEILATNGQALLTLGEGLNKGRVIGSEGVDITVVKTLTNQGHVLSPGSTVFHGTGSLENSESGVLMGDKSLDISNLALENKGLFGSARDLALHHLQGPLQNTASIISQGSIRIAETASVHNSGHIQGTTHTAVVVKELDNKGEISSLEGSVDLIVQKGANREGSVIVARTNLAVNAQTAFVNNGVMSGEASTTLNGERFTNHNLLQSRGALKVDLQSLTNHKEILAKEADFSLHTLENARDIFIGDDLRLDVQQGVNAGDIQAQTLTLDVRAPVSSTDQFHNKKTLYASSHLALQGKGMVSNESDILSEGTLEVHSKALINGEKGQLQATRGINISSTTHVENAFGAKILSQSTIEVGREASVTNQGLISGKSLVFQQPSYTVSGRLEAKGDITFPHIQTLHTSKGSSLYTEEGQIVLPQVQVVDHAGSLLSQKSISLPNLLSFKNNGSFQSNGEVTLVSSGELANHHRIVGRSGVSLKAQKVSNQGGVVSEDTITIQSPQIDNSRLIVGAKGVKVVSSQPIVQGQTGRMESRGGYLRLECPEFKGAGEFVAKKVEIHSTGKKPFSLDGIQIKSLEDLALTVSYGWDLKGKAQSFGHGIHLTGPLLNPSHLTIDGDFIWDHAGGDLQNKFNMIVAGLLSLRLKGQWINLADVQARGGAAVVATQILNKGTFYSYASTSFECPKGFENYHQLEVWGDMNLLASQGSIVNYPGSHFHQKTIPGGKNHVVFKALQDIKDISGTVVIEGVLDSTSKVFSNEVAPPTQQNGTKIIIFKGQNTSVPATREVPAPPARFYAGSANVRAESLQNLGGTLSTSDGFYFEGNKVKNLSRTYKEVSQKEHSWTEKKRKRWYKKKKRITKYELIPSTTVIDDLTSVAKLYSRGVLKLVVNGTVFEKTFPKQKEESTQRRIRRKRQGEDPFKALVENTGDIRASLLTMNVREYLINCVSSDNVMIILKEGLTDGS