Monarch geneset OGS2.0

DPOGS213855
TranscriptDPOGS213855-TA3585 bp
ProteinDPOGS213855-PA1194 aa
Genomic positionDPSCF300361 - 30133-79299
RNAseq coverage427x (Rank: top 29%)
Annotation
HeliconiusHMEL0071280.055.11% 
BombyxBGIBMGA009663-TA2e-16658.67% 
DrosophilaE(Pc)-PA2e-12449.08% 
EBI UniRef50UniRef50_E0VB692e-12955.08%Enhancer of polycomb, putative n=2 Tax=Pediculus humanus corporis RepID=E0VB69_PEDHC
NCBI RefSeqXP_972128.22e-13156.88%PREDICTED: similar to PX domain containing serine/threonine kinase [Tribolium castaneum]
NCBI nr blastpgi|2700096657e-13156.88%hypothetical protein TcasGA2_TC008956 [Tribolium castaneum]
NCBI nr blastxgi|2700096651e-13156.52%hypothetical protein TcasGA2_TC008956 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[6-141] IPR0195422.7e-25Enhancer of polycomb-like, N-terminal
Orthology groupMCL25634 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213855-TA
ATGTCGAAGCTCTCGTTTAGGGCGAGGGCCCTGGATGCGTCTAAACCTATGCCCATATATCTCGCCGAGGAGCTCCCGGATTTACCGGACTACTCGGCGATTAATCGTGCTGTACCTCAAATGCCCTCTGGTATGGAGAAAGAGGAGGAAAGTGAGCATCACCTCCAGAGGGCTATATCAGGCACGGGCCTCATCATACCGACGCCGGAGGTATGCCAGGTGTCGGACGTGGAGTTTTACGAGGCCTGCTACCCGCCGGACTACAAGATGCCCAAACAGCATATACACATGCAGCCGCTATGGGAGGAACAAGAGGCGCCGGAGTATGACATCGACACAGAGGACGAGAGGTGGCTGAAACAACAGAGGCATCCAGAGTTGACAGACTTAAAGTTCGAGCAAATGATGGACAAGTTGGAGAAGAGCTCCGGTCAGACGGTTGTGACCCTCAACGAGGCCAAGCTTCTGCTGGAGAGGCACGACGACCTGGTCATAGCCGTGTACGACTACTGGCTCAACAAGCGGCTCAGCACTCAACATCCGCTGGTGCTATCTGTGAAGACGGAAAACCGCCCCGGACAATCCACCAACAACCCCTACCTCGCGTTCAGAAGACGGACGGAGAAAATGCAGACCAGGAAAAACAGGAAAAACGACGAGAGTTCATACGAGAAAATGCTGAAGCTACGCCGTGATCTGGCGCGAGCTCTGTCTCTGTTGGAGTTGGTGGCGAGGAGGGAGAGAGCCAAGCGGGAGCTGGTGCGGCTCACGGCGCTGCTGGCTGAGAGGAGGTACGGCGCTGGGGACTACACGCACCCCGCCGCTACCGACAACACACACAGGCCTACATACCAAGTACCGATCACAGCGACCAGCTTCAGACGGGAGTACGCCGCGCCCTACCCGCCACCCGCGCCGCTAGACGCCAGACAGCGTGAGAAACGTCCCTACAAGAGACGGAAGCATCGCCACTACGTACCCGCGGTTCCGCACAGAGATTCCGGCGTCTGTACGTCCTCGGAGGAGGAGGTCGCCCCGCTCGACGACGGACCCTTCGCCTTCAGGCGCAAGCCGGGGTGTTTCTATGAGATGCCGACGGCCACCTTATACGGTGACCCTGTGGACCCCGACGATACAAGCAAGGACGGCCTATTCCAACACGAACTGGACGAGAAGACCAGGTTCACGCTGACATCTCTGCGTCTGCCGTACTCGCATTGCGTGGGCTTCGCTCGCCGGAGACGAGGTCGGGGCGGCCGCGTGATGCTGGACCGGATACGGACTCCCCTCGACGACCTGTGGAGGAGAGAGTGGAAGTGTGTGTGCCTCCCGCACGAGGAGAGGGAGCGGAGGGCCGAGGAGGTCGAACTGGAGACTAAGCCGCACAGAAGCCCGAAGGAAATGAAGACGGACTACCACGCGGGCGGGAAGTACCCGTGGCGACACGCGTTCAGGCGACATCTGGCCGATAACCCCCACCTGTGGACCGAACCTGTCGGCGATGACGTCTTAGACGTCAAGGCCGACGTCCACGTGAAGATCAACGGCGAAGACGTCAAAATAGACGTCGACGAGGTCAAGATAGAGCCGATGGACGTCGACAGTGAAAGAGTTTTACCCGAAACGGACATTAGTGATAGTGTTGAGGACGTTAGTGAAAAGAGAACTATAGACAGACTAGTTACTGACAATCTCAATAGGGTTATAAGGAAGAGGACCTGGAGCGGCTGCACCGACAGCAGCTACGACTCAGATGACAGTCTGCAGCCTGTAGAGAAGGAGTTCGAGAAATTCATCAACAAAGTCAATAGGAAATGGTTACATTTCCGACCGAAAACCCCACCGCCGTCACCTCCATACGTGGATAACCCCGCGGAGGATCAACTTCCGCTGGCCGTGGACACGCCGCTCGCCGTGGAACTCACCTCCAAACCCTCGGTCGGCGCCCTCGACACGTTCACCACCTCCGAGTTCACGCTCTCCGACCTCTACGACATCAGCGTGCCGGAAGCCAACGGCCCGTCGGAGATCAGCGACGACCTCCCCGAGAACTTCACGGGCTTCACGGACGATCAGGTCGAGAGCATCCTCTCGGACACGGATCTGAAGGCGCTCGAAGACAAGAAGTCGACGGACGACCTGCTGGAGGAGCTGGTGAGGGATGTGGACACGGGGAAATCTTTTTTATGTCAAGGCCGGAGCCCTGGGACGCGGGCTGGCTCTGGCCGCAACGAAGCGTTCGGCTGCAGCGTAGTGGACGTGCGATCGGAGCGGGAAGCCGTATACGTGCCGGTGGAGACCCGGCCTCCACCCCCGGCGACTCCGCCCCCGCCGCCCAGGAACGAAATCCCGGCGACCCTCAGGAAACCAGCGCCCCCGCCGCCCAGGCGACCGCCCAGCGACCCAGACACCACGCAGATCGTCACGGTCGCCGTCTCCGACAGTCTCAAGGTGCGTCTGGCTAGTCAAGGAGCGACGGCGGCCACCGCGGCCGGGACTGTGGTCGGCTTACTGCAGAACGGACCATTCGCCACTATGCTGCCAGTCGCAAGCGTCGCGAGCGTGGCGAGTGTAGCGAACGTCACGGCCGGAAGCGGGAAAATGACGAATGTCGCAAACGTGGCTAACGTGGCCAATGTAGCTAATGTTGCTAACGTGGCTAGCGTCGCCAACGTGAACGTGTCAGTGGCGAACGCCAATCGTCGCGTCACGCCGTTCGTCCAGTTAGCTCCGGCCGCGTTAGGTCACAAACCTCTCCAGCTGCACCACTCGCCGTCGGTAGTGGTCGGGCCGCCCGTGCACCACGTGACGCCGAGCAAGCTGAAAGTGTTGCACGCACACCCACTGACCAACTCGCAACGAGCACAACTGTTCGCACAGAACCGCTCCCTGGCCCAGCTGCCGGGCATGGTGTCCCTCGCCGCTCTAGGCGACGCCAAGATAAAACCTCACGGCTCAGTGGCGCAGTACTACGAGATAAAGGGCGGCCAGCTGGGCAAACCGCACCTGGTCAACGTGCTCCGACAACCGCCGCCCAAGACACAAGCAGCGAGGATCGACCTGAACGACGCCAAGAAACGACCGTTCATATTCGACGGCACGCTGAAAGGGAACATGGCCGCCGCGAGTAGACCGCACCGGGTCAGCGTGAGCCTGGACGGGCGGCAGCTGGTGCGGGCGGCGCTGCCCGCGGTGCGGCACCAGATACAGCTGAAGAACAGGACGCTGCAGGTGGCCGCGCCCAGCGCCCGCTCCGCCGCCGAGCCCTCCACCAGCATCACGATAGCGCCCACCAAGACCGCCACCATCGCCAGCTCGGTGGTCGCCAATCTTCTCCAGAAGAACGTCCAGCTGCCCAAGGGTCAGAAGATCGCCATATCCGGCCCCGGCGGACAGGCGCTCGCGGCCAACGTCCAGGCCATCGCCTTCACCACGGCACAGCTCAAGGCGCGACAGAGCAGGATGATGCCGCAGACAAGACCGCCTCCTGTAGCCGAGATAGTGGAAACCTCCTCCGCACCTAGCGCGGCCCCCGACGACGAAGACGGCGTCCGGCGCACCGAGACTATGATGGAGGTCACGTGA

Protein sequence:

>DPOGS213855-PA
MSKLSFRARALDASKPMPIYLAEELPDLPDYSAINRAVPQMPSGMEKEEESEHHLQRAISGTGLIIPTPEVCQVSDVEFYEACYPPDYKMPKQHIHMQPLWEEQEAPEYDIDTEDERWLKQQRHPELTDLKFEQMMDKLEKSSGQTVVTLNEAKLLLERHDDLVIAVYDYWLNKRLSTQHPLVLSVKTENRPGQSTNNPYLAFRRRTEKMQTRKNRKNDESSYEKMLKLRRDLARALSLLELVARRERAKRELVRLTALLAERRYGAGDYTHPAATDNTHRPTYQVPITATSFRREYAAPYPPPAPLDARQREKRPYKRRKHRHYVPAVPHRDSGVCTSSEEEVAPLDDGPFAFRRKPGCFYEMPTATLYGDPVDPDDTSKDGLFQHELDEKTRFTLTSLRLPYSHCVGFARRRRGRGGRVMLDRIRTPLDDLWRREWKCVCLPHEERERRAEEVELETKPHRSPKEMKTDYHAGGKYPWRHAFRRHLADNPHLWTEPVGDDVLDVKADVHVKINGEDVKIDVDEVKIEPMDVDSERVLPETDISDSVEDVSEKRTIDRLVTDNLNRVIRKRTWSGCTDSSYDSDDSLQPVEKEFEKFINKVNRKWLHFRPKTPPPSPPYVDNPAEDQLPLAVDTPLAVELTSKPSVGALDTFTTSEFTLSDLYDISVPEANGPSEISDDLPENFTGFTDDQVESILSDTDLKALEDKKSTDDLLEELVRDVDTGKSFLCQGRSPGTRAGSGRNEAFGCSVVDVRSEREAVYVPVETRPPPPATPPPPPRNEIPATLRKPAPPPPRRPPSDPDTTQIVTVAVSDSLKVRLASQGATAATAAGTVVGLLQNGPFATMLPVASVASVASVANVTAGSGKMTNVANVANVANVANVANVASVANVNVSVANANRRVTPFVQLAPAALGHKPLQLHHSPSVVVGPPVHHVTPSKLKVLHAHPLTNSQRAQLFAQNRSLAQLPGMVSLAALGDAKIKPHGSVAQYYEIKGGQLGKPHLVNVLRQPPPKTQAARIDLNDAKKRPFIFDGTLKGNMAAASRPHRVSVSLDGRQLVRAALPAVRHQIQLKNRTLQVAAPSARSAAEPSTSITIAPTKTATIASSVVANLLQKNVQLPKGQKIAISGPGGQALAANVQAIAFTTAQLKARQSRMMPQTRPPPVAEIVETSSAPSAAPDDEDGVRRTETMMEVT-