Monarch geneset OGS2.0

DPOGS204704
TranscriptDPOGS204704-TA5352 bp
ProteinDPOGS204704-PA1783 aa
Genomic positionDPSCF300170 + 600040-605463
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0031970.067.47% 
BombyxBGIBMGA007483-TA0.062.82% 
Drosophila% 
EBI UniRef50UniRef50_UPI00022C8E343e-15126.76%UPI00022C8E34 related cluster n=2 Tax=unknown RepID=UPI00022C8E34
NCBI RefSeqXP_002430728.11e-13126.28%hypothetical protein Phum_PHUM496580 [Pediculus humanus corporis]
NCBI nr blastpgi|3504014361e-15026.76%PREDICTED: hypothetical protein LOC100740935 [Bombus impatiens]
NCBI nr blastxgi|3504014362e-16426.80%PREDICTED: hypothetical protein LOC100740935 [Bombus impatiens]
Group
KEGG pathway 
Orthology groupMCL17377 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204704-TA
ATGGCAGAGATTTTAAAGGAGTGGCTGTCAAAAAGATTACAACGTCCTATAAAATGGGAAGCTGAAGAATTTGGGGACATCATGAAGAATGGATATATAATATCCTGTGTACTGAGAAGTTACTATGTTATCGACGACGAGAAACATTACCTGATGAGATCTAGCAATTTAAAGGAAGATATAACAAGCAATTGGAAACTTTTGAGGGAATGGTTACGAGGTATCGAAATAAATTTGAGTGACACAGATTTAACTAATATTATGAATGGAAAAGGTTCAGCACTACTTCGTTTATTCTATCAACTGTTCCTTCATTTGGATAAACGAGATAGGATAAATTTTATCAAAGAAGAAAGGAAAAAAGCATCTGCCTTAGTAGAAAAGTTGGATCATCGATTCACAGTCAATAAAGTGGATGAAGAGCGGCCATCAGTTGTTGATGACCTTTCGAAACCGCTTCTAGATCAAAGACAATTTATTGACTGGCAGAGAAAGAAGGCCAAGGAAGTACAAGAAATTTACGATTTTATAAGACACAAATACTCCAAGATGCTTTCAAGAATAGATGAATCTAAAACTCCCTTTCAGAACCCCCCAGTTAAGCCGAAAAAAAAACCGAATAAAGACAAAAAGGAAATGGATAAATTTGAAAAAAGATATCCTTGTGAATTTCATAACTACACTTATGAGGAATTATTAAACTTAGAAGAAAAGGCGCTTGAAAAAAAAAATTCACTTATAGATACAGAATGGGCTAAAAACTATATGGACAACTTATTTACGCGAATGCATAACAAATCGGATTCAGAAGAGTTCCAGAAGCAATTGAGAAACGTTTTGAGCGGCTCATTATGGGACCTTTCAGTTGCTGAAGAAGAATCAAAATTAGATACCGATCTTGCAAAAACAGTTATGAAACTGTCACAATTTGAAAAGCAAATGTGTACTCAAATAATGGAGACTAAACAACAGGCCCGAAACTTAGTAAAGAATAGAATATTAGGTGAAGTCGAATTCGCTGATCAAAGAACACAACAATTTAACCAGTTTCTAGATAACGTTAAAGAAGAAATTAATTTGGGTGTAGCTGAAATAGATTTTGAAAAACGGAGGCAAGATATGTTACATCAAAGACTATATGCTGAGAAAATGAAAAGGAAAAGGCAACATTACTATGACATATGTTACGATACTATGCTGTCCATTGTAGATTATGCTACGAAGTACGCTTACTTTAAAAAACTATTAGGAGATGAAATACCGGATCATTTTATTCACGAGTGGAAAATTCTGTATTTCAAGAAACAACCTATCTTCGATTTGCTTAGTCCCATGGAAGAAATTTTAAAAGATGTAGAATTTGAAGCCGAGAAAACTCCTGATGAGGAAGAAATAATACGTTTAGAGTTAGATAGACAAGAGGCTTTAAATGAAGAGGAATTTATAGAATATCACAATTACTTACCACCATGGAAATTAGAATTGTTAATACCAAACTATGATAGTGAATGTGAAGATAGAAAATATGAATATTTGGGGTCAAGGATTCTAGGACATTTAATCTATACTTTACTCGAAATAAAGTACCCTTATCCACCACAAAGACCACCAGCTGAGTTACCAAATTTTTTTGCAAAAGCCATTTTACGGGGGCTCCCTGATAGATCTATAACAGTTGCTATGCAGATATTATTAAATACTCAAAAAATACACGTAGTTCGATTGGAATCGGCTATAAACTTCTGTTTAAGAAAGTTCAAAGTTGAGATGATTGGTTGCACTGACATAGAACTGTCATTTGATAAATTTATCGCCGCTGCTCAGGAGGAAGAGACAAGGGAATTGATAAAATTGGTGAAAGCTGAAGATGAAATCATATGCAAAAGTAGCGAAACCAACTCTTTATTATTGCTTGGACCTACACCAGCCAATACGAAGCAGACCCAAACACCAAAAACTATACCAGAAGAAGACATAACTTTGTCAACTGCAGCAGAACTTGGTAGATATGCGTACGAATCATTAAATTTTGGCGATACGCTTACAGATCACTTACTTTCAGCCATGATTGTTGAGTACATAAAAGATCAAGAGGATATTAATGGATTTGTTATAATAAATTATCCAAATTCTTATCGTGAAGCACAAATTTTAGAGGAAACTTTTTCTGGTCGTCCACCACCAGATGAAGGTGAATTGGACGACAGAGATGATATTTATCTGGAAGAAAGTATTAGTAAACATAGGAAAAAGGAGAAAGACCCCTTCAAAGATATTAGAATTTCAAGACTAGTTAATGATCCTCACAAAAAAAGACTTGACAACCCGTTTACTAGTTATTTCACATGTTATCTTAATTTGAAAGAAACAGAAAACATCTTACATGAAATATTTATCTGGAATTTGGCTGAAGATAATTCTGAGTTAATAGATAGATTTTATGCTGCTCTAGGTATCAATTACAGCATGTATTATGAGGTGATAGAGAAAGACCAGCTGGCTTTGATATGCAAATATATAATTGGAGATTTTACTATGCCTCTCAAGTCTTACGACAATTTGTTTGGAGACAATGTTTTAAGCGTTTTAGAATTTCCTACTTCAGAAGACAAAAGAACAAAATCAAAAGTTGTGAAACCAGAAACATCCAATGGCAAATCTAAAGAAAGATTACGACGAAGTTCTAAATTAAGTAAAATATCTTTTACTAATGATTTGGAAGTAGTGAAAGCCCCAGAGTCTTTAGAGGAGGTTCATGATGATATAACACGTATAGAAATAATCGAAGAAGAAGAATTTCACAGTAAAACATCGTCAGGTGTCTTTAGTGTGGAAGAAATCAAATTGTTAGCGGGAGAAGAAGATTGGGATTATGGTGAATTACCTATACCAGAAATAATAGGAGTGGCGTTAGCCACATGTTGGGAAGTCATAGAAAAGTCCTATATTAACGACATAAAGCAATTATTATTCGGAAAAAGATTGCAAATGAACTGTCTTGTTCCATATACAAGGTTCATAAAGGATAAAATGGAACAAATTATAACGCTACCATCAAAAAAGCAAGATTTAATCAGCGATTTCCAAAAAGAATATAATGATTTTGAAAACGATTGGCGCGATGTACATTCAACTAAAAATGAATGGCATTGCAGGATTAAAGATTTGCAACACCATCTATACCGAATATGTGACGAAAGAAAGTTACATGCTGAACAACAACGAAAATCACTTATTTGTGAAAATTGGGCTATGGAGGAATTGACCTCTATGGTCAATACCTACATATCTTGTATGCAAGCTGAATTAAACAGATCTATTTTAACGTATCAAGCGTTGCATGATTTCTACTTCACAATGATAAAACGTTCTCCACCAAATGACAGATTGTCATCTAAAGAATTAACAAAACTTTTAAAGGAAAGTGACGAAAGCTCTGGAAACAGAAAAGGTGGTGAAGATAAAGTGTTCAGACAACTCAAATCTGCATTCCTAGACTTGCAACTACGAAATATAGAAATTGATTATACTAACAACCCCTTTAATACTATTATTGACAATAATATAAAATTTGCTCTTAAAATAATTAAAGACACCAACGATAACTATAGGTCCATTATAAGTCGTGAGTATAGTGAAATAGCTAAAATTGTTCCAACTGCAAAGAAAAAGGATGAAAGCACATCTGAAGATTCAATACATACTGAAATGAATTTTAAAGACAACGCTTTGAAGTGCATTGAAGAATGGACTATGGGAATAAATGGGGAAATGTTCAGAGCGGATTTACGACTGCTCGCAATACAGTATATGTGCTATAAAGATATGAAATTGTTTAATGATCAAGTTTATAGAACCTTTACGGAAATTCAAGAATTTATAAACAGCTACTACTTAAATGAGATAAAGTCAGTGGATCGGCTATGCAAATATTTACAAATGGCAGTAGAAGACGGCAAAAAGGTTCCCGAAACGCTTATTCTAGAACAAGACACTTTTATAATAGATCCTAATCTAATTCAATTTTCTGAGCCAGAACCAGAAAGAGATACTGGAATTTTAAAGGAATTCGTGAATGAAACAGAGTTTAAAGTTGCGCAGCTAGCAAGATTACGAACTCAATTCAAAATTGTGGCTCCGAAGGGAATAATTTTATTACAAGCTTTTATATATTTGATACAGGACTTTATATTCTTCGGAAAAGAATCATGTGAAGGTCCCATTTTTCCGGATGCATGGAAACAGGTGAATCCAGAGCAAATTCCAAAATTAGTGTATACCATGTTTGGTGATACAGTTTATGTCGATTGGCGGGATTTTCTTATTTACTGTCTTAACATTAGATTTCCTACAATAGATGAATTATTGATACTGCGAAAAACCTTTCGCTGTATTGATCTTGAGTCAACAGAGACAATTTCTAGGCAAGATTTCGTATCTGAGAAACTTTGGTTCGAAGACGATTTTGATTTAGAAGATTCTCAAGCTGTATTGCGAATCAACTTGATCAAACATTTCTTATTTGAGCTTTATGAAATTGCTGAGGATGTTATGAATTATTCTGCATTCCTGCTAGCACTTTGTAAAAGTTCAGATCCTGTTGAGGGATTTGCTATGGCTTTATCTATGGCTGTTGGCAAAAAGGTTTGCTATTTGATGGAAGAATGCCATGAAGTTGTATGTAAGCTGATCAAGGAAAAGAAATATCGAGATGAATGCTACACCTGTGCTCTCAAATGTACTAACCAATTCTTGGATAAGGTTATTGCGAATGTGATTAGCACTTGTGAAGGTACCACTATTATAGAATTAGAGTATTCAGAACCACTACCGGAGACCGATAAGAAAGGAAGAAAAGGTAAAGGTCAAAAAGCTAAAAAGATTGAAAGCACTCTCAGTGCTAGGTTACCAAAACTGCAAAAAAGTGGCATCAGTCGTAGTAAGATCACGCAATCTGCAATTAACGTGAAAACTACTTTTATTTGTCCGCCGTGTGAAGAAGACACTGAAATTACTGAGGAGAAGCCTCCGGAGAAAGAAGAAGTGGAAGAAGAACAAAAATATGAACCGCAGGTAGATGTGAATTTAGCTTATACTATTAGTCAAGCTGCTTTGTGGAACGTGTTAAAAATTTGCCTGCCGTGGCATTTTGAGTTAATACCAGAAGTAAAAGTTACTCCATATCTTGAACAAGTGAATGCAGTGATAAAGGAATTAGAAGTTTACACAGATAATAAAGATATATATGTATGCAAGTTTGTAAAAGATCCAAACGTTTGTACCATTATACACAAATCCAAAAAATTTGAAGCTCTTAATCTAGCGCAAGAAATTAGGAAAATCATTATGTAA

Protein sequence:

>DPOGS204704-PA
MAEILKEWLSKRLQRPIKWEAEEFGDIMKNGYIISCVLRSYYVIDDEKHYLMRSSNLKEDITSNWKLLREWLRGIEINLSDTDLTNIMNGKGSALLRLFYQLFLHLDKRDRINFIKEERKKASALVEKLDHRFTVNKVDEERPSVVDDLSKPLLDQRQFIDWQRKKAKEVQEIYDFIRHKYSKMLSRIDESKTPFQNPPVKPKKKPNKDKKEMDKFEKRYPCEFHNYTYEELLNLEEKALEKKNSLIDTEWAKNYMDNLFTRMHNKSDSEEFQKQLRNVLSGSLWDLSVAEEESKLDTDLAKTVMKLSQFEKQMCTQIMETKQQARNLVKNRILGEVEFADQRTQQFNQFLDNVKEEINLGVAEIDFEKRRQDMLHQRLYAEKMKRKRQHYYDICYDTMLSIVDYATKYAYFKKLLGDEIPDHFIHEWKILYFKKQPIFDLLSPMEEILKDVEFEAEKTPDEEEIIRLELDRQEALNEEEFIEYHNYLPPWKLELLIPNYDSECEDRKYEYLGSRILGHLIYTLLEIKYPYPPQRPPAELPNFFAKAILRGLPDRSITVAMQILLNTQKIHVVRLESAINFCLRKFKVEMIGCTDIELSFDKFIAAAQEEETRELIKLVKAEDEIICKSSETNSLLLLGPTPANTKQTQTPKTIPEEDITLSTAAELGRYAYESLNFGDTLTDHLLSAMIVEYIKDQEDINGFVIINYPNSYREAQILEETFSGRPPPDEGELDDRDDIYLEESISKHRKKEKDPFKDIRISRLVNDPHKKRLDNPFTSYFTCYLNLKETENILHEIFIWNLAEDNSELIDRFYAALGINYSMYYEVIEKDQLALICKYIIGDFTMPLKSYDNLFGDNVLSVLEFPTSEDKRTKSKVVKPETSNGKSKERLRRSSKLSKISFTNDLEVVKAPESLEEVHDDITRIEIIEEEEFHSKTSSGVFSVEEIKLLAGEEDWDYGELPIPEIIGVALATCWEVIEKSYINDIKQLLFGKRLQMNCLVPYTRFIKDKMEQIITLPSKKQDLISDFQKEYNDFENDWRDVHSTKNEWHCRIKDLQHHLYRICDERKLHAEQQRKSLICENWAMEELTSMVNTYISCMQAELNRSILTYQALHDFYFTMIKRSPPNDRLSSKELTKLLKESDESSGNRKGGEDKVFRQLKSAFLDLQLRNIEIDYTNNPFNTIIDNNIKFALKIIKDTNDNYRSIISREYSEIAKIVPTAKKKDESTSEDSIHTEMNFKDNALKCIEEWTMGINGEMFRADLRLLAIQYMCYKDMKLFNDQVYRTFTEIQEFINSYYLNEIKSVDRLCKYLQMAVEDGKKVPETLILEQDTFIIDPNLIQFSEPEPERDTGILKEFVNETEFKVAQLARLRTQFKIVAPKGIILLQAFIYLIQDFIFFGKESCEGPIFPDAWKQVNPEQIPKLVYTMFGDTVYVDWRDFLIYCLNIRFPTIDELLILRKTFRCIDLESTETISRQDFVSEKLWFEDDFDLEDSQAVLRINLIKHFLFELYEIAEDVMNYSAFLLALCKSSDPVEGFAMALSMAVGKKVCYLMEECHEVVCKLIKEKKYRDECYTCALKCTNQFLDKVIANVISTCEGTTIIELEYSEPLPETDKKGRKGKGQKAKKIESTLSARLPKLQKSGISRSKITQSAINVKTTFICPPCEEDTEITEEKPPEKEEVEEEQKYEPQVDVNLAYTISQAALWNVLKICLPWHFELIPEVKVTPYLEQVNAVIKELEVYTDNKDIYVCKFVKDPNVCTIIHKSKKFEALNLAQEIRKIIM-