Monarch geneset OGS2.0

DPOGS208078
TranscriptDPOGS208078-TA2658 bp
ProteinDPOGS208078-PA885 aa
Genomic positionDPSCF300282 + 62980-66007
RNAseq coverage5x (Rank: top 88%)
Annotation
Heliconius% 
BombyxBGIBMGA007788-TA8e-11539.98% 
Drosophila% 
EBI UniRef50%
NCBI RefSeqXP_001181196.11e-0825.00%PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
NCBI nr blastpgi|3504239271e-0626.37%PREDICTED: hypothetical protein LOC100743163 [Bombus impatiens]
NCBI nr blastxgi|3320222348e-3624.47%Zinc finger protein DZIP1L [Acromyrmex echinatior]
Group
KEGG pathway 
Orthology groupMCL30521 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208078-TA
ATGCAATATAATAAATGTAATTACTGTGAAAAAGTTTTTATGAATCAACTCTATTTAAAAAGTCACATTTCTAGGCGACATGAACAAGTTTTAGAAATACCTCAAAAAGATATAACAGATAATAACCAAAGTACAGATAATATGAATGCAAAATTAAACAATGAAATTGCCGAGTTAAAAACGAAACTAAAACAAATGGAAGAATTAATGGAAAATATGCAAAAATGCAACGATGCAACGAAAGAAGAAGCCTCCTCCTCGACTTTAGCTGTGAAAATTAGTAATGTAAATGAAGAGAAAATATCAAACACCCAAACAAAAGAGATGAAAGATGTAGAAGTGATTGCCAATGAAGAAGGCTGTATTCTAGACAAAATAGAAGAGTGGAAAAAAGAGGAGCATAACAAATACAATGAAGAAATTAAACTTTTGCGTCAGCAGATAATTGACATTATAAGCAATAAAGACAAACAAGATCACATCTCTATAAAAAACGAATTAAAATTAATGGAGGAACTTCAAAACACAATCAAGCAACAAGGAGATGAAATATTGTCACTTAAAGATAAACTTCTAAACGAACAAAACAATGAAGTTGAAAAAAGAAAGGAAATTGAGACACAAATGGCATATTGGGTTAAAAGAGCTGAAATGCAATCAAATGAATACAAATCATTACTGCAAAAATTGAATGAAGTCGCCCATGAAGCCAGGGAATATAAAGTCAAAGCAGAAACGGAAAAAGAAAAGGCAGACAACTTACAAAAGTTGTTATTACAGCAACAATACGACACATCTCCAAAACGCAATCAACGTTCTCAAAACAAGGATTCTCACGATGAAAAAAACACAGAAGACAAAATAGAGAAAGATGAGTCTGAGAAGAGTATGAACATACAAATAACAAATCCCCATAAAAGTCCAACAGCTGACATACTGACGCTAAAAAAGCTTCAACAAAAAGCACAGGAACTTTTAAACATTGATCAATCCACTACTAGTGATAATTCAATCACAAGCGATCATAAGTCAAAATCTAAGCAGAGCAACAAAGAAAATAAAATTAGCAATAAGAAAAGACGTAAGCGTAAAGACCCTATTTTAATCAACTCTAAGAGAAACTTTAAAACTCACGAAAATGGTGGTAACTTACAGAAAAAAGATATAGTTGAAAAAAAAGTTAAAACTAAGTCTAAAACCATCGATCAGGGATCTAAAAAAATAAACGGTTTTGCCCATACTCCTGCAAGCCCCATAAAAGTAGTCAGGGCAAAAATAACCGAGGAGGTCAATAATCGTTTAATTACATTAGGAGTTGATCCACTTAAAAATCGAATTCCACAAAACGTTTTTAAGAAACAACGCAAGAACTTGCTGGAACAACAACAAGCTAAAACTAAGAAAATTCCATCCCGAGAAAGAATAATGCATTCTATAATAACTCATTTGGATGAAAATGCAACAAATGCAAGTTATGATAAAAGATTTCAAGAAGTTTCTCCTAACAAATCGAAGACTTTTAGCCTTTCCTCTGTTTTATCAAACGTGAAGACAAAAGCTTTATCTTTAGTCAAACAAAATGAGGCAAATAATAAATTTAACAAAACTCAAGATGATTTAGCCAAAACTGCAATAGCTTTACTAAAGACACCTCCAGAGTCCATTAATTCAAGCCCCGTTATACAGCGACGGGCCAACTTCAATTTGTCAAATAAAACTGTAGGTGAGATGAAAGAAAATCAAAAAAGCAGTTCTCGACAAAAATCTAAATTATTGCCCTTAACTAGAAACGCAAGACATGAAAATGAGCATGAAGAAAGCAGTACTGAAAGTACTGAGAATAGTGAATATAACGACCAACACGAATCAAACCTAGACAACACTTCAAAAATAATAAACAATCTTATTAAATCACCTGTACGTAAACCGACTGACAATGTAACTACCAGCCTTGATGTAAATCGCAAAAATGTTACGTCTCCTAAAAGTCAGGGGATTAATCTTGACAAAACATCCGTCATGGACACTTCTAAAATCAGCAGTGATGATATAGAGTCCATATCTTCTCCCAAAAAGAATTCATCTTTAGATAACATTTCTAATACAAAACAGACGAAGGGGGTTTTAAAAAGTGCATCTTCTATTTCATCACTCAATAAGAAGAAGGTATTGTTTGACATGGATGCGATACAGATGAAGCTAATGAGTGCATCACCATCACAGAGTATAACGGATAAAAGTGATAAAAACGATCAATACGTATTAGGGATAGAAAATCTGGATACAGAGGAATGGGATATTTCAAGCATAGAAAACGAACCTGCAAACACGACCGCAAAAATTCAAATCTCAACACGTACGAGTCCGAAGATAGCAGAGTTGAAACAGACTATTGAATCGCAATTAACGAAAAGGAATCCAAAACTATCTACAACTATCGTCGGCGGAGTGGATGTTCTAGCTACTCCCATACAGAAAGCCAGTTTCGGCGGCAGCAACACTAGTCTAGGCAGTTCAATTCTAGACGATGACAGTCTCCCTCTCCCAACACGTAACGCTTTTGTGAAACCGAAGAAAGTGACAGAAAAAGATGACAGTGAAATTGAAATTTCCGATTTAATTGAAAACAGTATGAGCAACAAGAAATATAATAAGTAA

Protein sequence:

>DPOGS208078-PA
MQYNKCNYCEKVFMNQLYLKSHISRRHEQVLEIPQKDITDNNQSTDNMNAKLNNEIAELKTKLKQMEELMENMQKCNDATKEEASSSTLAVKISNVNEEKISNTQTKEMKDVEVIANEEGCILDKIEEWKKEEHNKYNEEIKLLRQQIIDIISNKDKQDHISIKNELKLMEELQNTIKQQGDEILSLKDKLLNEQNNEVEKRKEIETQMAYWVKRAEMQSNEYKSLLQKLNEVAHEAREYKVKAETEKEKADNLQKLLLQQQYDTSPKRNQRSQNKDSHDEKNTEDKIEKDESEKSMNIQITNPHKSPTADILTLKKLQQKAQELLNIDQSTTSDNSITSDHKSKSKQSNKENKISNKKRRKRKDPILINSKRNFKTHENGGNLQKKDIVEKKVKTKSKTIDQGSKKINGFAHTPASPIKVVRAKITEEVNNRLITLGVDPLKNRIPQNVFKKQRKNLLEQQQAKTKKIPSRERIMHSIITHLDENATNASYDKRFQEVSPNKSKTFSLSSVLSNVKTKALSLVKQNEANNKFNKTQDDLAKTAIALLKTPPESINSSPVIQRRANFNLSNKTVGEMKENQKSSSRQKSKLLPLTRNARHENEHEESSTESTENSEYNDQHESNLDNTSKIINNLIKSPVRKPTDNVTTSLDVNRKNVTSPKSQGINLDKTSVMDTSKISSDDIESISSPKKNSSLDNISNTKQTKGVLKSASSISSLNKKKVLFDMDAIQMKLMSASPSQSITDKSDKNDQYVLGIENLDTEEWDISSIENEPANTTAKIQISTRTSPKIAELKQTIESQLTKRNPKLSTTIVGGVDVLATPIQKASFGGSNTSLGSSILDDDSLPLPTRNAFVKPKKVTEKDDSEIEISDLIENSMSNKKYNK-