Monarch geneset OGS2.0

DPOGS208041
TranscriptDPOGS208041-TA5814 bp
ProteinDPOGS208041-PA1937 aa
Genomic positionDPSCF300203 + 85394-92657
RNAseq coverage376x (Rank: top 32%)
Annotation
HeliconiusHMEL0040460.065.21% 
BombyxBGIBMGA001470-TA0.057.09% 
DrosophilaCG14200-PA4e-1639.05% 
EBI UniRef50UniRef50_UPI0002063DA86e-5938.95%UPI0002063DA8 related cluster n=1 Tax=unknown RepID=UPI0002063DA8
NCBI RefSeqXP_001866907.16e-4942.34%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3287841812e-5838.95%PREDICTED: hypothetical protein LOC551566 [Apis mellifera]
NCBI nr blastxgi|3287841812e-9124.07%PREDICTED: hypothetical protein LOC551566 [Apis mellifera]
Group
KEGG pathway 
Orthology groupMCL26674 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208041-TA
ATGTGTGAACAGAGTGAAGTTAATTATTTGGATGGAGATATAGTATGGGTAAAGCTAGGATCTTGCTGGTGGCCAGGTGAGGTGGTGGGTACAGAGAAACTACCACCGGACCTCCTACCATCATTTCGAAAGCCACCTATTGCTGTTGTCAAGTTTTTTCAAGAAGATTCTTATGAATATGTGAAAAACCTTAATTCAATATTCAAATACAACTGCAGTAGAAAAAATGAATTCATTAATAAAGGGCTATATATGTATAGAACAAAACATGGTCTCATGGAGAAATTCCCTGAGGATGTTATCAGGGCGGAAACTGCTGTGGGTGGTGATATAACGATATTAACTAGAGATGAATTCCAAGAAACAAAGAAAGAAAGCTATGCGGGGCTTTTCGGTGATCCCGCAAAGAAAAATACCCCCATCCACAAGAAAGGTAAAGGTGGAAGAGGACGGCCGCCTGACACAGCTAAGTTATCAACACCAATGAAAAAGTTCAAGGAAAAGACAAACTATAAAGTACATATATTATTACAAGGATCAAAAACTCCCAGCCAAAATAATGAGACATTATCAACACCCTCTACTAGCAGGGCCGGATCGGAACCCACAGAGATGTCACCTGAAAAATCCATTGAAAAGAGTGAAAATCAAAGTGAATCTGAGGAAGCTGATAAAACACCTCAACCAGCCTCATTTAGTACTCCAACAATGTCTAGTTCGGGACTGTATGTATGTCATGCGTGTCAGTTTTCAACACAACGCCTCAATGTACTGATTCTACATAACAAAACACATAGTGTATCATTTACACCATACTCACCATCACCAGTCAAAAAGAAACCGCTCAGTAAAATTAAATCAACGCCCGTTACGTCTAAAACTCCTAAAGAACGCAAGCGTAGAACGGAGAAGTCCGAAAAGAAAGAAAGAAAGACGCAGAGCAATAAGCGTTCAGCTGAAACAGAGACTATATCGGAAGTGAAGAAACCTAAAACGGATGAAGAAATTAAAAGCAGTCTTTTAGCAGACTGGGACGATGGTGATGAAGAGTCAGCTGATGAGTCCTCGACTATAGTAACGGCCGGATCCCCGGAAGTGCCTATGGCGGCAGAATCGCCAGCAGTGCCAACGCCAGCAGAACTGCCAAACAATCAGGATACTGTCGAGGATAAAAAAGATTCACCAACTCAAAAGCCGGAGTCATCCAGCGATTCCAAATACGAATTCTGTGAAGACGAAGATTGGCCGATTGAAACAGATATTGGAAGGAAAATACCTCGCGTCAAGAGCTCGTCGAAACGTAACGGCGATAAGAAAAGTGTGAGCATCGATGAAGAAGAAATGGCCAGGGAGGTGGCTGAGCTGCTCAACAAAACTGCGCTTCCGGAGCTGCCGTCGGCACCGGAACCATTGAAAGTCGAGGAAAATTTTCCGGAGGGCTCGATAGCGAAATCTCCAGATAAAAAGACCGATAAATCACAAGATCAAAGTTCCCCGGAAAAAAAAATACAAAATGACAACCAACCACCTAAAACAATATTCAAAACGAAAACTTTTTTTCGAAGTCGACACTCTAGAAGCCAGGATGCAATAGGAAAATATGTTGCGGAACAATTAAACGCAGCAGAAAGAATGGATCTGTCTGAGAGTGAATTAAACGGGTCTGAAGTGGCTTCGTCCCCAGAAATAAGGGAATCGCCTCAACAGATAAAGGTTGCACGTCTAGCGCCAAAGATACAATTCAAGAAAAGCAAAGCGGAAGCTGCACAACAAAAGGAAATGGAAAAACATAAGACGGAAAAGATTGACGAAGAATCTACGAATGCTATTAAGGATGTAGATATTCATGAAGATATGACTCATATAGAAGACAAAGATAAAGATGATAACTTAATGAGTGATATAAGCATATCAACCGACGAAAAATTGTATAAAAATAAACAAAAGCATAACTTAAAAGACAGCACAAATGATGTTTTAAATGATGAAGAAAATTTTGATTCCCCTAATCATTCTGATTCACCGTCAAAAAATGTGGTTCCGAAGAAAGAGGATAGTTCAAAAATATTAAGTTTTTCAGAAAAATCATTTGAGCCCTATATGAATGAGTCGACAGCATCGGCCGTTGATGCGTTACTGAGCGTGTCGAGAGAAGCTGATCGTGTTACTAAAGTTATAAGTGACGATCCTCCCGAGGATTTGTTTGAAGACGACGTAAAAGACAGCATATCCGTTAACATTAATGGTTTCAGCGACAGTGACCACAACAATATAACACAAAACGAAGATAACATCGAAAAAACAAATGATTCGACTGAAAAATTGGTGGATAACGAAGTATCTGATGCAAAAGTGTGTGATGAACAAGAGGAAAAACTTTCACAAAATAGTATAAATGATAATATTTCTGTGAAAGCTCATAAAGACATGGAAGACATAGCAGAAGTTGTGGATACGACTAAACTTGATGTAGTTAATTCTAAATTGATCTCAATTGAACCGGACAATAATTTTCCCGTAGAAAGTATCCCTTCTGAATCTGATTTGCAAGTAGCAGAAGCTCTCATTAATTTACCTACGACGACATTAAATAACAAACTGCCCGACGGCCACACAAACGAAACAAGTGTCAAAGAAGATAAAGAAACTACTAATATCACTAATGACGTACCTTTACAAAGCAGTCAAAATTTCTCCTTAGAAGAAGAACCAGACATCACACCAATCCAAACAAAAATTATGGTTAGTCCAAACAAAGAAATTAATTCGCGATATGAAACAGAACAAGAGAAAAGTGAAAATTTAAACGCAGCGAAGTCTTTAGTACAAATGTCGGAATCCATAGACCATAAAATTAAAATGTCTGAAAATAAGTCACCAAAGGAAAAACTTAGTATTTCACAACGAAAAGATGACTTATGTAATGAAAGTTCCATTGAAGTAACCCAAAAGCTAAGTGTGTCTACCAATGACTTCAGCAATGAATCTAAGACACATTGCGTCTCACCAGATTTACATTTACATTCACCGAAATTGTTGAAAATTCTGGAAGAACCGGGTTTGCCTAAGATAGCTGCTAGGAGAACTGTTACAAAACAAATTATTGTACCTCGCAAAGAGAAAATTTTAAATGTTGAAGCTGGAAAATCGCCTTTAAAGCCAAAAACGCAATCACCCAAACAAAAAATCATTATTCGTAGGACAACACCAAGTAAAAACTTGCTAAACAATATTGGTGAAATAACAACACCTGATAAAATAATTTTATCCCGAACAAATAAATCGTCACAAGATGGTTCGTCTGTACAAACCTATACAATTCAAACCTCTCCCGACATATCGCCCACAAGCGATCCTAATACTATCATAATTCAGCCAAAACTTCGTCAAGTTGTGAAACCAGTGTCTAAATTACAAAAAATTAAATCTCAACCCCAAACAATAATTGCCCCTAGTAAAGAGAGTAAAATTACACAAAACACGAAAAGTAAGGATGACTCTGTATTTGATATTAATTCTATGCCAATCGTGTTGACACCAGAGAGTATTGAGAAAATGCCCATCGTTATGTCCGATGGAAATATCATTACGAACTCTAGTAATCCACCAAAACTAGTAAAGACCAAACAAACTATAGCAGACAGTGGAAAGATGTCGCCTGGGCCTATCAAAGAAATAAAACCCATGATAATGAGTAACGAAGTAAGTAAAGCTACTACACCTAATATTCTCTCTAAATCACAAAAATTACGAGGAACAAAACCAATGCTTGTGATAGATAAGACTACAGGCAAACAAAAAATTATAATGACGAAGACAGAACAATCGAAAGAAGTTAAACAGCAAGCGACATTAATACAATCAGCACCTCAGAATTCGCAGAAAGCGGAAAAGTTCATAATTTTACCATCACAGAATTCTCCTCGCTCTGGAAGGACGCAAAAAATTGTTATCGATCCTCAAACCGGTAAGGCGCATGTTCTTGTAGGAAAATCAGAATCGCAATTAAGTACAGCTGAAAGTAATAAACCGGTTTCAGCGAAACTGATACCATCGCCATCAGATTCTAACACCCCCGGTAATACCGTTATGATAATTACGAACGCCCAAGGAGGACAATCTAGAATAGTCCTGACACCCGAACATGAAAAAATATTGTTCCCAAACAAGCAACAGCCAGCGATGTCTCAACTTAAGCCTGTAACGCATCGTATCACATCAGGTTCGGGAACTGTACAAAAAACTATAGTTTCCACAGCTACTGGGTCTACGAAAACTCAAACGCGAATCGTCCCTAAACAAAAGAGTGCTATAATTACGTCCAAGGGTCAACTAATAGTGGGTGGCCGTGTGGCGACTACTACTCAAAACATTGCACCATTACCTGAAATCAGACCAGCTCCAAAACGAATATTGGCCTCGGAGCCCAAGAGATTAGTCCAAACAATACAAAAAAATTCCTCGGAACCATTAATATTCTTACGCCAGAATTCCAGCGCTGTGATGCAACTTACTGTAGCCCAATTTGAACATCTCCAAAGAACCGGTCAAATAATACAGAAAGCCCCCACGCCTGTTCAAGAGAATAAAATAGTTGTCCAGAAGTCAATTACCATTTCACCAAAAGAACCAGTGTCCTCGATACAAAAGCAAAGAGTCAGAAAACAGACAAACGAGTCTCCGGCCCCTATGAAAAAAATAAAACATGAAATAGCGATAGCACCTGCACCCGCGCCTGTGACTATGCCAGCTCTAACACCAATCGCACCACCACAAGTACCGAACGTTTCGAGTACTACTACCAATATGTCAACCTCTAACTACTCAGATTTAGAAAACCTAGAAGAACTTCTGCCATCAACGGCAATAGTAAGACATTCGGAACCGACACTAATACAGCCTCAGTCTGAACTCAATCAGCCTCCACCTGCCGCTCTCTCTGATGGACAACTGCTGGCAGTGCCTGGAGAACACTTCGGAGGTCCGACGGGCTCATTTTACCTATGTGTAGAAGATAATGGCAATTTCACCGCGATAGACAATCGTCCTTTAATACTTGAAAACAATCAGTTAGTGCCGATGCCCGATCCTTTGCCGGTGCCAGTCGCTCACCCCGAACGTAGGGACATTTTAGAGGCCGCGCTGGCTAATAGTGATGTTTTTCACGGTGAAACAACGCGTGACGAGGCTCCAGATTTTAGAGATTTAAATGCGAACGTTTCGGTTCACTGTCGAGTCTCAGAAACTAGCACAACACTCAACCAGCCCATCATGACGCCGGTCGAAGTGCCTTCAAAAGTCGACAGTGAACCAACAACAGTCCCGTCTAACTTGGAGGATGGATTGGCCGTGATAGGTGTCACACCACACACCGTGCCGACCTCCCTCGAGCTGCCGATAACTGTAACAGATCCAAGGATAGCACCTAAAACAACCGATCCGCTTAGCAACAATAATTACGGAACATCCTTACTACCTTCTCCGAACACCGAATTGACGTTTTCCACAACAGAAGACGCTGATATATCCATGGTCGGTCCAATATCGATGCCAATTCTCACAGATGATGATAACGTTGGGGGGAAGTCCATGCCGATTCTGACGGATGAGGTTACGGAACGAACAGTATCCTCAGTGGACTCTACGATTGGATCTCCTTCATCCATAGATGTAAGGGAATCTGAGAACGAGGACAGCAGTCAGTGGCCGCGACGACTCCTCACTCCGTGTTCAGACACGTCAGAAACGTCATCAGAAATACCCTTACAACCCGTCATGCAACTATCAGTTAATGATCTGTCCCACGACGGCTAA

Protein sequence:

>DPOGS208041-PA
MCEQSEVNYLDGDIVWVKLGSCWWPGEVVGTEKLPPDLLPSFRKPPIAVVKFFQEDSYEYVKNLNSIFKYNCSRKNEFINKGLYMYRTKHGLMEKFPEDVIRAETAVGGDITILTRDEFQETKKESYAGLFGDPAKKNTPIHKKGKGGRGRPPDTAKLSTPMKKFKEKTNYKVHILLQGSKTPSQNNETLSTPSTSRAGSEPTEMSPEKSIEKSENQSESEEADKTPQPASFSTPTMSSSGLYVCHACQFSTQRLNVLILHNKTHSVSFTPYSPSPVKKKPLSKIKSTPVTSKTPKERKRRTEKSEKKERKTQSNKRSAETETISEVKKPKTDEEIKSSLLADWDDGDEESADESSTIVTAGSPEVPMAAESPAVPTPAELPNNQDTVEDKKDSPTQKPESSSDSKYEFCEDEDWPIETDIGRKIPRVKSSSKRNGDKKSVSIDEEEMAREVAELLNKTALPELPSAPEPLKVEENFPEGSIAKSPDKKTDKSQDQSSPEKKIQNDNQPPKTIFKTKTFFRSRHSRSQDAIGKYVAEQLNAAERMDLSESELNGSEVASSPEIRESPQQIKVARLAPKIQFKKSKAEAAQQKEMEKHKTEKIDEESTNAIKDVDIHEDMTHIEDKDKDDNLMSDISISTDEKLYKNKQKHNLKDSTNDVLNDEENFDSPNHSDSPSKNVVPKKEDSSKILSFSEKSFEPYMNESTASAVDALLSVSREADRVTKVISDDPPEDLFEDDVKDSISVNINGFSDSDHNNITQNEDNIEKTNDSTEKLVDNEVSDAKVCDEQEEKLSQNSINDNISVKAHKDMEDIAEVVDTTKLDVVNSKLISIEPDNNFPVESIPSESDLQVAEALINLPTTTLNNKLPDGHTNETSVKEDKETTNITNDVPLQSSQNFSLEEEPDITPIQTKIMVSPNKEINSRYETEQEKSENLNAAKSLVQMSESIDHKIKMSENKSPKEKLSISQRKDDLCNESSIEVTQKLSVSTNDFSNESKTHCVSPDLHLHSPKLLKILEEPGLPKIAARRTVTKQIIVPRKEKILNVEAGKSPLKPKTQSPKQKIIIRRTTPSKNLLNNIGEITTPDKIILSRTNKSSQDGSSVQTYTIQTSPDISPTSDPNTIIIQPKLRQVVKPVSKLQKIKSQPQTIIAPSKESKITQNTKSKDDSVFDINSMPIVLTPESIEKMPIVMSDGNIITNSSNPPKLVKTKQTIADSGKMSPGPIKEIKPMIMSNEVSKATTPNILSKSQKLRGTKPMLVIDKTTGKQKIIMTKTEQSKEVKQQATLIQSAPQNSQKAEKFIILPSQNSPRSGRTQKIVIDPQTGKAHVLVGKSESQLSTAESNKPVSAKLIPSPSDSNTPGNTVMIITNAQGGQSRIVLTPEHEKILFPNKQQPAMSQLKPVTHRITSGSGTVQKTIVSTATGSTKTQTRIVPKQKSAIITSKGQLIVGGRVATTTQNIAPLPEIRPAPKRILASEPKRLVQTIQKNSSEPLIFLRQNSSAVMQLTVAQFEHLQRTGQIIQKAPTPVQENKIVVQKSITISPKEPVSSIQKQRVRKQTNESPAPMKKIKHEIAIAPAPAPVTMPALTPIAPPQVPNVSSTTTNMSTSNYSDLENLEELLPSTAIVRHSEPTLIQPQSELNQPPPAALSDGQLLAVPGEHFGGPTGSFYLCVEDNGNFTAIDNRPLILENNQLVPMPDPLPVPVAHPERRDILEAALANSDVFHGETTRDEAPDFRDLNANVSVHCRVSETSTTLNQPIMTPVEVPSKVDSEPTTVPSNLEDGLAVIGVTPHTVPTSLELPITVTDPRIAPKTTDPLSNNNYGTSLLPSPNTELTFSTTEDADISMVGPISMPILTDDDNVGGKSMPILTDEVTERTVSSVDSTIGSPSSIDVRESENEDSSQWPRRLLTPCSDTSETSSEIPLQPVMQLSVNDLSHDG-