Monarch geneset OGS2.0

DPOGS209646
TranscriptDPOGS209646-TA3096 bp
ProteinDPOGS209646-PA1031 aa
Genomic positionDPSCF300015 + 1145953-1154952
RNAseq coverage982x (Rank: top 13%)
Annotation
HeliconiusHMEL0170546e-7248.53% 
BombyxBGIBMGA006708-TA1e-6942.00% 
DrosophilaCG7971-PF4e-1134.25% 
EBI UniRef50UniRef50_E2B7G21e-1453.61%Putative uncharacterized protein n=1 Tax=Harpegnathos saltator RepID=E2B7G2_HARSA
NCBI RefSeqXP_002429393.16e-1546.85%hypothetical protein Phum_PHUM431970 [Pediculus humanus corporis]
NCBI nr blastpgi|3072126345e-1453.61%hypothetical protein EAI_02470 [Harpegnathos saltator]
NCBI nr blastxgi|1951353873e-8628.19%GI16794 [Drosophila mojavensis]
Group
KEGG pathway 
Orthology groupMCL34809 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209646-TA
ATGCAACGTGGTAGAGCGTCGCGTCGGGAACAAAACAATCGCGTTGTATTCGAGCGGCTATGGCGGGGCACATTCCAAGCCGTCCTCGCCGGTCCGCAGCGCCGGCCGCGAGCCTCCTCCCACGATCCCAATCACAATCTATCGATCGAACCCGTGCAGAGCACTTCTACCATAACCGACCCACTAGAGCCAGCTATACGGACTCTCCTCCAAGAACTAAGAAGTTGCTGTTGTAGATGTAATTGCAACAACTATAGAAACAACGTTAACAGCAATTTTCAGGTCTGTCTCCCAAATGACGAGGAGAGGAAAATGCGCTCCGTTATCAGTGTTCGAGAGACCCATGCGGTGGCTGAGGCGCAACAGGAGAAAAATGCCCGTCTTAGGGACGCCTTTGGCATATCGCCTAGGTTCGTGGAGGGGACGAGTCTGGACCCCGAAAGACGCGCGCGGGAGGAGGCTCTCAAATACCCCCTCGTGAGGACCCCCAGCCATGAGAGAGATGTTCACGTCACTAAGAAGAAGAAGAAGAGAAGCACGTCGCCTGAAGCTAAAAAATCTAAGAAGAAAAAGTCAAAGAAGAACAAAAAAGAGAAGCAAGAATCCAGTAAGAAGAAGAAGAAGCGTTCCAGGTCTCCGAACACTGACAGCTCCAGCAGCTCAGAGGACAGCGACTCCAGCTCTGAAGGCGAAAAGCCGAAGAAAAAGAAGAAAAAGTCAAGTAGTAGACGTCGTGCGAGCAGCGTACCTTCAAGCCGCGAAAGTTCGCCAGTGAAACAGAATAAAAACAAGCAGTGGGAACGTCCAAATGAGTTTCAAGAGAACGATGTACGTAATAGAAATCGCAACCGCGAGGAACGCAACAACAAATATGATGATCCCTCAAGAAATTCCCATCAAATTGATTCAAAAAGACGCGGAGATCGCAGTCGGTCTACAGAACGGAGAACGTCGCCCAACTATAGACATAGAGATGATCGCAGGTCCCAAGAATACAAACAAAAATCTTACGTCAATAAAATAGCTAATCGAAAGGAGCCCAAAGATGATTCAGCTAATTCTAAAAACTCCAAGTCTAAACGTCAACGAACACCAACTGAATCCGAATCTGAAGAAACGTCTAAGAAGACTGATCGAAGAGATAAAAGGCGGAGCAGAGATGTTAGCGTTTCACCACCGCCAAGCAAAAGACGTAGAAGCTCTTCGGCCGATAATAAAAGATATGACTCGTATTACAGTAAGAAGGAATTACCTAGAGAAGAATCTCGATACGAAAGTAAAAGAATGCGTAAGAGTGAATCGCCTGACTGTAGAAGCAAACGGCATGAAGCATCGCGTAACAACAATGACAGGAATAATAGATCGGACCATGAAAAGAATCATAGAGATGATGAATCTAGAAAAAGAAGGAAAGACAGAAGTTCTTCAAAAGAGAGACCAGAAACTTCTTATAAATATAAAGATCGATCACCAGTTAATAAAAACAAAAAGGATTCAAAGAAAGAACCTGAGAGATATTACAAAAATAGTCGCGTGGAAGACTCGGAATCAGAATCCGAAGCGGACACACGAAAACAAAAGGATATCAAAGGGTCTAACAGTAAATCTAAAGGTTCGAAAAGTAGAAACAAGTCACCACCTAGAGACAAAAGGAAAAGAAGCCCTTCAATTGAAAATAGAAAAGACACCCCAGACCGGAGGGAGCGTAAAAATTCTCCTGATAGAAGACAAGAAGATAAGAAGACAGACAAATATAAATCGAGTTCAAAACTAGAAGAGAACAAAAGAGATGAAAGTAGGAAGAGAAAATCTTACACACCGGAAAAGAACTCGGCACGGAAACAATCGAGAAGCAAATCTAAATCTGTATCACAGGAGCGGAAAAGAGCAAAGAAAGATAAACAGAGAAGCAGATCACCAACTTTGAAATCTAAGAGAAATCCAGATAGACGGAATTCTTCTGCGAGGTATGATAATAAAGATAAACATGATAAAGATAGAAATGACAAAGAAAAAAATGATCGCGATAATAACAAAGAAAGGAAAAAATCTCGATCGAAGGAACGAAGACGGAGCACTTCCAGTCTGAGTTATTCTCCCGCCAGACGCAGCCCAGAAAGGTATCGCGATATAATAGAAAAGTTACCAGAAAAGGACAAAAGGAAGTATATAAAATCACCCTCAGATTTGAAGCCAAAGGCAAAGAAGAGTAAAGAGATAGCTGATAAATATAAACCCAAAGCCGTCTGCATGATAAGTTCGTGTAGTGACAGTGATCAGTCTGAGGAAGATCTAGACGTACGGGCCAGGGCGAGGCAGGAGGAACTTGATATTAAAGAGTTGATTAGATTGAAAGAAAAACTAGCGCAGATGGCCAAAGCGTCCATAGAAAGGATGAGGACTGAAGAACAGGTTCAATCCACCTCCAATACTAATAACAAACAGAGTCCCATAGTAGTGGAATCTTCTAAGGTAGATGTCACTCCCAAGAAAATTGAAACTCCCAAGAAGGACAGTCCAGTGAATGTAGAAAGCTCTCCTGAGGCGAAACGCGAAAAATCACGGGGCTGGTCGCGCTCGTCGTCACGTCGCTCGTCGCGGTCTAGGTCCAGATCTAGATCCAGGTCGTCACGTTCCAAGTCACGCTCGTATTCATCTTCAAGGTTCGCTGAATTTTTTTCACTCTTACATAAGCATGACAGTTATTATATCGCCACAGCGTATTTTCTAGACGTTCTCGGTCGAGTCGATCCAGTTCGTACAGCAGCAGGAGTTCCTCGCGATCCTCTCGCAGCTCGTCACGATCCACGCGATCATCAAGGTAGCAAGGTAATGTATGTAGCGGGTGAAATACGCAGAACTATGACGGGAGTGTACGGCGATTACCGCAGATCGAGCGGTTCTCGGGCTTCCCGCTCGCCCTCCATCCCGAGGCGCGCCGGATCGCCCAGCTTCCTTGATAGGAGACGCATCACCAGTGCACGAAAACGGCCAATTCCATACCGTCGGCCGACTCCATCATCACCATCGAGCGGAAGCTATTACAGCAGTAGATCGTCTCATAGGAGCTGGACAAAATCACCCCCACCTGAATAA

Protein sequence:

>DPOGS209646-PA
MQRGRASRREQNNRVVFERLWRGTFQAVLAGPQRRPRASSHDPNHNLSIEPVQSTSTITDPLEPAIRTLLQELRSCCCRCNCNNYRNNVNSNFQVCLPNDEERKMRSVISVRETHAVAEAQQEKNARLRDAFGISPRFVEGTSLDPERRAREEALKYPLVRTPSHERDVHVTKKKKKRSTSPEAKKSKKKKSKKNKKEKQESSKKKKKRSRSPNTDSSSSSEDSDSSSEGEKPKKKKKKSSSRRRASSVPSSRESSPVKQNKNKQWERPNEFQENDVRNRNRNREERNNKYDDPSRNSHQIDSKRRGDRSRSTERRTSPNYRHRDDRRSQEYKQKSYVNKIANRKEPKDDSANSKNSKSKRQRTPTESESEETSKKTDRRDKRRSRDVSVSPPPSKRRRSSSADNKRYDSYYSKKELPREESRYESKRMRKSESPDCRSKRHEASRNNNDRNNRSDHEKNHRDDESRKRRKDRSSSKERPETSYKYKDRSPVNKNKKDSKKEPERYYKNSRVEDSESESEADTRKQKDIKGSNSKSKGSKSRNKSPPRDKRKRSPSIENRKDTPDRRERKNSPDRRQEDKKTDKYKSSSKLEENKRDESRKRKSYTPEKNSARKQSRSKSKSVSQERKRAKKDKQRSRSPTLKSKRNPDRRNSSARYDNKDKHDKDRNDKEKNDRDNNKERKKSRSKERRRSTSSLSYSPARRSPERYRDIIEKLPEKDKRKYIKSPSDLKPKAKKSKEIADKYKPKAVCMISSCSDSDQSEEDLDVRARARQEELDIKELIRLKEKLAQMAKASIERMRTEEQVQSTSNTNNKQSPIVVESSKVDVTPKKIETPKKDSPVNVESSPEAKREKSRGWSRSSSRRSSRSRSRSRSRSSRSKSRSYSSSRFAEFFSLLHKHDSYYIATAYFLDVLGRVDPVRTAAGVPRDPLAARHDPRDHQGSKVMYVAGEIRRTMTGVYGDYRRSSGSRASRSPSIPRRAGSPSFLDRRRITSARKRPIPYRRPTPSSPSSGSYYSSRSSHRSWTKSPPPE-