Monarch geneset OGS2.0

DPOGS211813
TranscriptDPOGS211813-TA3312 bp
ProteinDPOGS211813-PA1103 aa
Genomic positionDPSCF300031 - 312544-321753
RNAseq coverage51x (Rank: top 70%)
Annotation
HeliconiusHMEL0042760.056.75% 
BombyxBGIBMGA008152-TA2e-16452.62% 
DrosophilaCG30263-PB6e-4861.74% 
EBI UniRef50UniRef50_D6WHW53e-5140.05%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WHW5_TRICA
NCBI RefSeqXP_969178.15e-5240.05%PREDICTED: similar to Y97E10AL.1 [Tribolium castaneum]
NCBI nr blastpgi|910791981e-5040.05%PREDICTED: similar to Y97E10AL.1 [Tribolium castaneum]
NCBI nr blastxgi|2213305115e-5625.40%CG30263 [Drosophila melanogaster]
Group
KEGG pathway 
Orthology groupMCL25434 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211813-TA
ATGGCCCAAATGGCTACACTACCCCCACCGGGGCCGGTTCATAGACCGCCGAGAGATTTTCTTAAAGAAAACATGGAAGAAATTAGAGAGCTCAGTGAATTGAACAGAGAAAAGAATGAAGCCGAAGCCGAGAAGAAGAAAGTAGAAGAAGAAATAGCTCTGTTGAAGGAAATGGGTCTTATAGACAAGAAAGGTGAAATAAAAAGCGTAGTAAATTCCAGAGCAAATTCTCGAAGCAGTTCGCCAAATAAGTTCATTTTACGATCAAGATCCAACTCACCTTCAGCTATTCTACTCGAAGCTAGGAGTTCCGAGTATCTTAACAAAGATGGAGGGAAACCTGTATCTCATCGTTCTAGAGTTCGATCGATATCTAAATCTCAACCTCAATCTGGTAATGGATCTCCAAAACATTCTAAAATACCGACGAGACAAAATTCAGTTTCTCCAACCAGGTCGAACTCAAGACAATCTATGGATAAAAGCCTTTCAAAAAATCAAAGGCACATATCAAACAGTACCAGTAGTATTCACGAGTCCATAAGGATAGGAAGTACGGTTCATGACAAAAGGGCCAGTCAAAGTACTGATCTTTTGAATATCGGACCGACTAATATTAAAAGTAAGCCACCGATTTCTCCTGGCAGAATTGGACCACCTCCCTCCAATAAATCAATAAACCCAAAAAGATTATCACCAATAGTTGGAACACCAAGTAAGAGTCCTCTGGAAGATTCTAAGCCTTCGTCAGCAAAAATATCGGTTAACTCAACAGCTACCAAAGGCACTAAACCTGTTAAAGCCTCTGGCAGTACTCCGGCTGCTTCTAGATTAAATTCTAGACAGGCAAGTAAAGCTACCAGCCGTGACACCAGTCCAGACAAACGTAAAACGAGTACTGGAGTATCTAAAACGAACAGTATTACGAAAACTGCAACAAAGCCTCCTAATACACGTTCCAGTCAGAAACCTCCTGTTGCCAATAAATCTGAGCCTAAAAAACCTATAAGTAGAACGAACAGCGTAAAAAACTTAAGTAGAGCTCCAAGCACTAAAGCATTAAATGAGAAGCCTCCTTTAAGTAGACAATCGAGTAAAAAGGATATAACTGAGAAAGCAAAACCGGTAAGTAAAAAGAACGAGATGACGTCCAAAGTCGATGTTACTGAAAAAACTGTGATGAAAACAGATGAAAGTAAAAATGAAGTCATTGAAAATAAAGACAAGAAAGACGATGACAATAAAGTGGATGCTAATGTTGGAGACTCGATCGAAAAACATGACAATGAAACTCAATACGATAAGAAGCCAGCCGACACGGAAGAACTTGTTATAATGACTAAGAAAAATGTTGTCTCTATGACCACTGCGGCAATTACCTCTCAACCATTAGAAGTTGTAGCGACAGTGACGAACCAGTTACCAGCAGCTCTGGAGAAAGCCAGGGAAAAGGCGGAGATTGAAAGGATCAGTTCTAAAGATTCATTACTAGCAGTGGAAGATGAAAAAGTGACAAAAACTCCAGATGAAACCAAAGTAGAAAAAGAAGACAAGAGAACATCAAAATCAAAGTCGGAAAAAGCTTTTGATGAAAGTGTTAAACTTAGACCGTTACAACCGCCCTACAACAACCCTCAAGTAGAGAGAGTGAAACAAAAAATAGACTCCATATTAAAAGAGCCCGAAATATCGACAGAAAATATTTTAGCAGCTGCAAAACAGAAGGAAACACCGAAAGCCACTGCCGATAAAACTAGAGACTCCATTAAATCATTAAAAAGTGATATAATAGAAAAGAAAGAAAAAATAAAGAAGCAAAGTGAAGAAATTACAGAATTCAAAAAAGAAGCGAGCAAAATAGTTGACAGCATCATAACACCTGTTGAAGAACCGAAAGAATTAATGGAAAAAACAAAAGATGAGATTAAAAAAGACATAGAACCGATAGTGAAGGTGGTGAGTGAAAGGAAGAAGGAGGTTAAGGCAGACGTGGAGAAAATGACGGAGACCTTAGTACAGGGGGGGCCCGAGCTGGAGGTCCTCAGCTCCAACGTGTCCACACCTGGAGCGGCTGGGAAGAGGAAGGTGGCTGACGGCGCATCAGACAAGTCTCACAGCAATGGAGGGGTCGGCGGTGAATCAGCTCTTATGAGGTTCCCTCAAAGTACGAGCACCACACCAAAACCACCGCCACGAGCACATCGAGAAGCGAAAGATAAATCACCACCAGCTCAGGAGACACAGGCGCCCCAGGACGGAACCACCAAGACGAACATTTGTACCAGGTTTATTGGAAAATGTAGAACTGCGTGCTCCTGTTGCACCAAAACACAATTAGATGGGATAGAAGAACAGAGAGACATAGACGAGGAGCAACCAGCGAAAAAACATTTTCTCCAGAGATTGAACTGCTTCAAGAAGAAGATTCCTGAAGAAGATATAGAAGCTGCAGCTGGGAAAGGAACTACGATAGAGTTTGAAAGCGAAACGAAAAGAAAACGTAAAATACGAGACGTCTTATGTGGGTGTTGTCGTCGTGACCGTGTCGCCGACGTGTCGGAGCCGCGTGCCGTGGACGTGTCGCCACCCGTCACGACATCGGACGTGGTCCAGGAGGGAGGATGCTGCGGGAAGAGGAGGGAGATCGAGAGGAGGGACAGTATCCTCAGCGAGCAGCCGACATCCAGCTGCTGCAGCGCTTTCAACCGCTGGATCGTTGGTGCTTGTCGTCGCTCGTCCGAGGGATCCTCTAGCCGTCGCACCAGTCTGTTCTCCAAGAACAAGAGTCTGTCACCAACACTACCGCCTGAGGACATATCTTCATCACTTCCATCACCGAAACCAAAAATAGTTCTCACTGATGACCAAAAGAGTGACATTACATTAGTCAGTGAAGATGATGAGGATACTCGCAAGAAGTTGGATTCATCTCTGATCGAACACACGAGTGCGATGCGCGGCGCCATACCCGTACTGGCCTTACCGCTGGCCGTCTTCTGTTTGATTTGTAACATCCTGATACCGGGACTTGGTACTATATTCAGCGGCTTGTTTTGTCTATGTTTTGGGATTCCCCGTTTCGGTGTTTACGACGGAGCAAAACATAGAATAGGATCCCTGGTGATCAACCTGCTGGTGGGTTGCAGCCAACTGTTCACTGTTCTGTTCTGTCTGGTGGGATGGGGCTGGGCTATCTGGTGGGGAGTCATCATGGTGCAGGTTTCTCGTAAATACAAAAAATTGAAAGCAGACGCCGCTGCAGCGGAAGCGGAAGCTCCTCCTGTCACCAACAACAACCACACAAGACCCTGA

Protein sequence:

>DPOGS211813-PA
MAQMATLPPPGPVHRPPRDFLKENMEEIRELSELNREKNEAEAEKKKVEEEIALLKEMGLIDKKGEIKSVVNSRANSRSSSPNKFILRSRSNSPSAILLEARSSEYLNKDGGKPVSHRSRVRSISKSQPQSGNGSPKHSKIPTRQNSVSPTRSNSRQSMDKSLSKNQRHISNSTSSIHESIRIGSTVHDKRASQSTDLLNIGPTNIKSKPPISPGRIGPPPSNKSINPKRLSPIVGTPSKSPLEDSKPSSAKISVNSTATKGTKPVKASGSTPAASRLNSRQASKATSRDTSPDKRKTSTGVSKTNSITKTATKPPNTRSSQKPPVANKSEPKKPISRTNSVKNLSRAPSTKALNEKPPLSRQSSKKDITEKAKPVSKKNEMTSKVDVTEKTVMKTDESKNEVIENKDKKDDDNKVDANVGDSIEKHDNETQYDKKPADTEELVIMTKKNVVSMTTAAITSQPLEVVATVTNQLPAALEKAREKAEIERISSKDSLLAVEDEKVTKTPDETKVEKEDKRTSKSKSEKAFDESVKLRPLQPPYNNPQVERVKQKIDSILKEPEISTENILAAAKQKETPKATADKTRDSIKSLKSDIIEKKEKIKKQSEEITEFKKEASKIVDSIITPVEEPKELMEKTKDEIKKDIEPIVKVVSERKKEVKADVEKMTETLVQGGPELEVLSSNVSTPGAAGKRKVADGASDKSHSNGGVGGESALMRFPQSTSTTPKPPPRAHREAKDKSPPAQETQAPQDGTTKTNICTRFIGKCRTACSCCTKTQLDGIEEQRDIDEEQPAKKHFLQRLNCFKKKIPEEDIEAAAGKGTTIEFESETKRKRKIRDVLCGCCRRDRVADVSEPRAVDVSPPVTTSDVVQEGGCCGKRREIERRDSILSEQPTSSCCSAFNRWIVGACRRSSEGSSSRRTSLFSKNKSLSPTLPPEDISSSLPSPKPKIVLTDDQKSDITLVSEDDEDTRKKLDSSLIEHTSAMRGAIPVLALPLAVFCLICNILIPGLGTIFSGLFCLCFGIPRFGVYDGAKHRIGSLVINLLVGCSQLFTVLFCLVGWGWAIWWGVIMVQVSRKYKKLKADAAAAEAEAPPVTNNNHTRP-