Monarch geneset OGS2.0

DPOGS215171
TranscriptDPOGS215171-TA1164 bp
ProteinDPOGS215171-PA387 aa
Genomic positionDPSCF300143 - 565588-585819
RNAseq coverage142x (Rank: top 55%)
Annotation
HeliconiusHMEL0157673e-7995.36% 
BombyxBGIBMGA008647-TA1e-7796.58% 
DrosophilaDys-PH2e-8075.25% 
EBI UniRef50UniRef50_UPI00022C97746e-9276.02%UPI00022C9774 related cluster n=2 Tax=unknown RepID=UPI00022C9774
NCBI RefSeqXP_394154.38e-9180.20%PREDICTED: similar to dystrophin CG31175-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838624159e-9277.78%PREDICTED: dystrophin, isoforms A/C/F/G/H-like [Megachile rotundata]
NCBI nr blastxgi|3838624151e-8980.39%PREDICTED: dystrophin, isoforms A/C/F/G/H-like [Megachile rotundata]
Group
KEGG pathwaybfo:BRAFLDRAFT_1257444e-46 
 K10366 (DMD)maps-> Dilated cardiomyopathy
    Viral myocarditis
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Hypertrophic cardiomyopathy (HCM)
Orthology groupMCL20488 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215171-TA
ATGGAAGGCCGGGGTGGCGATGATTTAATGATTCTATCGGAAGGTTTAAGGTGTCCTAATTTAAAGGGTACCTTAATAAATAAGCAAACAGTTGAATCAGTGATCTCGAAATGGAGATGGCGGCCCGAGCCAGCTGATAGGAACATCGAAGAGTGGAAGCTGCAAAGGAGTCCACAAGGGCTTCGGCCTCGGAACAGCTCGGGGACATGGCCCGGCCGCCCTCGCTCGGGGGCGGCCTGGCCTCTGGAGAAGTTCGTCACTGAAGGCCTGTGTCATCGATGCGGTGGTGACTACAGTTACAAAGGTGCTCTTAAAGTGAACGTTGACAGTTCGGAGTATACGGGTTACAAGTATTTTTATAACAAGCCTGGCTATCGAGTGAGAGCCCGGTCCTACGAAGACGCCAGGTTCATTCATATCAACTTCCTCCTGCTGGCTGCGATGAGACATTTCGAATCGGATCTACAATCAGAGATAGAAACTCATCGCGATGTGTACGCGTCTCTCACGGGAACTGGCCGCCGTCTGCTGGGCTCGCTCTCATCCCAGGAAGACGCCGTGATGCTGCAGAGAAGATTAGACGAGATGAATCAGAGATGGCATCACCTTAAAGCGAAGAGCATGGCCATCAGGAACCGTCTGGAGAGCAACGCTGAACACTGGTCCGCGCTGTTGCTGTCGTTACGAGAACTCACCGAGTGGGTCATCAGGAAGGACACCGAGCTGAACGCCCTGGCTCCGCCGAGAGGAGACCTCAACGCTCTCATAAAACAACAGGACGACCACCGTGCCTTCCGCCGCCAATTGGAGGATAAGCGCCCAGTGGTTGAGAGTAACCTGCTCTCTGGGAGGCAGTACGTGGCCAACGAACCTCCGCTCTCTGACACCAGTGACACGGAACCGAGTCGTGACTCAGAAGGTGACTCCCGAGGATACCGTTCTGCTGAGGAGCAGGCTCGGGAATTGGCGAGGTCCATCCGAAGAGAGGTCGCAAAGTTAGCTGATAAATGGAACTCCCTGGTCGATAGGAGCGACGCCTGGGGCCGCTGTCTCGATGATGCCGTGCAGGAGTCGCCTGTAGTGATTGTGCGAGGTGAGACTCTCTCTGTGAATACTATTATTAATACAGCGGACCCGGAGTCGGCGGACAGATGCTGTCGCTAA

Protein sequence:

>DPOGS215171-PA
MEGRGGDDLMILSEGLRCPNLKGTLINKQTVESVISKWRWRPEPADRNIEEWKLQRSPQGLRPRNSSGTWPGRPRSGAAWPLEKFVTEGLCHRCGGDYSYKGALKVNVDSSEYTGYKYFYNKPGYRVRARSYEDARFIHINFLLLAAMRHFESDLQSEIETHRDVYASLTGTGRRLLGSLSSQEDAVMLQRRLDEMNQRWHHLKAKSMAIRNRLESNAEHWSALLLSLRELTEWVIRKDTELNALAPPRGDLNALIKQQDDHRAFRRQLEDKRPVVESNLLSGRQYVANEPPLSDTSDTEPSRDSEGDSRGYRSAEEQARELARSIRREVAKLADKWNSLVDRSDAWGRCLDDAVQESPVVIVRGETLSVNTIINTADPESADRCCR-