Monarch geneset OGS2.0

DPOGS214337
TranscriptDPOGS214337-TA1905 bp
ProteinDPOGS214337-PA634 aa
Genomic positionDPSCF300020 - 446744-451335
RNAseq coverage612x (Rank: top 21%)
Annotation
HeliconiusHMEL0200402e-10055.15% 
BombyxBGIBMGA004139-TA0.053.39% 
DrosophilaDys-PH2e-3731.85% 
EBI UniRef50UniRef50_E2AT981e-4528.22%Dystrophin, isoform B n=2 Tax=Camponotus floridanus RepID=E2AT98_CAMFO
NCBI RefSeqXP_001863120.16e-3732.05%dystrophin major muscle [Culex quinquefasciatus]
NCBI nr blastpgi|3071715024e-4528.22%Dystrophin, isoform B [Camponotus floridanus]
NCBI nr blastxgi|3407201587e-4726.21%PREDICTED: dystrotelin-like [Bombus terrestris]
Group
KEGG pathwaybfo:BRAFLDRAFT_1257442e-34 
 K10366 (DMD)maps-> Dilated cardiomyopathy
    Viral myocarditis
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[34-152] IPR0151532.4e-21EF-hand domain, type 1
Orthology groupMCL22059 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214337-TA
ATGGACCTTCTAAAGTGGAAAAAAGTAGAAAATGAAGCAGGATATCCTTGTTACATTGATGAAACTAGCGGCAAACAAAGTTATGACCACCCTCAATTTAGCAAAATCTTGAAAACCTTGGACGAGTATAATGATATTAAGTATAGTGCTTATAGAATAGCGTTTAAAATATTCGCTCTTCAACGGAGCCTTAAAGTTCCACCACTACGGATAAGTTCTGGTGTCTTTGCTCGTCACCAGTTAAGTTTGTCTGAAACTAGTCTTTCTCTCGATACCGCAGAGCTTGAAGCTGTACTCGCAGACATTTATTTTGCAGCGGAAAAAGAAGGACTGTTTACTGGTGATGTGGATTTATCTGTGGATCTGTTAATTAACCTATTACTTAATGTTTATGATGGAGAGAGAAAATCACCAATCCGTGTTCTAGCAGCCAAGACTTTATTAATATTACTTAGTGAGGATTCAATATCAGAGAAATGGTTGGCATTAGCAAATTGTTGTGCTGATCATAATGGTTGTGTGTCACGGAAGAGACTTGCCGCGTTATTGATACATGTCACTGCATTACCTAAATACCTCGGCTGCCAATGTGATTATTTAGAAAATGATGTGGATAACTGCTTTGAAAAGAGTGCTGGTATGCTTGGTATAAGTGCTCACACAGTGGCTGAGTGGGGGACTCACTGTGGTAGCACAAGGTGGCTGGGGGTGGTGCAGCGTGTGCTGGACAGTAGAAACTGTGCTACGGCTAGCGCTGCATGCGCTATCTGTGCACAGCCCCTGATACAGGTATTAAAATTCAGATGTTCGAAATGTCACAACATATACTTCTGTGAGAAGTGCTATCTTTATGGCAAAGATTTGACAGTTGTATCTGGACATAAGAAAACTCACTCGATTCATGAAATTATCGATGGGGAGATCAAACCTCCAGAACACTTAGGTTTCATTGAAGGTATGAGTAGATTTGTAGAAGATATGGAAAGATTATTTTTATGTATGGGAACTAAAAAGAAGAAGCCAAACAGGCGTGGTCAAGAGAGAAATGACATACTAACAAGCCAATCCATCAAAACAAAGGAAGTGACGCCGAGTATGTTTACCTCAACTGTGGGAAAGGCTTCCGGGAATAAGGCTAACAATCCAGTGGTCACACTGCAAGATATTATAACCCAGTTGGAAAATCAAAATAAGGCCTTAATGGAACTATCAGGCCAATTGCAAGATGGAGGGAAGGATACAAATAACGAACTGAAAGAAAGGGTAGATGCACACTATAGTCAAATATCTAAACAGATCAACAGGTTGAAAAGTTTAAAGGACAATATAGCGAATCCAGACTCGTCAAATGAGTTAGTTGTAGAAAAAGACGTATGGCCCAGAGCCTTCGAAATTTTCAGTCCGATCCCAGTTGCCGACAAACAGTCGAAGGTAACCAAAAATTTAGACCAGAAAAAAATTCTCAGTATGGACTCAGAGAATTTAATGGCCACCTCAAAACATCACAGCCAGGATCTACTCACTATGTCCGGGGATTTTTGGAAACCTGTGGTCGGTCATACGTCTGATAGCGTTTCCACGGTGTCAATGAATGATATAAGCAATTGGTACAATGAGACTCAGCCGAGTCGCGTGGACAGTATATCTCGTACAGAACAACGTTACTTGTGTGACGCGCGCTCGGTACAGAAGGGTACAGAGCGGGTGAGAGAACTCAGCGCAGACATCGACTCTGTGTTGGACAGGCTCCAGGAGATAGTCACACAAACATTCACTGTAGACGGTTCCTGCTTCGACAACGCACAACTAAAGAAAACGGCTCACGAATTAGAGGGATTACTGGGAACACTCATACGTGGGGTCGAACAGCGCGCCACACTCAAGGCCACTAAGACTCTAGTTTAA

Protein sequence:

>DPOGS214337-PA
MDLLKWKKVENEAGYPCYIDETSGKQSYDHPQFSKILKTLDEYNDIKYSAYRIAFKIFALQRSLKVPPLRISSGVFARHQLSLSETSLSLDTAELEAVLADIYFAAEKEGLFTGDVDLSVDLLINLLLNVYDGERKSPIRVLAAKTLLILLSEDSISEKWLALANCCADHNGCVSRKRLAALLIHVTALPKYLGCQCDYLENDVDNCFEKSAGMLGISAHTVAEWGTHCGSTRWLGVVQRVLDSRNCATASAACAICAQPLIQVLKFRCSKCHNIYFCEKCYLYGKDLTVVSGHKKTHSIHEIIDGEIKPPEHLGFIEGMSRFVEDMERLFLCMGTKKKKPNRRGQERNDILTSQSIKTKEVTPSMFTSTVGKASGNKANNPVVTLQDIITQLENQNKALMELSGQLQDGGKDTNNELKERVDAHYSQISKQINRLKSLKDNIANPDSSNELVVEKDVWPRAFEIFSPIPVADKQSKVTKNLDQKKILSMDSENLMATSKHHSQDLLTMSGDFWKPVVGHTSDSVSTVSMNDISNWYNETQPSRVDSISRTEQRYLCDARSVQKGTERVRELSADIDSVLDRLQEIVTQTFTVDGSCFDNAQLKKTAHELEGLLGTLIRGVEQRATLKATKTLV-