Monarch geneset OGS2.0

DPOGS215169
TranscriptDPOGS215169-TA885 bp
ProteinDPOGS215169-PA201 aa
Genomic positionDPSCF300143 - 676804-681353
RNAseq coverage137x (Rank: top 55%)
Annotation
HeliconiusHMEL0137012e-9294.74% 
BombyxBGIBMGA008642-TA2e-8889.60% 
DrosophilaDys-PH2e-5258.18% 
EBI UniRef50UniRef50_UPI00022C97741e-6369.33%UPI00022C9774 related cluster n=2 Tax=unknown RepID=UPI00022C9774
NCBI RefSeqXP_002426445.12e-6668.00%hypothetical protein Phum_PHUM253780 [Pediculus humanus corporis]
NCBI nr blastpgi|2420114133e-6568.00%hypothetical protein Phum_PHUM253780 [Pediculus humanus corporis]
NCBI nr blastxgi|2420114135e-6370.91%hypothetical protein Phum_PHUM253780 [Pediculus humanus corporis]
Group
Gene OntologyGO:00055151e-10protein binding
KEGG pathwaybfo:BRAFLDRAFT_1257446e-27 
 K10366 (DMD)maps-> Dilated cardiomyopathy
    Viral myocarditis
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[40-142] IPR0181591.2e-14Spectrin/alpha-actinin
[39-142] IPR0020171e-10Spectrin repeat
Orthology groupMCL19561 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215169-TA
ATGAATGTCGTCGAGCACTTCGATGCTGTAATGACAGCCCTCCGCGAACACTGGGATGAGGCGAACGCCCGCGTGCTACAGCGGAAGGCGCAGCTGGACGCTATGCTGGGAGACTCGCAGAGGTATGAAGCGAGGCGAAGAGATGCAGACGCCTGGCTCACCAGGATGGAAACGCGACTGGCTGAGATGACTCCACCTGGACACACCGCTGATGTGCTGGAAATGCAGCTCCGGGAACAGAAGTCGTTCCACGCGGAAGTCCATCAATACAAGCACCAGATAGAGCTGTTCGGTCAGCTGACGCAGCGTCTGATAGCGGTCTACAGAAACGACGACACGACGCGCATCAAACGAGCCACGGAGGCCATCAACCATCGATACAGTGAACTTAACAATAGTATCGTGGCACGCGGTAAAGCTCTACATTCGGCGGTGTCATCGCTGCAGAATTTTGATCGCTCGCTGGAACGCTTCGTGGGATGGCTGAGTGAGGCCGAGTCGCTACTGGAGGCGGCCGAGAGAGACCCGCATCTGTTAAAGGTAGGCCCTTCAATGAAAATAAATACTCTGTGGCTGATACGCGGAGTTGAAAAACTTCGCTTATAGGTGGTAGAGCCTAGCTGTGGTCGCCCATACCCTGTCCCTTTCCTGCAATTCCCTGCCTCTTCCTCAACAATCAAGGTCGGCAACGCATCCACAGCATCTCTTATGTTGCGGATGTGCGGATGTCCATGGGCAACCATGACAACTGCCCATCAAGTAGACCGTCTGCTCGTTTACCACCTTTCACATTAAAAAAATGAAAAGGAAACGAAAGAAGCTGACGATAGTTTAAGAACCAAGCAAGCGAATCGGTTTGGTGTCGTAGGTCAAGTAATATTTTAA

Protein sequence:

>DPOGS215169-PA
MNVVEHFDAVMTALREHWDEANARVLQRKAQLDAMLGDSQRYEARRRDADAWLTRMETRLAEMTPPGHTADVLEMQLREQKSFHAEVHQYKHQIELFGQLTQRLIAVYRNDDTTRIKRATEAINHRYSELNNSIVARGKALHSAVSSLQNFDRSLERFVGWLSEAESLLEAAERDPHLLKVGPSMKINTLWLIRGVEKLRL-