Monarch geneset OGS2.0

DPOGS201499
TranscriptDPOGS201499-TA1434 bp
ProteinDPOGS201499-PA477 aa
Genomic positionDPSCF300006 + 779447-786098
RNAseq coverage1195x (Rank: top 11%)
Annotation
HeliconiusHMEL0154970.094.72% 
BombyxBGIBMGA002590-TA4e-14589.14% 
DrosophilaMef2-PD1e-9960.95% 
EBI UniRef50UniRef50_G6CUC80.099.79%Myocyte enhancing factor 2 isoform A n=3 Tax=Obtectomera RepID=G6CUC8_DANPL
NCBI RefSeqNP_001036905.10.089.71%myocyte enhancing factor 2 isoform A [Bombyx mori]
NCBI nr blastpgi|1129827550.089.71%myocyte enhancing factor 2 isoform A [Bombyx mori]
NCBI nr blastxgi|1129827550.089.95%myocyte enhancing factor 2 isoform A [Bombyx mori]
Group
Gene OntologyGO:00056342.2e-38nucleus
GO:00063552.2e-38regulation of transcription, DNA-dependent
GO:00435652.2e-38sequence-specific DNA binding
GO:00037002.2e-38sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[86-145] IPR0021002.2e-38Transcription factor, MADS-box
[186-245] IPR0221022.4e-06Holliday junction regulator protein family C-terminal repeat
Orthology groupMCL13061 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201499-TA
ATGTGTGCATTACACGCACACATGCAACAGAGAGAAGACTGCGAACCTTTGCCATGGCCACGCACACCATCAAAGCACATCGCACAGAACAATGACATATTCAAGAATTTACACACATTGAGACAGGGAAGCACTCGCAACGCAACAGGCTCTAGAAATAAGGAAAATATAGCAGCTGTTGACGTCCGTAGTAGCCAGCCACCGGCACATTTGTCTCCCCCCGCTCCTATAACCCCCTCCAAGATTGCTGCCACCATGGGCCGGAAGAAAATACAAATATCACGTATCACGGATGAACGGAATCGACAGGTGACTTTCAACAAACGCAAATTCGGAGTAATGAAGAAGGCTTACGAATTGAGCGTCCTCTGTGATTGTGAAATTGCTCTCATCATCTTCAGCTCCAACAACAAACTCTACCAATATGCCAGCACCGACATGGACAAGGTTTTATTGAAATACACGGAATATAATGAGCCGCACGAGTCACTGACGAATCGCAACATCATCGAGGCACTAACGAAGAAAGAGCATAAGAACGGAGTGATGTCACCAGACAGCCCTGAAGCCGAACCAGAATATAACTTGACCCCAAGGACTGAAGCAAAGTATTCAAAGATTGATGAGGAATTTCAAATGATGATGCAACGCAACCAACTCAATGGAAGCCGAGTCGGAGTTGGTGTTCCTGGAACGAACTACAACCTTCCTGTCAGCGTACCCGTCGGAAGCTATGATCAGTCCCTTTTGCAAGCTAGTCCTCAAATGCATACCTCTATCAGTCCACGGCCATCGTCTTCAGAAACTGATTCAGTTTACCCGAGCGGCGCAATGCTAGAGATGTCAAATGGTTACCCGGGATCTGGCTCACCGTTGGGTGGAGGGTGCACTCCCTCCCCCTCCCCCGGACCAGCTCCCTCCCCCCACCGACACCCGCACAAGCCCCACCATGCTCCGCCGCCGCATCATTCGCCCCGTCACAACAACTTACGCGTCGTCATACCCAGTTCCATGCCACCACCGCAAGATGATATATCTTATACCGGAGAGACACCACTAAGCTATTCAGGACTCGGAAACTTCGGCGGACCACAGGATTTCAGCATGAGCTCAGACATCGGCATAGGACTGTCATGGGGAGCGCACCAACTGCAGACACTACAACACAACAGTCTGCCGGTGCTAGGCGGTACACCGCCTCCGGCCGCATCTCCTAGCAACGTGAAGATCAAAGCCGAGCCAGTTTCCCCACCGCGTGAACATCTGCATCGTCCACCAGCACCGCCAGCCCACCTCGCCGGCATCGATGGATCGGTGACTTCAAGTAACATGGGTTCGCCAGCCGGGCAGGACATGAGGCATGCCAACACCGTCCCGCTAGACTACGAACAGCCTCACTCAAAACGGCCGCGGATCGAAGGCTGGGCCACATAG

Protein sequence:

>DPOGS201499-PA
MCALHAHMQQREDCEPLPWPRTPSKHIAQNNDIFKNLHTLRQGSTRNATGSRNKENIAAVDVRSSQPPAHLSPPAPITPSKIAATMGRKKIQISRITDERNRQVTFNKRKFGVMKKAYELSVLCDCEIALIIFSSNNKLYQYASTDMDKVLLKYTEYNEPHESLTNRNIIEALTKKEHKNGVMSPDSPEAEPEYNLTPRTEAKYSKIDEEFQMMMQRNQLNGSRVGVGVPGTNYNLPVSVPVGSYDQSLLQASPQMHTSISPRPSSSETDSVYPSGAMLEMSNGYPGSGSPLGGGCTPSPSPGPAPSPHRHPHKPHHAPPPHHSPRHNNLRVVIPSSMPPPQDDISYTGETPLSYSGLGNFGGPQDFSMSSDIGIGLSWGAHQLQTLQHNSLPVLGGTPPPAASPSNVKIKAEPVSPPREHLHRPPAPPAHLAGIDGSVTSSNMGSPAGQDMRHANTVPLDYEQPHSKRPRIEGWAT-