Monarch geneset OGS2.0

DPOGS200140
TranscriptDPOGS200140-TA2550 bp
ProteinDPOGS200140-PA849 aa
Genomic positionDPSCF300128 - 366549-376530
RNAseq coverage13x (Rank: top 82%)
Annotation
HeliconiusHMEL0075560.084.40% 
BombyxBGIBMGA002785-TA0.091.92% 
DrosophilaDCX-EMAP-PC0.071.46% 
EBI UniRef50UniRef50_Q9VUI30.071.46%Echinoderm microtubule-associated protein-like CG42247 n=30 Tax=Neoptera RepID=EMAL_DROME
NCBI RefSeqXP_969211.20.079.28%PREDICTED: similar to IP09257p [Tribolium castaneum]
NCBI nr blastpgi|1892410160.079.28%PREDICTED: similar to IP09257p [Tribolium castaneum]
NCBI nr blastxgi|1892410160.079.28%PREDICTED: similar to IP09257p [Tribolium castaneum]
Group
Gene OntologyGO:00055151.8e-32protein binding
GO:00355563.5e-10intracellular signal transduction
KEGG pathwaypbe:PB000833.01.05e-10 
 K10599 (PRPF19, PRP19)maps-> Ubiquitin mediated proteolysis
    Spliceosome
InterPro domain[243-687] IPR0110472.9e-41Quinonprotein alcohol dehydrogenase-like
[566-846] IPR0159431.8e-32WD40/YVTN repeat-like-containing domain
[217-289] IPR0051081.4e-28HELP
[618-845] IPR0110463.9e-26WD40 repeat-like-containing domain
[68-132] IPR0035333.5e-10Doublecortin domain
Orthology groupMCL14563 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200140-TA
ATGGTCGACAGTGACGAAGAACCCTTCACGGTCATAGCACTCCCAGTTAACACTCCAGAACCACCAGCAGCGAGTTACGGTAAGAAGAACGGCATGTGGTACGGCGGCACAGCCACGGGAACCAGCGGATGGTCGCGGGCTGGTACTCGCAAACAGTCGGTGGCCGAGTCTGATGCACCCCCGCCCGGAGGTGGCAAGCCTGCCAGTGGCAGAGTGATTCGGATCATCAACAACATGGACCATTCTATACAGTGTCGTGTTTTGTTAAACTTACGTACAACCCAACCATTCGAAGAGGTTTTAGAGGATTTAGGGCAGGTTTTAAAAATGAGCGGAGCAAAGAGGATGTACACAATTACGGGACAAGAGGTAAGGAGCTTTTCTCAGTTGCGTAACGAATTCGCTGATGTAGAGACGTTTTATTTGGGAGCAGCGATGGTACCTCCAGCACTCAGTCCAGGAATAAGCGCACCTCTACCAATTGAGTCGCCCATTAGAAGATCCAGGTCTAGGGGGAACGTATCTGCTGTATCAGTGTCTGAGGAAGGGCGCGGTCGGCGCGCTCGCAGCAAAAGTCGCCCGCGCGTACTATACGCTCCCGAAGGAGAGATAATAAGAAACTCAGATTACACCCTCCTCGAGGTTCTGAAAGAAGAGCCTATCCGTGTAACAATACGCGGTTTACGACGCACCTTCTACCCTCCAATACACCACGCGCCCATAGACAACAGTCCTCCAGATAAAAAGATGCAACTGGAATGGGTATACGGTTATCGAGGCTCGGACTCACGTCGCAACCTGTGGGTGCTGCCCACCGGCGAGCTGCTATATTACGTCGCAGCGGTTGCCATCATGTATGATAGAGACGAACACGCTCAGAGGCACTACACGGGACACACCGAGGATATACAGTGTATGGAGCTGCACCCGTCCCGCGAGCTGGTGGCGAGTGGTCAGCGCGCGGGCCGGGGGCGCCGGGCGCAGGCCCACGTCCGCATCTGGAGCACCGACACGCTTCAGACCCTGCACGTGTTTGGGATGGCCGAGTTCGAGGTCGGCGTCTCCGCGGTCGCCTTCTCGCAACTGAACGGAGGTAGTTACGTATTAGCCGTCGACGCCGGTCGTGAAAGTATTCTGTCCGTGTGGCAGTGGCAATGGGGACATCTTCTAGGCAAAGTTGCGACTCTTCAAGAGGAGCTGACAGGCGCGGCGTTCCATCCTCTCGATGACAACCTGCTGATCACACACGGTAAGGGACACCTCGCCTTCTGGAACAGGAGGAAAGACGGATTCTTCGAACGAACGGATATTATTAAACCGCCGGCTCGCACACAAGTGACAGCCTTACAGTTCGAACAAGACGGTGACGTAGTGACGGCGGATAGTGATGGGTTCATAACCATATATAGCGTCGATAGTGATGGTGCTTACTTTGTACGAATGGAGTTTGAGGGTCACATAAAAGGAATTTCCTCGTTAGTAATGCTCTCAGAAGGTACTCTTATATCTGGCGGTGAGAAAGATAGAAAAATAGCCGCTTGGGATTCCTTACAAAATTATAAGAGAATAACTGAAACAAAGCTACCCGAATCAGCTGGTGGAGTCCGAACAATTTATCCCCAGAGACCTGGAAGGAACGATGGCAACCTGTACATAGGAACAACGAAAAATAACATTCTAGAAGGATCGCTACAGAGGAGATTCAATCAAGTTTTGTTCGGTCACCACAAACAGTTGATGGGCGTCGCGGTGCATCCTGATGATGAAATGTTTGCCACTGCCGGCCACGACAAGAACATAGCGCTCTGGAAGGGGCATAAGCTAGTGTTCGCGACACAGGTTGGATATGAATGTGTATCTCTGGCGTGGCATTCCGGCGGGGGGGCGTTGGCGGCCGGCAGTACCGAGGGTCACCTGGTGATACTGAACGCTGATGCTGGAGCCCACGTCGCCACCATCAGGGTCTGTGGATCGCCTCTCAGCTGTTTGCAGTACAACACTGCTGGAGACATATTAGCCATTGGATCCCAAAATGGCAGCATATATTTATTCCGTGTGTCACGTGATGGTTTTTCTTATAAGAAATCGAATAAGATCCGAGGAGCTCAGCCTCTCGTGATGCTGGATTGGAGTCTCGATGGAAACTACTTACAGACAGTCACCGCTGACTATGATTTATCATTTTGGGACATCAAAGCTCTGTCACCTGAGAAGAGTCCGATAGCTATGAAAGACGTCAAGTGGGCTACATTTAATTCGACAGTCGGCTTTCTTGTTTCAGGGATGTGGAACAACCGTTTTTATCCTATGACGTCACTGATAACAGCCGCGAGTCGCTCTGCGGCTCACGATCTACTTATAAGCGGAGATTCAGAAGGCCATCTCCGTCTTTTCAGATATCCCTGTGCGAGTCCAAAGGCCGAGTACAATGAGATAAAGGTGTATTCTGGTGCGATCCACTCCGCTCGGTTCTTGTTCAACGACCGCTGCCTGGTGACCTCTGGCGGCTCTGACGCGGCGCTCATGTTGTGGGAACTAGTGGACGACTAG

Protein sequence:

>DPOGS200140-PA
MVDSDEEPFTVIALPVNTPEPPAASYGKKNGMWYGGTATGTSGWSRAGTRKQSVAESDAPPPGGGKPASGRVIRIINNMDHSIQCRVLLNLRTTQPFEEVLEDLGQVLKMSGAKRMYTITGQEVRSFSQLRNEFADVETFYLGAAMVPPALSPGISAPLPIESPIRRSRSRGNVSAVSVSEEGRGRRARSKSRPRVLYAPEGEIIRNSDYTLLEVLKEEPIRVTIRGLRRTFYPPIHHAPIDNSPPDKKMQLEWVYGYRGSDSRRNLWVLPTGELLYYVAAVAIMYDRDEHAQRHYTGHTEDIQCMELHPSRELVASGQRAGRGRRAQAHVRIWSTDTLQTLHVFGMAEFEVGVSAVAFSQLNGGSYVLAVDAGRESILSVWQWQWGHLLGKVATLQEELTGAAFHPLDDNLLITHGKGHLAFWNRRKDGFFERTDIIKPPARTQVTALQFEQDGDVVTADSDGFITIYSVDSDGAYFVRMEFEGHIKGISSLVMLSEGTLISGGEKDRKIAAWDSLQNYKRITETKLPESAGGVRTIYPQRPGRNDGNLYIGTTKNNILEGSLQRRFNQVLFGHHKQLMGVAVHPDDEMFATAGHDKNIALWKGHKLVFATQVGYECVSLAWHSGGGALAAGSTEGHLVILNADAGAHVATIRVCGSPLSCLQYNTAGDILAIGSQNGSIYLFRVSRDGFSYKKSNKIRGAQPLVMLDWSLDGNYLQTVTADYDLSFWDIKALSPEKSPIAMKDVKWATFNSTVGFLVSGMWNNRFYPMTSLITAASRSAAHDLLISGDSEGHLRLFRYPCASPKAEYNEIKVYSGAIHSARFLFNDRCLVTSGGSDAALMLWELVDD-