Monarch geneset OGS2.0

DPOGS213097
TranscriptDPOGS213097-TA3264 bp
ProteinDPOGS213097-PA1087 aa
Genomic positionDPSCF300016 - 74048-82099
RNAseq coverage183x (Rank: top 49%)
Annotation
HeliconiusHMEL0065560.047.83% 
BombyxBGIBMGA007470-TA8e-3638.22% 
DrosophilaDLP-PA3e-2123.44% 
EBI UniRef50UniRef50_E9JEH32e-3338.22%Daxx n=1 Tax=Bombyx mori RepID=E9JEH3_BOMMO
NCBI RefSeqXP_001842789.15e-2931.03%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3214000926e-3338.22%daxx [Bombyx mori]
NCBI nr blastxgi|1954433525e-3623.16%GK18723 [Drosophila willistoni]
Group
KEGG pathwaycfa:4748734e-21 
 K02308 (DAXX)maps-> Amyotrophic lateral sclerosis (ALS)
    MAPK signaling pathway
InterPro domain[330-882] IPR0050121.2e-13Daxx protein
Orthology groupMCL21992 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213097-TA
ATGGCTAGTGAAGATGTTATTGAGTTGGGATCTTCTGATGATGAAGCTGAACCAGCTCCAAAGAAGAGAAAACCGATGCCCAATGCCATGGTTGTTATCCCGAATAAGTTTCCTGGTTTAACTATTAAACCATCTCATTCAAATCAGTTTCGTAAACAGAAGGATATTTTAAATAAACCAATTATATTTAATAAATTGGTAAGTAATGTAAATTCAAATGTTCATAGGAAAAAAGTAGTAACTGGGTCTCTCCAATCTCCACTTAAAAATTATACTACTACCAAAATTGTGAAAACTGCAATACCTATACAAAACAAGTTCTTAAACCCATTGAGTATTGCAAAGAATTTATTTAATAATCAAGTATCAATTCCTAAACCACAAACAAGAGGTTTAAAGAAACAAACAGAAGCTAAAGGCTCAAGCCTTTTAAATAATTTACCTCCAGGTATTACTATTAAATTAGTACAGAATTCTTGTCCACTAGTTAATCAAGACAAAATTAGGACATCCCAAAGTACTGTAGGAGAAGTATTGACTGTTGAAATAGATGATGAGGAGACATCTGAAACATCAACATCAAGCCCACAGTGGTATATACGACCCGAGGATCAAGTAGATGAGAATGAAAAAAAAATTGAAGACAACAATGAACTTAAAGTAGAAACTTCAATACAAGAACAGAATAACAAAGAGCCTGACAAATCTAAATATGTTGAAATCACAATAGAAGATAGCCCTTTAAAGCCTCTAACACAAAAACAGGACAGTGAAATTGGTAAAGAGCTGGCGATTACAATTGAAGACAGTCCCGCTAAACCTACACCATATAAGTCTAATAACGGGGATGGTGATTTTGAAAACAGGAATATCAAAGAACCTCATAGTAAGAAGAAATTAGATTATCCCAAAGAAAACAACGACGAGAGACAGGTTGTTGAAATAGAAATAGATCTTAATGATACTAGTTATTCTGAGATAAAGGAAAATGATGATAGTACTGAAAAATGTCAACAAATTGATACCAGAAATGAAACACAAAAATGTAAAGGAATTGATACACCACCAGTTGGCACTAAATCTGAGTGTAATGAAGCTAAAACAAATAATTCTGAGAATTCTATTGAAAGTAAAGTTAGCACAGAAATAAGTGATACTGAATTCCATCCTGTGTATCAGAATTTCATCGACATATGTTTTGAACTAGAAAATTCTGATGATATGAAAAAGATAGTTGATAAGAAAATTAAAGCGTATTACAAACAGTGTCCAAAAGTATATGTTGAGTCTACCGATTTTATAGATATGGTCTCCACAAAAATAGTTTTGATGAAGGCCAGTCCAGAAAAAATGTATTTGTATATAAAAGATGTTGTGGATGAACTTAATTTACAAAGAAAAATGGCCAAATCTGCATGTACTACGGAAGACACACAACAAGATCCAGCAAATACTTTACCTGCTGAATCTGAGCGAGACAGCAAGAAAATGATACAGATTCGAAAACTGGAGAAGACTATCAAGAAATTACACAGAGCAATACAGAAACTCGAGCAGCAAGAGGTTGACTTTGATGATGACGAGGATTCAGTTTATTTATTGACAGAGAGATATAAAGAACGCATGGTACGAGTTCATAAGAAGTTTTGTCAGCTCACCAACACGAAGATGCCATCGGAACCAAGGATACACATTGAATCAAGACCTGGCCGTCCTACAGGTCCAGCAAAAAGAGTAGAGAAGTGGATCAATAAGAAAGTGCCTATTGGAGCTCCATTACCATTCCCAGATTTTCATGATGTACTACATTGTGTACGTGACGCCAATGACGATGATAGACTGGGATGGAATGAATATGAAATTATGGAGGAAGCGAGAGATTTGTTTACAAGATGTGGCAAGAAACTTCAAAGACGGCGACAAGAGAATGAATGGAGATTAGCTGTGTCAAGGATTACACAAGTATTGGATCCCGCAGAAGAAAATATTGATTTGAAGAAACGGCTAGAAGAAAACGAAGCTTTGGCCACGTCTAAAGAATTAGAACTATTTAAAAAATTTGTGGACAAACAAAATCAACTGAAACTAGAAGCCGTCGAAATCGGTGATAAAGAAGCAGAAGAGTCACCATTGGAGAGTGATGATGAAGAAGAAGTGGTTGAAAGTAAACATTCGGCAGAAGAAAAACAGAAGAGAAAGGAAAAGATTAAGGAGCTGCTGCAGGATAAAGGCAAAAAGACTTTAGAAGAGGAAAACTACGTTGATTCTATAAAAGAAAAAGATAACACTGGAGTAGAATCTGCTGAAAAACAAGATGGAGTTGATGGAAATGAAGACAAACACATGACAGGAACTAATCTAGAATCTAAAGAACAATATGAAACTAAAAATGAAGACGGAAATAGTGAAGTTACTGATGAAAATATCGATAAGATAGAGCAAGTAGACGGCGGTAGTGAGAGTGATAAAGTTGAATCCGATGTTGACGAATTACATTTATATCAGGAATCACGTAAATGTCATGACGACGAAGCGAATAGTTCGGCTCTGGAGTCCTCTGACTCTGAAATACCAATCGCGATTTCGGACACCTCAGAGTCGGAGATCGAAATGGAGAATGATAAGAAAATGAGTGACATAATCAGCATTGAAGATTCTAGTTACTCGGAGTCAGAATCTACTTACGATAGGTTTGCTTGTACGAAAAATGATTATATTTTAGGTAATGTTTATGGTGAATTTGAATATTCATGCATGGGCGTCGTTGAGTCCAAACAGGATGAAGTTGTTCTTGATGAAGATGTTGTTTTGGAATCATCAGACGAGGAATCAAATAAAGATGTGGCTTCAGGAATCGGTGACACTTGTATTAATTTAAAAGATGATAATATATCGATCTCAGAAACAATTGTAGATGGGGAAAGTATGGAACAAGATATTAGTATTAATGAAATAAATGGTACATTTGACAATAACAATGAATTGAAGGCATGTGTGGCAAATTCAACAAGTAGTGAAAGTGAATTGATCATACAAGAATGCTCTGAAAAAGTTGGAAGTGATGTTTTAAACAGAGTTCATGCAAAAGAATTCCCAGAATCGGCTGAAATGAAAAATGTACCGGATGAAAATTCAATTGTTCGAATGGAAACTGGTGTTAATGAGGATAAAGGGCAGTTTCAGAATTGTAGCACTTTGGCGACCCAAGAGGTTGTTAGTAGTCACTGTGAAAAAATATCAGAAACAGTGCAGCCAATAAGTTGA

Protein sequence:

>DPOGS213097-PA
MASEDVIELGSSDDEAEPAPKKRKPMPNAMVVIPNKFPGLTIKPSHSNQFRKQKDILNKPIIFNKLVSNVNSNVHRKKVVTGSLQSPLKNYTTTKIVKTAIPIQNKFLNPLSIAKNLFNNQVSIPKPQTRGLKKQTEAKGSSLLNNLPPGITIKLVQNSCPLVNQDKIRTSQSTVGEVLTVEIDDEETSETSTSSPQWYIRPEDQVDENEKKIEDNNELKVETSIQEQNNKEPDKSKYVEITIEDSPLKPLTQKQDSEIGKELAITIEDSPAKPTPYKSNNGDGDFENRNIKEPHSKKKLDYPKENNDERQVVEIEIDLNDTSYSEIKENDDSTEKCQQIDTRNETQKCKGIDTPPVGTKSECNEAKTNNSENSIESKVSTEISDTEFHPVYQNFIDICFELENSDDMKKIVDKKIKAYYKQCPKVYVESTDFIDMVSTKIVLMKASPEKMYLYIKDVVDELNLQRKMAKSACTTEDTQQDPANTLPAESERDSKKMIQIRKLEKTIKKLHRAIQKLEQQEVDFDDDEDSVYLLTERYKERMVRVHKKFCQLTNTKMPSEPRIHIESRPGRPTGPAKRVEKWINKKVPIGAPLPFPDFHDVLHCVRDANDDDRLGWNEYEIMEEARDLFTRCGKKLQRRRQENEWRLAVSRITQVLDPAEENIDLKKRLEENEALATSKELELFKKFVDKQNQLKLEAVEIGDKEAEESPLESDDEEEVVESKHSAEEKQKRKEKIKELLQDKGKKTLEEENYVDSIKEKDNTGVESAEKQDGVDGNEDKHMTGTNLESKEQYETKNEDGNSEVTDENIDKIEQVDGGSESDKVESDVDELHLYQESRKCHDDEANSSALESSDSEIPIAISDTSESEIEMENDKKMSDIISIEDSSYSESESTYDRFACTKNDYILGNVYGEFEYSCMGVVESKQDEVVLDEDVVLESSDEESNKDVASGIGDTCINLKDDNISISETIVDGESMEQDISINEINGTFDNNNELKACVANSTSSESELIIQECSEKVGSDVLNRVHAKEFPESAEMKNVPDENSIVRMETGVNEDKGQFQNCSTLATQEVVSSHCEKISETVQPIS-