Monarch geneset OGS2.0

DPOGS200630
TranscriptDPOGS200630-TA1584 bp
ProteinDPOGS200630-PA527 aa
Genomic positionDPSCF300076 + 307818-309696
RNAseq coverage444x (Rank: top 28%)
Annotation
HeliconiusHMEL0032930.075.19% 
BombyxBGIBMGA011310-TA3e-17266.05% 
Drosophila% 
EBI UniRef50UniRef50_D7EIS55e-7037.69%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7EIS5_TRICA
NCBI RefSeqXP_002429776.13e-3846.49%hypothetical protein Phum_PHUM450880 [Pediculus humanus corporis]
NCBI nr blastpgi|2700158022e-6937.69%hypothetical protein TcasGA2_TC016110 [Tribolium castaneum]
NCBI nr blastxgi|2700158028e-9440.70%hypothetical protein TcasGA2_TC016110 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[423-484] IPR0003134.8e-10PWWP
Orthology groupMCL25834 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200630-TA
ATGACCATTCAAAAGAATTCTAAGATTTTAGTAAATGTTGAAGAAGCATTAGAAGATTTAATTGTAGTATCCTATACAGACGTAAACAAAAAGTTTCAAGGTGTACTACTAGATTCTAATAAAGGAAATTTACCTTTCGGAGTATATAGTCTCCATCCATCGTTTACCAAGCAAGTTGAACAGGCAGACAGTAAAGATGAAAAGTTACATTCCGTTAGTCAAAGGTTTACTTATCAAGATCCTGAATATGAAGACGTCCAAAAGTCAAAAAAGCAGGGTTCAAAATCCAAATCCCAGGCGCGACAAAAGATGACGGTGAGATTACGCCCTAGGAAGGTGTTGTGTTCTAATTGCCAAGGTATTTGTAATGAGAATAGTGAAAACGTTGACGTGTCCAAGAAGCGAAAACATGACTTGGATGACGACTCACAGAGGGATAGTTCTGAGTCATCGAAGTTAGAGAAGAAGTACCTCATGGGTGGCATGTTGATGCCTAAACTATCTAAATTACAGCCTAATGATATTACGACTGCTACTAAAGCGAAAAATACGTCCAAATCTAGTGACAAAGGTAGAAGCTTAAAGAGATCTCACAATGACGATGGACCAACGAGGGAGCTTGCGTTTGTTAGTGTTTCGGACAATAGTGATAAAGAAGACGTGAAGGATGGCGAAAATGCATTTAGTAATTTATTTTCTAATGCCAGGACACTCAAAATAAGTTTCGGTGAGGGTGAAGGCACTGTAGTTAAAATACCACCACTGGCTGGTGATTTCAACGAAGACTCTGGTGTATCCTGTGATATGGCGAAACCAGATTCTAAAGCAGTTAAGAAAGCTTTGAAAAAGGCAAGGAAACAAGCCAAAAAGGGCAACACTTTAGAGAAAACCGTTAATCTGTGGATAGAAAAATCTCCAAAGCATATAGGGGCTCTGTCCCCACGAAACAGTGTTTGTAATAGTCCACCATCACTGGATTCCTTGGAGAAGAAGCATAAACACAAAGTCAAACATAAAAAGAAACATAAAATACAAAAGAAACGAGACGAAAACACAAGTAAAAGTGATGCAGATTCCTTGGATTGTTTCAGCAATGCTCAGGAAGAATGTTTGAAACAGAAGTTATCCATCAACCTACGTAGGCTTAGTAGCAATTCATATGAACATAGGACTGAAGAAAACGAGCTATCCGAAGACTGTGAAAGTGAAACTGTGCCGGATTTCCCAGCCAGTATAGAGAGTGTTGGGGGTAAATTGCTTAGGGTATCTGTGGGTGACATAGTGTGGGGTAAAATTGTTGGTTTTCCTTGGTGGCCGGGTAAAGTTATGAGTGTGACTCCTTCTTCTAGGGCCCATGTAGCGTGGTTTGCGTCTACCACATCATCGTTTATGCCCTGCGAAAGTTTGAGCCCTTTCCTGCAGGATTACAAGATACGATTCAACAAAAAGAAAAGAGGTCCATACAAAGAGGCCGTCAAACAAGCTACAATAGAAGCGAGAAGAATTGAATCTCTGAATGATCCGCTAGCAAGTCCAACCCATATGTCCGACCCCGCCCCACATACAATAGACGTGTTCTCGTAA

Protein sequence:

>DPOGS200630-PA
MTIQKNSKILVNVEEALEDLIVVSYTDVNKKFQGVLLDSNKGNLPFGVYSLHPSFTKQVEQADSKDEKLHSVSQRFTYQDPEYEDVQKSKKQGSKSKSQARQKMTVRLRPRKVLCSNCQGICNENSENVDVSKKRKHDLDDDSQRDSSESSKLEKKYLMGGMLMPKLSKLQPNDITTATKAKNTSKSSDKGRSLKRSHNDDGPTRELAFVSVSDNSDKEDVKDGENAFSNLFSNARTLKISFGEGEGTVVKIPPLAGDFNEDSGVSCDMAKPDSKAVKKALKKARKQAKKGNTLEKTVNLWIEKSPKHIGALSPRNSVCNSPPSLDSLEKKHKHKVKHKKKHKIQKKRDENTSKSDADSLDCFSNAQEECLKQKLSINLRRLSSNSYEHRTEENELSEDCESETVPDFPASIESVGGKLLRVSVGDIVWGKIVGFPWWPGKVMSVTPSSRAHVAWFASTTSSFMPCESLSPFLQDYKIRFNKKKRGPYKEAVKQATIEARRIESLNDPLASPTHMSDPAPHTIDVFS-