Monarch geneset OGS2.0

DPOGS206603
TranscriptDPOGS206603-TA1422 bp
ProteinDPOGS206603-PA473 aa
Genomic positionDPSCF300048 - 1303865-1305286
RNAseq coverage1072x (Rank: top 12%)
Annotation
HeliconiusHMEL0088364e-13361.73% 
BombyxBGIBMGA008327-TA2e-11154.96% 
DrosophilaPPP4R2r-PB1e-5759.30% 
EBI UniRef50UniRef50_D2A1H44e-6562.98%Putative uncharacterized protein GLEAN_08382 n=1 Tax=Tribolium castaneum RepID=D2A1H4_TRICA
NCBI RefSeqXP_001807993.18e-6662.98%PREDICTED: similar to AGAP002501-PA [Tribolium castaneum]
NCBI nr blastpgi|1892367742e-6462.98%PREDICTED: similar to AGAP002501-PA [Tribolium castaneum]
NCBI nr blastxgi|1892367742e-6734.66%PREDICTED: similar to AGAP002501-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[4-264] IPR0152678.8e-60Protein phosphatase 4 core regulatory subunit R2
Orthology groupMCL25464 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206603-TA
ATGGAAAATGCTGAAGAGATTTTTCATTTTCTTGAAGATTTTTCAAAATGTGAACGGAAAACAATACCACAAGAACTAAACGATTATTTGGCCTATGTAGCGCGTACCGGTGATCCCGTCTATCAGTGGTCTCTTGTTAAGAGTTTGTTCAAAGAAAAATTACTTAACGTTATAACCGACTTTTACGAGACTACTCCAGGCATCGAAATACCGCCCTATCCTAATGTTAATGCGTTTAACTACGATAGTATGAAAAACAGCTTAGTAGAACGATTGGATTCATTCAACTCTGCTCCATTTACCGTTCAAAGAATCTGTGAGCTACTCACCTACCCGCGGAAACAGTACAATCGAGTTGATAAATTTATGAGGGCCATTGAGAAAAACATTTTAGTTGTAAGTACAAGAGAACCTGGTCATCAAAGACATCCAGAGAATGGAGAACCTATTGTAAATGGATCAGATAACAATTCTGATTATAATGTTGATGTTGAAATGGAGGATATGTCATGGAAAGATAATCCCGAGTCACGACAGAATTGCGAACCACAGCCTAGTTCATCAGAAGCCCACATTATTGTTGAGGATATCGAAACAAGGTTACACACCAAAAAAGACGCTATTGTTAACAGTGAGGAAAAATCTACAACCCAATATGCATCTGAGAGTCCATCTGACTTGAATGCAAATGAGGAAATGTTAGATAATGCCCCAAAACCAAAACCCTTGGATAATGCTGTTGCATCAACCAGTGATGATATCAAGAACAATAATACTGAAGAGCAACCTTCCGTAGTCAAAGTGGATGAAGAAATGAAATCGGAAGCCCCAAATGAAGTTTTAGTCATACCTGAAATTAAAGTTGATGATGCTGAAATGACGGAACAGAAGCTTGTAGAATCAAAAGAATCTGAAACCTTAGATTCTGTGGCATCTAAGGAAAATACACAAAATGTTATTGATAACCCCACATCCATGGACGACAGCAGCTCGGACTCTATACCGAAAGAAGAAGATAACAAAATTCATTGTGAAGTGTCATCATCAAGTGAAGAAAGTTCTTCATCAAGCGATATCATTGATGGCAACAGTAACTCACCCAAAACCGAAGAACAAATAGACATCCATGTAGAACATCCACCGCAAGCAATGCCCGAAAATCCTGTACCAGAACAAGATAAAGAAGACGATAAACCTATGGAAGACGCTAAGCCCGTGGAGACAGACAAAGAAACACCTACTTTAGAAGCGGAAAACTATTCAATATCTGATGTTAACTCTGAATTACCTGTAACTAATATACCGGACAGTGAAAAAACTCCAGTCGAGGCAGAAGTAAAAGAAGTATCTGACAAAAAAAAGGAAGACCCATCTCCTGATAATACAGAAAACTTGACAAACACTAAGGTTTCCGACACCTAA

Protein sequence:

>DPOGS206603-PA
MENAEEIFHFLEDFSKCERKTIPQELNDYLAYVARTGDPVYQWSLVKSLFKEKLLNVITDFYETTPGIEIPPYPNVNAFNYDSMKNSLVERLDSFNSAPFTVQRICELLTYPRKQYNRVDKFMRAIEKNILVVSTREPGHQRHPENGEPIVNGSDNNSDYNVDVEMEDMSWKDNPESRQNCEPQPSSSEAHIIVEDIETRLHTKKDAIVNSEEKSTTQYASESPSDLNANEEMLDNAPKPKPLDNAVASTSDDIKNNNTEEQPSVVKVDEEMKSEAPNEVLVIPEIKVDDAEMTEQKLVESKESETLDSVASKENTQNVIDNPTSMDDSSSDSIPKEEDNKIHCEVSSSSEESSSSSDIIDGNSNSPKTEEQIDIHVEHPPQAMPENPVPEQDKEDDKPMEDAKPVETDKETPTLEAENYSISDVNSELPVTNIPDSEKTPVEAEVKEVSDKKKEDPSPDNTENLTNTKVSDT-