Monarch geneset OGS2.0

DPOGS204951
TranscriptDPOGS204951-TA1728 bp
ProteinDPOGS204951-PA575 aa
Genomic positionDPSCF300160 + 531343-533280
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0039592e-10692.08% 
BombyxBGIBMGA007650-TA8e-9893.58% 
Drosophila% 
EBI UniRef50UniRef50_Q8N3281e-12741.81%PiggyBac transposable element-derived protein 3 n=11 Tax=Simiiformes RepID=PGBD3_HUMAN
NCBI RefSeqXP_002155162.17e-12942.44%PREDICTED: similar to hCG32740 [Hydra magnipapillata]
NCBI nr blastpgi|3834082714e-12841.81%piggyBac transposable element-derived protein 3 [Macaca mulatta]
NCBI nr blastxgi|3834082712e-12741.14%piggyBac transposable element-derived protein 3 [Macaca mulatta]
Group
KEGG pathway 
Orthology groupMCL17604 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204951-TA
ATGGCAAATGACAGACCGTTAGCTGCGCATGAAATTTTAGATGCTTTAGAAAATGTTTCTGATAATGAAGAAGATTATAGAGAACGACTGATATGTATTCTACCTCCTCCTGTTGATCCTGACTGTCTCACTGACGAAGATTCGGGTGAAGAAGATAATGTAACTTTGAATAATTTGCCACGAAACATTCTGCTTCAACCGGCTGAAGTAATGATTCAAGGACAGATTATGGTGAGTGATACAGAAGAAGAACCTTCTAATTCTACAGGTCGAACTAAACGTAGGCGTACCTACGCATGGAGAAAAAGGGACTTGGCAAAAAATCCCGTGAATTGGCCAGATGTTCAAGGCGCTTGCCAAGATAAGCGCCCAATTGAGTGGTTTGAAAACTTCTTAGATGAAGATGTTATTTCGTTGTTGGTGTCAGAGAGCAATAAATATGCTGTCAAAAAGAATTTGCCTGGAGACATAACCACTGAAGATATGAAATGTTTCATCGGCATATTGTTGGTTAGTGGTTATTCATGGCTCCCCCGTAGAAGAATGTATTGGGAAAACTCCCCTGATACAAAGAATGAATTGATCAGCTCGGCTATGACTAGGGATAGATTTGACTTTATTTTTCGCCACCTTCATGTCAATGATAATCTGGATTTGCAAGACAAATACACAAAAGTACGCCCCCTAGTTACACTTCTAAATAAAAAGTTCTTAGAGTTTTCTCCTCTTGAAGAGCATTACAGTGTAGATGAGGCCATGATCCCCTACTATGGTAAACATGGCTGCAAACAGCACATAAAAGGTAAACCTATTAGGTACGGGTTCAAAGCTTGGGTTGGTGCTACACGGTTAGGGTATGTTTTATGGATAGAACCATACCAAGGTGCTACAACTATGTGCAATCCAATATATAAAGAATTAGGGCTGGGTGCAAGCGTTGTTCTCACTTTTTGCGATGTGCTGATTTCACGTGGCTTCGACCTTCCTTACCACGTAGTTTTTGATATTTTTTTTACTGGGACGCCCTTGCTGGAAGAGATAACAAAAAAAGGCCTTCGTTGCACTGGGACAGTTCGAGAAAACAGGACATCTAGTTGTCCTCTGATTACATCGAAGTTACTGAAAAAAAAGGAACGTGGTGCTGTCGATTACAGAACGACACATGACAACACGTTCATTATTGCTAAATGGCATGACAATAATATATTCAGTATTGCTTCTAATGCTGTAGGAATAAATCCTAAACAATCTGCCAAACGCTTCTCACAAAGTGAAAAGAGAAACATTGTCATAGAAGAACCACATATGGTGTCCATCTATAACAAATATATGGGAGGAGTGGATCGGTCTGATGAAAATATTTCACATTACCGAATTGGTATACGAGGTAAGAAATGGTACATGCCGTTGCTCACACACATGATTGATCTTGCCGAACATAATGCATGGCAGTTATATAAAATAAATCATGGAAAACTGGATCATCTTGGCTTTCGCAGAAGGGTAGCAATTGCTTTGATTGAATCAAACAGTAAAAATGGCAAAAGAGGGCCTAGCCGACCCTCCCATCATGAACATGCTGATAGTCGTAAAGACCAAATGAATCATCTAGTTATTCCTCAATCGAAGGAAACACACTGTCGTCAATGTCACAAAAAATGTTTGACACGTTGCAAGAAATGTGATGTTGGAGTGTGTGTCAAGTGTTTTGAAACATATCATTCATAA

Protein sequence:

>DPOGS204951-PA
MANDRPLAAHEILDALENVSDNEEDYRERLICILPPPVDPDCLTDEDSGEEDNVTLNNLPRNILLQPAEVMIQGQIMVSDTEEEPSNSTGRTKRRRTYAWRKRDLAKNPVNWPDVQGACQDKRPIEWFENFLDEDVISLLVSESNKYAVKKNLPGDITTEDMKCFIGILLVSGYSWLPRRRMYWENSPDTKNELISSAMTRDRFDFIFRHLHVNDNLDLQDKYTKVRPLVTLLNKKFLEFSPLEEHYSVDEAMIPYYGKHGCKQHIKGKPIRYGFKAWVGATRLGYVLWIEPYQGATTMCNPIYKELGLGASVVLTFCDVLISRGFDLPYHVVFDIFFTGTPLLEEITKKGLRCTGTVRENRTSSCPLITSKLLKKKERGAVDYRTTHDNTFIIAKWHDNNIFSIASNAVGINPKQSAKRFSQSEKRNIVIEEPHMVSIYNKYMGGVDRSDENISHYRIGIRGKKWYMPLLTHMIDLAEHNAWQLYKINHGKLDHLGFRRRVAIALIESNSKNGKRGPSRPSHHEHADSRKDQMNHLVIPQSKETHCRQCHKKCLTRCKKCDVGVCVKCFETYHS-