Monarch geneset OGS2.0

DPOGS214550
TranscriptDPOGS214550-TA2565 bp
ProteinDPOGS214550-PA854 aa
Genomic positionDPSCF300266 - 147162-154524
RNAseq coverage202x (Rank: top 47%)
Annotation
HeliconiusHMEL0161140.051.97% 
BombyxBGIBMGA003278-TA0.049.89% 
DrosophilaD12-PA1e-3528.80% 
EBI UniRef50UniRef50_UPI00020639BD1e-5033.91%UPI00020639BD related cluster n=3 Tax=unknown RepID=UPI00020639BD
NCBI RefSeqXP_392847.33e-4936.79%PREDICTED: similar to YEATS domain containing 2 [Apis mellifera]
NCBI nr blastpgi|3838633126e-5238.19%PREDICTED: uncharacterized protein LOC100881401 [Megachile rotundata]
NCBI nr blastxgi|3838633127e-5332.00%PREDICTED: uncharacterized protein LOC100881401 [Megachile rotundata]
Group
Gene OntologyGO:00056342.2e-40nucleus
GO:00063552.2e-40regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[9-455] IPR0050332.2e-40YEATS
Orthology groupMCL15464 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214550-TA
ATGGAACATAAAGAAGAGTACCACGATCCAGACTATCCAGAAGCACCGGCGGTTGAAAAACCAGACAAGCCAAAAGTCTCTCAGGAAGATAACATTCAAACAATAAAGACTATAATACGCCGCGAGTTTCAAAACGAATTAGATGTTCGAGAGAGGGAAGTTAATCTGATCGACCAAAGGATGTCCCTAGCAAGGCGGTACCTCCACGAGTTGAGGTATGCTGTCGTGAACAGCTACTACAACAATCAAAAGCTACAATTATCAGCTACCCAGGTGGAAGACGAGGTTGCAGCACAAACGGAACCACGAGCTAGATCTGAGGTGTCCTCTATACTCCGTAACACACAGCCCAGGATACATCCGTCAGTACAGAAGCTGCTCGGCAAGAAATCTGTTGCCATCGAGGAGATATTCAAATCAAGAGCACCAAGGAAAACCAGGAGGGACTATGGGGCTATGGTGCAGAAGAGGAATTACACGATATCAGCTGATGAGACGAAGTCGCTCCGGCCGGACAAGAATGAGCCCGGCCTGAATGTGGTGAAGACGGAAAGCAACGAGCACGAGGACAGGTCGGAGGCCAAAGGTCAAGTCCCAAGCAGCAGCAGGCCAAAGAAGATCCCTCGCCAGATAGACCCGAAGGTGAACAATGTGATCACAGTGGACGAGGTCACTAGGAACCAAATGAAACACAGATATAGAGTCATTATAGGCAACACGTCAAAGTACGCGCCCCCGGCGTCCCGCTGTGACCGTTCCACCCACAAATGGTTGTTGTATGTCAGAGGAGCGCCCGTAGTGGAAGCCATCACTGTTAGGTTACACCACTCGTACGCGCCTCACGACACTGTACATATAGACAAGCCTCCATTTCAAGTGTGTCGCCGTGGTTGGGGCGAGTTCCCAGCGCTGGTTACTCTCCACTTCCTCAAGTCATATCTGAACAGACCGGCAACCATCACACACACCATCAAACTAGACAGACAGTACACCGGCCTGCAGACTCTAGGTGCGGAGACAGTTGTGGATGTATGGTTATACAGCACACCGGATATGATAGAACACCAGCAGAGGGACGAAGAAGTGAAGGAGATCAAAGAGGAAGTGAAAGAAGAAGTGAGAGAAGAAGAGAACAGAGTCAGCGGAGACGATAAACAAGACAGCTGGCTGGAGTTCTTTGCAAAAGACACGAGTCAAGTGAACGTTGATGAGATGTTAGTTAAGAATGAAATAAAAACCGAGACGGTGATGGACAAGCACGGTGATGACAATGATGAAGTGAGCGATGAAGTGAAGAACACACAGAACAAGAGGATAATGAAGTACATAGAGCCGACCACAGGGAAAATATACTATCTGGAAATGGACAGGGCCCTAGACCTGACCAAGGTGCAAGAAATAGTAATAAACTCGGAGGGGAATGTGAAGACAGCAAAAATAAGCCCGCTGAAGACAAACGGCCTGAAGACGACCAAGAACAAGGAGTCCATCTTAAGGTCGCTATTGAAAACGGAGGATTGTGACGAGTATACTTACGATCACATAGAGAACGATCACTGTTACCTAGCCAGCGACTGGTACAAGAGGGACCATAGACAAGCTAGAGTGGAGGAGGCCAGAGACAAGTCGAAGAGTCTAGTCTACAGCAAATACAAAGATATTATATCCAAATTCACGTGTGTCAAGTCTATGGTCAGTTACCTGTTGAAACACATGCCGTTGGTGAGCGAGGCGGCCGGCGACGCGGGCTACGTCTCAATGTTCCCGTTCGTTGTCACATCCGACGACAGATACTGGAAGTTGGACTTCGCTAAACGAAGGAATATGGAGTGGTCACGGGCCAAGTTGATCAACAAGTTACTCACAGAGACCTTCAAGGCTGATCCCGGTAAGGTCTGGAGGACGAAACAGATCCTGGTATACTCGAGATTACACGGATACTATCCAATAAGGCGCGAGAAGGCGGACCTCAGAACCGACGAGTGGTCCTCGTGGAACGATCTGGATGAAGGGAAATCAGAATCGAATATAAGAGAGGTGTTCCCTAACGAGAGCGACCTGTCCACGTTGAGCGTGTTCAATAAAAGTGATTACGTCACCGAGGGTGCGGTTGTGGATTTAGATGTGAGCGGTTCCGACGAAGAGATAGAAATAGTCGGTGATGTGAGCGGCCAGAAGAAGCCTGTGCTGGTGGAGCGGCCTGTGAGTGATGACGTGCTGCCCGTGGACAGCAGCGACCGGCTCAGGTTCCTGTTCATAGAGAAAGTTTGTGAAGACATCGGCATCGTATTGAGGAATGAGGACATAGGTCACGGTTACTCTTACAGTTCGGTCCACTCAGTCTTGTTGTCGGCCACCAAGTGTTTCGCTGAAGAGCTGATCAGGTCGTCTCTCGCCAGACAACTCACCTCAGAGCTGGGAGAGGGACGCGTCTGGGTCGGCTGGTCCAGGCCTCGCGTGTGTCTCCAGCACGTGTTCCTCGCCACCAGCGACTCCAGGTTACAGCTGGTGACGTCATCACACCTCGCAGCCGCCGCGCACACACACACACCGCCGCCGCTATAA

Protein sequence:

>DPOGS214550-PA
MEHKEEYHDPDYPEAPAVEKPDKPKVSQEDNIQTIKTIIRREFQNELDVREREVNLIDQRMSLARRYLHELRYAVVNSYYNNQKLQLSATQVEDEVAAQTEPRARSEVSSILRNTQPRIHPSVQKLLGKKSVAIEEIFKSRAPRKTRRDYGAMVQKRNYTISADETKSLRPDKNEPGLNVVKTESNEHEDRSEAKGQVPSSSRPKKIPRQIDPKVNNVITVDEVTRNQMKHRYRVIIGNTSKYAPPASRCDRSTHKWLLYVRGAPVVEAITVRLHHSYAPHDTVHIDKPPFQVCRRGWGEFPALVTLHFLKSYLNRPATITHTIKLDRQYTGLQTLGAETVVDVWLYSTPDMIEHQQRDEEVKEIKEEVKEEVREEENRVSGDDKQDSWLEFFAKDTSQVNVDEMLVKNEIKTETVMDKHGDDNDEVSDEVKNTQNKRIMKYIEPTTGKIYYLEMDRALDLTKVQEIVINSEGNVKTAKISPLKTNGLKTTKNKESILRSLLKTEDCDEYTYDHIENDHCYLASDWYKRDHRQARVEEARDKSKSLVYSKYKDIISKFTCVKSMVSYLLKHMPLVSEAAGDAGYVSMFPFVVTSDDRYWKLDFAKRRNMEWSRAKLINKLLTETFKADPGKVWRTKQILVYSRLHGYYPIRREKADLRTDEWSSWNDLDEGKSESNIREVFPNESDLSTLSVFNKSDYVTEGAVVDLDVSGSDEEIEIVGDVSGQKKPVLVERPVSDDVLPVDSSDRLRFLFIEKVCEDIGIVLRNEDIGHGYSYSSVHSVLLSATKCFAEELIRSSLARQLTSELGEGRVWVGWSRPRVCLQHVFLATSDSRLQLVTSSHLAAAAHTHTPPPL-