Monarch geneset OGS2.0

DPOGS209742
TranscriptDPOGS209742-TA4044 bp
ProteinDPOGS209742-PA1347 aa
Genomic positionDPSCF300105 + 390538-401704
RNAseq coverage245x (Rank: top 42%)
Annotation
HeliconiusHMEL0113405e-7570.21% 
BombyxBGIBMGA008956-TA0.050.77% 
DrosophilaCG9642-PA4e-5932.45% 
EBI UniRef50UniRef50_D6WV301e-8437.13%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6WV30_TRICA
NCBI RefSeqXP_966915.12e-8537.13%PREDICTED: similar to CG9642 CG9642-PA [Tribolium castaneum]
NCBI nr blastpgi|910880534e-8437.13%PREDICTED: similar to CG9642 CG9642-PA [Tribolium castaneum]
NCBI nr blastxgi|2420196903e-8535.39%histone-lysine N-methyltransferase ASHR1, putative [Pediculus humanus corporis]
Group
KEGG pathway 
InterPro domain[702-742] IPR0051721.1e-12Tesmin/TSO1-like, CXC
Orthology groupMCL16421 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209742-TA
ATGGATCATAATTTGGACGATTCTTTAAATTTGGAGAATGCTATGGGAGTTGATTTTGGACACACTGATGATGTGGAGGTCAGTCAAGCAAGTATAACGTTGATACAAAGTGGTCATGATGAAAGTATTCCCATGGAATTTGAGCACAGCGAAGAACAACCCATGCTTATGGATACAGGAAATGAGGAGATTATTGATTTCATGAGTGATCAGTTCACCCTTCAAGAATATTCTGTTCAAAGCGATCAAACAACAGAATTGACGACAACCGTCGAATCAAATCAACAGCACATGCTTACTTCACAAACAGACACATTGGTCGATATACAGTTCACTCCTCAAATAGTAAGCAACTCCACAATAACACAAGACAATGGGGAACCGAAGATGGTTGCTGTTAAGTCACTTTTGCCGAAAACTACAAAAAAAAGTGACTCAATGGCCACCCTTGCAACAATGCCTCGTCAAGTGGCCATTGCACCCAAGCCGCCGAAACTTGTGAGCAAAACATTTGCGCCCAAACAACTGGCTATAGCACCGAAACCGGTCACAATGATATCTAATAGAGGACAAAGCCTCGTCAAAAAGGTATCACTAGCCAATGTGATACAGGGAACCACTAAAGGGAAGACAGTTTTGGCACAAATCGGCAAGCAGCTTATCATGGTGCCTTCTGGATCACAGAAAATTAAACTTGTCACCGCGGCACCGGGAACTAACACAGTGCAGTACATCAGAGCTGACGGAGAACAAGCACAGCTTATTGTCAATAAGAATACTGGTGCCACACAAAACAAGCCGCTGCTGACCAAACTTATAACAGTGCCAGGAGCAACCAGCGACTCGAACGCGCCGGCCGTCATCACTAAAATAGAGCCGTCACGGTTTGTGGTACAACAGAAGACAATACCGCTTACTGTGGGAAATAAGGTTTTAATGGCAACACCGTCAAAACAGCCCGTGAGGCTGTCGAAGAAACAAGAAATAATAACTATCAAGTCGCCTACACCAAAATTGATGCCAGCGACAGCCATAAATACAGCAACAAAACAGAAAGTTATTATAAATCCTAGCGTGAATGCGATGCTTAAGGCAGCGGCAGCGCAGAAAAAAACACAAAATCCAGAAGATGGCGCCACAGTCGCGGAAGCCGGGAAGTCACAGCTGCATCAGATTAATGTACCGGGCAAAGGCATTCAATACATCCGCCTCGTCACCAACTCGTCGTCGGAGACGCCGAAGCCGGTTCCCAAGCAGCTGATGGCGCTGCCCTCGAAGACATTCGTGTTGACGGACAATAAAGGTAATCTGATACAGATGACTGCTGAGAAAATGTCAGCCGGCCAGCCGCCGTCACTCGTCGTCACCGGGAAAGGGACGGCGGTTAACAAATTGAATGCGTCGAAACCGCCACAAAAGTTAGTGAGAATAGCACCTATAGTGAAAACATCGCAAACTGTTACTCAGTCTCCCGCGAGCTTGTTGGCGCCGCTGTCCCCGGGCTCGCCCGAGCGTGATCCGAGTCCTGCCCAGGAGTCCAAGGTGACGCTCAGAGCCCTCGTGGAACAAGCCAACGACGACTGCAGCTTGGCCGAGGTCGAGGTGGACTTCAAGAAAGACGGTAGTCCGACACCCGAAGATAGTATGGACAGCATTGATCAATACAATCACACAAATAAATCTGAAGATCATCCCCTTATAGTGATACCGTCGGCCTACGATCAAATGATTGTACCAGACGACTCGGAGAGAATGGATGAGTCTCAAGACGCAAACAACACTTTGAATGTGGATATGGATATGACGAACTATCAGTCACCGACGACGCCTGCTCTCTCTGAAACGGATCAAGTGACCACTGAACTAGGTGGTCTGAGGCCTCGCAAAGCATGCAACTGCACAAAATCTCAATGCCTGAAGCTGTACTGTGACTGTTTCGCTAACGGAGAGTTCTGCAATAGATGTAACTGCAACAACTGTCACAATAATCTGGAAAATGAGGAACTCCGGCAAAAAGCTATCAGAGGCTGTCTGGACAGGAATCCTAACGCTTTTAGGCCCAAGATCGGGAAATCGAAGGCGGGCGGCCCGGAGATAATACGACGGCATAATAAGGGATGCAACTGTAAACGGAGCGGCTGTCTTAAAAACTATTGTGAATGTTACGAGGCTAAGATAGCGTGTTCGTCGATCTGCAAGTGTGTCGGATGTCGCAATGTGGAGGAAACCCTGGAGCGCGGCCGTCGCCGAGACGCGCCTCGAGCCCTCCAACCACTGGCCGGACCGCCCACCTACCGGCCGCACGTCGCACTCGCACACGCCAAGCAACCATGCAGCTTCATGACTTCCGAGGTGATCGAAGCTGTGTGCCAGTGTCTGATAGCGGCGGCTGTTGACAACAAGGAGGAGGCGGAGCGGCGGGACGCGGACCCCATGCGTGACGTCATCGAGGAGTTTGCGCGCTGTCTCCAGGACATCATCAGCGCTGCACATCAGAGCGCGCCGCTCGCCTTGCTGGACGAGGTGAGGGAGGGGGAGGGATATCTAGTCGCTGCAAAAGACATAAAGGCGGGTGAGAGGATACTATCCGATCAGCCTTTCGTGCTGGGTCCGAGTAGTGACACCTCCTTAGTTTGCTTCAATTGTTACTTGCCATTGATCAACAAGTTCCTTGTCTGTAAGAACTGTGCCGTTGCCCCGCTTTGTCCTGGGGATGGATGCCCTGATGAGTTTACAAAGTATCATAATAGACAAGAATGCGATGTTTTTCGTAATTTAAAACTTACGAAGGGTATAAGTCCTATGACTATGGTACAGAACGTCGGTTCCTTATCGGTCCTCCGAGCTTTGTTGAAGAAAGAAACTAATTTGCTGGAGTGGAAATTGTTTATGGAATTAGAAACTCATTTAGAGAGAAGAAGAGAAAGTAACGTCTGGCAATACTATGATAATACTGTAAAGTTTATTCAGTCGTTGGGACTCCTTGAAAACGGACAAAACCAGGATTTAGTTCAGAAGATTTGTGCAGCAATTGATGTAAATAGTTTCGAAGTGCGCGGTCCACCGATCCCAGCTATAGGGTGTGCTGAGATTTTGCGAGGGGTTTATTTGCAGGCAGCGTTACTCAGCCACGACTGTATTGCTAACACGCACATGTCCATAGACGACAATAATATGTTAGTGTGCCACGCCAGCGTTGACATTAAAAAGGGGGAATCGATAAATTATAATTATACAGATCCCTTAAAGGGTACAATACCAAGGCAGCAACATCTCATCGTTGGTAAATACTTTAAATGCACTTGTACCAGATGTACAGACAACACAGAGCTGGGAACTTTTATGAGTTCTGCTATCTGTCCCGGATGTAAGAAGGGATACATAACAAAAAATGGTGACGTATGGATTTGTTTTGACTGTAAAAAGGAAGTTGAGTCTATGGATATCGAACCTAAGATTAAATGTTGTTCTGACAAATTAGAAGTCATAAATAAGAAAGATGAAAAAGAGTTAGAGGAATACATAAAATATGTGTCCCTTGTGTTGGCGCCCGGTCACTATTTATTATTGGATGCCAAGCAACGTCTCGCGGGAGTCCTACGTGACACAATTAACAGAGAACCTCGGCCTACTAAGAAATTAATGGTTAGGAAGCTAGCACTCTGTGAGGAGATATTGCCTATACTAGAAATAGTTAGCCCTGGAATATGTAGGACGAAGGCGATCACCCTTTACGAATTACACGCGACCACAGTGCAATTAGCCAAAAAGTTATTCGATAGTCGAGAAATGTCTGGATCTGCCTACGTGGACGCGTTGCTTAAAGCGGAAAAGTATCTGAAACGCGCTTTGGAAATGTTAGTCATTGAACCGGGTAATTCCCCCGAAGGAGAACTCTGTGCTAAAGCGTTAGAAGAATACAGGGCATTAAAAGTTACGATTTCCAAGATAGTAGATTCTCTACATGCCCAGGGAAAGACCTTTGGTATGGACTCTACGACCATAGAAGAAACGGGAAATTCTACGTATCTAGATGTGGATTAA

Protein sequence:

>DPOGS209742-PA
MDHNLDDSLNLENAMGVDFGHTDDVEVSQASITLIQSGHDESIPMEFEHSEEQPMLMDTGNEEIIDFMSDQFTLQEYSVQSDQTTELTTTVESNQQHMLTSQTDTLVDIQFTPQIVSNSTITQDNGEPKMVAVKSLLPKTTKKSDSMATLATMPRQVAIAPKPPKLVSKTFAPKQLAIAPKPVTMISNRGQSLVKKVSLANVIQGTTKGKTVLAQIGKQLIMVPSGSQKIKLVTAAPGTNTVQYIRADGEQAQLIVNKNTGATQNKPLLTKLITVPGATSDSNAPAVITKIEPSRFVVQQKTIPLTVGNKVLMATPSKQPVRLSKKQEIITIKSPTPKLMPATAINTATKQKVIINPSVNAMLKAAAAQKKTQNPEDGATVAEAGKSQLHQINVPGKGIQYIRLVTNSSSETPKPVPKQLMALPSKTFVLTDNKGNLIQMTAEKMSAGQPPSLVVTGKGTAVNKLNASKPPQKLVRIAPIVKTSQTVTQSPASLLAPLSPGSPERDPSPAQESKVTLRALVEQANDDCSLAEVEVDFKKDGSPTPEDSMDSIDQYNHTNKSEDHPLIVIPSAYDQMIVPDDSERMDESQDANNTLNVDMDMTNYQSPTTPALSETDQVTTELGGLRPRKACNCTKSQCLKLYCDCFANGEFCNRCNCNNCHNNLENEELRQKAIRGCLDRNPNAFRPKIGKSKAGGPEIIRRHNKGCNCKRSGCLKNYCECYEAKIACSSICKCVGCRNVEETLERGRRRDAPRALQPLAGPPTYRPHVALAHAKQPCSFMTSEVIEAVCQCLIAAAVDNKEEAERRDADPMRDVIEEFARCLQDIISAAHQSAPLALLDEVREGEGYLVAAKDIKAGERILSDQPFVLGPSSDTSLVCFNCYLPLINKFLVCKNCAVAPLCPGDGCPDEFTKYHNRQECDVFRNLKLTKGISPMTMVQNVGSLSVLRALLKKETNLLEWKLFMELETHLERRRESNVWQYYDNTVKFIQSLGLLENGQNQDLVQKICAAIDVNSFEVRGPPIPAIGCAEILRGVYLQAALLSHDCIANTHMSIDDNNMLVCHASVDIKKGESINYNYTDPLKGTIPRQQHLIVGKYFKCTCTRCTDNTELGTFMSSAICPGCKKGYITKNGDVWICFDCKKEVESMDIEPKIKCCSDKLEVINKKDEKELEEYIKYVSLVLAPGHYLLLDAKQRLAGVLRDTINREPRPTKKLMVRKLALCEEILPILEIVSPGICRTKAITLYELHATTVQLAKKLFDSREMSGSAYVDALLKAEKYLKRALEMLVIEPGNSPEGELCAKALEEYRALKVTISKIVDSLHAQGKTFGMDSTTIEETGNSTYLDVD-