Monarch geneset OGS2.0

DPOGS213010
TranscriptDPOGS213010-TA2394 bp
ProteinDPOGS213010-PA797 aa
Genomic positionDPSCF300024 - 43929-46961
RNAseq coverage385x (Rank: top 31%)
Annotation
HeliconiusHMEL0134370.064.07% 
BombyxBGIBMGA006921-TA0.056.79% 
DrosophilaCG1553-PC8e-9636.40% 
EBI UniRef50UniRef50_E2C1318e-12134.35%PIH1 domain-containing protein 1 n=3 Tax=Harpegnathos saltator RepID=E2C131_HARSA
NCBI RefSeqXP_969728.14e-12140.66%PREDICTED: similar to CG1553 CG1553-PA [Tribolium castaneum]
NCBI nr blastpgi|3071969773e-12034.35%PIH1 domain-containing protein 1 [Harpegnathos saltator]
NCBI nr blastxgi|3071969772e-12734.01%PIH1 domain-containing protein 1 [Harpegnathos saltator]
Group
KEGG pathway 
InterPro domain[51-514] IPR0129815e-40Nop17p
Orthology groupMCL13689 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213010-TA
ATGGCAACAGGAGTAAAACCTCGTGATGAAGAACCCAATTTTACTAAAGCAGATCTCGAAGCAATTCAAGACGCTATGAAAAGCAAGAAATTTCGAGAACTACTTAACGAATACTGCGATGAAGTACGTGACCCGGCAAATCAAGCAATCTATCAAAAAGAAATGACCCAACTAGAAAAGGAACGTGGCTACGATGTAACTTTCATCAATCCTAAAGGCAGTTATGTTATTAAAACGAGCGTCGCTGGCGACAGAAAGGCTTTTATAAATATTTGCAGCAATGAGAATATTGAGAAACCATCTTGCCGTGTTCAGGAGATTAATGGTCAAAAAGGTATGAATTGGCAACTACCATATACCCTTATTCCTCCTCGGGAAGATTTCGATCACAACAAACAACGCTGCGTAATTTACGATGTTGTTTTTCACCCGGACACACTGCGCATGGCAGAGGTTAACAAAAAATTCAGAGAATTGGTAAACAAAAGTGCCTTTGATGGATTACAGAAAACATACAACATTCACTTAGATAGCAACAATTGCCGCTTTCCTAAGTCTTACTACAAAGGAATGTCGACCGCGGCTGTGATTAGGAAAGAAGACCCTAACTTTAAACCAGTTACTGATGAAGAAAATGAGGAATTATCCCAAGATGTAATTGACAAGTTATATCCTCAGCATTTATATCCAGACAAAAAACAATCCCCAGAAGTTAAAACAAATAAAACTATAAAAGAACATAGAAAAAAAGCAAAAAACTGTCATAAAGATTACACTCATGCCTCCGAAAATGGTTATACAATGCCAAAATATGTTATTAAACAGCAAAAAAATGTTGACATGCAGGAATTTACTAACAGTAGAGATTGTAAACACTACTCTGCTATACCAAGTCATATAGTTGTTGAAATAAACATGCCGCTTATCTCTTCAACACAAGATTGTTCTTTAGATGTTCGGGAGAAGACATTAAGTTTAATCACTGAAAAACCAGCAAAATATAAATTAGACCTTGAATTGCCATATCCTGTTAATAGCGATTGTGGCAATGCTAGATTTGACAAAACTAAACATACTCTAACTATAACTTTACCTGTAATCAGAAAGAGTTTATCCAGTACTTCATTGTCCCTGAAAGGAGACAGTGGGGTTGAAAGTGAAGATACTTCTAACAGTGACGAAGAATGTAAGACAAATCTTATAGAAGAAATATCATCTACTCCATCAGAACCCGCTGGATATAAGTCACTCAGCCCTATTGATGAAACTGGATTCTTGGATTCCTCAATTGGTTACACACTCCCTCCATATACATACAATGCATTAGATGATATTCTAGCTTTCACATTTCATGTTAAAAATACCGAACCTGATTCTGTGAAAGTAAAGCATGACAAAAACAGTATATATATCAAGTTCTCCTCTCTTGGTTCTGGGTTTGTTCCTTTACACTATGCAGCCCTAATTGTATTCAGCGATGAAATAAATTTGGAAAATGTATCTGGTGAAGCTTGGGACAATAATGTTATTTTACAATTAGAAGTGCTTGGAAACCTACCTGAAAAATTCCAAATTGGTTTGACAGAAAGTGACATGAAAACTGAATGCTTTGATGTCGTTCCAACTAAGCAAATTGTTAATACCGAAGAATTTGAAGAAAAGACGTCAGAAGAGGATACCCTAACTCCATTAATAGAAGTCACGAACTTTGGAAAAGAAACAAATATCGTTGTGTCATCAAAAAGCAATGAAGATAGCATCTTACGCAAGTCTATGCCCATGATGCGTAGTTTCTCTGAATCAAGCGCTGGAGATATAGCATCATCCATGGACTACATCAGCTCTGATTATATCCATGAGGAATCTAGTTTAAAAAAGACTGTTAGGTTTAATGATGTCATTGCTAAGCAATTTTACAGATATAATTCTTCAATTGAGGGCCAAAAAAAGAAAAATCAGCGTAAGAAGAGCAAAAAACGTAACCTTGAAAGGCGTAAGAGTGAAAGTGAAGCTGAAGATGACACTTCAAATTCATTAAAGTTTTCCAAACCAAGGAAGGCTTCTATCAAACAGCGTCATGACAGTGGTGTGCCGGACACTTCAGATGCTGAGGAAAGTAAAAATGTAATGTCCGACAGTGATTTTTACAGCAACCAGTATGATGAAAATGCCAATGAAAACACTATAAAAACAGAAAAAACAAAGAATTTGGACAGTAATGGTAACAAAAACATTGAAAATTGGCAGAGAGATGCAAAACCTTTGACACAAAAGAAGCAGTCATGTGATAGTATAAAACATAACACAAATCTCTATGATCCCGACAGATTAAACAAAGGCCAATATCTGGAAGTCCAATTCAAGAATGATCTCATATTTGATCTTGATATGTGA

Protein sequence:

>DPOGS213010-PA
MATGVKPRDEEPNFTKADLEAIQDAMKSKKFRELLNEYCDEVRDPANQAIYQKEMTQLEKERGYDVTFINPKGSYVIKTSVAGDRKAFINICSNENIEKPSCRVQEINGQKGMNWQLPYTLIPPREDFDHNKQRCVIYDVVFHPDTLRMAEVNKKFRELVNKSAFDGLQKTYNIHLDSNNCRFPKSYYKGMSTAAVIRKEDPNFKPVTDEENEELSQDVIDKLYPQHLYPDKKQSPEVKTNKTIKEHRKKAKNCHKDYTHASENGYTMPKYVIKQQKNVDMQEFTNSRDCKHYSAIPSHIVVEINMPLISSTQDCSLDVREKTLSLITEKPAKYKLDLELPYPVNSDCGNARFDKTKHTLTITLPVIRKSLSSTSLSLKGDSGVESEDTSNSDEECKTNLIEEISSTPSEPAGYKSLSPIDETGFLDSSIGYTLPPYTYNALDDILAFTFHVKNTEPDSVKVKHDKNSIYIKFSSLGSGFVPLHYAALIVFSDEINLENVSGEAWDNNVILQLEVLGNLPEKFQIGLTESDMKTECFDVVPTKQIVNTEEFEEKTSEEDTLTPLIEVTNFGKETNIVVSSKSNEDSILRKSMPMMRSFSESSAGDIASSMDYISSDYIHEESSLKKTVRFNDVIAKQFYRYNSSIEGQKKKNQRKKSKKRNLERRKSESEAEDDTSNSLKFSKPRKASIKQRHDSGVPDTSDAEESKNVMSDSDFYSNQYDENANENTIKTEKTKNLDSNGNKNIENWQRDAKPLTQKKQSCDSIKHNTNLYDPDRLNKGQYLEVQFKNDLIFDLDM-