Monarch geneset OGS2.0

DPOGS211918
TranscriptDPOGS211918-TA1758 bp
ProteinDPOGS211918-PA585 aa
Genomic positionDPSCF300011 + 125876-129823
RNAseq coverage148x (Rank: top 54%)
Annotation
HeliconiusHMEL0177140.064.58% 
BombyxBGIBMGA001061-TA0.058.12% 
DrosophilaCG17807-PA4e-11840.00% 
EBI UniRef50UniRef50_D2A2C24e-12142.88%Putative uncharacterized protein GLEAN_07671 n=1 Tax=Tribolium castaneum RepID=D2A2C2_TRICA
NCBI RefSeqXP_001657457.12e-12440.75%hypothetical protein AaeL_AAEL000938 [Aedes aegypti]
NCBI nr blastpgi|1571122464e-12340.75%hypothetical protein AaeL_AAEL000938 [Aedes aegypti]
NCBI nr blastxgi|1571122465e-12140.59%hypothetical protein AaeL_AAEL000938 [Aedes aegypti]
Group
Gene OntologyGO:00167063.6e-11oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors
GO:00551143.6e-11oxidation-reduction process
GO:00164913.6e-11oxidoreductase activity
GO:00081524e-11metabolic process
GO:00081684e-11methyltransferase activity
KEGG pathway 
InterPro domain[204-316] IPR0051233.6e-11Oxoglutarate/iron-dependent oxygenase
[386-476] IPR0132164e-11Methyltransferase type 11
Orthology groupMCL13088 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211918-TA
ATGGATGAATTGCGAAAAATAGAACGTAAAAAGAAGCGATTTGCTGTGAGACTTAAAAATTCTAAGAAAATTATATGTACAGATATTCCAGGATTGAATATAGTACTGTGTAACGTAGGCCAAGCCACTGGTTTTAAGAAAAATGATTTATTATTAATGTTAGAGAAGATAACACCCACACCGATAGTTTTAAAATTCATAGCAGAAAAAGGCGATTCACATTGTTTTATTGTGTTCAGTAGAATTGAAGAAGCGAAAATCTTTTACGATACATATGACGGCACTTTAAGCGGCAATAGTGATGTGCCTTTATACATGTGTTTTGTTGAAAGTGTTCCAAACAACGAGATAATTTGTTCCTCGAGCAATCCTGAAGGTTTAACATTAATTGAAGGATTTATCACTGAAGATGAAGAGAAGCAGCTATACCAACTCTTCGACTGGATTGATGAATCAAACTTGAAGAATAGACAAGTGAAACATTACGGCTATGAATTTAGATATGGCTCTAACGATGTAGACTTAAATCAACCCCTTGATGAGAAAATACCCCAGGAATGTGAAATTATTTGGAAGAGACTTGAAGATTATGGTATTAATTTTAGCATTCCAGACCAATTAACTGTCAATAAGTACAGTCCAGGACAAGGTATACCAAGCCATGTGGACAAACACAGTCCATTCGGAGATACGATTTTATCTCTGTCGTTGAATTCATCAGTTGTAATGGACTGGAAACATCACAGTAAGGCGTATGTACCAGTGGTGGTGCCGTCTAGATCATTGCTGGTGATGCAAGCTGAAGCAAGATATGACTGGCAGCATGGCATTCAACCGAGGACATGGGATCCCATCATAGAAGTCAGAAAAATCGATAATGGACTAGTGAAAGTGATCACGAGTGAAACAAAAGCTCGCGGGACGCGGATATCTTTGACATTCAGGAAGACGAGGCAGGGGCCGTGTAACTGTTGTTATGAGACACTCTGTGACAGCAGGGTGACGGCCGACTTGGATGAAGTCGCCTCACATTTAGAAGATCTCCATGTACATCAGGTATATGAGCAGATAGCTGGTCACTTCAGTTCGACTCGTCACAAGCCCTGGCCGAAGGTGGTGGAATTCCTCCAAGACATACCACCCGGCTCTATCGTACTGGACCTTGGAGCTGGAAATGGGAAGAACATACTCAATAGGAATGATCTATTGCAGGTCGCTTGCGAGTACAGCGCGGGCCTGTTGTCCGAGTGCAGGTCAGTCACGCGCGCGTGCTGCGTGCGGGCGGACGTGTTGCACGCTCCGCTCTGTGACTGCCGCGCCGACGCTGTGCTGTGCGTCGCCGTCATACATCACTTTAGTACTTACGCCAGGAGACGTCAAGCGATCGCCTCAATAGCTCGTCTTTTACGTCCCGGAGGAAGAGCACTCGTCACTGTGTGGGCGAAGGATCAGAGTAAATCGAACTATCTCTGTAAGGACAAAGTAAACACCGAATCGGATACTTACAAGACGGCCGGCATTCACCTACCGGTGCATCAGAACAGGACGCAGTTCAAACATAACGACGTTTTAGTGCCGTGGAAACTCAGGAAGGTCAAGGAGAACAAACTGGAGAATGAGTCCAGCCAGACCCTGCTGAGATATTACCACGTGTTCGAGGAACACGAGCTCGAGGAACTGTGCCGGAACCAAGGAGTCGTCGTCGAAAAGAGTTTCTATGAGGAAGGCAACTGGTGTGTCATATGTCAAAAGATTTGA

Protein sequence:

>DPOGS211918-PA
MDELRKIERKKKRFAVRLKNSKKIICTDIPGLNIVLCNVGQATGFKKNDLLLMLEKITPTPIVLKFIAEKGDSHCFIVFSRIEEAKIFYDTYDGTLSGNSDVPLYMCFVESVPNNEIICSSSNPEGLTLIEGFITEDEEKQLYQLFDWIDESNLKNRQVKHYGYEFRYGSNDVDLNQPLDEKIPQECEIIWKRLEDYGINFSIPDQLTVNKYSPGQGIPSHVDKHSPFGDTILSLSLNSSVVMDWKHHSKAYVPVVVPSRSLLVMQAEARYDWQHGIQPRTWDPIIEVRKIDNGLVKVITSETKARGTRISLTFRKTRQGPCNCCYETLCDSRVTADLDEVASHLEDLHVHQVYEQIAGHFSSTRHKPWPKVVEFLQDIPPGSIVLDLGAGNGKNILNRNDLLQVACEYSAGLLSECRSVTRACCVRADVLHAPLCDCRADAVLCVAVIHHFSTYARRRQAIASIARLLRPGGRALVTVWAKDQSKSNYLCKDKVNTESDTYKTAGIHLPVHQNRTQFKHNDVLVPWKLRKVKENKLENESSQTLLRYYHVFEEHELEELCRNQGVVVEKSFYEEGNWCVICQKI-