Monarch geneset OGS2.0

DPOGS210084
TranscriptDPOGS210084-TA1899 bp
ProteinDPOGS210084-PA632 aa
Genomic positionDPSCF300017 + 283469-287826
RNAseq coverage36x (Rank: top 74%)
Annotation
HeliconiusHMEL0051720.056.84% 
BombyxBGIBMGA011886-TA3e-15344.57% 
DrosophilaPxd-PA1e-6228.33% 
EBI UniRef50UniRef50_UPI00020616C96e-7031.23%UPI00020616C9 related cluster n=1 Tax=unknown RepID=UPI00020616C9
NCBI RefSeqXP_967241.12e-7129.37%PREDICTED: similar to AGAP010734-PA [Tribolium castaneum]
NCBI nr blastpgi|910781765e-7029.37%PREDICTED: similar to AGAP010734-PA [Tribolium castaneum]
NCBI nr blastxgi|3287079383e-6831.06%PREDICTED: peroxidase-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00069792e-109response to oxidative stress
GO:00200372e-109heme binding
GO:00046012e-109peroxidase activity
GO:00551142e-109oxidation-reduction process
KEGG pathwayecb:1000709075e-34 
 K10788 (EPX, EPO)maps-> Asthma
InterPro domain[58-625] IPR0102552e-109Haem peroxidase
[206-600] IPR0020073.3e-100Haem peroxidase, animal
[88-99] IPR0197911.5e-09Haem peroxidase, animal, subgroup
Orthology groupMCL21146 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210084-TA
ATGGAAATCGGTCTTAGAAGTTTTTTGATTATTTCACTTGTTGGATTTAGTTTGTATTCGTGCCTTGAACCAACTTATTATGATACTTACACTGGATATGTTTTGTCGTTAGAAGAAGTCAAAGGCCACATTAAAAGAAATAGTTCAGAATGGTGTGTGAGTGAAGTGCTACCATGTAACCCACAAGAGACAAGACGACTGGATGGAACCTGTAACAACCTCAAGCACCCCAACAGAGGAGCCTCGCACACCCCGACATATAGACTGCTACCAGCCCATTATGATAAGGATTTTGAGCCAAGAAAATCCAAGAGCGGTAAACCTCTGTCGCTATGTAGGAAAATCCGTACGTCATTGTTAGCCGAGGGGAGAGTCCCTGACACCGAATTGACACAGCTATTGTTGCATTTCTGGGTATTTGTTTCGTCTGACGTGTTATCATTGCATGATACAGTAAACTACATTTTATGGAAACCATATTGCTGTCAAGAAAGAGGGAAGACAGACAAGGGTTGTATTCCGAACATAATCCCTGAAGATGATCCTGTTCATCGCTTCTCTTCTATCCGCTGCATGAACTTAACCAGACCTTGGAGTTACCAATCTACAGGGTGTTATAGAAATGACACTACCCCAGAAAGAATAATAACAGCTAGTCCCGCATACGATTTATCTCACGTGTATGGTCTCTCTTTAAAATTAATAAATGAAAAGCACAGAAGTTTTAAAAATGGTATGCTCAAGTTTGAAGTTGAAAATAATATGATATGGCCCCCAAGCACAAAGACTCCGGTCAACCTATGTCTTCTTAATCAAAAACCGAAGGAAACTCGTTGTCACGATACTCCTGAATCGGGCTCCAATAGTGTACTGGGTCTTAATCTTTTTGTCATCTGGACTTGGCGTTTTCACAATTTCGTGGCATCAGAACTATCAAGAATCAATCCTTGCTGGTCTGATGACAGACTCTTCTTCACAGCCAGAGATATCGTCATTGCTTACTATATGCAGATGTTCTATTATGAATTGTCACCGACACTGTTGGGTTACGAAAATCTTCTTCGAGACGGAGTTCTTTCACCTTTCAAAGACTTTAGAGATCTTTACAAGGAGGATCTCTTGCCACAAATATCTATAGAATATCCTGTGGTTCTTAGATGGGCTCACACTATAACTGAGGGAGTACTAAAAATGTACGACGCAAAAGGAAATTATTTGAACGAGACAAAGATCGTCGATTTAACATTAAGAACGGGATATTTAGTTGAAAATTTAGAATTCATCACACATGGTGCATACAGACAGCCGGCTGCTAAGAATGATGGAGTTGTCGATCCAGATATTTCAGAAAAAGGTCTCGGGCCTCATCAAAGAGCATCCGATTTACCAACAAGTGATATGTGCAAGAATCGTTACTTTGGGTTGGCACCGTATATTAAGTATAGAAAACTGTGCTCGGGAGTAGATTACCGGAGTTTTGATGATTTAATAGAAGTCATGGATCCAGAAAGGATAGAGATTCTAAAGGAATTGTATGAACACGTTGAAGACATAGATTTAATGGCTGGAATATATTCAGAGAGGTATGTTCAAGGAGGTCATGTTCCCCTCACCCTGTACTGTGTCGTCGTAGAACAGATGATGAGGACGATGATGTCTGACAGACATTGGTACGAGAGACCGAATAGACCGAACGCGTTTACCAGGAATCAGCTGTTACAGATTAGAAAGGCATCTGTAGCTCAGATGCTGTGTTTGGTTGGAGATGGAGTGACACATATACAGCCTCATGCTTTCTCTATGCCAGGGCCCGGGAATGAGATGTGTAGCTGTAAAATGATCGAGAAAATCAATTTTTGGGCTTGGAAAGATACAAGTTGTGGATTAAGCAACGCATAA

Protein sequence:

>DPOGS210084-PA
MEIGLRSFLIISLVGFSLYSCLEPTYYDTYTGYVLSLEEVKGHIKRNSSEWCVSEVLPCNPQETRRLDGTCNNLKHPNRGASHTPTYRLLPAHYDKDFEPRKSKSGKPLSLCRKIRTSLLAEGRVPDTELTQLLLHFWVFVSSDVLSLHDTVNYILWKPYCCQERGKTDKGCIPNIIPEDDPVHRFSSIRCMNLTRPWSYQSTGCYRNDTTPERIITASPAYDLSHVYGLSLKLINEKHRSFKNGMLKFEVENNMIWPPSTKTPVNLCLLNQKPKETRCHDTPESGSNSVLGLNLFVIWTWRFHNFVASELSRINPCWSDDRLFFTARDIVIAYYMQMFYYELSPTLLGYENLLRDGVLSPFKDFRDLYKEDLLPQISIEYPVVLRWAHTITEGVLKMYDAKGNYLNETKIVDLTLRTGYLVENLEFITHGAYRQPAAKNDGVVDPDISEKGLGPHQRASDLPTSDMCKNRYFGLAPYIKYRKLCSGVDYRSFDDLIEVMDPERIEILKELYEHVEDIDLMAGIYSERYVQGGHVPLTLYCVVVEQMMRTMMSDRHWYERPNRPNAFTRNQLLQIRKASVAQMLCLVGDGVTHIQPHAFSMPGPGNEMCSCKMIEKINFWAWKDTSCGLSNA-