Monarch geneset OGS2.0

DPOGS202185
TranscriptDPOGS202185-TA2439 bp
ProteinDPOGS202185-PA812 aa
Genomic positionDPSCF300149 - 622277-630924
RNAseq coverage235x (Rank: top 43%)
Annotation
HeliconiusHMEL0091690.073.91% 
BombyxBGIBMGA013482-TA0.063.40% 
DrosophilaCG6969-PA1e-17543.75% 
EBI UniRef50UniRef50_D6W8630.047.38%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W863_TRICA
NCBI RefSeqXP_001662610.10.044.79%oxidase/peroxidase [Aedes aegypti]
NCBI nr blastpgi|3407259890.047.41%PREDICTED: hypothetical protein LOC100646756 [Bombus terrestris]
NCBI nr blastxgi|3407259890.047.27%PREDICTED: hypothetical protein LOC100646756 [Bombus terrestris]
Group
Gene OntologyGO:00069797.8e-175response to oxidative stress
GO:00200377.8e-175heme binding
GO:00046017.8e-175peroxidase activity
GO:00551147.8e-175oxidation-reduction process
KEGG pathwaytgu:1002281472e-90 
 K10789 (MPO)maps-> Phagosome
InterPro domain[215-794] IPR0102557.8e-175Haem peroxidase
[360-768] IPR0020074e-172Haem peroxidase, animal
[246-257] IPR0197914.8e-36Haem peroxidase, animal, subgroup
Orthology groupMCL14516 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202185-TA
ATGCAGAGACCTCAGGGCAGCAGTGAGCGGACTCCGTTGGTGCCGCCCACTTACATGTTCGAGTCCAGCATCTCCAGGAGCTACCAGAAACGTCTCAGAAACTTCCAGTGCGCCGTCTGTGTCGTATTGATCCTACTACTGTCGGTAACATTGTTGGTGACCATATCCTACAACCTCACCCTCGGCGGCCCTGAGTTTCCGGAGGTGGTGTCTCCGACGACTCCGTCCTCTCCTGAGGGCAACCTCACCCGCCGTCTCATGCTCAGCCCTGACCTGATGCCGATCATGAACAGAACCTGGCCTCTCAATGGTCGCCCTATCCCAAAATGGAAAGCCGAAACAGTAAGTCCGGAAGCCATAGACGCCGCTGTACAGAAAGGCAAAGCTATGCTAGTGAAGCGTAGAATAATAGAGCGAAGCCTTACTCCTCTCGACTCAGAGTCGCCGGCCTTCAGAGGCCAAAGGGCGGCAGCTACGTCGGCCCTGGTTAAACCGATCGCCGAGACAGCGTACGCCGTAGAAGAAGCAACCAGAGAATTACTGAACAGCACTGAGATACCCGATGCGGTTGGCGCAGTCGGTGTGGGTCCAGCAACCAACGGGTCGTTCCCGGAGCCAGCGTACTGCCGCCCGCCCACCGCGCCCTGCGTCATCTCTAAGTACAGGACGCAGGATGGCTCTTGTAACAACCTGGACCATCCTCTACTCTGGGGCGTCTCCAATACACCGTTCAGACGAGTCCTCCCACCAGACTACGGTGACGGTGTAAGCTCCCCCCGCACTGGATGGAACGGCGCTCCTCTGCCCAGCGCTCGAGATGTCAGCGTAACGGTGCACAGACCCAGCTACGCTCACGACACACAGTTCACCGTGATGCTCGCCGTGTGGGGACAGTTCATAGACCACGACATCACAGCCACCGCTCTCAACAAGGGAGCCAACAGCACTCCCATCTCTTGCTGCACCGACATGACAATACACCCGGAGTGCTTCCCGGTGAAGCTGGACCCGGAGGACCCCTTCTACCAGGACTACAACCTCACATGCATGGAGTTTGTGAGGTCAGCGCCTGCGCCTACCTGCCATTTCGGTCACCGCGAGCAGCTGAACCAGGCGACAGCGTTCCTGGACGCGTCGACGGTCTACAGCTTCATGGAGAACAAGACCAACCAGCTCCGTGCGGGAGCCAACGGTCAGCTGCGGATGTTGAAGCTCGGCCCCTGGGAGCTGCTGCCGCCCTCCACCGACCCCAACGACGGATGTAACACGGTCGAGATGAACGCCAAAGGACGCTACTGCTTCGAATCGGGCGACGACCGCGCTAACGAGAACCTCCATCTGACGACGATGCACCTGCTATGGGCCCGACAACACAACCGCGTGGCAGCGCGCCTCCAGCAGCTCAACCCCGCCTGGGACGACCAGCAGCTGTTCCAGGAGACGCGCAGGATAGTCGGAGCCCAGATGCAGCATATCACATACGCAGAATTCTTACCATCTATACTAGGGGAGGACGTGATGTGGTCGTTGAACCTCACGCTGCAGGAGTCAGGGTACGCGACCGTGTACGACTCCGCAGTGGACCCTTCCATCGCGAACCACTTCTCCGCCGCAGCCTTCAGATTCGCTCACACGCTGCTGCCGGGCCTGATCCATAACGTGGACCTGAGCACGGGCACGGTGAGCTACACGCACCTCCACGAGATGTTGTTCAACCCGTACGCGCTGTACAACGAGCAGGGGTCCAAGAGGTCCGTGAGGTCCGCCATCTACACGCCCGTGCACGCCGTGGATCCCCACATCACCAGCGAGCTGAGCAATCATCTCTTCGAGCGCAGCGTCGCCAACAGCAGCAGCAGTGTGAAGGGTGCCAATCCCCTGCCGTGCGGACTGGACCTGGTGTCGCTGAACATCCAGCGAGGCCGCGACCACGGCTTGCCCGCCTACCCTGCCTGGAGGGAGCACTGCGGCCTCTCCCGCCCGCACACCTTCGAGGACCTGGAACCGATCTTTGACGAACTGTCCTTGAGCAGGATTTGCAAAATATACAAGAGCGTCGATGACATAGACCTGTACACGGGCGCCCTGGCTGAGGACCCCAAAGGCCGTCTCCTGGGCCCCACGCTCACATGTCTCGTAGCGGATCAGTTTCTGCGCATCAAGGTCGGCGACCGCTACTGGTACGAGACCTCGGATCCAGATATTAAATTTACTCCAGAACAACTGTACGAAATCCGTAAGACGACCCTGGCGGGAGTGATCTGCGCTAACGAGGGTCTGCTGGATCAGGCGCAGCCGCGCGTCATGGAGGCTCTGAGCGCCACCAACCCGCTGGTCGACTGCAAGGAACTCCCGCAACCTGACTTCAAACCTTGGAAGGATCCCGACCCGAACCAGCCGACCAAGAAACCATCGAGCAAAAACAACAACAAAGGATAA

Protein sequence:

>DPOGS202185-PA
MQRPQGSSERTPLVPPTYMFESSISRSYQKRLRNFQCAVCVVLILLLSVTLLVTISYNLTLGGPEFPEVVSPTTPSSPEGNLTRRLMLSPDLMPIMNRTWPLNGRPIPKWKAETVSPEAIDAAVQKGKAMLVKRRIIERSLTPLDSESPAFRGQRAAATSALVKPIAETAYAVEEATRELLNSTEIPDAVGAVGVGPATNGSFPEPAYCRPPTAPCVISKYRTQDGSCNNLDHPLLWGVSNTPFRRVLPPDYGDGVSSPRTGWNGAPLPSARDVSVTVHRPSYAHDTQFTVMLAVWGQFIDHDITATALNKGANSTPISCCTDMTIHPECFPVKLDPEDPFYQDYNLTCMEFVRSAPAPTCHFGHREQLNQATAFLDASTVYSFMENKTNQLRAGANGQLRMLKLGPWELLPPSTDPNDGCNTVEMNAKGRYCFESGDDRANENLHLTTMHLLWARQHNRVAARLQQLNPAWDDQQLFQETRRIVGAQMQHITYAEFLPSILGEDVMWSLNLTLQESGYATVYDSAVDPSIANHFSAAAFRFAHTLLPGLIHNVDLSTGTVSYTHLHEMLFNPYALYNEQGSKRSVRSAIYTPVHAVDPHITSELSNHLFERSVANSSSSVKGANPLPCGLDLVSLNIQRGRDHGLPAYPAWREHCGLSRPHTFEDLEPIFDELSLSRICKIYKSVDDIDLYTGALAEDPKGRLLGPTLTCLVADQFLRIKVGDRYWYETSDPDIKFTPEQLYEIRKTTLAGVICANEGLLDQAQPRVMEALSATNPLVDCKELPQPDFKPWKDPDPNQPTKKPSSKNNNKG-