Monarch geneset OGS2.0

DPOGS209262
TranscriptDPOGS209262-TA4011 bp
ProteinDPOGS209262-PA1336 aa
Genomic positionDPSCF300111 + 409437-430056
RNAseq coverage523x (Rank: top 24%)
Annotation
HeliconiusHMEL0167360.086.42% 
BombyxBGIBMGA007042-TA0.082.34% 
DrosophilaCG10211-PA0.054.91% 
EBI UniRef50UniRef50_B0WXB40.058.12%Oxidase/peroxidase n=5 Tax=Endopterygota RepID=B0WXB4_CULQU
NCBI RefSeqXP_001807949.10.059.31%PREDICTED: similar to oxidase/peroxidase [Tribolium castaneum]
NCBI nr blastpgi|1892403970.059.31%PREDICTED: similar to oxidase/peroxidase [Tribolium castaneum]
NCBI nr blastxgi|2700127080.059.24%hypothetical protein TcasGA2_TC005493 [Tribolium castaneum]
Group
Gene OntologyGO:00069793.5e-185response to oxidative stress
GO:00200373.5e-185heme binding
GO:00046013.5e-185peroxidase activity
GO:00551143.5e-185oxidation-reduction process
KEGG pathwaycel:F09F3.58e-98 
 K00430 (E1.11.1.7)maps-> Phenylpropanoid biosynthesis
    Phenylalanine metabolism
    Methane metabolism
InterPro domain[668-1242] IPR0102553.5e-185Haem peroxidase
[820-1216] IPR0020071.9e-177Haem peroxidase, animal
[698-709] IPR0197918.2e-42Haem peroxidase, animal, subgroup
Orthology groupMCL14699 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209262-TA
ATGGGATCCCATAAAATGTTAAAACGAAGTATATTGCTATACGCATTATTTTTAACTCAAATATTAGCACAGAAAACAAAAACAACAGATGACAATCTACAGACAGTTCCAAGTGACCTCAGAAGAGCTGTTAGCGAGGCACTAGCTTTAGAGAGGAGGTTTTTATTAGGAAGTAACGACGTGAGGAACTGTTCATACGACGAGATCAGCAATACGCCATGTCCCCCCAGCAAGTATCGAAGCGCGAGTGGAGAGTGCAACAACGTTCGACATAGACCATGGGGAAGGCGAGGCGACGTGTTCCTTAGATTATTACAACCACACTACGCTGATGGAATATCACAACCTTTCGCAAGTCCGAAGCTGCCGGAGCCAAGGCTGGCCGTCCAGGCAGTATCGCAGTTGGCTGAATCAGTCGGTCACGATTACGTAACCAGCTTACTCGCTGCCTGGGGACAGTTTCTTATGGATGATCTCATTGCCACCTCTAATCAGAACCAAAAGTGTGAATCTGGTTGCGAATATGTTCGCTCAGCGCCTACAAGAAATTACGATTCCTGTGGATTTGAATACCGTGATCAGATGAATTTAGCGACATCGGTACTAGATGGCTCTGCTCTGTATGGAAATTCTGAAAAAGAGCTGTTGTCTCTCAGACTATACGATGCTGGCAAAATGGATATATCCTCCTGCCGAAAATGCAACGAAAATTCACCTACCGCACCACTTTATAAGGCTCTCTTGACTGAACACAACCGTATTGCTGGCGAATTATTTTCGATGAATCCCTTCTGGGAAGATAATGCCCTGTTCTTAGAAGCAAGACGAACAATAGCAGCAGTTATTCAACATATTACATACAATGAATTTTTGCCTGTACTTTTGGGCGAAGTTGGAATGGCAAAAGCAGATTTAAAACTAACCACCCACGGTTTCTGGCGTGGATATTCAAGTGCAAATCGTGTCGGAGCTTATGCGGAGCTTGTTGCGGTTGCACCAATTTTTAACGCCATGATGAATGAGAAACTTATAAACACAACAATTCTTCTAAAAGACTTGGTCAAGACCAGCGCTCATCAAATCTCAAGATTCGCCTTATCAGCTCAATGGGATCTTAACCGAGCTCGTGACCATGGTGTACCTTCGTATCTAAAAGTCTTGCAGCTGTGTGATCCCATGGCAAATGTTAAGTCATACGCTGAATTTGAAAAATTAGGTTTTGACAGAAGACATCAAGAAATATTCGCTGATATGTATAGAAACGCTGAAGATATTGAACTGATGGTAGCTGGAGCCATGGAGAAACCAGCTACTGGCGCAGTTATTGGGTCTACCTTAGCATGTGTATTAGCACTCCAATTTGGAAACCTAAAGAAAAGTGATAGGTTTTGGTATGAAAATGACATTCCTCCATCATCATTTTCAATAGAGCAGTTAGCAGCAATCAGAAAAGTATCCATTGCTGGACTTTTATGTTCTGCTGATGAAGGACTTGATAATGTGCAACCTAGAGCTTTCGTAAAGGAAGATACATTCTTAAATGCAGCCCAACAATGCTCTCAACACCCTCGGCTGGAGCTTTCTTCGTGGCGCGATGAAAGTGGTGCCCGGGCTGCAGAGCGCCTCTCACAAGACATGCTGGCAGCTGCACTTCAGAAGGCAAAGCAGGAGATGGCTGACCGCAAGAAACTGGAGTACATGTTATGGGAAGCACATGGAGGAGCCGACCCAAAATCTCCTGTCGGCACAGCTGCATCTTTCTCAAAAGCAAATAAATATGCCCTTAAGTTGGCAAACACCTCCTTATTCTTCGAATTCGCTACTAACGAACTCGTTAACTCTCTTAATGGACGACGACGCAAACGTCAGATCTTCGATGACTCCCTCGGCTTTGGATCAACCGATTTTGTAGAGTCTCTTCAATCAGTAGACGTCAGTGGTTTTCTAGGGCAGGACCAACTTGGGCCGGTTATTGAACCGCAGTGCGATGATAATGGAAACTGCGATCCTGATAGTCCTTTCCGAAGCTACACTGGCCACTGTAATAATTTGAGAAATCCTAATTTGGGAAAAAGTTTAACAACCTTCGCGAGACTGTTACCTCCTGTTTATGAAGATGGTGTAAGCCGGCCCCGCATCAACTCAGTAACAGGCACCCCGCTACCTTCCCCCCGTATAGTTTCTACGGTCATACATCCCGATATATCAAATCTTCATACGCGCTACACTTTGATGGTGATGCAGTTCGCCCAATTCCTGGACCATGAACTGACAATGACTCCCATTCACAAAGGCTTCCACGAATCTATTCCGGACTGCAGATCTTGCGACTCTCCCCGTACAGTGCATCCAGAATGCAACCCTTTCCCAGTACCTCGCGGTGACCATTATTATCCAGAAGTTAACATAACTTCCGGAGAACGATTATGTTTTCCATTCATGAGAAGTTTACCAGGACAGCAGCAATTAGGACCGCGTGAACAAGTCAATCAAAATACAGCTTTCATTGATGGATCGGCGATTTATGGAGAAAACCCTTGCATTGTTCGTAAACTGCGAGGTTTCAATGGCAGGCTCAACGCTACTGCAAACCCGATTAATGGAAGAGATTTGTTGCCTAGAACAGATAACCACCCTGAATGCAAAGCAGCCAGTGGTTTTTGTTTTATTGCTGGTGACGGTCGAGCTTCAGAACAGCCTGGACTCACGGCTTTACACACAATCTTCATGCGCGAACACAACCGCATAGTTGAAGGACTTCGTGGTGTCAACCCTCATTGGGATGCGGAATTATTATTCGAACACACTAGGCGTATAGTTGCTGCCTCATTCACACATATCATTTATAATGAATTTTTGCCAAGACTTTTGTCTTGGAACGCTGTTAACTTGTATGGACTCAAATTATTACCTTCAGGTTACTATAAGGAATACTCTCCAACCTGCAACCCGTCTATTGTAACGGAATTTGCAGCAGCCGCCTTCAGATTTGGTCACTCGTTGTTGAGACCACACTTACCGAGACTCTCACCTTCCTATCAACCAGTTGATCCACCAATATTGTTGAGAGATGGATTTTTCAGGCCTGACATGTTCATGAATCATCCACCAATGGTTGACGAACTTATTCGTGGTTTATCTTCCACGCCCATGGAGACCCTTGACCAATTCATAACAGGAGAAGTTACCAACCATCTATTCGAAGACCGGAGAATTCCGTTTTCGGGTATAGACTTAGTAGCTCTTAATATCCAAAGAGGTAGAGATCACGGTATACCGAGTTATAACAACTATAGAGCTTTGTGTAACTTAAAGAGAGCAGCAACTTTCGAAGATTTGGCGAGGGAAATTCCCGATGAAGTAATTGCGAGATTTAAGCGAATATACGCTACAGTAGACGACATTGATCTATTCCCTGGTGGTATGAGCGAACGACCACTACAGGGCGGTCTAGTTGGACCCACCTTCGCCTGCATCATCGCTATACAGTTCAGGCAGTTAAGGAAATGCGATCGATTCTGGTATGAGAACGACAACAGAGCAGCTCGCTTCACCGAACAACAATTGTCGGAAATTCGCAAAGTAACATTGTCCAAGGTTCTATGTGACAATTTCGATTTGCCAAGCGACATTCAACGCGCCTCTTTCGATCTACCTAGCAACTTTTTGAATCCTCGCGTGCCATGCGCGTCTCTTCCAAAACTGGACCTTTCCGCATGGCGTGAGAGTTCAGCCCAGGGCTGTCTCATCGCGGGTCGCTCGGTACGACTTGGTGACTCCGCCTTCCCTTCGCCCTGTACATCGTGTATATGCACCGTTGACGGGGCGCAGTGCGCATCCCTACGCATCACAGACTGTGCACAGCTATGGCGTGAATGGCCACGAGAAGCTGTGCTAAGAGATGATGTATGCACAGCACAGTGCGGCGCCGCCCCCGCAGGTCAGAGAGCGCCGCGGAGACCACACGCTCACTTCAAATTCCCCGATCTTACACCATTCATCGCTAAATAG

Protein sequence:

>DPOGS209262-PA
MGSHKMLKRSILLYALFLTQILAQKTKTTDDNLQTVPSDLRRAVSEALALERRFLLGSNDVRNCSYDEISNTPCPPSKYRSASGECNNVRHRPWGRRGDVFLRLLQPHYADGISQPFASPKLPEPRLAVQAVSQLAESVGHDYVTSLLAAWGQFLMDDLIATSNQNQKCESGCEYVRSAPTRNYDSCGFEYRDQMNLATSVLDGSALYGNSEKELLSLRLYDAGKMDISSCRKCNENSPTAPLYKALLTEHNRIAGELFSMNPFWEDNALFLEARRTIAAVIQHITYNEFLPVLLGEVGMAKADLKLTTHGFWRGYSSANRVGAYAELVAVAPIFNAMMNEKLINTTILLKDLVKTSAHQISRFALSAQWDLNRARDHGVPSYLKVLQLCDPMANVKSYAEFEKLGFDRRHQEIFADMYRNAEDIELMVAGAMEKPATGAVIGSTLACVLALQFGNLKKSDRFWYENDIPPSSFSIEQLAAIRKVSIAGLLCSADEGLDNVQPRAFVKEDTFLNAAQQCSQHPRLELSSWRDESGARAAERLSQDMLAAALQKAKQEMADRKKLEYMLWEAHGGADPKSPVGTAASFSKANKYALKLANTSLFFEFATNELVNSLNGRRRKRQIFDDSLGFGSTDFVESLQSVDVSGFLGQDQLGPVIEPQCDDNGNCDPDSPFRSYTGHCNNLRNPNLGKSLTTFARLLPPVYEDGVSRPRINSVTGTPLPSPRIVSTVIHPDISNLHTRYTLMVMQFAQFLDHELTMTPIHKGFHESIPDCRSCDSPRTVHPECNPFPVPRGDHYYPEVNITSGERLCFPFMRSLPGQQQLGPREQVNQNTAFIDGSAIYGENPCIVRKLRGFNGRLNATANPINGRDLLPRTDNHPECKAASGFCFIAGDGRASEQPGLTALHTIFMREHNRIVEGLRGVNPHWDAELLFEHTRRIVAASFTHIIYNEFLPRLLSWNAVNLYGLKLLPSGYYKEYSPTCNPSIVTEFAAAAFRFGHSLLRPHLPRLSPSYQPVDPPILLRDGFFRPDMFMNHPPMVDELIRGLSSTPMETLDQFITGEVTNHLFEDRRIPFSGIDLVALNIQRGRDHGIPSYNNYRALCNLKRAATFEDLAREIPDEVIARFKRIYATVDDIDLFPGGMSERPLQGGLVGPTFACIIAIQFRQLRKCDRFWYENDNRAARFTEQQLSEIRKVTLSKVLCDNFDLPSDIQRASFDLPSNFLNPRVPCASLPKLDLSAWRESSAQGCLIAGRSVRLGDSAFPSPCTSCICTVDGAQCASLRITDCAQLWREWPREAVLRDDVCTAQCGAAPAGQRAPRRPHAHFKFPDLTPFIAK-