Monarch geneset OGS2.0

DPOGS213267
TranscriptDPOGS213267-TA1488 bp
ProteinDPOGS213267-PA495 aa
Genomic positionDPSCF300264 - 146721-166987
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0166758e-5945.97% 
BombyxBGIBMGA004678-TA1e-1722.54% 
DrosophilaCyp4e3-PA7e-2424.10% 
EBI UniRef50UniRef50_D2JLJ98e-6653.33%Cytochrome P450 CYP405A3 n=3 Tax=Ditrysia RepID=D2JLJ9_9NEOP
NCBI RefSeqXP_001944092.14e-2727.57%PREDICTED: similar to p450 enzyme [Acyrthosiphon pisum]
NCBI nr blastpgi|3083166025e-6955.24%cytochrome P450 CYP405A2 [Zygaena filipendulae]
NCBI nr blastxgi|3083166226e-7442.29%cytochrome P450 CYP405A3 [Zygaena filipendulae]
Group
Gene OntologyGO:00090551e-51electron carrier activity
GO:00200371e-51heme binding
GO:00167051e-51oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061e-51iron ion binding
GO:00551141e-51oxidation-reduction process
KEGG pathwaydme:Dmel_CG41056e-22 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[165-491] IPR0011281e-51Cytochrome P450
Orthology groupMCL21182 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213267-TA
ATGGATCCAGATCATAAAAGTTCTGATATAGAAATTGTTACTCTCTTGAATAAGAAAGCAGATGATGATGGGTGTATGCTAGACAAATTTTTACTAAACAAACTATCCAATGGAAATCCGATTCCAGATGATATAATTGAAGAGGAAATATCACTTATCTGTTTTACAAAATCAATTTTTATTTTGATAATGTTAAACAAAAATGTGTTTAATTATGCTCTTCTCAGGCTTATAAAGTCAAATTTGTTTAAGGGACATTATACTACAACATTGACGATTTGTCATACATTATATTATGTGGCAAAATATCCAGAAATACAAAACCGAATTATTGAAGAACAGCAGTGTATATTTAAGAATGATATGTCAAAGAAACCTACTCATAAAAATCTTAATGATATGAAGTATCTCGAGGCCGTAATAAAGGAAAGCATGCGGGTACTGCCACCAGTGACTAAAATCGGAAGACAATTGAAGAAGGATTTTCGATTCGATGAGGCGTCACCCAAATTAATGGAAATGTGGAATAGACATGGAAAACAAAATTTTCGTCTGACAATTGGCTCTGAAGACTGGATAATGCTTTCAGATCCTGATGATGTGGGAACCATTCTCAATCATCCGAGTGAACTTTCCAAACCCTTAGAAAGAAATGCAGCTATGAAACCATTCTTTGGGAATTCAATTTCTAGTTCGGAAGGTGAAAAATGGAAATCGACAAGAAAACTGATGACGCCAAGTTTTCACTTCAAAACATTAGAGAAGCGGGTGGAAGACGTGAACAAACACTGTGATCGTTTGTTTAAGATCTTGGACACCTTCAATGACAAGAGCACTATAAATTTATACACTTATTTAAGACCTTATATGCTGGATATTTTATGCAGCACCCTTATGGGGGTTGATAGCAATTTCCTTGGTAATATAAATCATCCATACCTTGAAGCGAGCGGAAGAACGATAAAAATAATTACGCAAAACTATTTCTCATATTGGAGAAACATTTCAAAAATTTTTGAGTTATCGCCGCAGTACCGAGTTATGATAAAAACTGTTAAGGCTCTTAGAGACACAAGCGCAACCGGACATTACACAACAACAATGACGATTTGCCATACATTATATTGTATCGCAAAGTACCCAGACATCCAGAACCGTATTATTGAAGAGCAGCGTTCTATATTTAAAAACAATTTCTTTAAATGTCCAACTAATCAAGATCTCAATGATATGAAGTACCTTGAGGCTGTATTGAAAGAAAGCGTTCGAGTAATACCAACTGTGACTAAGATCGGAAGACAATTACACGAGGATCTTAAATTTAAAGGGTTTCGTTACGCTTGGGTGGCCATGAAAGCAACACTATCCAACATGTTACGAAGATATGAAGTGTTCCCTTGTGATCCTGCTGACGAACCACAATTTGCACATCACTCAGTTAAGGAGTTTGAATATAAATTCATTGTAGAACTCTCTGCCAATACATAA

Protein sequence:

>DPOGS213267-PA
MDPDHKSSDIEIVTLLNKKADDDGCMLDKFLLNKLSNGNPIPDDIIEEEISLICFTKSIFILIMLNKNVFNYALLRLIKSNLFKGHYTTTLTICHTLYYVAKYPEIQNRIIEEQQCIFKNDMSKKPTHKNLNDMKYLEAVIKESMRVLPPVTKIGRQLKKDFRFDEASPKLMEMWNRHGKQNFRLTIGSEDWIMLSDPDDVGTILNHPSELSKPLERNAAMKPFFGNSISSSEGEKWKSTRKLMTPSFHFKTLEKRVEDVNKHCDRLFKILDTFNDKSTINLYTYLRPYMLDILCSTLMGVDSNFLGNINHPYLEASGRTIKIITQNYFSYWRNISKIFELSPQYRVMIKTVKALRDTSATGHYTTTMTICHTLYCIAKYPDIQNRIIEEQRSIFKNNFFKCPTNQDLNDMKYLEAVLKESVRVIPTVTKIGRQLHEDLKFKGFRYAWVAMKATLSNMLRRYEVFPCDPADEPQFAHHSVKEFEYKFIVELSANT-