Monarch geneset OGS2.0

DPOGS207372
TranscriptDPOGS207372-TA1155 bp
ProteinDPOGS207372-PA384 aa
Genomic positionDPSCF300267 - 89950-96916
RNAseq coverage27x (Rank: top 77%)
Annotation
HeliconiusHMEL0122454e-8650.33% 
BombyxBGIBMGA009005-TA1e-10174.66% 
DrosophilaCyp4d1-PA4e-3730.15% 
EBI UniRef50UniRef50_Q7QEX24e-4332.41%AGAP000193-PA n=1 Tax=Anopheles gambiae RepID=Q7QEX2_ANOGA
NCBI RefSeqXP_001602979.13e-4734.96%PREDICTED: similar to cytochrome P450 [Nasonia vitripennis]
NCBI nr blastpgi|567103145e-4433.91%cytochrome P450 CYP4G25 [Antheraea yamamai]
NCBI nr blastxgi|567103141e-4333.45%cytochrome P450 CYP4G25 [Antheraea yamamai]
Group
Gene OntologyGO:00090552.4e-60electron carrier activity
GO:00200372.4e-60heme binding
GO:00167052.4e-60oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055062.4e-60iron ion binding
GO:00551142.4e-60oxidation-reduction process
KEGG pathwaydme:Dmel_CG36563e-35 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[32-382] IPR0011282.4e-60Cytochrome P450
Orthology groupMCL30627 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207372-TA
ATGATCATCTTCCTGATGGTAGTATTCCTGTTTTGTATGTACTGGTTGTTCTGGATGTCAAATACACGGCGCATTGAGAAGTTCACGGCATCTCTCCCTATGCCCCCATCCTTACCTGTCATCGGCCATGCAGCCCTTTTCTTAGGGAACACCGAGAAAATCCTTAAAAATCTAGAAGACATAGCCTCTATCGCGCTTAAAAACAACGGAGCTGTGAAACTTTGGTTGGGCCCAAAATTATACATTGCAATCGGCAACCCTGAAGATGCCCAACTCATTTTGGAGAACTGTTTAGATAAAGATGTCGTTTATAGATTTCTGCGTCCTTGGCTAGGACAAGGCTTGTTCATTGCCCCGCTCAGACTGTGGAAGATGCACAGAAAGATTCTGTTGCCAGTGTTCCACAACAAAGTAATAGAAGAATATATCGGTGTTATATCGAAACAAGCAGACGTACTCACAGAAAGGCTGGAAGAGCAATCTGGGAAGGAGACATTCGATGTTTTAAGTTACATTTCAGCCTGCACTTTAGATATTGTTTTTGAAACGTCTATGGGTGAGAAAATGGATGTCCAACATTGGCCTGATACTCCATACCTGCGAGCTCGTCACACAGTTATGGAGATTCTTAATAAACGACTATTTAAGGTTTGGCTCCAGCCTGACTGCCTCTTCAAGCTAACCAGATACGCTAAAGAACAAAAGAAGAATATCGACCTCACTCATAAATTTACAGACGAGGTTGTTCAGAAAAGGCGTCTACAATTTGAGGCAAAGGAAGCGATTGGAATTAACAACACAAAAGATTCAGCGTACAGTAAAGTGACAATACCATCAGGCGTTGGAGCGGTTGTTGGAGCTTTTGCGATACACCGTTCAGTTGATTTGTGGGGATCAAATGCCAACGAGTTTGATCCTGACAGATTCCTTCCGGAACGTTCTAAGAATAGACACCCGTGTTCCTTTATACCTTTCAGTCATGGCTCACGGAATTGTATTGGAAGAAATTTCGGTATGATCATCATAAAAGGCATCATATCGAACGTGATCAGATCGTTTAGAATACAAGCGGATGAGGTGGGACCATTGAAAATCGAGATGCTTTTATTTCCCATTAGAGGCCATCAAATTAAGATAACTAAGAGAATGAACTAA

Protein sequence:

>DPOGS207372-PA
MIIFLMVVFLFCMYWLFWMSNTRRIEKFTASLPMPPSLPVIGHAALFLGNTEKILKNLEDIASIALKNNGAVKLWLGPKLYIAIGNPEDAQLILENCLDKDVVYRFLRPWLGQGLFIAPLRLWKMHRKILLPVFHNKVIEEYIGVISKQADVLTERLEEQSGKETFDVLSYISACTLDIVFETSMGEKMDVQHWPDTPYLRARHTVMEILNKRLFKVWLQPDCLFKLTRYAKEQKKNIDLTHKFTDEVVQKRRLQFEAKEAIGINNTKDSAYSKVTIPSGVGAVVGAFAIHRSVDLWGSNANEFDPDRFLPERSKNRHPCSFIPFSHGSRNCIGRNFGMIIIKGIISNVIRSFRIQADEVGPLKIEMLLFPIRGHQIKITKRMN-