Monarch geneset OGS2.0

DPOGS207566
TranscriptDPOGS207566-TA1257 bp
ProteinDPOGS207566-PA418 aa
Genomic positionDPSCF300072 - 668054-671872
RNAseq coverage3393x (Rank: top 4%)
Annotation
HeliconiusHMEL0171483e-12964.72% 
BombyxBGIBMGA004717-TA1e-13168.01% 
DrosophilaCyp4c3-PA2e-5739.08% 
EBI UniRef50UniRef50_Q4R1I71e-12462.46%Cytochrome P450 n=4 Tax=Papilionoidea RepID=Q4R1I7_9NEOP
NCBI RefSeqXP_966563.25e-6138.86%PREDICTED: similar to cytochrome P450 [Tribolium castaneum]
NCBI nr blastpgi|675139584e-12462.46%cytochrome P450 [Papilio xuthus]
NCBI nr blastxgi|675139587e-12162.46%cytochrome P450 [Papilio xuthus]
Group
Gene OntologyGO:00090553.9e-83electron carrier activity
GO:00200373.9e-83heme binding
GO:00167053.9e-83oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055063.9e-83iron ion binding
GO:00551143.9e-83oxidation-reduction process
KEGG pathwaydme:Dmel_CG14382e-55 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[72-415] IPR0011283.9e-83Cytochrome P450
[207-224] IPR0024012.2e-17Cytochrome P450, E-class, group I
Orthology groupMCL20928 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207566-TA
ATGATTTTGTGGTATTTGTTGTTAGCGCTGGCCTTTTGGACTGTTGCGTTCAAGTATAAGAGACGTAGGATGTACAAGTTAGCGGCACTTGTGCCAGGTCCCAAAAATGAGTATCCGATAATTGGTGTTGCTCATGATTTGGTGGGAACTACTGAGGTGGTAGTACATCCTACTGACTTGGAAATGATATTGAAAACATGCTTAGAAAAAGATGATCTTCATAGGTTTATACAAAAAGTCATCGGATATGGAGGAATATTTGCACCAGAAACTGCTATGGGAGTGAAAATAAATGCTCAAAGCGATCCCACTTCCTCACCCTTTCTGGCTTCAATGACAAGTCTCCTTAATATTGTATGTGCGAGGATTTTCCACCTGTGGCTGCAGCCAGATTGGCTGTTTAAACTTTTCCCACAATTCAATGAACATGAGAAATGTATAAAAACATTACATGACTTTACGGACGAGGTTATTGAAAAGAAAAGAATGGAACTCCAGGAGGATAAAACAAGTCAAACGGAAGTGGACCACCATTTAGATTTACAAGATTATCAAAGAAAAAGTTTTCTGGATCTATTAATTAAGTTGTCTGGTGGAGAGAAGGGTTATACAAATGTGGAGTTGAGAGAAGAAGTTATGACTCTGACAATCGCTGGAACAGATACATCTGCTGTTGCGATCGGATTTACTCTCATATTGTTAGGAAAATATCCAAAGATACAGGATAAAGTTTATGAAGAGTTGTATGGAGTGTTTGGAGATTCCAAACGTCCTTTAGTAAAAGAAGATTTACTGAAGTTAAAATATTTAGAGCGAGTGGTAAAGGAGTCATTAAGGCTATTTCCGCCTGTGCCTTTTATAATAAGAAAAATTGATAAAGAAATAGAATTGCCGACAGGTAAGCGTCTTCCAGCTGGAGCTGGGGCAGTTATATCAATTTGGGGTTGTCACAGAAACCCTGAATTCTGGGGTCCGGATGCTGAGTGCTTTGACCCAGACAGGTTCCTACCGGAGCGTTTCGACTTAGTAAAACCTGGTAGTTACTTACCTTTTAGTAACGGACCCAGAAATTGTCTTGGATACCAGTACGCGTTGATGTCAATCAAGACAGCTTTATGTGCAATATTAAGAAATTACAAAATACTTGGAGAACCCGAAGCCACTCCTATACCTCATATAAGAGTAAAATTAGACGTCATGATGAAGGCCGTGGATGGGTATCAAGTTTGTTTAGAAAAAAGAAAATCAGCTGCTTAG

Protein sequence:

>DPOGS207566-PA
MILWYLLLALAFWTVAFKYKRRRMYKLAALVPGPKNEYPIIGVAHDLVGTTEVVVHPTDLEMILKTCLEKDDLHRFIQKVIGYGGIFAPETAMGVKINAQSDPTSSPFLASMTSLLNIVCARIFHLWLQPDWLFKLFPQFNEHEKCIKTLHDFTDEVIEKKRMELQEDKTSQTEVDHHLDLQDYQRKSFLDLLIKLSGGEKGYTNVELREEVMTLTIAGTDTSAVAIGFTLILLGKYPKIQDKVYEELYGVFGDSKRPLVKEDLLKLKYLERVVKESLRLFPPVPFIIRKIDKEIELPTGKRLPAGAGAVISIWGCHRNPEFWGPDAECFDPDRFLPERFDLVKPGSYLPFSNGPRNCLGYQYALMSIKTALCAILRNYKILGEPEATPIPHIRVKLDVMMKAVDGYQVCLEKRKSAA-