Monarch geneset OGS2.0

DPOGS202736
TranscriptDPOGS202736-TA1500 bp
ProteinDPOGS202736-PA499 aa
Genomic positionDPSCF300284 + 69417-71867
RNAseq coverage74x (Rank: top 65%)
Annotation
HeliconiusHMEL0126760.068.66% 
BombyxBGIBMGA005356-TA3e-15754.45% 
DrosophilaCyp12a4-PA3e-8035.96% 
EBI UniRef50UniRef50_D5L0N52e-17257.75%Cytochrome P450 333A3 n=2 Tax=Obtectomera RepID=D5L0N5_MANSE
NCBI RefSeqXP_001604810.16e-8633.88%PREDICTED: similar to cytochrome P450 [Nasonia vitripennis]
NCBI nr blastpgi|2914641056e-17257.75%cytochrome P450 333A3 [Manduca sexta]
NCBI nr blastxgi|2914641052e-16957.75%cytochrome P450 333A3 [Manduca sexta]
Group
Gene OntologyGO:00090559.2e-76electron carrier activity
GO:00200379.2e-76heme binding
GO:00167059.2e-76oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055069.2e-76iron ion binding
GO:00551149.2e-76oxidation-reduction process
KEGG pathwaydme:Dmel_CG60422e-78 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[30-496] IPR0011289.2e-76Cytochrome P450
[305-322] IPR0024011.6e-15Cytochrome P450, E-class, group I
Orthology groupMCL10325 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202736-TA
ATGGGTGTCTTATTTAGAAATTTAATGTTCAAAAACCATCGTTCTTGGAGTTCACTGAGGTGTTACTCATCTGAAGTAAAACCATTTACATCCGTACCAGGGTTGTCTTCTTTACCCATTTTGGGTCCCATTCATCATTTCCTTCCAGTGATAGGATCAGTTGGTCCGAGACCGAATTTTTATGATCTAGCCAAAGTTTTACATGAGAAGTTTGGATCAATTGTTAAATTAGATGGAATTTTCTCTAGATCTTCGATGCTTATTTTATATGACCCAAAAGATTTTGATCAGGTTTACAGATCTGAAGAGCCAATACCATCACGACCAGGATTCGACACTTTAAATTATTATAGAAAAAATTTGGAAAAGTCTAAATATGAAGGAATACATAGTCTGACTACATCTGAAGGTGTTCATTGGCGGGATACTCGTACTAAAGTTAATCCAGTATTACTAAAACCGAAGCTTGTGAAACTTTATTCGCCATTACTGGAAGAGATCGCTGATGAAATGGTTCAAAGAATAAAAAATTTAGACAAAGATGCTAGATATATGCAGGAGAATTTTGATTTCGAAATAACAAAGTGGTCTTTAGAATCCGTAGCCGTTATCGCGTTGGGTGCGAGGCTTGGGTGTTTTCAAAACGATTTACCCGAGAGTCACCCGGCACGTATTCTGATAGAATGCGCGAAGGATATTATAAATTTATCTTGGAAATTAGAATTTCGTCCAAGCATGTGGAAATATATAGCAACTAAGAATTTTAAAAAAATTATAAAGGCGTTTGATTTACAGTGGGATACGAGTGTTTACTTCATCAGAGAGGCTGAGAAAAAAATAAAAGAGCGAGGTCATGATGTACCTGAAGAAGATAAAAGTATAATTGAAAAACTACTCAGCATAGATGAGAAAGTTGCTCTTGCGATGGGAAACGAAATGCTTTTAGCAGGAATTGATACAGTTTCTTTTACAACTATTGGCCTATTATATTACCTGGCAAAATATCCCGAAGTGCAAGAAAAAATACGCAAGGAAATTAATTCCTCGGAGCCAAGTAAAAGATATCTCAAGGCTTGTTTGAAAGAGTCTTTGAGATTACATGCCGTTGTGCCGGCCAACTTGAGAAGAACAACGAGAGAACATGTTGTGGCTGGTTACTTGATACCGAAAGGAATTGATGTGATAGCACCGAATGAATATCTTTCAAAGCTGGATGAACATTATCCTCGAGCTAAGGAGTTTATACCAGAAAGGTGGCTGGTTGAAAAGTCTGATCCATTATATTATGGAAATTGCCACCCCATGATAACATTGCCGTTTGGTTTTGGAGTAAGATCATGTATTGGGAGAAGGATCGCAGAAATGGAAATCGAGATCCTTGTTACGAAGCTGTTGAAAAATATGAAAATTTCTTGGAACGGTCCACCGATACAAGTGGTGACTCGATTGATGAACTCGTTTAGAAAGCCGTTTTATTTTAAATTTGAAAGTATTTCATAG

Protein sequence:

>DPOGS202736-PA
MGVLFRNLMFKNHRSWSSLRCYSSEVKPFTSVPGLSSLPILGPIHHFLPVIGSVGPRPNFYDLAKVLHEKFGSIVKLDGIFSRSSMLILYDPKDFDQVYRSEEPIPSRPGFDTLNYYRKNLEKSKYEGIHSLTTSEGVHWRDTRTKVNPVLLKPKLVKLYSPLLEEIADEMVQRIKNLDKDARYMQENFDFEITKWSLESVAVIALGARLGCFQNDLPESHPARILIECAKDIINLSWKLEFRPSMWKYIATKNFKKIIKAFDLQWDTSVYFIREAEKKIKERGHDVPEEDKSIIEKLLSIDEKVALAMGNEMLLAGIDTVSFTTIGLLYYLAKYPEVQEKIRKEINSSEPSKRYLKACLKESLRLHAVVPANLRRTTREHVVAGYLIPKGIDVIAPNEYLSKLDEHYPRAKEFIPERWLVEKSDPLYYGNCHPMITLPFGFGVRSCIGRRIAEMEIEILVTKLLKNMKISWNGPPIQVVTRLMNSFRKPFYFKFESIS-