Monarch geneset OGS2.0

DPOGS208965
TranscriptDPOGS208965-TA1536 bp
ProteinDPOGS208965-PA511 aa
Genomic positionDPSCF300009 + 881697-891204
RNAseq coverage185x (Rank: top 49%)
Annotation
HeliconiusHMEL0157840.066.93% 
BombyxBGIBMGA002085-TA4e-5730.06% 
DrosophilaCyp304a1-PA4e-9937.65% 
EBI UniRef50UniRef50_D2JLK20.061.45%Cytochrome P450 CYP304F2 n=2 Tax=Ditrysia RepID=D2JLK2_9NEOP
NCBI RefSeqXP_001648726.12e-11544.20%cytochrome P450 [Aedes aegypti]
NCBI nr blastpgi|3083166380.061.45%cytochrome P450 CYP304F2 [Zygaena filipendulae]
NCBI nr blastxgi|3083166380.061.45%cytochrome P450 CYP304F2 [Zygaena filipendulae]
Group
Gene OntologyGO:00090553.1e-91electron carrier activity
GO:00200373.1e-91heme binding
GO:00167053.1e-91oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055063.1e-91iron ion binding
GO:00551143.1e-91oxidation-reduction process
KEGG pathwaymcc:6786831e-49 
 K07413 (CYP2C)maps-> Drug metabolism - cytochrome P450
    Arachidonic acid metabolism
    Linoleic acid metabolism
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[24-510] IPR0011283.1e-91Cytochrome P450
[63-82] IPR0024014e-32Cytochrome P450, E-class, group I
Orthology groupMCL14586 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208965-TA
ATGATCGGGTTGCTCATATTAGTATTTTTTATTATATTCTATTGTGTACGGTGCTTTAAAAATGCGTATAAACGACCTTCGGACAAATTTCCTCCAGGACCTCCAAAGCTACCAATACACGGAGCCTACTGGATCGTTGTTTTGAAAAAAATTAATAATTTGGCTGCTTCTTTTAAGATGTTGTCAAAGGAATACAAAACTAAAGTATTGGGTCTCTACTTAGGAAATTTTGTCACAATCATAGTAGACGATCCCCATTTAATTAAGGAGTGCCTCAATCGTGAAGAATTTGATGGTCGCGTTGATTTTTTGGTCGCCCGTCTTAGATCGTTTTGGAAGAAATTAGGCATATTTTTTACTGATGGTTACTTCTGGCACGTGCAAAGAAGATTCTCATTACGTTATATGAGAGATTTTGGTTTCGGAAGACGTGAGGAAACTTTGGAAACATGTTTTTCTAACGATATAAAAGAAATGATTAATATGGCAACATATGGTCCTAATTATCCTGCCGAGAAGGCTATAATTAAAGGTGATTTAATATGTCTACAAGATTATTTTGCTGTTCCGTTTATTAATGGAATGTTACATGTCCTCGTCCGAAATCCATTGCCAAGATCTGAATACAAAGTACTATGGGAACTTTCTCGATATGCCCTTAAATTTCAAAGAGGCACAAACGACTTAGGAGGCGCTGTATCACTTACGCCGTGCTTAAAAGACCTTATGCCAAAACTTAGCGGATACCATGATTTACGTTCAGGAAACCAGTATCTGTTGGATTTTTTTAATAAAATTATTAAAGATCAGATGTCTACGTATGACACGACACATCTACGTCATTTTATAGATGTATACGTGAAGAAAATGAAGGAAGAAGAAAAAGATACGAAACGTTCTACTTTTTCAGTGGAACAGCTCCAATTAATCTGCACCGACTACATGTTCCCCGCGGCTAGTGCAGTTCAGGCTGTACTTACGTTCCTGGTGGAGCGATTACTGATTCAACCCGAAGTTCAAGATAGGATACACGAAGAAATTGACCGAGTGGTTGGCCGAGATAGGATGCCCTCGTTAGATGATCGGCAGAATATGCCGTACACGGAAGCTTGTATCAGAGAAATTATGAGATTTGAAACTCTAGTGCCACTAGGAGTGCTGCATCGGACTGTGAAGCCTACTACGATTGACGAATATCACATACCAGAGAATACAGTTGTTGCTTTCAATTATGTAAGTTTACACATGGACAAGAAATTATGGGGTGATCCCGAAAACTTCAGGCCGGAACGTTTCATTAAGAACGGAATATTGAATTTGTCTGCCGACAAGTCTCTTCCATTTGGATCAGGAAAAAGGTTATGTGCGGGAGAGACTTATGCAAGGCAGTCGATGTTCCTAGTATTCTCAGCGTTTATGCAAGCCTTTCATGTGTCAACTGTTACGGGAAAACCTTTGAAGAAACCAGCTTCAAGAATTCAGGGTATCATTACAACTATCAAGGACTTCTGGGTTAAAATCACGCCTAGATCGTAA

Protein sequence:

>DPOGS208965-PA
MIGLLILVFFIIFYCVRCFKNAYKRPSDKFPPGPPKLPIHGAYWIVVLKKINNLAASFKMLSKEYKTKVLGLYLGNFVTIIVDDPHLIKECLNREEFDGRVDFLVARLRSFWKKLGIFFTDGYFWHVQRRFSLRYMRDFGFGRREETLETCFSNDIKEMINMATYGPNYPAEKAIIKGDLICLQDYFAVPFINGMLHVLVRNPLPRSEYKVLWELSRYALKFQRGTNDLGGAVSLTPCLKDLMPKLSGYHDLRSGNQYLLDFFNKIIKDQMSTYDTTHLRHFIDVYVKKMKEEEKDTKRSTFSVEQLQLICTDYMFPAASAVQAVLTFLVERLLIQPEVQDRIHEEIDRVVGRDRMPSLDDRQNMPYTEACIREIMRFETLVPLGVLHRTVKPTTIDEYHIPENTVVAFNYVSLHMDKKLWGDPENFRPERFIKNGILNLSADKSLPFGSGKRLCAGETYARQSMFLVFSAFMQAFHVSTVTGKPLKKPASRIQGIITTIKDFWVKITPRS-