Monarch geneset OGS2.0

DPOGS206194
TranscriptDPOGS206194-TA1794 bp
ProteinDPOGS206194-PA597 aa
Genomic positionDPSCF300345 + 151806-156582
RNAseq coverage31x (Rank: top 75%)
Annotation
HeliconiusHMEL0109351e-12350.36% 
BombyxBGIBMGA002307-TA9e-9039.55% 
DrosophilaCyp4c3-PA2e-5929.37% 
EBI UniRef50UniRef50_Q4R1I72e-6533.87%Cytochrome P450 n=4 Tax=Papilionoidea RepID=Q4R1I7_9NEOP
NCBI RefSeqXP_001602979.19e-7232.48%PREDICTED: similar to cytochrome P450 [Nasonia vitripennis]
NCBI nr blastpgi|1176062121e-6833.64%cytochrome P450, family 4, subfamily V, polypeptide 8 [Danio rerio]
NCBI nr blastxgi|1176062126e-6733.64%cytochrome P450, family 4, subfamily V, polypeptide 8 [Danio rerio]
Group
Gene OntologyGO:00090552.1e-91electron carrier activity
GO:00200372.1e-91heme binding
GO:00167052.1e-91oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055062.1e-91iron ion binding
GO:00551142.1e-91oxidation-reduction process
KEGG pathway 
InterPro domain[187-596] IPR0011282.1e-91Cytochrome P450
[37-54] IPR0024013.5e-17Cytochrome P450, E-class, group I
Orthology groupMCL15444 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206194-TA
ATGAAGGAAATGTCATCAAAAGATGAGAATTTTAATAGAACAAAATTTAAACCATTCATTCAACTGCTTTTGGAACTGTCTCACAATACGAAGGTCCTAACTGAAGATGAGATCAAAGAACATGTAGATACAATCATCGTAAGTGGTTCTGATACATCTGGTGGTACCATAACATACTGTATGTTACTAATCGGTTCTTACCCACGCGTTCAGAATAAAATTATTGAAGAATTACAAACTGTTTTTGGTAATGACGACAGAGATGTTACGAAAGAGGATCTCTCGAAATTGGTTTATTTGGATGCGGTGATTAAAGAATCAATTCGTTTATACCCCACCGTGGCTCTTACAGGAAGAGATATTGAAGAGGACTTGAAATTACGCAATTACACATTATCAAAAGGGGCTTCAGTGTATCTCTCGATATATGCGTTATACCACCATCCTCAGTGGGGCCCTGATGTCGAGGAATTCAAACCAGAACGTTGGCTTGATCCATCGTTACTACCATCATGGGCAAATTCAATAGGTTTTGGAGTAGGCCGAAGATTTTGTATAGTTGTAGCGGATCCTGATGATTTCGCAACTGTAATGAACTCATGTTTACTAAAACTTAAAATAGTTGACTTTGCTAAGCCGTGGCTCGGAAATGGACTTGTTACAGCGTCTCCATCAATATGGAAAGCCCATCGTAGATTAATAAACCCATCCTTTAATCAGCATGTAGTTGATAGTTTTTTGGGTATATTTAACAGCCAATCCCGTCGCTTTGTGAGAAGCCTAGAAGTTGAAGTAGGAAAAGGGCCATTTGATCACTACGTTTATTGTCATCGCATTGCGTTGGAAACAATATGCCAGACCGCGTTAGGAGATGACTTTATGAAAAATAGTGCTAAAAGCTCTGAATACGTCTGCACAATAGACAAAATGTTAAACATTCTCATATCAAGATTCCAAAAATTTTGGTTACATCCAGATATAATATATAACTTTTCTAGTGTCAAACGACAAGAAAAGGACTGCATTAAAATATTACACAAAGGTTCTACTGCAATTCTACAAAAGAAAAAAGAGAGTTACATGAAGGAAATGTCATCAAAAGATGAGAATTTTAATAGAACAAAATTTAAACCATTCATTCAACTGCTTTTGGAACTGTCTCACAATACGAAGGTCCTAACTGAAGATGAGATCAAAGAACATGTAGATACAATCATCGTAAGTGGTTCTGATACATCTGGTGGTACCATAACATACTGTATGTTACTAATCGGTTCTTACCCACGCGTTCAGAATAAAATTATTGAAGAATTACAAACTGTTTTTGGTAATGACGACAGAGATGTTACGAAAGAGGATCTCTCGAAATTGGTTTATTTGGATGCGGTGATTAAAGAATCAATTCGTTTATACCCCACCGTGGCTCTTACAGGAAGAGATATTGAAGAGGACTTGAAATTACGCAATTACACATTATCAAAAGGGGCTTCAGTGTATCTCTCGATATATGCGTTATACCACCATCCTCAGTGGGGCCCTGATGTCGAGGAATTCAAACCAGAACGTTGGCTTGATCCATCGTTACTACCATCATGGGCAAATTCAATAGGTTTTGGAGTAGGCCGAAGATTTTGTATAGGAAAGACTTATGCCCTCATGTCTATTAAGACAACAATAGTTCATGTTTGTCGAAACTTCCTAATATATGGCAACCATAAAAATATGAAACTTAAGTTGGACGTACTGCTGAAACCTGCTTCTGGGCATTATATTACTATTAAAAAGAGAACATAG

Protein sequence:

>DPOGS206194-PA
MKEMSSKDENFNRTKFKPFIQLLLELSHNTKVLTEDEIKEHVDTIIVSGSDTSGGTITYCMLLIGSYPRVQNKIIEELQTVFGNDDRDVTKEDLSKLVYLDAVIKESIRLYPTVALTGRDIEEDLKLRNYTLSKGASVYLSIYALYHHPQWGPDVEEFKPERWLDPSLLPSWANSIGFGVGRRFCIVVADPDDFATVMNSCLLKLKIVDFAKPWLGNGLVTASPSIWKAHRRLINPSFNQHVVDSFLGIFNSQSRRFVRSLEVEVGKGPFDHYVYCHRIALETICQTALGDDFMKNSAKSSEYVCTIDKMLNILISRFQKFWLHPDIIYNFSSVKRQEKDCIKILHKGSTAILQKKKESYMKEMSSKDENFNRTKFKPFIQLLLELSHNTKVLTEDEIKEHVDTIIVSGSDTSGGTITYCMLLIGSYPRVQNKIIEELQTVFGNDDRDVTKEDLSKLVYLDAVIKESIRLYPTVALTGRDIEEDLKLRNYTLSKGASVYLSIYALYHHPQWGPDVEEFKPERWLDPSLLPSWANSIGFGVGRRFCIGKTYALMSIKTTIVHVCRNFLIYGNHKNMKLKLDVLLKPASGHYITIKKRT-