Monarch geneset OGS2.0

DPOGS211423
TranscriptDPOGS211423-TA1527 bp
ProteinDPOGS211423-PA508 aa
Genomic positionDPSCF300115 + 486406-488136
RNAseq coverage2069x (Rank: top 6%)
Annotation
HeliconiusHMEL0075680.063.71% 
BombyxBGIBMGA010854-TA1e-16554.71% 
DrosophilaCyp6a2-PA3e-7933.27% 
EBI UniRef50UniRef50_D5L0N05e-16552.45%Cytochrome P450 6AN5 n=7 Tax=Obtectomera RepID=D5L0N0_MANSE
NCBI RefSeqNP_001073135.12e-14849.90%cytochrome P450 6AB4 [Bombyx mori]
NCBI nr blastpgi|2914640952e-16452.45%cytochrome P450 6AN5 [Manduca sexta]
NCBI nr blastxgi|2914640957e-15952.05%cytochrome P450 6AN5 [Manduca sexta]
Group
Gene OntologyGO:00090552.1e-104electron carrier activity
GO:00200372.1e-104heme binding
GO:00167052.1e-104oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055062.1e-104iron ion binding
GO:00551142.1e-104oxidation-reduction process
KEGG pathwaynvi:1001140238e-86 
 K07424 (CYP3A)maps-> Drug metabolism - cytochrome P450
    Drug metabolism - other enzymes
    Linoleic acid metabolism
    Steroid hormone biosynthesis
    Metabolism of xenobiotics by cytochrome P450
    gamma-Hexachlorocyclohexane degradation
    Retinol metabolism
InterPro domain[25-508] IPR0011282.1e-104Cytochrome P450
[300-317] IPR0024013.9e-24Cytochrome P450, E-class, group I
Orthology groupMCL10682 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211423-TA
ATGTTTTTAATAATATTGGCCTGTCTTGTTGCGTTGTATTTCTACACAACTAGAAATCATAGGTACTGGGCTAAGAAAAACATAAAACATGAACGTCCAATACCTCTGTTCGGGACGCATTTCCGGAATATAGTGGCGTTGAAAAGTGCCAGTGAACTAATACATGAATTTTACGTCAAGTTTGGTGATGAAAAGGTTATAGGAGCTTACCGCGGTAACGACAAGGAGCTGATTGTCAAGGATCCAGACATAATAAGAGACATTTTAATGGTCGACTTCAACAGTTTTCACAATCGAGGCTTAGGCAGGGATCCGGAAAAGGAGCCTCTGCTGAAAAATCTCTTCCACGCCGAGGACGACTTATGGAAATTGTTGAGACATAAATTAACACCAGCATTCACAACAGCGAAACTAAAGTCTATGTTCCCTCTTATAGTCAATTGTGCGGAAAAGTTGGTTGGCCTGGGAGAGAATATCATCGCTAAAGGTGGAGATTGTGACATACGAGAAGTTATGGCGAAATTCACAACTGAGTTTATAGGCGCTTGCGGCTTGGGAATCGAAATGGACACCATCAATAATGAAAGCTCTATATTTAGACAAATAGGAAAGAAAATGTTTTCACGGTCAGACATGGAAGTGTTTCTGTTTGCCGTATGGGACGTCTTTCCGGAAATAAGAAACCTTATGAATCTCTCCAACAAAAAAGTTGAACAGGTTTTTTTTGATATGGTGACAAAAATCTTCGAGCAGCGAAATTATAAACCTATTGGCAGAAATGATTTTGTCGACCTGTTGTTAGACTATTTATCTCAAGGCAAAATACAGAGCGAATCTATTAAATACAAAAACCCGGACGGCACAGCGAAACAGGTCGAAATGGAAATGGATATCGAGGTCTTGGTCGCACAGGTGTTTGTTTTCTTTGCAGCTGGATTTGAAACATCCTCTTCTACGACCAGTTACACGTTACACGAATTAGCTTACCATCCCGAAACACAGAAGAAAATTCAAGACGAAATAGATCGAGTTTTATCAAAATACGACAACAAGTTGTGTTATGATGCTGTTAATGAAATGACGCTATTAGATATGGCCATGAAAGAGGCGTTGAGGATAATGCCGGCCGTGGGGATCCTCAACAGGCAATGCGTTAAGCCATACACCATAAAACAAGTAGGACTAACTATAGATCCCGATGTCAATATAGTTATTCCTGTACAATCGCTGCATTTAGATGAAAAGTATTTTGATGAACCTTATGAATTCAAACCAGAAAGATTTGCCGACGAGGATTTTAACCAACGGACCAAGAACGTATACTTGCCTTTTGGTGCCGGACCGCGTGCTTGTATTGGTGCTCGTTTAGGACAGATGCAATCTCTGGCTGGACTAGCTGTGATGCTTCACAACTTTTCTGTCGAACCTTCGGAAAACACAAAGAGAACTTTAGACATCAACCCACGTCTTAATGTCGTACAAGGAGTACTACACGGGGTGCCGATTAAACTAGTAAAGAGAAAATAA

Protein sequence:

>DPOGS211423-PA
MFLIILACLVALYFYTTRNHRYWAKKNIKHERPIPLFGTHFRNIVALKSASELIHEFYVKFGDEKVIGAYRGNDKELIVKDPDIIRDILMVDFNSFHNRGLGRDPEKEPLLKNLFHAEDDLWKLLRHKLTPAFTTAKLKSMFPLIVNCAEKLVGLGENIIAKGGDCDIREVMAKFTTEFIGACGLGIEMDTINNESSIFRQIGKKMFSRSDMEVFLFAVWDVFPEIRNLMNLSNKKVEQVFFDMVTKIFEQRNYKPIGRNDFVDLLLDYLSQGKIQSESIKYKNPDGTAKQVEMEMDIEVLVAQVFVFFAAGFETSSSTTSYTLHELAYHPETQKKIQDEIDRVLSKYDNKLCYDAVNEMTLLDMAMKEALRIMPAVGILNRQCVKPYTIKQVGLTIDPDVNIVIPVQSLHLDEKYFDEPYEFKPERFADEDFNQRTKNVYLPFGAGPRACIGARLGQMQSLAGLAVMLHNFSVEPSENTKRTLDINPRLNVVQGVLHGVPIKLVKRK-