Monarch geneset OGS2.0

DPOGS202077
TranscriptDPOGS202077-TA1569 bp
ProteinDPOGS202077-PA522 aa
Genomic positionDPSCF300116 - 460511-464244
RNAseq coverage146x (Rank: top 54%)
Annotation
HeliconiusHMEL0056296e-15463.85% 
BombyxBGIBMGA003926-TA1e-17258.86% 
DrosophilaCyp9f2-PA2e-10439.92% 
EBI UniRef50UniRef50_B1AAB30.057.90%CYP9G3 n=7 Tax=Ditrysia RepID=B1AAB3_BOMMO
NCBI RefSeqNP_001108456.10.057.90%cytochrome P450 9G3 [Bombyx mori]
NCBI nr blastpgi|1692346690.057.90%cytochrome P450 9G3 [Bombyx mori]
NCBI nr blastxgi|1692346690.058.21%cytochrome P450 9G3 [Bombyx mori]
Group
Gene OntologyGO:00090551.6e-103electron carrier activity
GO:00200371.6e-103heme binding
GO:00167051.6e-103oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.6e-103iron ion binding
GO:00551141.6e-103oxidation-reduction process
KEGG pathwaydwi:Dwil_GK111957e-105 
 K07424 (CYP3A)maps-> Drug metabolism - cytochrome P450
    Drug metabolism - other enzymes
    Linoleic acid metabolism
    Steroid hormone biosynthesis
    Metabolism of xenobiotics by cytochrome P450
    gamma-Hexachlorocyclohexane degradation
    Retinol metabolism
InterPro domain[12-520] IPR0011281.6e-103Cytochrome P450
[309-326] IPR0024013.1e-16Cytochrome P450, E-class, group I
Orthology groupMCL24957 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202077-TA
ATGATCACCGAAATTGTTGTGTTTCTACTGACTTCTCTCATAGTATATTTTATTTATATACACAAAAGCATTCATAAGTATTTCGATGATCGCGGCATAAAATATTTACCCGGTGTTCCTTTTTTTGGCAATAATTTAAGAAGCTCGTTTCTAAAAGTCCATGTTTTGGAAGACCTTGAAGCTGTATACAAAGCCTTCCCAGAGGAAAGATATGTGGGCTACATCGAAGGCACGACTAAGATAATAATTATTAGAGATCCTGAGGTCATGAGAAATATCACAGTAAAGGATTTTCAACATTTCACTGATCATAAAACTTTCGTCTCGGAAGATTCTGAGCCTCTCTTTGGACGAAGCCTTTTCTTTATGAAAGGTGAGAGATGGCACAGTATGCGAACAACCCTTAGCCCAGCGTTCACCAGCTCCAAGATGAAGCTAATGATGCCGTTCATGAGCGAAATCAGTTCCAACATTGTAGAATATCTAAAAGGTCACGTAAATGAAGACGTCGATGTTCAAGACCTGATGCGACGATACACGAATGACGTCATTGCGTCGGCTGCCTTTGGTCTTCCGGTGAATTCAGTTAAGGATCGAGACAATGAATTTTTCACCATCGGCAGAAATCTTTTTTCTTTTACTTTTTTCCAAAAAATTTATTCAATATTTGTAGCCTTGTTTCCTAACTTTATGAAGAAACTGGGTATCACGGTATATCCAACTAAAGTTATAAACTTCTTTAAAGGAATAGTTGTAAACACGATGCAAAATAGAGAAAAAAACAATATTCACAGACCTGACATGATACAATTACTGATGGAAGCTTCTAAAGGTATATTGAAAGATGAGAGTGATACAAACGGAAATAAATCACAAAATAACGAAACATCCAAACAAAAACCTATAAAAGAATGGTCAGTGGAAGAGCTATCGAGCCAAGTTTTCTTATTCTTTGCCGCTGGATATGAAAGCTCGGCATCAACTTTGGTCATGTGCGTTCATGAATTGGCCCTGAACCCCGATGTCCAAGAGAAGTTGTATCAAGAGATCAAAGAGTACAAAGAGAAACACGGAGAAATTACCTTCGAGCATATACACAATCTAAAATATCTAGATTGGGTTTTAAGCGAGACTTCCAGGAAATGGTCGACCATTTTCGTTTTGGATAGAACCTGCACTCTTCCTTATGAACTTCCACCACCCAGAGAAGGGTTGAAGCCTGTACAACTAAAACCTGGAGATGTTGTTTACAACATGGTTAATTGTATCCACAAGGACCCCATTCACCATCCCAATCCTGAAAAGTTTGATCCCGAAAGATTTTCCGATGAGAACAAGCATAAAATCAAACCATTTACATATATGCCTTTTGGCATGGGTCCGCGGAACTGCATTGGGATAAGATTCGCGCAACTCGAACTTAAAATTCTCATCTTTGACATTGTGTCAAACTACAAGATCGTCAAATGCACGAAAACTATGGACCCTGTGGTTTTGAAACCGCACGCTTTTAATTTACAACCAAGAGATGGCTCGATTGTCCGATTTGTGCCCAGACATGATAATTGA

Protein sequence:

>DPOGS202077-PA
MITEIVVFLLTSLIVYFIYIHKSIHKYFDDRGIKYLPGVPFFGNNLRSSFLKVHVLEDLEAVYKAFPEERYVGYIEGTTKIIIIRDPEVMRNITVKDFQHFTDHKTFVSEDSEPLFGRSLFFMKGERWHSMRTTLSPAFTSSKMKLMMPFMSEISSNIVEYLKGHVNEDVDVQDLMRRYTNDVIASAAFGLPVNSVKDRDNEFFTIGRNLFSFTFFQKIYSIFVALFPNFMKKLGITVYPTKVINFFKGIVVNTMQNREKNNIHRPDMIQLLMEASKGILKDESDTNGNKSQNNETSKQKPIKEWSVEELSSQVFLFFAAGYESSASTLVMCVHELALNPDVQEKLYQEIKEYKEKHGEITFEHIHNLKYLDWVLSETSRKWSTIFVLDRTCTLPYELPPPREGLKPVQLKPGDVVYNMVNCIHKDPIHHPNPEKFDPERFSDENKHKIKPFTYMPFGMGPRNCIGIRFAQLELKILIFDIVSNYKIVKCTKTMDPVVLKPHAFNLQPRDGSIVRFVPRHDN-