Monarch geneset OGS2.0

DPOGS202078
TranscriptDPOGS202078-TA1047 bp
ProteinDPOGS202078-PA348 aa
Genomic positionDPSCF300116 - 434518-438598
RNAseq coverage13x (Rank: top 82%)
Annotation
HeliconiusHMEL0056296e-11755.01% 
BombyxBGIBMGA003926-TA4e-9051.70% 
DrosophilaCyp9f2-PA3e-5934.57% 
EBI UniRef50UniRef50_B1AAB31e-10251.33%CYP9G3 n=7 Tax=Ditrysia RepID=B1AAB3_BOMMO
NCBI RefSeqNP_001108456.13e-10351.33%cytochrome P450 9G3 [Bombyx mori]
NCBI nr blastpgi|1692346695e-10251.33%cytochrome P450 9G3 [Bombyx mori]
NCBI nr blastxgi|1692346692e-10051.47%cytochrome P450 9G3 [Bombyx mori]
Group
Gene OntologyGO:00090551.2e-71electron carrier activity
GO:00200371.2e-71heme binding
GO:00167051.2e-71oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.2e-71iron ion binding
GO:00551141.2e-71oxidation-reduction process
KEGG pathwaydwi:Dwil_GK111953e-62 
 K07424 (CYP3A)maps-> Drug metabolism - cytochrome P450
    Drug metabolism - other enzymes
    Linoleic acid metabolism
    Steroid hormone biosynthesis
    Metabolism of xenobiotics by cytochrome P450
    gamma-Hexachlorocyclohexane degradation
    Retinol metabolism
InterPro domain[20-345] IPR0011281.2e-71Cytochrome P450
[122-139] IPR0024011.7e-17Cytochrome P450, E-class, group I
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202078-TA
ATGTTTTGGAAGACCTTGAAGCTGTATACAAAGACTTCTCAAAGGAAAGGTCACGTAAATGAAGACGTCGATGTTCAAGACCTGATGCGACGATACACGAATGACGTCATTGCGTCGGCTGCCTTCGGTCTTCCGAAACTGGGTATAACGGTATTTCCAAAAAAAGTTATAAACTTCTTTAAAGGAATAGTTGTAAACACGATGCAACATAGAGAAAAAAACAATATTCACAGACCTGACATGATACAATTACTGATGGAAGCTTCTAAAGGTACATTGAAAGATGAGAGATATGCAAACGGTAACAAATCACAAAATAACGAAGCATTCAAACAAAGACCTGAAAAGGAATGGTCAGTGGAAGAGCTATCGAGCCAAGTTTTCTTATTCTTTGCCGCTGGATATGAAAGCTCGGCATCAACTTTGGTCATGTGCGTTCATGAATTGGCCCTGAACCCCGATGTCCAAGAGAAGTTGTATCAAGAGATCAAAGAGTACAAAGAGAAACACGGAGAAATTACCTTCGAGCATATACACAATCTAAAATATCTAGATTGGGTTTTAAGTGAGACTTCCAGAAAATGGTCAACCGTTTTGGTTTTGGATAGAACCTGCACTCTTCCTTATGAACTTCCTCCACCCAGAGAAGGTTTAAAGCCTGTGCAAGTAAGTTATTTACTTCCTACCCTATGTTTATGGTTGCAGCTGAAACATGGAGATCTTGTTTACAACATGGTGAAATCCATCCACATGGACCCCATCTACCATCCTAACCCTGAAAAGTTTGATCCTGAAAGATTTTCCGATGAGAACAAGCATAAAATCAAACCATTTACATATATGCCTTTTGGCATGGGTCCGCGGAACTGCATTGGGATAAGATTCGCGCAACTCGAACTTAAAATTCTGATCTTTGACATTGTGTCAAACTACAAGATCGTCAAATGTACGAAAACTATGGACCCTGTGGTTTTGAAACCACATGCTTTTAATTTACAACCAAGAGATGGCTCGATTGTCCGATTTGTGCCTAGACATGCTCATTAA

Protein sequence:

>DPOGS202078-PA
MFWKTLKLYTKTSQRKGHVNEDVDVQDLMRRYTNDVIASAAFGLPKLGITVFPKKVINFFKGIVVNTMQHREKNNIHRPDMIQLLMEASKGTLKDERYANGNKSQNNEAFKQRPEKEWSVEELSSQVFLFFAAGYESSASTLVMCVHELALNPDVQEKLYQEIKEYKEKHGEITFEHIHNLKYLDWVLSETSRKWSTVLVLDRTCTLPYELPPPREGLKPVQVSYLLPTLCLWLQLKHGDLVYNMVKSIHMDPIYHPNPEKFDPERFSDENKHKIKPFTYMPFGMGPRNCIGIRFAQLELKILIFDIVSNYKIVKCTKTMDPVVLKPHAFNLQPRDGSIVRFVPRHAH-