Monarch geneset OGS2.0

DPOGS213017
TranscriptDPOGS213017-TA1620 bp
ProteinDPOGS213017-PA539 aa
Genomic positionDPSCF300024 + 142071-145156
RNAseq coverage37x (Rank: top 74%)
Annotation
HeliconiusHMEL0050670.072.54% 
BombyxBGIBMGA006936-TA0.067.35% 
Drosophilaphm-PA8e-10841.17% 
EBI UniRef50UniRef50_D1FQH30.072.91%Cytochrome P450 CYP306A1 n=3 Tax=Endopterygota RepID=D1FQH3_SPOLI
NCBI RefSeqNP_001106222.10.067.35%cytochrome P450 monooxygenase [Bombyx mori]
NCBI nr blastpgi|2221427040.072.91%cytochrome P450 CYP306A1 [Spodoptera littoralis]
NCBI nr blastxgi|2221427040.072.91%cytochrome P450 CYP306A1 [Spodoptera littoralis]
Group
Gene OntologyGO:00090551.5e-113electron carrier activity
GO:00200371.5e-113heme binding
GO:00167051.5e-113oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.5e-113iron ion binding
GO:00551141.5e-113oxidation-reduction process
KEGG pathwayame:4083986e-125 
 K10720 (PHM)maps-> Insect hormone biosynthesis
InterPro domain[311-533] IPR0011281.5e-113Cytochrome P450
[56-75] IPR0024011.8e-40Cytochrome P450, E-class, group I
Orthology groupMCL15841 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213017-TA
ATGGACGTTATATTCTTATGGCTCGTAACGATCGTGCTGGGATTTTTGTTACTCAAACAACTTTATCGATGGCGGTCTTTACCCCCGGGCCCCTGGGGTCTGCCTGTATTTGGATACCTACCATTTTTAGACCCAGAACAACCTCATTTGACCTTAACTAAGTTGGCGGAGCGTTATGGACCAATTTATGGCATAGACATGGGGAGCACATACGCCGTTATTATTTCTGATTATAAATTAGTTCGGGAAGCGTTTGCCAAAGATGTTTTTTCTGGAAGAGCTCCGCTATATTTGACCCATGGAATCATGAGAGGCTTTGGCATAATTTGCTCTGAAGGAGCTCGGTGGAAAGATCAACGCAAACTAGTAACAACTTGGCTTAAAAGTTTTGGTATGAGCAAACATAGCGTATCCAGAGACAAATTAGAAAAGCGTATAGCTTCTGGTGTTCATGAAGTAGTTCAAAATCTTAGGGAAAACAATGGAAGTCCTGTTAATTTGTCGGATATGATAACGCATTCTTTGGGCAACGTTGTGAATGATATTATATTTGGTTTCAAATACGATGCCGACGACAAGACGTGGCGATGGTTCAGACAAATACAAGAAGAAGGTTGCCATGAAATGGGCGTGTCTGCTATCGTGAACTTCCTTCCGTTTATGAGATTCATATCCCCGTCTATTCAAAAGACAATGGAAGTTTTAATTCGAGGGCAAGCCCAAACTCATAGATTATACGCTAGCATCATAGCCCGCCGGCGAAAAATGCTTGGTTTGCAAAAACCTGCTGGAGCTGAGCGCCCTGCACACGATAAGCTGTTTGATGAACATCCTGAGGGGTTCATAAAATGCACAAAGTATAGTAAGAATGCATCAAACGACGAAGTACATTTTTTTAATCCCGATGTATTAATTCCATCCCAGGATGAATGCATTTTGGATAAATTCTTGATAGAACAAAAAAGGAGGTATGAAAATAAAGAAGAGAGTGCAATTTTTGTAACCGACGAACAGTTACATTTCTTGTTAGCTGACATGTTTGGGGCAGGACTTGATACTACGTCTGTGACACTATCGTGGTTTTTACTGTACATGGCTCTATATCCGGATGAGCAGGAACTCGTGCGTGAAGAAATATTATCTGTGTATTCAGAAGAATGTGAAATCGACAGTTCAAAATTACCTAAACTTATGGCTGCGATTTGCGAAACACAACGAATCCGGTCTATAGTACCTGTGGGTATACCTCATGGATGTCTGCAAGATACATATTTGGGTAATTATCGAATACCAAAAGGTGCAATGATCGTACCGCTGCAGTGGGCAATTCATATGGACCCTAACATTTGGGAAGATCCGCATATTTTCAAACCAAGCAGATTCCTCGACGAGAACGGCAAATTGTTGAAACCCCAAGAGTTTATACCGTTTCAAACAGGTAAGCGAATGTGCCCTGGCGACGAGCTTTCAAGAATGCTTACGGTTGGCTTCATGGTTCAATTGTTCCGATCTTTTCGCGTGCGACTTGAATCAAAGCCCCCCTCGACAAAGGAAATGCAAGGAAAAGTTGGTGTTACTTTGTCGCCACCTCATGTTCTTTTCGTTTGTGACTCTTTATAA

Protein sequence:

>DPOGS213017-PA
MDVIFLWLVTIVLGFLLLKQLYRWRSLPPGPWGLPVFGYLPFLDPEQPHLTLTKLAERYGPIYGIDMGSTYAVIISDYKLVREAFAKDVFSGRAPLYLTHGIMRGFGIICSEGARWKDQRKLVTTWLKSFGMSKHSVSRDKLEKRIASGVHEVVQNLRENNGSPVNLSDMITHSLGNVVNDIIFGFKYDADDKTWRWFRQIQEEGCHEMGVSAIVNFLPFMRFISPSIQKTMEVLIRGQAQTHRLYASIIARRRKMLGLQKPAGAERPAHDKLFDEHPEGFIKCTKYSKNASNDEVHFFNPDVLIPSQDECILDKFLIEQKRRYENKEESAIFVTDEQLHFLLADMFGAGLDTTSVTLSWFLLYMALYPDEQELVREEILSVYSEECEIDSSKLPKLMAAICETQRIRSIVPVGIPHGCLQDTYLGNYRIPKGAMIVPLQWAIHMDPNIWEDPHIFKPSRFLDENGKLLKPQEFIPFQTGKRMCPGDELSRMLTVGFMVQLFRSFRVRLESKPPSTKEMQGKVGVTLSPPHVLFVCDSL-