Monarch geneset OGS2.0

DPOGS215094
TranscriptDPOGS215094-TA1854 bp
ProteinDPOGS215094-PA617 aa
Genomic positionDPSCF300187 + 289858-300012
RNAseq coverage2101x (Rank: top 6%)
Annotation
HeliconiusHMEL0055537e-11159.75% 
BombyxBGIBMGA014043-TA8e-8151.27% 
DrosophilaCyp4ac1-PA2e-6240.95% 
EBI UniRef50UniRef50_Q8ISJ86e-11261.71%Cytochrome P450 CYP4S4 n=3 Tax=Noctuidae RepID=Q8ISJ8_MAMBR
NCBI RefSeqXP_001604548.12e-7145.07%PREDICTED: similar to cytochrome P450 CYP4AB1 [Nasonia vitripennis]
NCBI nr blastpgi|1566195061e-11464.13%cytochrome p450 CYP4S1 [Helicoverpa armigera]
NCBI nr blastxgi|1566195064e-11164.33%cytochrome p450 CYP4S1 [Helicoverpa armigera]
Group
Gene OntologyGO:00090551.5e-87electron carrier activity
GO:00200371.5e-87heme binding
GO:00167051.5e-87oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.5e-87iron ion binding
GO:00551141.5e-87oxidation-reduction process
KEGG pathway 
InterPro domain[315-617] IPR0011281.5e-87Cytochrome P450
[421-438] IPR0024012.5e-21Cytochrome P450, E-class, group I
Orthology groupMCL30481 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215094-TA
ATGCTGTGGCCGATATTTTTTGGATTACTGGTTTGTGTGGTGGTGTGGAGACTTATCAAACAGGAGCCCAATGACCTTGCACACCTGCCTGGACCTCCAAGGACACCAATATTTGGATCAGTTTTTATGTTCCTGGGAAAATCTCATAGCGAACTCTTCAAAATGTTGGTTGAGCTTCCAAAGAAATATGGAAATCGTCTTGTTATCAAGGCAATGCACCGGTATATTCTACATGTTTACAAAGTCGAGGACATTGAGATTGTTCTAACACATTCGAGAAACATCAAGAAGAATAAACCTTACACGTTCATAGAGCCGTGGTTGGGAACTGGTCTTCTTATTAGTAATGGCAGTAAATGGCAGAAACGGCGAAAAATCTTGACACCGACATTCCATTTCGACATTTTAAAGGGATTCGTAAAAGTATTCGAAGAGCAAAGTAGGAATCTGACAACAATGCTCAGGAAAAAACTGCAGGAGTCAAATGTTGTCGATACTATGGCCATCATGAGCGATTTTACACTTTATATTATATGTGAGACGGCTATGGGTATAAGATTAAATGCGGATAAAAGCGCTGAAAAAATGATGTATAAGAAGGCCATCATGGAAATAGGACAGATAGTGATGAAGAGGCTGACCACAGTGTGGCTTCACAGTGACCTGATCTTTTACAATATGCCCATCGGAAAGAAATTCACCAAGTGTCTGGAAAACGTGCATTCCTTCGCTGATAACGTGATCCTGGAGCGGAAAAAAAAATACGAGAGCGTCGCAAATGAGGATGGTGGGAGAAGGAGATTAGCGTTTTTAGACTTACTCCTTGAAGCGGAGAGGAACGGAGAAATAGATTTGGAGGGAGTAAGAGAAGAGGGTCATGACACAACAGCTACCGCTTTAGCATTTGGCCTGGTGTTGCTCGCCGACAGCGAGGAGGTTCAGACGGCTATGGGTATAAGATTAAATGCGGATAAAAGCGCTGAAAAAATGATGTATAAGAAGGCCATCATGGATATAGGACAGATAGTGATGAAGAGGCTGACCACAGTGTGGCTTCACAGTGACCTGATCTTTTACAATATGCCCATCGGAAAGAAATTCACCAAGTGTTTGGAAAACGTGCATTCCTTCGCTGATAACGTGATCCTGGAGCGGAAAAAAAAATACGAGAGCGTCGCAAATGAGGATGGTGGGAGAAGGAGGTTAGCGTTTTTAGACTTACTCCTTGAAGCGGAGAGGAACGGAGAAATAGATTTGGAGGGAGTAAGAGAAGAGGTTAATACGTTTATGTTTGAGGGTCATGACACAACAGCTACCGCTTTAGCATTTGGCCTGGTGTTGCTCGCCGACAGCGAGGAGGTTCAGGAACGTCTCTTCGAGGAGTGTCAGCGGGTTGGTCCTGAGCCGAGTGTGTCCGAGTTGAACGACATGAAGTATTTAGAAGCTGTGGTCAAAGAAATCTTGAGGTTGTATCCAAGCGTGCCGTTTATAGGACGAGAAATTACCGAGGACTTTATGTTAGATGACATCAAAGTAAAGAAAGGCTGTGAAGTAGTCGTTCATATATACGACGTACATCGAAGACCGGATCTATATCCGGATCCTGTAGCTTTCAAACCGGAAAGATTTCTGGACGAAGAGAAACGACATCCCTACTCCTATGTACCGTTCAGTGCTGGGCCACGAAATTGCATTGGTCAAAAGTTCGCCAAGCTCCAGATGAAGGTCGTCATTAGTGAGATAGTCCGTAATTTCAAGTTGTCACCGCTGGTCGCTGGCGCACGACCCGACCTCAAGGTCGATCTAGTACTGAGACCTGCTGAAACCATCTACGTGAAATTTTATCCTCGATAG

Protein sequence:

>DPOGS215094-PA
MLWPIFFGLLVCVVVWRLIKQEPNDLAHLPGPPRTPIFGSVFMFLGKSHSELFKMLVELPKKYGNRLVIKAMHRYILHVYKVEDIEIVLTHSRNIKKNKPYTFIEPWLGTGLLISNGSKWQKRRKILTPTFHFDILKGFVKVFEEQSRNLTTMLRKKLQESNVVDTMAIMSDFTLYIICETAMGIRLNADKSAEKMMYKKAIMEIGQIVMKRLTTVWLHSDLIFYNMPIGKKFTKCLENVHSFADNVILERKKKYESVANEDGGRRRLAFLDLLLEAERNGEIDLEGVREEGHDTTATALAFGLVLLADSEEVQTAMGIRLNADKSAEKMMYKKAIMDIGQIVMKRLTTVWLHSDLIFYNMPIGKKFTKCLENVHSFADNVILERKKKYESVANEDGGRRRLAFLDLLLEAERNGEIDLEGVREEVNTFMFEGHDTTATALAFGLVLLADSEEVQERLFEECQRVGPEPSVSELNDMKYLEAVVKEILRLYPSVPFIGREITEDFMLDDIKVKKGCEVVVHIYDVHRRPDLYPDPVAFKPERFLDEEKRHPYSYVPFSAGPRNCIGQKFAKLQMKVVISEIVRNFKLSPLVAGARPDLKVDLVLRPAETIYVKFYPR-