Monarch geneset OGS2.0

DPOGS206994
TranscriptDPOGS206994-TA1404 bp
ProteinDPOGS206994-PA467 aa
Genomic positionDPSCF300001 + 858554-864022
RNAseq coverage113x (Rank: top 59%)
Annotation
HeliconiusHMEL0021164e-14153.15% 
BombyxBGIBMGA009949-TA4e-10566.30% 
DrosophilaCyp4d1-PA9e-5129.30% 
EBI UniRef50UniRef50_UPI00022467C56e-6932.28%UPI00022467C5 related cluster n=1 Tax=unknown RepID=UPI00022467C5
NCBI RefSeqXP_001602395.11e-7134.81%PREDICTED: similar to cytochrome P450 [Nasonia vitripennis]
NCBI nr blastpgi|3330371737e-17463.71%cytochrome P450 [Bombyx mori]
NCBI nr blastxgi|3330371731e-16863.71%cytochrome P450 [Bombyx mori]
Group
Gene OntologyGO:00090551.2e-91electron carrier activity
GO:00200371.2e-91heme binding
GO:00167051.2e-91oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.2e-91iron ion binding
GO:00551141.2e-91oxidation-reduction process
KEGG pathway 
InterPro domain[6-462] IPR0011281.2e-91Cytochrome P450
[38-57] IPR0024011.2e-15Cytochrome P450, E-class, group I
Orthology groupMCL25672 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206994-TA
ATGTTGGATCTGGCAAAATCAGTTCCGGGACCCCCAGCTCTGCCTCTTGTAGGAAATGCCCTACTCTTCATGGTCAACCCTAAAGAACAATTAAAAATTGTAAACCAGTTATTAAATAAATACGGAGATTATGTGAAGTTTTGGTTGGGCCCAGACTTGAATATTTGCGTCAAAAATCCGGCGGATATAAGGTTCCTGTTGACAAGTAACAAAGTTACTCAAAAAGGTCCGGTGTATGAATTCTTCAAGAGCTTCGTAGGGCATGGTATTTTATCTGGCGGTAAGCATTGGAAGGCACATCGGAAAATTGTTATGCCATCGTACAACAAGAAAGCAGTAAATCTATTCAGTACAGTGTTCAATAAAGAGGCTGAAGATTTAGCGAAAGTTTTAAGTCAAAAAGACCCGCATCAGACATTCAACGTTTACTTTGACGTTGTTCATTGCACCACCCAAAGCGTGTGTCAAACACTAATGGGACTTACGAAGGAAGAATCCTTAAATGTTGCCCGTTTGAGAGAAGTGATGTTGGAGACACATAATATGTATCAATTAATTCATTTAAAAATGACAAGATGGTGGTTACACATACCTATTATTTACTATCTGTCCGGAAGAAAACGAATAGAAAATAAATATATTAAAATGACTGAAGATCTGTCATCGGATATACTACAGAAAAGAAAAAACGCACTGAAACATGAAGTCACAGATGAAAATAGTATGAATGCAGTTGATAGACTAATTTTAGAAGGTTTAGATGAAAAAGAAATAAAATTAGAGGTTTTCACTCTATTTACAACAAGTCAAGAGGCATCGGCTAAAATAGTCGCTGGTGTACTTCTATTTCTTGCGCATCTTCCCGAATGGCAGGAGAAAGTCTACGACGAAATCCTGGCTACGGTTGGCTTTACAGCTGAGGTTACTGATGAACACCTGAAGAACCTTCACTACTTGGATATGGTGTACAAGGAAGTTCTGCGCTATTTGGCCATAGGGGCCATGATACAGAGATCTGTCGAAAAAGAGATAACTATTAACAACGGTAAAATAACCCTTCCGGTCAAAACGTCATTGGTAATACCGATACACGAATTGCATCGCGATTCTCGGTACTGGGACGAACCGAATAAAGTGAAACCGGAGAGATTCATGCCGGAAAATGTAAAGAAACGCGACCCAAATGCCTTCGTACCATTCAGTTTGGGGCCCATGGATTGTCTGGGTAGAGTTTATGCGACAAAATTAATCAAAACAATTGTTGTCCAAGTCATCCGACAACTGAAGCTAGAAGCTGACGGAACGTTGGAAGAGCTGGAACTAGACATCGCGATATCGGTGAAGTTTGCAAAAGGATACAACATTAGGGCGAAAAAACGAAACAATGACGCAACAAGCGCGTGA

Protein sequence:

>DPOGS206994-PA
MLDLAKSVPGPPALPLVGNALLFMVNPKEQLKIVNQLLNKYGDYVKFWLGPDLNICVKNPADIRFLLTSNKVTQKGPVYEFFKSFVGHGILSGGKHWKAHRKIVMPSYNKKAVNLFSTVFNKEAEDLAKVLSQKDPHQTFNVYFDVVHCTTQSVCQTLMGLTKEESLNVARLREVMLETHNMYQLIHLKMTRWWLHIPIIYYLSGRKRIENKYIKMTEDLSSDILQKRKNALKHEVTDENSMNAVDRLILEGLDEKEIKLEVFTLFTTSQEASAKIVAGVLLFLAHLPEWQEKVYDEILATVGFTAEVTDEHLKNLHYLDMVYKEVLRYLAIGAMIQRSVEKEITINNGKITLPVKTSLVIPIHELHRDSRYWDEPNKVKPERFMPENVKKRDPNAFVPFSLGPMDCLGRVYATKLIKTIVVQVIRQLKLEADGTLEELELDIAISVKFAKGYNIRAKKRNNDATSA-