Monarch geneset OGS2.0

DPOGS210259
TranscriptDPOGS210259-TA1452 bp
ProteinDPOGS210259-PA483 aa
Genomic positionDPSCF300216 - 284688-291230
RNAseq coverage146x (Rank: top 54%)
Annotation
HeliconiusHMEL0109344e-12845.95% 
BombyxBGIBMGA002307-TA9e-7335.32% 
DrosophilaCyp4ac2-PB4e-4927.80% 
EBI UniRef50UniRef50_P299813e-5729.75%Cytochrome P450 4C1 n=6 Tax=Neoptera RepID=CP4C1_BLADI
NCBI RefSeqNP_001108341.11e-5528.16%cytochrome P450 CYP366A1 [Bombyx mori]
NCBI nr blastpgi|3214767716e-5830.77%hypothetical protein DAPPUDRAFT_312048 [Daphnia pulex]
NCBI nr blastxgi|3214767712e-5530.44%hypothetical protein DAPPUDRAFT_312048 [Daphnia pulex]
Group
Gene OntologyGO:00090553.3e-93electron carrier activity
GO:00200373.3e-93heme binding
GO:00167053.3e-93oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055063.3e-93iron ion binding
GO:00551143.3e-93oxidation-reduction process
KEGG pathway 
InterPro domain[15-483] IPR0011283.3e-93Cytochrome P450
[304-330] IPR0024012.3e-10Cytochrome P450, E-class, group I
Orthology groupMCL34828 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210259-TA
ATGATTATATATTTAGGACTTCTGTTTGGCTGTCTTGGTGTTTATTGGACATACGTGAAACTTTCGTCATATAACAATAAATTGATACCTAAAATACCAAAAAAATTTAATTTAAATAATTTCAGAGTGTGCTTTAAAGAATTATCTAAATGGCTTATATATGCCGGAGAATATAGTAATGATGTCGGTTGTGTATCAAAGTTCAACTATGGACCCGTTGTAGTATACGTTGTATCAAACCCCGAAGACGCCTCAACTGTATTGACTCGATGTACGTCCAAGTCGTTTGTGTACGACCTCATAAAACCGTACGTCGGTAGTCAACTGATCGCATCCGACTATCCAGTCTGGAAACGCAACCGACGTCTTCTGGACCTGGCTTTCAAGCAGAACATCTTGAATGGATACGTAGCCATGTTCAATAAACGAGCTGACGCTCTCGTGATAGATATGTCCAAGGAAATAAACAAAGAGGTCGATTTTACGCATCTGTTTAGTAGATATCTGTTGGGAACCGCTTGCTATACAACACTGGGCGTGAACGTGAACGACCAAGAGGTATCTATAGACAGCTATCTTAAGTCCGTAGATAAATTAATAAAAATCATGTTAAACAGATTTGTACAGCCCTGGCTTCTTTTAGATTTTGTGTTTAAATTTAGTGAATTGAAAAAAGAACAGGATGTAGCCCTGGAAATTGTGCAGAACTTCTCTGAAGAGGTTATACGAAAGAAAAAAATGCAATTCGCACTAAACAGTGACAACCCTGGAGATGTAGACGAAGACAACGCGGAGAGTCTTCTGGATATTCTAATTAAGAATAGAAATGAAGATGAAATTTCTGATAGGAATATGAGGGAGATAGTTGATAACGTGATATTGGCCGCTTACGATTCCAACGTGTATATGATGCTTTATATATTAGTCTGTATTGGAAGCTATCCTGAAGTGCAAAAAAAAGTCTACGATGAAGTAATCAGTGTAACAGGACAAACAGACAGGGATATTACCCACGAGGACCTGCCCAAGTTGGTTTATCTAGAGGCTGTGGTTAAAGAAGCGATCCGCCTCTATCCAGCCGGGCCTATTGTGGGGCGAGTCACGACATTTGATACCCAATTAAAGGAATATGTACTGCCGGCCGGCTGTCAGGTCATAGTTCACCTTGGCGCCATAAACCGTAACAAGGAGCACTGGGGACAAGACGCGGACGACTTCCGACCAGAGCGCTGGTTCGACTCCCTGCCGAAACATCCGGCTGCGTTCTCCAGCTTTTCTCCTGGAAGAAGAAGCTGTATTGGCAAATCCTATTCAATAATGTTGTTAAAAACTATGGCAGTTAAGATAATCAAGAAGTTTAATATAAGCAGTGATGATAAAAAATTGGATTTTGAATTCGGACTGTTTTTGAAACCTATCGCTGGACACCACATTAAATTAGAACTGAGATGA

Protein sequence:

>DPOGS210259-PA
MIIYLGLLFGCLGVYWTYVKLSSYNNKLIPKIPKKFNLNNFRVCFKELSKWLIYAGEYSNDVGCVSKFNYGPVVVYVVSNPEDASTVLTRCTSKSFVYDLIKPYVGSQLIASDYPVWKRNRRLLDLAFKQNILNGYVAMFNKRADALVIDMSKEINKEVDFTHLFSRYLLGTACYTTLGVNVNDQEVSIDSYLKSVDKLIKIMLNRFVQPWLLLDFVFKFSELKKEQDVALEIVQNFSEEVIRKKKMQFALNSDNPGDVDEDNAESLLDILIKNRNEDEISDRNMREIVDNVILAAYDSNVYMMLYILVCIGSYPEVQKKVYDEVISVTGQTDRDITHEDLPKLVYLEAVVKEAIRLYPAGPIVGRVTTFDTQLKEYVLPAGCQVIVHLGAINRNKEHWGQDADDFRPERWFDSLPKHPAAFSSFSPGRRSCIGKSYSIMLLKTMAVKIIKKFNISSDDKKLDFEFGLFLKPIAGHHIKLELR-