Monarch geneset OGS2.0

DPOGS210281
TranscriptDPOGS210281-TA1077 bp
ProteinDPOGS210281-PA358 aa
Genomic positionDPSCF300216 + 196416-203516
RNAseq coverage173x (Rank: top 50%)
Annotation
HeliconiusHMEL0109394e-12763.91% 
BombyxBGIBMGA000004-TA3e-7943.31% 
DrosophilaCyp4d2-PA1e-4130.06% 
EBI UniRef50UniRef50_Q4R1I76e-5133.24%Cytochrome P450 n=4 Tax=Papilionoidea RepID=Q4R1I7_9NEOP
NCBI RefSeqNP_001108341.13e-5033.42%cytochrome P450 CYP366A1 [Bombyx mori]
NCBI nr blastpgi|3214767727e-5135.46%hypothetical protein DAPPUDRAFT_192258 [Daphnia pulex]
NCBI nr blastxgi|675139587e-5033.33%cytochrome P450 [Papilio xuthus]
Group
Gene OntologyGO:00090557.4e-75electron carrier activity
GO:00200377.4e-75heme binding
GO:00167057.4e-75oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055067.4e-75iron ion binding
GO:00551147.4e-75oxidation-reduction process
KEGG pathway 
InterPro domain[17-357] IPR0011287.4e-75Cytochrome P450
[157-174] IPR0024013.2e-20Cytochrome P450, E-class, group I
Orthology groupMCL26720 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210281-TA
ATGACAACTTCTAATCCAAGCAAACTTGAGGAAGGCATCGAGAATTTTGAAGAGGGACGTAAGCTTGTTGATGGCATGCAAGCAGAAGTCGGGAGAGGGTGGTTCGACCAATCAAAGTACGTGAAACAGAACTTCATGGAAACTATTTGTCTAACGGCGTTAGACGACTGCGTTACACCAGAGGAGGCGAATGAGTATGTCGAGGCTTTCGAAAATTACCTTAACGGTAACATTCTTAGGTTCCAGACGTTTTGGCTTCATCCCGATATAACTTTCAAGTTCAGTAAATTGAAGAAAAAACTTGATGCAAGCATAAAAGTCCTTCACGAAATGTCGGATAAGGTACTCAAAAACAAAAGAGCTCTTAGAAAACTTAATGAAACGGAAAGTAGTGCAGAAAACAGTCCAAAGCTGAAGGTATTTATGGATTTACTTATGGACTTGGATGGTGGTGTGCTAACTGATCAGGAGATAAGAGACGAAATGAACACAATCATCATGGCGGGCCACGAGACATCAGCTAACGTTATAGTTTTCGCTCTCATATTAATTGGATCTTATCCAGAAGTTCAGGAGAAGCTTCATGAGGAATTACAAAGAGTGTTTGGTGATAGTGATAGGGATATAGAAAAACAGGATCTTTCACAGCTCATTTATATGGAAGCTGTTTTGAAAGAGACTATGCGTTTCTTCGTAATGGCGCCATTTGTTGGAAGACATATCGATCGGGAGGTCAAATTAAAAAACTGTACTCTTAAACCTGGTAACAATTGCCTGATCCTGTACTATGGGCTTCATCGTCACCCTATTTGGGGTCCAGATGTTAACGAATTCAAGCCCGAACGGTGGTTAGATCCAGCCACGCTACCGAAGAATCCAAATGCGTTTGGCGGATTTAGCATCGGGAAGAGAAATTGTATAGGTAAAACATACGCTATGATGTCCATGAAGTCGACACTATCCTACGTATTTCGACGGTTCAAAATGCAAGCCGACCACACGAAGCTCAAGTTCAAACTGGACGTTTTACTTAAACCGATAACCGGACACTACGTCACTATACAGAATAGATTATAA

Protein sequence:

>DPOGS210281-PA
MTTSNPSKLEEGIENFEEGRKLVDGMQAEVGRGWFDQSKYVKQNFMETICLTALDDCVTPEEANEYVEAFENYLNGNILRFQTFWLHPDITFKFSKLKKKLDASIKVLHEMSDKVLKNKRALRKLNETESSAENSPKLKVFMDLLMDLDGGVLTDQEIRDEMNTIIMAGHETSANVIVFALILIGSYPEVQEKLHEELQRVFGDSDRDIEKQDLSQLIYMEAVLKETMRFFVMAPFVGRHIDREVKLKNCTLKPGNNCLILYYGLHRHPIWGPDVNEFKPERWLDPATLPKNPNAFGGFSIGKRNCIGKTYAMMSMKSTLSYVFRRFKMQADHTKLKFKLDVLLKPITGHYVTIQNRL-