Monarch geneset OGS2.0

DPOGS213007
TranscriptDPOGS213007-TA1623 bp
ProteinDPOGS213007-PA540 aa
Genomic positionDPSCF300024 - 146134-148366
RNAseq coverage393x (Rank: top 31%)
Annotation
HeliconiusHMEL0050660.090.00% 
BombyxBGIBMGA006916-TA0.081.15% 
DrosophilaCyp18a1-PB0.062.60% 
EBI UniRef50UniRef50_A3RKJ40.081.15%Cytochrome P450 n=8 Tax=Endopterygota RepID=A3RKJ4_BOMMO
NCBI RefSeqNP_001077078.10.081.15%cytochrome P450 18a1 [Bombyx mori]
NCBI nr blastpgi|1342544700.081.15%cytochrome P450 18a1 [Bombyx mori]
NCBI nr blastxgi|1140495930.083.52%cytochrome P450 [Spodoptera littoralis]
Group
Gene OntologyGO:00090552e-121electron carrier activity
GO:00200372e-121heme binding
GO:00167052e-121oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055062e-121iron ion binding
GO:00551142e-121oxidation-reduction process
KEGG pathwaydre:5562803e-89 
 K07422 (CYP2U)maps-> Arachidonic acid metabolism
InterPro domain[46-519] IPR0011282e-121Cytochrome P450
[79-98] IPR0024012e-54Cytochrome P450, E-class, group I
Orthology groupMCL15054 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213007-TA
ATGATAACAATACTCTCTAACTCAAAACTGTTGTGGGGACTGTGGCAAGTGATGAGCTACTGTACTTCGCGTACGACGATGCCGCTATTGCTGACTGTTGGCTTGACATTTTTGGTAATGCGGTTAATAAATCTAGTGCGTGAAATCCGAAAGTTACCACCCGGCCCATGGAGTCCGCCGGTCGTCGGCTACCTACCCTTCCTTGGGGTTCGCCATAAAACATTTTTGGAGCTTGCAAGAAGTTATGGTGCCCTTTTCTCCGCTCGTCTTGGAAATCAACTAACTGTGGTGTTGAGTGACTATAAATTAATCAGAGAAGCGTTTCGACGGGAGGAGTTCACCGGTCGACCCAGCACTCCATTGATGCATATTTTGGATGGTCTAGGTATCGTCAATAGTGAGGGTCGCCTGTGGAAGAGCCAACGCCGGTTCTTGCACGAGAAACTCCGCGAATTCGGCATGACTTACATGGGAAATGGCAAGAAAATCATGGAAGCGAGAATTAAGAACGAAGTTCACGATCTGATGGATAACTTGCAATGCACAGAAGCAGCCCCAGTAGATCCCAACCCGCTTCTCGCTTTGGGCGTGTCTAACGTGATCTGTGGTATTACTATGTCAGTACGCTTTAGTCACGGTGATGCCCGATTTGCTAGACTAAACCATTTAATTGAAGAGGGCATGAGACTATTTGGAGAAATCCATTATGGAGAATACATTCCTCTATACAACTATCTACCAGGCAAAGCTCAAGTTCAAGAAAAAGTTGCAAAGAACCGCGAAGAAATGTTTGCATTTTATCAAACTCTAATCGATGAACACCGAAATACTCTCGACATCAATAATTCTAGAGACCTCATCGATGTTTATTTGATTGAAATTGAAAAAGCAAAACTTGAAGGCAGAGAAGGAGATTTATTTGACGGGAGAAATCATGAATTACAACTTAAACAAATTCTCGGTGATTTGTTTTCCGCTGGCATGGAAACTATTAAGTCTTCACTTTTGTGGATGATCGTATTCATGTTACGAAACCCTGATGTGAAGAGACGAGTCCAGGAGGAGTTAGATACGGTTATTGGTCGAGAGCGTCTTCCAACTATTGAAGACATGTCCAGTTTACCATACACAGAAACTACGATACTTGAAACTCTCCGCATGTCTAGCATTGTACCTTTGGCCACGACACATTCTCCCACAAAAGATGTTCACCTGAACGGGTACAGAATTCCTGCTGGTTCCCAAGTCGTTCCTCTTATCAATTGTGTACATATGGATCCAAACCTCTGGGACGAGCCCAACAAGTTCAATCCAAGTCGATTTATAGATGAAAATGGAAAAATTAAGCGACCCGAGTACTTTATGCCGTTTGGAGTCGGACGTCGTATGTGCCTAGGGGATGTTTTGGCAAGAATGGAAATGTTTATGTTCTTCGCAAGTTTGATGCATCAATTCGATGTTCAAACCGAGGCGGGTGCAGCGCCACCATCTCTAGAAGGAACGGTGGGTGCCACAATCTCGCCACAGCCTTTCCGCGTCAAGTTCGTGCCTCGTTCGCCACCAGCTCCGCCTGTTGTCCTGGTCCACGAGCATCCACACTTGCGCCATGTTGGTTCGCATTAA

Protein sequence:

>DPOGS213007-PA
MITILSNSKLLWGLWQVMSYCTSRTTMPLLLTVGLTFLVMRLINLVREIRKLPPGPWSPPVVGYLPFLGVRHKTFLELARSYGALFSARLGNQLTVVLSDYKLIREAFRREEFTGRPSTPLMHILDGLGIVNSEGRLWKSQRRFLHEKLREFGMTYMGNGKKIMEARIKNEVHDLMDNLQCTEAAPVDPNPLLALGVSNVICGITMSVRFSHGDARFARLNHLIEEGMRLFGEIHYGEYIPLYNYLPGKAQVQEKVAKNREEMFAFYQTLIDEHRNTLDINNSRDLIDVYLIEIEKAKLEGREGDLFDGRNHELQLKQILGDLFSAGMETIKSSLLWMIVFMLRNPDVKRRVQEELDTVIGRERLPTIEDMSSLPYTETTILETLRMSSIVPLATTHSPTKDVHLNGYRIPAGSQVVPLINCVHMDPNLWDEPNKFNPSRFIDENGKIKRPEYFMPFGVGRRMCLGDVLARMEMFMFFASLMHQFDVQTEAGAAPPSLEGTVGATISPQPFRVKFVPRSPPAPPVVLVHEHPHLRHVGSH-