Monarch geneset OGS2.0

DPOGS207389
TranscriptDPOGS207389-TA1458 bp
ProteinDPOGS207389-PA485 aa
Genomic positionDPSCF300267 + 100490-104751
RNAseq coverage43x (Rank: top 72%)
Annotation
HeliconiusHMEL0122450.070.02% 
BombyxBGIBMGA001162-TA3e-8332.58% 
DrosophilaCyp4c3-PA3e-8534.61% 
EBI UniRef50UniRef50_Q4R1I71e-9438.57%Cytochrome P450 n=4 Tax=Papilionoidea RepID=Q4R1I7_9NEOP
NCBI RefSeqXP_001602979.14e-10138.26%PREDICTED: similar to cytochrome P450 [Nasonia vitripennis]
NCBI nr blastpgi|675139584e-9438.57%cytochrome P450 [Papilio xuthus]
NCBI nr blastxgi|675139584e-9238.40%cytochrome P450 [Papilio xuthus]
Group
Gene OntologyGO:00090551.1e-109electron carrier activity
GO:00200371.1e-109heme binding
GO:00167051.1e-109oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.1e-109iron ion binding
GO:00551141.1e-109oxidation-reduction process
KEGG pathwaydme:Dmel_CG14382e-83 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[10-485] IPR0011281.1e-109Cytochrome P450
[285-302] IPR0024011.5e-14Cytochrome P450, E-class, group I
Orthology groupMCL17654 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207389-TA
ATGGTGATCATACCGCTCCTAATTGTATTGTCCTGCATATATTATGTTTATTTTCGACGGATGAGAAAGCCGTTGTATGCATTGTCTGAGAGCTTGCCTGATACGGGATGTCTCCCGATACTTGGGCACACACACTGGTTTATTGGAGGACCGGAAAAGATTTTAAGTAACATACTGGAGCTAGCTCAATTGGTTGATGATTCTGGCGGTATTGGTAAAATTTGGATCGGCCCTTCATTATACATCGTGAGTTTCAATCCAGAAGATGTCCAAAAAATTCTAGAGGGCTCCCTTCAAAAGGATTCTTCGTATAGATTTCTTCAGGATTGGTTGGGAAATGGTCTCTTTGTGGCTCCAGTTGATTTATGGAAAATACATCGCAAAGTTCTCCTGTCTGTGTTTCACAATAGAATTATTGAAGATTACGTGGGTGTTTTCGGGGAACAAGGAAAAGTCTTGGTTGGACGACTAGAAGACCAAGTAGGCAAAAAGGAGTTTGACGTGTTCAGATACATAACTTCGTGTATGCTGGATATCGTATTTGAGACTGCTATGGGGGAGAAAATGAATGTTCAGTATAATCCTGACACTCCATACTTGCGGGCTCGTAACACCGTCATATCTATAATTGGAATGAGGCTTTTTAAAGCCTGGATGCAACCAAACGCCTTATTTAATCTAACATCTTATTCCAAAACTCAGAAAGAAAACATCGAGGCGACACACAAGTTCACAGATGAGGTAGTAAAGAAAAAGCGAATCTTATTCGAGGCCAGAGAAAAAATTGTAACACAAGGTCGAAGAGATCTGCTAGAATTGTTGCTTGATAGAGACACGAAGTTTACGGACGAGGAACTCAGAGAGCATATAGACTCTATTACTATCGCGGGCAATGATACCACAGCTCTAGTTATATGTTACGCACTATTGTTATTGGGACAACATTCTGAAGCACAGGATAAAGTTTACAATGAATTACGAGATATATTCGGTGATTCATTGAGATCGCCAACTAAAGAAGATTTAAATAAAATGGAATATTTGGAAAGGGTGATAAAAGAGACAATGAGGCTGTACACTGTTGTTCCAATCATCGGCAGAGAAACTCAAAAGGAGATAAAACTCTCGAAATGCACAGTTCCTGCGGGTGTGGGTTGTGCTGTGTTACTGTTCGTGATGCACAGATCGAAACGTATTTGGGGCCCGGATGCTGACACGTTCAACCCTGACAGATTCCTCCCAGAGAACAGCGCAAAACGTCATCCCTGCTCATACATACCGTTCAGTTACGGCAATAGAAATTGCATAGGGCGTCATTTTGGGATGCTGGCCATGAAGAGCATCCTTGCAAATATTATAAGAAGTTACAAAATAACGTCAAAGCGTTGCGAACGTCTGAAAATAGAAATATTGTTGTATCCGGTACCTGGCCACCTAATATCTATAGAGAAACGTTAA

Protein sequence:

>DPOGS207389-PA
MVIIPLLIVLSCIYYVYFRRMRKPLYALSESLPDTGCLPILGHTHWFIGGPEKILSNILELAQLVDDSGGIGKIWIGPSLYIVSFNPEDVQKILEGSLQKDSSYRFLQDWLGNGLFVAPVDLWKIHRKVLLSVFHNRIIEDYVGVFGEQGKVLVGRLEDQVGKKEFDVFRYITSCMLDIVFETAMGEKMNVQYNPDTPYLRARNTVISIIGMRLFKAWMQPNALFNLTSYSKTQKENIEATHKFTDEVVKKKRILFEAREKIVTQGRRDLLELLLDRDTKFTDEELREHIDSITIAGNDTTALVICYALLLLGQHSEAQDKVYNELRDIFGDSLRSPTKEDLNKMEYLERVIKETMRLYTVVPIIGRETQKEIKLSKCTVPAGVGCAVLLFVMHRSKRIWGPDADTFNPDRFLPENSAKRHPCSYIPFSYGNRNCIGRHFGMLAMKSILANIIRSYKITSKRCERLKIEILLYPVPGHLISIEKR-