Monarch geneset OGS2.0

DPOGS211791
TranscriptDPOGS211791-TA1452 bp
ProteinDPOGS211791-PA483 aa
Genomic positionDPSCF300107 + 382751-386503
RNAseq coverage7x (Rank: top 86%)
Annotation
HeliconiusHMEL0079410.073.98% 
BombyxBGIBMGA004097-TA0.073.66% 
DrosophilaCyp49a1-PD9e-12544.42% 
EBI UniRef50UniRef50_Q9V5L31e-12244.42%Probable cytochrome P450 49a1 n=17 Tax=Endopterygota RepID=C49A1_DROME
NCBI RefSeqXP_002068865.15e-12544.72%GK18006 [Drosophila willistoni]
NCBI nr blastpgi|1954422381e-12344.72%GK18006 [Drosophila willistoni]
NCBI nr blastxgi|1954422384e-11844.81%GK18006 [Drosophila willistoni]
Group
Gene OntologyGO:00090552.9e-73electron carrier activity
GO:00200372.9e-73heme binding
GO:00167052.9e-73oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055062.9e-73iron ion binding
GO:00551142.9e-73oxidation-reduction process
KEGG pathwaydme:Dmel_CG150776e-60 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[10-478] IPR0011282.9e-73Cytochrome P450
[242-259] IPR0024013.1e-11Cytochrome P450, E-class, group I
Orthology groupMCL24985 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211791-TA
ATGGGTGTTGCGCGCAAGTTCGCTACTATACCGGGGCCCAGGCCCCTTCCTATACTTGGGAACTCATGGCGTTTTGCTATAGGACAGAGGCCATGGCGTACACGATCCTTGGACGCGACTTTGTGGAATCTGAGAGCCTTAGCCGGTATGGGGGGAGCAGCAAAAGTCGCCAAACTTTTCGGTCACCCTGACTTAATTTTTCCCTTTTGTGCTGATGAAACCGCCAAGATTTACAGACGTGAGGATACTATGCCACATAGAGCGGTTGCACCCTGCTTGAGACATTACAAGCAGGAATTAAGAAAAGAGTTTTTCGGAGATGAACCCGGTTTGATTGGAGTTCACGGCACTGCCTGGTCAACGTTCAGGTCTAAGGTTTCGAAGGCCCTTGCGGCACCTCAAGCGGCACAAGTCGCTGTACCATCGCTCGACTGCGTATCAAACGATTTTGTTCACAGAATGGAGAGTATTTTGGATCATAATAGAGAGCTGCCGTGTGATTTTTTAACTGAGCTTTATAAATGGGCACTCGAATCTGTCGGAGCTTGGGCTTTGGGAACAAGACTTGGATGTTTGAAAGATAATGATACTGATGCCATGGAAATTATAAACAATATCCACGGTTTTTTTCACAGCGTACCAGAATTGGAATTGAATCTCTGCTTGAAGAGATTGACGGATAAAGGGGTTTGTGCCCAAATTGCTTTGAATTCTGGTGAAAAGGTCGCCACTATTCTGGCTTTGGATTTGCTACTGGTTGGTGTGGATACAACAGCGGCAGCTGCGGCAAGTACTATGTACTTATTAGCGACTAATCCCAGGGCTCAAAGGAGATTACAAACTGAACTAGACATAAACATGCCGACTGATAGATCAATGAATCACAGGGATTTAAATAATCTACCATATTTAAAAGCTTGTATAAAGGAGGCTTTGCGTATAAAGCCTGTTATTCTTGGAAATGGACGCTGCATACAATCAGATGCTGTTATAGCTGGATATGAAGTTCCAAAAGGGTCCCATATAGTCTTCCCCCACTACGTCATGTCGAATGAGGAACGATATTTTCCGTCACCAAACGAATATATTCCCGAGCGTTGGTTACGAGATGACACCAATAAGGCAGGAACAGTTATACCAAATATTTCCAATGAGAAACACATAGAGGCAGCTAGATCGGTCTGTGAACACGCTGGAGTCGCATCTGTGGTGAAGAAACAAAGGGATATTGGGATACACCCGTTTGCTTCATTACCATTCGGTTTTGGAAGACGTATGTGTATCGGGAAGAGGTTCGCTGAAGCCGAACTACAGCTTCTAATCGCCAGGGCGTTCCAGAAGTATAATGTGTCCTGGTATCATGGTGAACTGACTTACAGTGTCACCCCCACGTATATACCGAACGAACCGCTGCGATTCAGATTGGATTCCAGGACAAAGAAATTAACATAG

Protein sequence:

>DPOGS211791-PA
MGVARKFATIPGPRPLPILGNSWRFAIGQRPWRTRSLDATLWNLRALAGMGGAAKVAKLFGHPDLIFPFCADETAKIYRREDTMPHRAVAPCLRHYKQELRKEFFGDEPGLIGVHGTAWSTFRSKVSKALAAPQAAQVAVPSLDCVSNDFVHRMESILDHNRELPCDFLTELYKWALESVGAWALGTRLGCLKDNDTDAMEIINNIHGFFHSVPELELNLCLKRLTDKGVCAQIALNSGEKVATILALDLLLVGVDTTAAAAASTMYLLATNPRAQRRLQTELDINMPTDRSMNHRDLNNLPYLKACIKEALRIKPVILGNGRCIQSDAVIAGYEVPKGSHIVFPHYVMSNEERYFPSPNEYIPERWLRDDTNKAGTVIPNISNEKHIEAARSVCEHAGVASVVKKQRDIGIHPFASLPFGFGRRMCIGKRFAEAELQLLIARAFQKYNVSWYHGELTYSVTPTYIPNEPLRFRLDSRTKKLT-