Monarch geneset OGS2.0

DPOGS202039
TranscriptDPOGS202039-TA1533 bp
ProteinDPOGS202039-PA510 aa
Genomic positionDPSCF300053 - 71549-74892
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0117211e-14951.14% 
BombyxBGIBMGA001276-TA1e-14650.87% 
DrosophilaCyp12a5-PA2e-7734.14% 
EBI UniRef50UniRef50_D5L0N62e-15854.03%Cytochrome P450 333B10 n=3 Tax=Obtectomera RepID=D5L0N6_MANSE
NCBI RefSeqXP_001604810.12e-8935.53%PREDICTED: similar to cytochrome P450 [Nasonia vitripennis]
NCBI nr blastpgi|2914641098e-16655.60%cytochrome P450 333B11 [Manduca sexta]
NCBI nr blastxgi|2914641099e-16255.60%cytochrome P450 333B11 [Manduca sexta]
Group
Gene OntologyGO:00090551.5e-82electron carrier activity
GO:00200371.5e-82heme binding
GO:00167051.5e-82oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.5e-82iron ion binding
GO:00551141.5e-82oxidation-reduction process
KEGG pathwaydme:Dmel_CG60421e-74 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[39-507] IPR0011281.5e-82Cytochrome P450
[316-333] IPR0024011.4e-13Cytochrome P450, E-class, group I
Orthology groupMCL10325 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202039-TA
ATGATCGCAATAAAGTATAACGGACCCATTATGAATAAATTATATATGAAGAATTTTATTTTTGCAACTGAAATAAATATAAGATTGGCTCATAGCTCATCGTGCAGAAAAATAAAATCATGGAAAGAAATACCAGGGCCGTCTTCATTGCCTATCATAGGACAGCTTCACCATTATTTTCCCGGCGGCTCGCTTTACCAATGCAATGGATTTGAATTTCAAGAAAAGTTATACAAAAACTACGGTCCTCTAGTAAGATTGAATACCGTATATGCTGGGAAACCCGCAATTTTTGTTTTTGATCCAGATAGCATGGCGCAGGTATTGCGTGGCGAAAATTGGCTGCCGATTCGTCCTGGTTTTGATGTATTGTATCACTATAGGAACTTTTATAACCAACCAAAAGGCGGTGCACCGGGACTCACTGGTTTAATAAGCGACCATGGTCAGAAATGGAAGCAACTGCGCTCTTTAGTTAACCCTATCATAATGCACCCAGATAACATTAAACTGTACGATACGCCCATAGGTGAAGTTGCTCAGGATGTAGTCCAAAGAATAAAGGATTTAAGGGATGAAGACGGAATGATTACAAAAAACTTCGATTATTTAATGTACCTTTGGGCTCTGGAATCTGTCGGTGTTGTGGCCTTAGGAAGTCGTTTGAATACCTTCAATGAAAACTTGGAATCGGATTCAGTTGTACGACGTCTGATAACACTTATTCATGAGTTTTTTGCAATATCTGAAAACTTAGATATCAAGCCAAGTCTATGGAGGTATTATCCAACTCCCGCATTTAAACGTGCTATGAAAGTATTTTGTGATATAGATAGTATTACAAGAAGTTTAGTACTGAAAGCAAAAGATGAATTAAGCCAAAGGGGTCATAGTGCCGATGATAAAAAGGGCGTCCTGGAAAAACTACTCGAAGTGGATGAAAAAATTGCCCTCATTATGGCCGGCGATTTACTGTTTACTGGCGTTGATACGGTTGGAAATACTATGAGTTGCACGTTGTACCTTCTTGCAAGCCATCCTGAAAAACAGAATACACTAAGACAGGAAGTTAATTCTGGAGACGAAAGGAAGTCTTATCTAAAGGCCTGCATAAAGGAGTCTTTAAGAGTAATGCCAGTTGCTGGTGGAAACATCAGACAGTGTACAAAGGAGTACAACCTTTTAGGATACGAAATACCGAAAGATATGTTTGTAGTATTTCCTCACCAGTACCTTTCGAAGATGGAAAGTCAGTATCCCAGAGCTAATGAATTTATTCCTGAAAGATGGTTGGTTGACAAGGATCACGCTCTGTATCACGGAAATGCACATCCGTTTGCATACAACCCTTTCGGATTTGGGGCAAGAATTTGTATAGGTCGTCGTATAGCGGAGTTAGAGTTAGAAAGCTTACTTTCAAAAATTATACAAAACTTCGAACTCGAATGGAGAGGTCCTCCACCGACCATGTACCAAAGTGCAATGAACTATTTCAAAGGACCCTTCAACTTTGTTTTTAAAGATATTAAATAA

Protein sequence:

>DPOGS202039-PA
MIAIKYNGPIMNKLYMKNFIFATEINIRLAHSSSCRKIKSWKEIPGPSSLPIIGQLHHYFPGGSLYQCNGFEFQEKLYKNYGPLVRLNTVYAGKPAIFVFDPDSMAQVLRGENWLPIRPGFDVLYHYRNFYNQPKGGAPGLTGLISDHGQKWKQLRSLVNPIIMHPDNIKLYDTPIGEVAQDVVQRIKDLRDEDGMITKNFDYLMYLWALESVGVVALGSRLNTFNENLESDSVVRRLITLIHEFFAISENLDIKPSLWRYYPTPAFKRAMKVFCDIDSITRSLVLKAKDELSQRGHSADDKKGVLEKLLEVDEKIALIMAGDLLFTGVDTVGNTMSCTLYLLASHPEKQNTLRQEVNSGDERKSYLKACIKESLRVMPVAGGNIRQCTKEYNLLGYEIPKDMFVVFPHQYLSKMESQYPRANEFIPERWLVDKDHALYHGNAHPFAYNPFGFGARICIGRRIAELELESLLSKIIQNFELEWRGPPPTMYQSAMNYFKGPFNFVFKDIK-