Monarch geneset OGS2.0

DPOGS206995
TranscriptDPOGS206995-TA2832 bp
ProteinDPOGS206995-PA943 aa
Genomic positionDPSCF300001 + 870382-881620
RNAseq coverage520x (Rank: top 24%)
Annotation
HeliconiusHMEL0021153e-17156.74% 
BombyxBGIBMGA009949-TA1e-6141.96% 
DrosophilaCyp4c3-PA7e-5827.77% 
EBI UniRef50UniRef50_D2A4W93e-9226.35%Putative uncharacterized protein GLEAN_15293 n=2 Tax=Tribolium castaneum RepID=D2A4W9_TRICA
NCBI RefSeqXP_001602395.15e-7432.53%PREDICTED: similar to cytochrome P450 [Nasonia vitripennis]
NCBI nr blastpgi|3330371732e-11542.49%cytochrome P450 [Bombyx mori]
NCBI nr blastxgi|3330371738e-11442.49%cytochrome P450 [Bombyx mori]
Group
Gene OntologyGO:00090553.6e-104electron carrier activity
GO:00200373.6e-104heme binding
GO:00167053.6e-104oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055063.6e-104iron ion binding
GO:00551143.6e-104oxidation-reduction process
KEGG pathwaydme:Dmel_CG14385e-56 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[451-940] IPR0011283.6e-104Cytochrome P450
[35-54] IPR0024011.7e-18Cytochrome P450, E-class, group I
Orthology groupMCL16292 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206995-TA
ATGGCAGATAAAATACCAGGACCACCATCACTGCCCATATTAGGAAATGCTCTCAAATTTATGGTCCATAACAAAGAGCTCAAAATATTAATTAAGAACCTTATGAACCAGTATGGCGAAGTTGTTAAATTTTGGCTTGGAATGGATCTTAATATATTAGTCAGTAATCCCGATGACCTCAAGCTTCTCCTAACCAACAATAAAACCAGCGTAAAAGGTGAACAGTACAAATATTTTAAGAACTACATCGGCGCTGGGCTCTTGAGTGGGTCAGGTCCAGAATGGAGGAAACATCGCAAAATAGCAACACCAAACTTTGGCAAACAGGCAACCGGAAGTTATGTTCATGTATTTAACATGGAAGCAGATTTATTATTAGAAAATTTAAAAGAAACTAGTTCTGGAGATCAGATAGACGTTTACCAGTATATTGTTATGAGCACCAGTTATGCTGTCTGTCGCAAGTATTTTTATGTACAACAAACCTTGATGGGACTTACAAGGAAGGAGACTTTGGGATTACCACACCTTCAATATGTTATTGAGGAAAGTCCCCTAGTATATACGATAATATTCGAAAGAATGACGAAATGGTTTATGCAAATAGATCCAGTTTATTGGTTGACCAAATCATACCAAGTAGAAAAAGAATTCGTCTCTCGTACGAATGAACTAAGTGGTGCTATATTAAAATTACGTATTAACAAATTGACAGACATAGATGAGAACAAATTAAAATTACTGGACACCGAAAAGGATTCCTTCAATAACACAGAATTGAGTGTTGTGGACAGATTCATACTATCAAGAGAACTTGATGCTACGGATCTGAAAACTGAAATCTTTACTATCTTTACAGCTAGTCAAGAAGCAACAGCAAAAGTAGCGGCTTCGTTACTATTGATGTTGGCGTTTCATCCCGAATGTCAGAAAAAAGTTTACGCTGAAATTACAAGCGTGATAAGAAATAAAAATAAACATATAACTGAAGAGGATCTGAAGCAGATGCCGTATTTGGAAATGATTTTCAAAGAGGTTCTTCGGCTGTTTCCAACTGGTGTCATGCTTCAAAGGAAAATTAATAAAGACATTTCTATAAGTTCTTGTACATTACCAGCTGGTTCGTCGCTCGTGATACCGTTATATCATATGCACCGAGATTCAAGATTTTGGGAAAATCCAGAGTCTTTCGATCCAGAGAGATTTTCCGCTGAAAACATGAAGAAGCGCAATGCTTATTGCTATTTTCCTTTCAGTTTAGGCCCTATGGATTGTTTAGATTATCCAGGAATATTACTTGACATCCACATACAAGTACAAATGCTGTTGGCTGTAGTGTTGTTAGCATTGGTTGTTCTGGGTGTTTGGGTTCACTGGCGGTACAAAAACCGAAGGTTGTTGGAAATGTCTACGAAAATACCAGGACCACCTACTATTCCAATCTTAGGCAACGCTTTTTATTTCATGTGCCGCCCAGAAGAAATGATAAAGATTACTAAACAGTTGATCGACGAGTACGGCCTCGTTCTCAGATTCTGGCTTGGGACTGATTTAAATATTGTTGTTAGCAATCCAGATGACATAAAGGTTTTACTAACTAACAACAAGGTCAGCGTAAAAGGTCCTCAATATAAGTACATGTCAGATTTAATCGGAGTTGGAATTTTAAGTGGATCAGGGTCAAAATGGAGAAAGCATAGAAAATTGGTTACTCCAAACTATGGAAAACGAGCTGTCGAAAGTTATAATGAAGTTTGCTACAGGGAAACTAAAATACTTATTGATAATTTAAATAAAGTACACCAAGCTGGAGTACTAGATATATACAAACATATAGTTAAAACAACCAGCTACATTGTATGCCAATCGTTGATTGGTTTAACTAGAGAAGAAACCCTAAGGATTCCTTGTTTGCAAAATGTTATAGATGAAAGTCCGAGGTTATATGACTTCATATTTGATAGAATGACAAAATGGTATCTACAAATAGACCCCGTTTATTGGCTAACGGAATCATATAAAATACAAAAAAAAATCATGAGAGACATAAGTGAGATGAACATGTTTATAATAAACAACAGAAAAGAAGCATTGACTGACATAAATGAAGATTATTTAGAATTATTAAATTCAGAACAGGATTCCGTGAAAAATACAAAGCTATCGGTCATTGATAGATTGATACTGTCCCAAGAATTAAATCACAAGGAGCTGATAGAAGAAACGTTCACAATATTTACATCTAGTCAAGAGGCAACTGCAAAGATAACGTCTTTTTTGCTGCTCATGATGGCATATCATCCGAAATGTCAGGACGAATTGTATTCGGAAATATTAAACGTGATAGGTAATAATTACGGGCCAATAACTGATGACTATCTAAAGCATATGCCCTACTTGGAAAAGTGTGTCAAGGAAGTGTTAAGGCTATACCCTATTGGTGTCATGTTACAAAGGACTGTCAAGGAAGACGTAGAAATAAGTACATGCACACTGCCAGCAGGTTCCTCCCTTGTTGTTCCCATTTTCAATTTACATCGTGACCCAAGATTTTGGGAAGACCCTGAAGCTTTTGATCCAGAAAGATTTTCAACTGAGAATATGAAAAAACGGAACCCTTTCTGTTATATCCCATTTAGTTTAGGACCAATGGATTGTCTAGGAAGATTCGTTGCAGCAAAGTTTATTAAGACCATAGCAATAATGGTGCTGCATGAATTCCGACTATCCTCCGTGAACGATTATAAAGATCTCAACGTGGTTATGGCCATATCAGCTAAATCTGCAAATGGTTACCCAGTTATTTTAACACCACGAAAACAATGCAATTGA

Protein sequence:

>DPOGS206995-PA
MADKIPGPPSLPILGNALKFMVHNKELKILIKNLMNQYGEVVKFWLGMDLNILVSNPDDLKLLLTNNKTSVKGEQYKYFKNYIGAGLLSGSGPEWRKHRKIATPNFGKQATGSYVHVFNMEADLLLENLKETSSGDQIDVYQYIVMSTSYAVCRKYFYVQQTLMGLTRKETLGLPHLQYVIEESPLVYTIIFERMTKWFMQIDPVYWLTKSYQVEKEFVSRTNELSGAILKLRINKLTDIDENKLKLLDTEKDSFNNTELSVVDRFILSRELDATDLKTEIFTIFTASQEATAKVAASLLLMLAFHPECQKKVYAEITSVIRNKNKHITEEDLKQMPYLEMIFKEVLRLFPTGVMLQRKINKDISISSCTLPAGSSLVIPLYHMHRDSRFWENPESFDPERFSAENMKKRNAYCYFPFSLGPMDCLDYPGILLDIHIQVQMLLAVVLLALVVLGVWVHWRYKNRRLLEMSTKIPGPPTIPILGNAFYFMCRPEEMIKITKQLIDEYGLVLRFWLGTDLNIVVSNPDDIKVLLTNNKVSVKGPQYKYMSDLIGVGILSGSGSKWRKHRKLVTPNYGKRAVESYNEVCYRETKILIDNLNKVHQAGVLDIYKHIVKTTSYIVCQSLIGLTREETLRIPCLQNVIDESPRLYDFIFDRMTKWYLQIDPVYWLTESYKIQKKIMRDISEMNMFIINNRKEALTDINEDYLELLNSEQDSVKNTKLSVIDRLILSQELNHKELIEETFTIFTSSQEATAKITSFLLLMMAYHPKCQDELYSEILNVIGNNYGPITDDYLKHMPYLEKCVKEVLRLYPIGVMLQRTVKEDVEISTCTLPAGSSLVVPIFNLHRDPRFWEDPEAFDPERFSTENMKKRNPFCYIPFSLGPMDCLGRFVAAKFIKTIAIMVLHEFRLSSVNDYKDLNVVMAISAKSANGYPVILTPRKQCN-