Monarch geneset OGS2.0

DPOGS213243
TranscriptDPOGS213243-TA1374 bp
ProteinDPOGS213243-PA457 aa
Genomic positionDPSCF300124 - 323070-332918
RNAseq coverage453x (Rank: top 27%)
Annotation
HeliconiusHMEL0030580.094.31% 
BombyxBGIBMGA009522-TA0.091.09% 
DrosophilaCyp301a1-PA0.068.11% 
EBI UniRef50UniRef50_Q9V6D60.068.11%Probable cytochrome P450 301a1, mitochondrial n=32 Tax=Neoptera RepID=CP301_DROME
NCBI RefSeqXP_001605672.10.068.82%PREDICTED: similar to GA21183-PA [Nasonia vitripennis]
NCBI nr blastpgi|3838557360.070.43%PREDICTED: probable cytochrome P450 301a1, mitochondrial-like [Megachile rotundata]
NCBI nr blastxgi|3838557360.070.43%PREDICTED: probable cytochrome P450 301a1, mitochondrial-like [Megachile rotundata]
Group
Gene OntologyGO:00090551.4e-85electron carrier activity
GO:00200371.4e-85heme binding
GO:00167051.4e-85oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.4e-85iron ion binding
GO:00551141.4e-85oxidation-reduction process
KEGG pathwaydme:Dmel_CG60426e-66 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[18-457] IPR0011281.4e-85Cytochrome P450
[258-275] IPR0024015.1e-10Cytochrome P450, E-class, group I
Orthology groupMCL15916 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213243-TA
ATGGTACCGGTGATCGGACAGTACGATATATCGGAATTCGCGAAAGTAACTAAGCTGTTTTTGGAGAAATACGGTAGGATTGTACGTTTGGGTGGATTGATAGGAAGACCAGATCTTCTGTTTGTGTATGATGCTGATGAAATCGAGAGGATGTATAGAAGAGAAGGTCCTACTCCATTCAGGCCTGCTATGCCGTGTCTCGTTAAATATAAGTCAGAAGTGAGAAAGGATTTCTTCGGGGAGTTACCTGGTGTTGTTGGAGTTCACGGCGAACAATGGCGGCGATTCCGCTCTAAGGTTCAACGACCTATACTCCAGCCCCAAACAGTGAAGAAGTATGTGACACCCATCGAGATGGTAACTGAGGATTTTATAAAATATATGGAGAAAGCGAGAGACGATAACAAAGATCTGCCGCATGAGTTTGACAATGACATACATCGGTGGTCCTTGGAATGTATCGGACGCGTAGCGTTGGACGTGCGACTTGGTTGCCTGTCACCGGATGCATCCAGCGAGGAACCGCAGCGTATTATAGATGCAGCCAAGTTCGCGTTGCGCAATGTTGCTGTACTGGAATTAAAGGCGCCCTACTGGAGATACATCCCGACCCCGCTGTGGACCAAATATGTCAATAACATGAACTTCTTCGTTGAATTATGCTCAAAATATATTAACGAAGCTCTAGAGCGTCTGAAGAGCAAGCAGGTGACATCTGAGAACGATCTGTCATTATTGGAGCGAGTGTTGCAAAGCGAAGGGGACCCCAAGATTGCCACAATAATGGCACTTGACTTAATTCTTGTCGGCATTGACACGATCTCAATGGCGGTATGTTCAATATTATACCAAGCGGCGACGAGATTGAAACAACAAGAGAAGATGGCTGAAGAAATAAGAAGAGTGTTGCCTGATCCGGATAAACCTCTCACTTACTCTGACTTGGATAAACTACATTACACCAAGGCTTTCGTTAGAGAAGTATTTAGAATGTACTCGACTGTTATCGGCAATGGAAGAACATTGCAAGAAGATGACGTCATATGTGGATATCACATTCCCAAGGGGGTACAAGTTGTGTTTCCAACGATCGTGACTGGCAATATGGAGCAATTCGTTTCCAACCCTCAGGAGTTCAGACCTGAACGTTGGTTGGAGGCCGATGGTCGATTGCATTCATTCGCTTCACTGCCTTACGGCTTCGGAGCCAGGATATGTTTGGGCAGACGGTTCGCTGATTTGGAGATACAGGTCCTTTTGGCTAAGTTGCTTCGCCGTTACCGTCTTGAGTACCACCACGAGCCGTTAGAATACGCCGTGACCTTCATGTACGCGCCCGACGGACCCTTAAGACTGAGGATGATCGAACGATAG

Protein sequence:

>DPOGS213243-PA
MVPVIGQYDISEFAKVTKLFLEKYGRIVRLGGLIGRPDLLFVYDADEIERMYRREGPTPFRPAMPCLVKYKSEVRKDFFGELPGVVGVHGEQWRRFRSKVQRPILQPQTVKKYVTPIEMVTEDFIKYMEKARDDNKDLPHEFDNDIHRWSLECIGRVALDVRLGCLSPDASSEEPQRIIDAAKFALRNVAVLELKAPYWRYIPTPLWTKYVNNMNFFVELCSKYINEALERLKSKQVTSENDLSLLERVLQSEGDPKIATIMALDLILVGIDTISMAVCSILYQAATRLKQQEKMAEEIRRVLPDPDKPLTYSDLDKLHYTKAFVREVFRMYSTVIGNGRTLQEDDVICGYHIPKGVQVVFPTIVTGNMEQFVSNPQEFRPERWLEADGRLHSFASLPYGFGARICLGRRFADLEIQVLLAKLLRRYRLEYHHEPLEYAVTFMYAPDGPLRLRMIER-