Monarch geneset OGS2.0

DPOGS206339
TranscriptDPOGS206339-TA1500 bp
ProteinDPOGS206339-PA499 aa
Genomic positionDPSCF300082 + 410221-422050
RNAseq coverage1534x (Rank: top 8%)
Annotation
HeliconiusHMEL0046081e-13774.03% 
BombyxBGIBMGA005266-TA4e-7554.84% 
DrosophilaCyp49a1-PD8e-1822.46% 
EBI UniRef50UniRef50_D2JLK55e-3524.55%Cytochrome P450 CYP333B8 n=1 Tax=Zygaena filipendulae RepID=D2JLK5_9NEOP
NCBI RefSeqXP_001604810.12e-2223.48%PREDICTED: similar to cytochrome P450 [Nasonia vitripennis]
NCBI nr blastpgi|2914641091e-3424.12%cytochrome P450 333B11 [Manduca sexta]
NCBI nr blastxgi|2914641094e-3223.52%cytochrome P450 333B11 [Manduca sexta]
Group
Gene OntologyGO:00090554.2e-21electron carrier activity
GO:00200374.2e-21heme binding
GO:00167054.2e-21oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055064.2e-21iron ion binding
GO:00551144.2e-21oxidation-reduction process
KEGG pathway 
InterPro domain[46-495] IPR0011284.2e-21Cytochrome P450
Orthology groupMCL22093 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206339-TA
ATGTCTAAAACTGTGCTCGCTAGACAATGCTTTTTCTCTCGTCCGAGACGACAGTTTTCAACGTCGTCTGTTAAAAGGACGTCGACATCGCCACAGCGTTACAACATGTCTGAACCTGCTGTGAGATCTGGAAATGTCAAAAAGTTCTCAGACATACCAGGACCTTTGGCTCTTCCCATCATGAGACATCACGCCCACGTTCTGCCGAGGATCGGGTGTTTCCATCACACCGTTGGCCTGGGTCTGCTGGAAGGTCTGCAACAGCGCTACGGGGACCTCGTCCGTTTCGCCAAGGCAACTCGCTCTCGGCCCGTACTATACGTCTTCGATCCCGAACTCATGAAAGAGGTGTATGATAGTAACATGACGGTATCGCCACAGTGGAGTCAGTCACCTCTCAACGAACACAGGAAGAACGCTGACACGAAATGCCCGATGCAGAGCGACGAAAGTAAAGCTGTATGGACGGCGCTGCGGACGCTGTTACAGGATGACTCGTTCCTTAGAAACTTTGATAAGGCCTTCGATGACATAGCTGTCGACGTTACACGGAGACTGGAAGGTCTGAGACACGCAGGGAACGCGCTCAACGAAGAACTACAAACAGAAATTTACCGCTGGGCGATAGAGATCATCGGCGTCTTGATTTTTGGGATGCGGTTAGGATGTCTGGATGGAAATGTTCATGTGCCCACAGAAGAGAACAGGAATCCAGAGAAGACTTCAATGGACGATCACATCAAAGATCTCTGTTCGCTTACAAAAAAAAGCGAACGAGATCTGTCTCCGGCCGAGAGGTTCGTGCGGTGCTCTCTAGACGTCGCCAACGAAAACTATTTGGTGCGAAGCGAAGATACTCTGAGACAGGAGTCGGAGACGTTCAACGAGGCCTTGAAGACCTTCGACAAACATTACAGCCTCACAGACCACTTTCTGGTTCGAGCGTTAGAGAACCTCAATAATGAGGAACTCAAAGCCGAACAAGTGCTTTTAAACAAACTGCGTCCGTTGGAGAGGCGAATATTGCCACTCGCTGCAGACGCGTTCCTGGCTGGCGTTGATCCAGTAATTGGCGCAGACGGCGATCAGCATGCTGTACGAGCTGTCGCTGCGGCCACGGGGGGGGTCGTGAGGCGCAGTGTCGGCGAGCTGAGCGTCGCCGGGTACGAGGTGCCCGAAGGGGTGGACATAGTGCTCGCTCATGGGGTTTCAAGTAAATTAGAGAAGGAATGGGGGAGAGCGAAGTCGTTTATACCGGAGCGCTGGCACAGCGACGGCTGGCGGCCGCTGAACGCGTCCCGGGCTCATCGCTGCGCCTCCATGCCGTTCGGCCAAACCTGCCCGGCGGCTGGAGTCGTGAACAAGATGCTCTCCAGCCTCGCGATACGGGTCCTGGACACATACAGGATCGAGTGGCACGGAACCGCGCCGCGCGTCACCGCGGCCGGCGTCAACAAGATACAGCGGCCATACTACTTCGTGCTGCAGAACGCCGGGTGA

Protein sequence:

>DPOGS206339-PA
MSKTVLARQCFFSRPRRQFSTSSVKRTSTSPQRYNMSEPAVRSGNVKKFSDIPGPLALPIMRHHAHVLPRIGCFHHTVGLGLLEGLQQRYGDLVRFAKATRSRPVLYVFDPELMKEVYDSNMTVSPQWSQSPLNEHRKNADTKCPMQSDESKAVWTALRTLLQDDSFLRNFDKAFDDIAVDVTRRLEGLRHAGNALNEELQTEIYRWAIEIIGVLIFGMRLGCLDGNVHVPTEENRNPEKTSMDDHIKDLCSLTKKSERDLSPAERFVRCSLDVANENYLVRSEDTLRQESETFNEALKTFDKHYSLTDHFLVRALENLNNEELKAEQVLLNKLRPLERRILPLAADAFLAGVDPVIGADGDQHAVRAVAAATGGVVRRSVGELSVAGYEVPEGVDIVLAHGVSSKLEKEWGRAKSFIPERWHSDGWRPLNASRAHRCASMPFGQTCPAAGVVNKMLSSLAIRVLDTYRIEWHGTAPRVTAAGVNKIQRPYYFVLQNAG-