Monarch geneset OGS2.0

DPOGS201061
TranscriptDPOGS201061-TA1497 bp
ProteinDPOGS201061-PA498 aa
Genomic positionDPSCF300497 + 28139-34711
RNAseq coverage2019x (Rank: top 6%)
Annotation
HeliconiusHMEL0087705e-17257.78% 
BombyxBGIBMGA014043-TA4e-9438.23% 
DrosophilaCyp4c3-PA2e-11041.67% 
EBI UniRef50UniRef50_D5L0M21e-16253.83%Cytochrome P450 4CG1 n=2 Tax=Obtectomera RepID=D5L0M2_MANSE
NCBI RefSeqNP_001073134.11e-12648.43%cytochrome P450 CYP4M9 [Bombyx mori]
NCBI nr blastpgi|2914640794e-16253.83%cytochrome P450 4CG1 [Manduca sexta]
NCBI nr blastxgi|2914640794e-15853.83%cytochrome P450 4CG1 [Manduca sexta]
Group
Gene OntologyGO:00090552.4e-129electron carrier activity
GO:00200372.4e-129heme binding
GO:00167052.4e-129oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055062.4e-129iron ion binding
GO:00551142.4e-129oxidation-reduction process
KEGG pathwaydme:Dmel_CG14381e-108 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[12-497] IPR0011282.4e-129Cytochrome P450
[295-312] IPR0024014.5e-23Cytochrome P450, E-class, group I
Orthology groupMCL10085 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201061-TA
ATGTTATTTTTGTCTTTAATATTAGCAACAATCATTTTATTAATAGTGCTGCACGATTACATAAGTAGATCAAAAAGACTTATTAGAAAGATTCCTGGACCTAAGAGTTACCCTATTATTGGAAACACTATTCCTTATATTTTGTCACCTGAAAATCTTTTTCATTATCTCCGAACATTACACAACACCTATGGAGAGCTTAATCAAGTGCACGCTTTGTCCGTGCAAGCGGTAAATGTTTTCAGCCCTGAAGATATGGAGGTCATACTGTCCTCGACCAAACATAACGATAAACAGTTACCCTATACTTTTCTTAAGCCATGGCTAGGGGAGGGACTTCTAACGAGCAATGGCCTGAAGTGGCACCAAAGAAGAAAGCTATTAACGAAAGCATTCCATTTTAATATATTAAAAAAATATTCCGCGACCTTTACTGAACAAACACAAGAATTTATCAAAAAGGTACATGAAGAGACGAAAAAATCTAAAACTGATGTCTTGCCATTGATATGTTCGGCCACATTACATATTATGTGTGAAACCGCTATACCCGCAACAAGGAATGAGGGTATTCAAACAATAACTCAAAAATATTTTAAATCCATACATACGGTCGGCGAAGCTGTTGTTGAAAGAATGTGCAGAGTGTGGCTTTATTTTGATCCTTTCTTTAAACTGACAAAAACTGCAAAAGAACAAGAAACAGCGTTAAAGGAATTGCATACGTTCACCAATAAAATAATAGCCGACAGGAAAGAATTTGTAAAAAATTTTGATGTCAGTAAGTATATTGATAGCGATGAGTATGATAATTCAAAGGGGAAATTGACGATGTTAGATCTTCTTCTCGAAAATGAAAAAACTGGAAATATAGATTTGGAAAGCATAAGGGAAGAAGTGGACACGTTTATGTTTGAGGGCCACGACACTACAGCCATGGCGTTGTCCTACTTTATTATGGCAATAGCGAATGAACCAGCAATTCAACGGAAAATATATGAAGAAATGGAGCAAATATTTGGTGATTCTAAACGTTTAGCAACTATGGCCGATTTACATGAGATGAGATATTTGGAATGCTGTATAAAGGAATCACTACGACTGTATCCTAGTGTGCCATTCATAGCTCGAAACTTGACTCAGGAGACTGTATTAAGTGGATATACAGTCCCAGCAAATACTTTTGTGCATTTATTTATATACGATTTACATAGACGTCCCGATCTCTTCCCTGATCCTGAGAGATTTATTCCGGAAAGATTCTTGCCACAGAACTGTTTGAACAGGCATCCATACGCATACATCCCTTTTAGTGCTGGTTCCAGAAATTGTATAGGACAAAAGTTTGCGATGCTCGAAATGAAAACGGTTTTATCAAGTTTGATAAGACAATTCCACATAGAGCCTGTGACAAAACCTTCAGAACTTCGATTCAGGACAGACCTGGTGCTGCGCACAACCCATCCTATTTATGTGAAGTTTAAAAACAGGGAATAG

Protein sequence:

>DPOGS201061-PA
MLFLSLILATIILLIVLHDYISRSKRLIRKIPGPKSYPIIGNTIPYILSPENLFHYLRTLHNTYGELNQVHALSVQAVNVFSPEDMEVILSSTKHNDKQLPYTFLKPWLGEGLLTSNGLKWHQRRKLLTKAFHFNILKKYSATFTEQTQEFIKKVHEETKKSKTDVLPLICSATLHIMCETAIPATRNEGIQTITQKYFKSIHTVGEAVVERMCRVWLYFDPFFKLTKTAKEQETALKELHTFTNKIIADRKEFVKNFDVSKYIDSDEYDNSKGKLTMLDLLLENEKTGNIDLESIREEVDTFMFEGHDTTAMALSYFIMAIANEPAIQRKIYEEMEQIFGDSKRLATMADLHEMRYLECCIKESLRLYPSVPFIARNLTQETVLSGYTVPANTFVHLFIYDLHRRPDLFPDPERFIPERFLPQNCLNRHPYAYIPFSAGSRNCIGQKFAMLEMKTVLSSLIRQFHIEPVTKPSELRFRTDLVLRTTHPIYVKFKNRE-