Monarch geneset OGS2.0

DPOGS202397
TranscriptDPOGS202397-TA3930 bp
ProteinDPOGS202397-PA1309 aa
Genomic positionDPSCF300233 - 235146-244019
RNAseq coverage71x (Rank: top 66%)
Annotation
HeliconiusHMEL0044472e-16762.33% 
BombyxBGIBMGA003293-TA2e-9641.40% 
DrosophilaCyp6g1-PA4e-6634.47% 
EBI UniRef50UniRef50_D6X4T28e-16128.69%Putative uncharacterized protein n=4 Tax=Endopterygota RepID=D6X4T2_TRICA
NCBI RefSeqNP_001104827.13e-8238.07%cytochrome P450, family 337, subfamily a, polypeptide 1 [Bombyx mori]
NCBI nr blastpgi|2700012403e-16028.69%hypothetical protein TcasGA2_TC016235 [Tribolium castaneum]
NCBI nr blastxgi|2700012403e-15928.72%hypothetical protein TcasGA2_TC016235 [Tribolium castaneum]
Group
Gene OntologyGO:00090551.6e-96electron carrier activity
GO:00200371.6e-96heme binding
GO:00167051.6e-96oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.6e-96iron ion binding
GO:00551141.6e-96oxidation-reduction process
GO:00044971.2e-16monooxygenase activity
KEGG pathwaynvi:1001140232e-72 
 K07424 (CYP3A)maps-> Drug metabolism - cytochrome P450
    Drug metabolism - other enzymes
    Linoleic acid metabolism
    Steroid hormone biosynthesis
    Metabolism of xenobiotics by cytochrome P450
    gamma-Hexachlorocyclohexane degradation
    Retinol metabolism
InterPro domain[843-1309] IPR0011281.6e-96Cytochrome P450
[220-236] IPR0024031.2e-16Cytochrome P450, E-class, group IV
Orthology groupMCL34444 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202397-TA
ATGACACCTCTATTTACTGCATCGAAACTGAAAAGCATGTTCTATATTATCGATAAGAGTGCACATGATTTTTTAAAGTACTTAAGGGAAAACAAAGAAATTCAAAAAGGTGACGCGTTTAAGACTTTGTCTACATTTTGTAGTGCTGCCATCTGTGCTACAGTGTTTGGTGTAAATACAGAATCAATATTCGATTCACCATTTTTAAAGATTGCACGGGGAAGTTTCCGTTCAACTCTTATAACTAACATACAATTCCTTATGTCACATTTCGCCGAAAAATTCTGTCAAATATTAAATATAAAAGCTTTTAAGCAATTTGAAGACTTTTTTGTAGGAGCCATAAAGCAAGTTATTCGTCAGCGTGAGATAGAAAATATTAGACGACATGATTTTGCTGATATTTGCATAGCTTTAAAGAACAGTGGGAAACTTATTGACCCTGATACTGGTCTGGAACTAGAGACAACGGATGAGTTACTCGCTGCTCAAGCATTTTTCTTTTTTGTCGCTGGAGTAGAACCGATAGCTACAGCAATGTTTGCAACTTTTGTGGAATTGGGTAAAAACTCGGAAATACAAAAACGTGTTCAAGACGAAGTGGACGGCGTTTTCGAAGAACATAACGAATCATTGTCTTATGATATCATTTCTGAAATGGTTTATCTCGATATGGTCATCAAAGAAGCTATGAGATTGCATCCCGCTATTGGCTTTGTAACTAGGAAATGTGTTCAGAACACTATTTTACCAGTTGGTAACATAAAAGTAGACAAAGGGACAAAAATTTTCGTGCCGATATATGAATTACATCATGACCCGAAGTATTTTCCCGAACCAGAGTTGTTTAAACCAGATCGATTTTCTAGTGAAAATAAACATAACATCTTAGACATCACATATTTGCCATTCGGTAAAGGTAAAAGAATTTGTATTGGTATGAGATATGCTCATATGCAAGTCAAAACTGGTTTAGTTCATATCTTACGACATTTCAGTATTAAGACAAATATAAACAGAGATGGCATTAAGTATATGAAACATCAAATTCAAGTGAGGTTGGAAAATGTAGATGTTGACAAAAATAAGATAACAGGCCCCTTTTGGGAAGTTTTCACTAGTGGTGAACCGTTTTACCTACATTTGCAAAACATCTATAACCAATATCCTGATCAAGCTGCCGTGGGAATCGGAGGATTTCTCACTCCTAGTCTGTATGTAAGAGATCCTAAGAATGTGCAAGCCATACTGTCATCAGATTTTAACTCTTTCTACCATCGTGGTTTTGAAGTAAATGAAAATGATAAATTGGCTAACAACATTTTGTTTCTAAACGGAAGTAAATGGAAACTTATGCGACAGAGCATGACACCTCTGTTTACCTCACTTAAACTGAAAAATATGTTTTATATTATGGATAAAAGTGCCCAAGACTTCGTTCAATATATAAAAGAAAGTCCTGAAATAAGGAAAGGCAACACATTTCAAACATTGTCGACGTTTTGCAGTGCAGCCATCGGCGCCTCAGTTTTCGGTCTCACTACTGAATCGATTTTCGATTCACCTTTTTTAAAAATTGCACAAAAAATTTTCGAACCAACGTTGAAATCAAATTTACGATTCGCTATTTCCAATTTGTCCCAAACATTATTTCATCTATTTAAAATACAATTTTTTAAAGAATTCGAAGATTTCTTTATTGATGCCATCAAACGAGTGGTACGTCAACGTAAAGAAGAAAATGTGAAAAAGCATGATTTTGCTGATATTTGTGTGGCCTTACAAAAAAACGGGATGCTTAAGGATCCCGAAACAGGATTAGAATTAGAACCTACATATGCATTACTTGCAGCACAAGCATTTTTCTTTTTTAATGCTGGTGTGGAACCCGTTGCTGCAGCAATGTTTAGTGCGTTAATAGAAATCGGTAAAAACCCAGAAATACAAAAAAAGGTCCACGAAGAAATTGATGGTACTTTTGATAAAAATAACGGAAAATTAAATTTTGATATTATTGCTGAAATGGTTTATCTAGATATGGTCATCAATGAGGCGATGAGAATGTATCCACCTATTGGATTTCTATCCAGGCAATGTGTTGAAGATACTGTACTGCCATCTGGAAATATTGCTGTGGAAAAAGGTACAAAAATATTTGTACCAATATTTGAATTTCATCATGATCCAAAGTACTTTCCTAATCCAGAAGAATTTAACCCTGAACGATTTTCACGTGAAAACAAAAAAAAGATGTCAGATATTACTTATTTACCCTTTGGTAAGGGTAATAGGATTTGTATTGGTATGCGTTATGCTCATATGCAAACAAAAACTGGTTTAGTACACCTATTAAGAAACTTTAATGTCAAGACAAATATTGGTAAAGGAGGAATAAATAATGGGAACCCACTAAGATTACAATACCAAGAGAACTTCTTGATCGTAGACAATGATGCTGTTCTGTCGGATGACCCGACTGACCCTCGGCGCTCGTTGATACCCGGGTATAACAATAACTATTGGACGAAACGCGGTGTAGTTTTCTACAAAGGAAACCTGTTGAATACAATACTTTATGATTTTTTTACTGGAAAGAAATCATTGTTCATGCACCTACATGATATCTATAAAAAGTATCCAAACGAACCGGTCGTTGCGGTTGGACTATTCTTTATACCAGCACTATACGTGAAAGACCCCATCAATATTCAGTACATATTATCATCAGAATTCAAGTCTTTCTATCACAGAGGTATTGAAACAAATGAAAGTGATATATTGGCGAAAAACACATTGTTTCAAAATGGCAAAACATGGAAGCTTTTACGAGAAAACATGACGCCCCTTTTTACTTCATCTAAATTAAAAAACATGTTTTATATAATGGATAAAAGTGCACAAGATTTCGTGAAATACTTAAAGAACGACAGGGAACTAAGAAAATATAACGCCTATCAGAAATTATCGATGTTCTGCTGTGCCGCTATCTGTGGAACAGTATTCGGTATTGGTGCTAATTCAATTTTCGATTTACCGTTTTTAAAGATCGCGCAGAAAAGTTTCAGGTCAACCCTGAAAATCAATATGCGTTTTGCGTTATCCACTTTGTCCAGAACTTTATTTAATAAATTTAAAATACACTTCTTTAAAGAATATGAAGATTTCTTTATTGGTGCCATTAAACAGGTCGTACATCAACGTGAACAAGACAATATAAAAAAACATGATTTCGTTGACACTTGGTTAGCATTACAAAAAAGTGGAAAACTTAAGGACTCGGATAATGGTTTCGAATTAGAAACCACAAATGAATTACTCGCCGCTCAAGCGTTTTCGTTTTTTATAGCCGGAGTAGAACCTACTGCCACAGCTATGTTCGCTACTTTATTTGAATTAGCTAAAAATCCAGAGATACAACAAAAAGTACACGCAGAAATAGATCGTGTTTTTGAAAACCACAGCGGTGCATTAACATATGAAATGATTTCTAAAATGGTATACTTAGACATGGCTATAAAAGAAGCTATGAGATTACATCCATCTATTGCAACATTATCAAGAAGATGTGTTCAAAATACTGTTTTACCGAATGGAAATATAGTGGTAGAGAAAGATACAAAGATATTTATACCTGTATACGAACTTCATCACGATGAAAAATACTTTCCCGATCCAGAAGCATATAAGCCTGAGAGATTATCACGTGAAAACAAACATGAAATCATAGAATTTACTTATTTGCCTTTCGGTAAAGGAAACCGAACATGTATTGGTATGCAATACGCACACATGCAAATAAAAACTGGCTTAGTTCACATGCTACGACATTTCACTGTACACACCGATATCCAACAAGGAAAACAGAAATATTTGAAACACCTAATTCAATTAAGACTGGATCATAATGACATCAAATTTTTAACAAGATAA

Protein sequence:

>DPOGS202397-PA
MTPLFTASKLKSMFYIIDKSAHDFLKYLRENKEIQKGDAFKTLSTFCSAAICATVFGVNTESIFDSPFLKIARGSFRSTLITNIQFLMSHFAEKFCQILNIKAFKQFEDFFVGAIKQVIRQREIENIRRHDFADICIALKNSGKLIDPDTGLELETTDELLAAQAFFFFVAGVEPIATAMFATFVELGKNSEIQKRVQDEVDGVFEEHNESLSYDIISEMVYLDMVIKEAMRLHPAIGFVTRKCVQNTILPVGNIKVDKGTKIFVPIYELHHDPKYFPEPELFKPDRFSSENKHNILDITYLPFGKGKRICIGMRYAHMQVKTGLVHILRHFSIKTNINRDGIKYMKHQIQVRLENVDVDKNKITGPFWEVFTSGEPFYLHLQNIYNQYPDQAAVGIGGFLTPSLYVRDPKNVQAILSSDFNSFYHRGFEVNENDKLANNILFLNGSKWKLMRQSMTPLFTSLKLKNMFYIMDKSAQDFVQYIKESPEIRKGNTFQTLSTFCSAAIGASVFGLTTESIFDSPFLKIAQKIFEPTLKSNLRFAISNLSQTLFHLFKIQFFKEFEDFFIDAIKRVVRQRKEENVKKHDFADICVALQKNGMLKDPETGLELEPTYALLAAQAFFFFNAGVEPVAAAMFSALIEIGKNPEIQKKVHEEIDGTFDKNNGKLNFDIIAEMVYLDMVINEAMRMYPPIGFLSRQCVEDTVLPSGNIAVEKGTKIFVPIFEFHHDPKYFPNPEEFNPERFSRENKKKMSDITYLPFGKGNRICIGMRYAHMQTKTGLVHLLRNFNVKTNIGKGGINNGNPLRLQYQENFLIVDNDAVLSDDPTDPRRSLIPGYNNNYWTKRGVVFYKGNLLNTILYDFFTGKKSLFMHLHDIYKKYPNEPVVAVGLFFIPALYVKDPINIQYILSSEFKSFYHRGIETNESDILAKNTLFQNGKTWKLLRENMTPLFTSSKLKNMFYIMDKSAQDFVKYLKNDRELRKYNAYQKLSMFCCAAICGTVFGIGANSIFDLPFLKIAQKSFRSTLKINMRFALSTLSRTLFNKFKIHFFKEYEDFFIGAIKQVVHQREQDNIKKHDFVDTWLALQKSGKLKDSDNGFELETTNELLAAQAFSFFIAGVEPTATAMFATLFELAKNPEIQQKVHAEIDRVFENHSGALTYEMISKMVYLDMAIKEAMRLHPSIATLSRRCVQNTVLPNGNIVVEKDTKIFIPVYELHHDEKYFPDPEAYKPERLSRENKHEIIEFTYLPFGKGNRTCIGMQYAHMQIKTGLVHMLRHFTVHTDIQQGKQKYLKHLIQLRLDHNDIKFLTR-