Monarch geneset OGS2.0

DPOGS205608
TranscriptDPOGS205608-TA1506 bp
ProteinDPOGS205608-PA501 aa
Genomic positionDPSCF300167 + 220801-223020
RNAseq coverage2557x (Rank: top 5%)
Annotation
HeliconiusHMEL0174631e-15051.31% 
BombyxBGIBMGA003945-TA3e-6328.93% 
DrosophilaCyp6g2-PA2e-7432.42% 
EBI UniRef50UniRef50_D5L0N38e-8935.93%Cytochrome P450 332A5 n=4 Tax=Ditrysia RepID=D5L0N3_MANSE
NCBI RefSeqNP_001108340.12e-8836.07%cytochrome P450 CYP332A1 [Bombyx mori]
NCBI nr blastpgi|2914641013e-8835.93%cytochrome P450 332A5 [Manduca sexta]
NCBI nr blastxgi|1688234132e-8835.67%cytochrome P450 CYP332A1 [Bombyx mori]
Group
Gene OntologyGO:00090556.3e-107electron carrier activity
GO:00200376.3e-107heme binding
GO:00167056.3e-107oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055066.3e-107iron ion binding
GO:00551146.3e-107oxidation-reduction process
GO:00044973.3e-14monooxygenase activity
KEGG pathwaynvi:1001183634e-76 
 K07424 (CYP3A)maps-> Drug metabolism - cytochrome P450
    Drug metabolism - other enzymes
    Linoleic acid metabolism
    Steroid hormone biosynthesis
    Metabolism of xenobiotics by cytochrome P450
    gamma-Hexachlorocyclohexane degradation
    Retinol metabolism
InterPro domain[34-495] IPR0011286.3e-107Cytochrome P450
[348-364] IPR0024033.3e-14Cytochrome P450, E-class, group IV
Orthology groupMCL10231 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205608-TA
ATGTCGGTTCTGTTTTTGTTATTTTCTTTAATGACATGTTTTTTGGCGTATCTTACTTTGAGATGGAATAAAGTAAAGAATTATTGGGCCCAGCGCGGAGTACCACACTCCCCACCAAACCCATTCCTCGGCAGCCTTACCTTCATTCAAAGAAAAAATGTGGGTGTATGGATACGTGACATGTATGAACACTTCAAATCTCCGTACATAGGGATCTGGTTAATATGGAAACCAGCTTTAATTATAAACGACCCAGAAATCGCTCGGCGAATATTAGTTAAAGATAGTTCGATTTTTAGAGACAGGTATTTGAGTTCTGGGAGCAGCGATCCTATCGGAGCACTAAATATATTTACCATGAATGACCCCGATTGGTCAAATATACGACGTAAACTGACCAATCTGTTCACTGCAGCTAAACTTAGATTTGTTCAAAGTTTTGCTTTAGTAAAAGCTAAAGAACTCGTGCAGAGAGTGGATAGAGATCGCAATAAAGGTCTAGAACTCAAGACTCTCTTTGTTGATTACACGACAGATGTGATTGGAACGTTTGCCTTCGGACTTGAGAGTAACGCAACGCTCACTTCCGAGGGTCCCCTAAGAAAAGTTACAGATGATTTCATGAGGTTTAGCGTATATAGAGGGCTTTGTTGGTTTAGTATTTTTTTTTGGCCGGGTTTGGTCGATATATTTAGGTTTAGCCTCTTTCCAAGGGACACAACTGATTTCTTTAAAAAGATTTACCTAAATATAATGGACCAACGTCACAAACATCCAGACGGCAAACAATACAAGGATTTAGTAGATGCTCTTATAGAAATTAAAAAAGAGAGCGAAGAAAAAAATCAAAACTACCCCGATGACCTTTATCTAGCCCAAGCGGCCATTGTCCTCCTTGGAGGTTTTGACTCTACTGCCTCAGCGCTAACGTACATGACATATGAACTTGCCCATGATAGCGAGAGTCAGGAAAAATTATACAGAGAATTGAAGGAAGCCGAAAGAAATGGAGCAAATTTCGATGCGCAGACCTTGACAGAGTTGACATATCTCAACTGTGTCTTCAAAGAGGTTCTGCGAAAATATGCACCAATGGGTTGGCTTGACCGAATAGCAGCTACCGATTATAATATTGATGAGAACCTGACTATCGCAGCAGGAACAGTGATTTATGTGAATGCTATCGGTATGCACTATGATCCCAAATACTTTCCTGAACCTTACAAATTTAATCCTGATAGGTTTTTACCAGAAAACGAAAGTAACATTGAACCGTATACATTTATGCCGTTCGGAGATGGTCCGAGAGTATGCATAGGTCAAAGATTTGCTTATATGTCCGCCCGAACGGCTGCATCTCAGTTGTTCCTAAAATACAAGGTCCAACCTATTCCTGGTTCACCAAAACCTAAAGACGTGAAGATTGAATCGAAAGGATTGTTTTTAGGACCAGGAGAGCCAGTGCACGTTGAATTCATCCCGAGAACGGAGAACGGGCATGATTAA

Protein sequence:

>DPOGS205608-PA
MSVLFLLFSLMTCFLAYLTLRWNKVKNYWAQRGVPHSPPNPFLGSLTFIQRKNVGVWIRDMYEHFKSPYIGIWLIWKPALIINDPEIARRILVKDSSIFRDRYLSSGSSDPIGALNIFTMNDPDWSNIRRKLTNLFTAAKLRFVQSFALVKAKELVQRVDRDRNKGLELKTLFVDYTTDVIGTFAFGLESNATLTSEGPLRKVTDDFMRFSVYRGLCWFSIFFWPGLVDIFRFSLFPRDTTDFFKKIYLNIMDQRHKHPDGKQYKDLVDALIEIKKESEEKNQNYPDDLYLAQAAIVLLGGFDSTASALTYMTYELAHDSESQEKLYRELKEAERNGANFDAQTLTELTYLNCVFKEVLRKYAPMGWLDRIAATDYNIDENLTIAAGTVIYVNAIGMHYDPKYFPEPYKFNPDRFLPENESNIEPYTFMPFGDGPRVCIGQRFAYMSARTAASQLFLKYKVQPIPGSPKPKDVKIESKGLFLGPGEPVHVEFIPRTENGHD-