Monarch geneset OGS2.0

DPOGS204386
TranscriptDPOGS204386-TA1284 bp
ProteinDPOGS204386-PA427 aa
Genomic positionDPSCF300002 - 1569822-1572305
RNAseq coverage17x (Rank: top 81%)
Annotation
HeliconiusHMEL0130816e-16361.76% 
BombyxBGIBMGA013237-TA4e-5429.16% 
DrosophilaCyp6v1-PA2e-6029.83% 
EBI UniRef50UniRef50_D5L0N32e-15057.58%Cytochrome P450 332A5 n=4 Tax=Ditrysia RepID=D5L0N3_MANSE
NCBI RefSeqNP_001108340.19e-14157.75%cytochrome P450 CYP332A1 [Bombyx mori]
NCBI nr blastpgi|2914641017e-15057.58%cytochrome P450 332A5 [Manduca sexta]
NCBI nr blastxgi|2914641014e-14757.58%cytochrome P450 332A5 [Manduca sexta]
Group
Gene OntologyGO:00090554.1e-96electron carrier activity
GO:00200374.1e-96heme binding
GO:00167054.1e-96oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055064.1e-96iron ion binding
GO:00551144.1e-96oxidation-reduction process
KEGG pathwaytca:6590545e-65 
 K07424 (CYP3A)maps-> Drug metabolism - cytochrome P450
    Drug metabolism - other enzymes
    Linoleic acid metabolism
    Steroid hormone biosynthesis
    Metabolism of xenobiotics by cytochrome P450
    gamma-Hexachlorocyclohexane degradation
    Retinol metabolism
InterPro domain[2-424] IPR0011284.1e-96Cytochrome P450
[218-235] IPR0024012.4e-15Cytochrome P450, E-class, group I
Orthology groupMCL34542 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204386-TA
ATGAGGCCAGCATTGGTTGTGCAAACACCGCAACTAGCTCGAAAAGTTTTTAGCTCTGAATACGATAACTTTCAAGATAGACATTTGTACTGTCATGAGTCCGACCCTATGGGGTCGCTTAACATATTTACTGTTAAGAATCCATTGTGGAAAATGCTCAGATACCGTTTATCGCCAATGTTTACATCGCACCGTCTAAGAAAAATAACAGAATTGATGAATGCAAATGCAACTGAATTGGTTAAGAAAGTTCAAAGAGATTTTGTTGAAAAAAAGAAACATGTCAATTTAAAGGAAATATTTTCCATGTACACGTCAGACACAGTAGCGTATAGTGTTTTCGGAATAAGAGTTAGTGCATTAAGTGACCAAGACTCCCCATTGTGGCACATAACTAACCATATGGTCAAATGGACATTCTACCGCGGATTGGAGATTACTTTCATATTCTTTCTTCCTGCCCTTGGAGCATTAATGCGGCTTAAATTATTTTCGAAAGCGGCTAATGATTATATTAAACAACTATATTGGAAAATTGTCAAAGATCGACAGAATCATATAAAATACGACGATTCGGACCTTGTGAATCATTTACTAAAAATAAAGGAACAGCTGGAACTACCACCTGAAGCTGACGAGGACTATGCTGACAATATCTTGCTTGCACAAGTAGCAGTGTTTATATTAGGTTCTATTGAAACTTCTTCTTCTGTTTTGAGCTATTCTCTGCATGAACTAGCATATCATCCAGATGAGCAGGAAAAGCTGTATAGCGAACTTAATAACGCATTTTTGAGCAGTGAAAAGGACTATCTGAATTACAATGAATTATTGCAAGTTGACTATTTGACTGCGTGTATCCACGAAACAATGAGGAAATTTCCCCTTTTGCCTTTAATAGATAGATTGTGTGGACAGACGTGTGTACTTGAAGATGGTTTGAAAATAGAAAAAGGAGTCCCCGTGCTTGTGAATGTGGTCGCAATACATAACAACGAGAAATATTACCCTGAACCTGAAAAGTGGAAACCCGAGCGTTTTATGACAGGCAATAAACAAGATAACAGAGAATTCACCTTTCTGCCGTTTGGTGACGGACCAAGATTCTGTATCGGTAAAAGATACGGTATGATGCAAGTACGCGCAGCTCTTTCCCAGTTGATTTATAATTATAAGATAGAGCCTGTCGTCCCTTACAAAGTGAAACCAGACCCGCATTCTGTTATCTTAGCACCTCAAGATGGATTGAGCGTCAAATTTGTTCCTCGACGTACTGAAAAATAA

Protein sequence:

>DPOGS204386-PA
MRPALVVQTPQLARKVFSSEYDNFQDRHLYCHESDPMGSLNIFTVKNPLWKMLRYRLSPMFTSHRLRKITELMNANATELVKKVQRDFVEKKKHVNLKEIFSMYTSDTVAYSVFGIRVSALSDQDSPLWHITNHMVKWTFYRGLEITFIFFLPALGALMRLKLFSKAANDYIKQLYWKIVKDRQNHIKYDDSDLVNHLLKIKEQLELPPEADEDYADNILLAQVAVFILGSIETSSSVLSYSLHELAYHPDEQEKLYSELNNAFLSSEKDYLNYNELLQVDYLTACIHETMRKFPLLPLIDRLCGQTCVLEDGLKIEKGVPVLVNVVAIHNNEKYYPEPEKWKPERFMTGNKQDNREFTFLPFGDGPRFCIGKRYGMMQVRAALSQLIYNYKIEPVVPYKVKPDPHSVILAPQDGLSVKFVPRRTEK-