Monarch geneset OGS2.0

DPOGS201160
TranscriptDPOGS201160-TA3213 bp
ProteinDPOGS201160-PA1070 aa
Genomic positionDPSCF300065 + 539853-551370
RNAseq coverage1292x (Rank: top 10%)
Annotation
HeliconiusHMEL0137390.065.83% 
BombyxBGIBMGA003945-TA0.060.80% 
DrosophilaCyp9f2-PA8e-11341.54% 
EBI UniRef50UniRef50_Q7YZX40.062.43%Cytochrome P450 n=18 Tax=Ditrysia RepID=Q7YZX4_SPOLT
NCBI RefSeqNP_001077079.10.060.80%cytochrome P450 9a20 [Bombyx mori]
NCBI nr blastpgi|2700113450.041.12%hypothetical protein TcasGA2_TC005353 [Tribolium castaneum]
NCBI nr blastxgi|629125080.063.74%cytochrome P450 [Helicoverpa armigera]
Group
Gene OntologyGO:00090551.9e-105electron carrier activity
GO:00200371.9e-105heme binding
GO:00167051.9e-105oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.9e-105iron ion binding
GO:00551141.9e-105oxidation-reduction process
KEGG pathwayaag:AaeL_AAEL0068151e-119 
 K07424 (CYP3A)maps-> Drug metabolism - cytochrome P450
    Drug metabolism - other enzymes
    Linoleic acid metabolism
    Steroid hormone biosynthesis
    Metabolism of xenobiotics by cytochrome P450
    gamma-Hexachlorocyclohexane degradation
    Retinol metabolism
InterPro domain[564-1065] IPR0011281.9e-105Cytochrome P450
[718-736] IPR0024014.7e-16Cytochrome P450, E-class, group I
Orthology groupMCL10032 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201160-TA
ATGATAGTATTTCTATTATGGGTCGTCGTTCTGGTGGCCGCCCTGGCACTATACTACCGCCAGATATACTCGAGATTCTCTAAGTTTGGAGTTAAGCAGAATAATATTGTTCCTTTTTTTGGCAATATGTTGAAGATTACACTTGGCATGGAACATTTCGCCGAAAATATTGACTCAATGTATAAGAACTTTCCAGAAGAAAGGTTCATCGGCCGGTTTGAATTTTCAAAACCTGGTGTTTTAATTAGAGATTTGGAATTGATAAAGAAAGTAACTGTCAAGGACTTCGAACACTTTCTAGATCATCGAGGCTTCGTTGATGATAAAGTCGAGCCACTTTTTGCAAGAAACTTATTCTCCTTAAAAGGTCAAGAATGGAAAGACATGAGGTCGACATTGAGTCCGGCTTTCACGAGCTCCAAGATCAAGCAGATGGTGCCATTTATGGAAGAAGTGGGAGAGATGATGGTTAGAGCAGTCAAGAAAAATATTGAGAATTCTAAAGTCGGACAGATTGAAATCGACGTGAAAGACCTGACCTCTAGATATTCTAACGATGTGATCGCTTCCTGCGCCTTCGGACTCAAAGTGGACTCCCATACTACAGAAGATAATGAATTCTATGAGATGGGCAAAAAAGCCTCGACTTTTAAGTTTAAGCAAATGCTCCTCTTCTTTATGAATTCTGCTTTACCTTTTGTTGCTAGGCTTTTTAAACTAAAACTCTTCTCAAGAGAAACAACAGATTTTTTTGTGGATCTAGTACAAGATACAATGAAGGAAAGGGAAGACAAAAAAATCATTAGACCAGACATGATTCATCTGCTGATGGAAGCTAAGAAAGGTAAACTGACATACGATGTCGCTAATGGAAAAGCTGACTCCAATGTTGGATTTTCCACAGTTGAAGAATCAGACTTAGGCCAAAAACCTGTTACTAGAGTGTGGTCTGATAGCGACTTGATCGCTCAAGCAGTAATGTTCTTCATAACTGCATTTGAAACCATATCTACTGTGGCATCTTTCACGCTTTATGAGCTAGCCGTGAACCCAGACGTACAGGACAAACTTTTTCAGGAGATTAAGGAACATAATGCTAAATTTGGAGGAAAATTGGACTTCAATTCCATTCAGAACTTGACTTATTTGGATATGGTGGTCTCAGAAGTTTTAAGACTGTGGCCACCCGCCATCGTCTTGGACAGAGTATGTAACAAAGACTACAACTTGGGAAAACCGAACAGTAAAGCCACTGAAGACTTTATTGTTCGTAAAGGAGAGCTGGTAGGAGTTCCAGTAATGAGCATACATCGTGATCCACAATATTTCCAAAATCCTAATAAATTTGATCCTGAACGCTTTTCTGAAGAAAATAGACACCTTATTGATTTAATGACTTATATGCCGTTTGGATATGGTCCCAGGAATTGTATCGGTTCACGATTCGCGCTCTGCGAAGTGAAGGTACTCACCTATCAGATTATTTTAAACTTCGAGATATCGCCAGCCGAGAAGACTTGCATTCCGGCGAAGTCAAGAATGGAAAGACATGAGGTTGACCCTAAGTCCGGCTTCCACGAGCTCCAAGATCAAGCAGATGGTACCGTTTATATGATTCTTGAAATATGGATTGTCGTTTTATTTGTGGCCGCAGTGTTGTACTATCGTCAGGTGTACTCAAGATTTACTAAATATGGTGTCAAACAACAGACACCGTTCCCTTTATTTGGCAATATGTGGAGAGTGACCCTTCGACTGGAACATTTTACTGATACAATGAATTCAATGTATAAAGAATTTCCAGAAGAAAGGTTCATTGGACGATTCGAATTCATCAAACCCTGTCTGTTTGTAACAGACATTGAATTAATAAAAAAAATAACGGTTAAAGATTTTGAACATTTTCTGGATAGAAGCGGCTTCTCTGTTGAAGTAGACCCGTTGTTTTCAAGAAATTTATTCTCTTTGAAAGGTCAAGAATGGAAAGACATGAGGTCGACCCTGAGTCCGGCTTTCACGAGCTCCAAGATCAAACAGATGGTGCCGTTTATGGAAGAAGTGGGAGAGATGATGATACAAGCAGTGAAAAATAAAATTAAAAATTCAGGAAGTGAATGGATTGAAATCGACTGCAAGGACTTAACAACACGCTATACCAATGATGTGATCGCTTCTTGCGCTTTTGGCATAAAAGTGGACTCACATACTAACGTTGACAACGACTTCTATAAAATTGGAAAAATAGCTAGCACGTTCAAAACTAGAGATGTCATTTTGTTCCAATTGTACTCCGCAGCACCTTTCCTTGGAAAGTTCTTTAAACTGGAACTGCTCTCGACAAAAACAAGAAACTTTTTCATCGATCTTGTTTTAAGCACCATGAAGGAAAGGGAGACCCACAACATTATACGACCGGATATGATTCACTTACTAATGGAAGCCAAGAAAGGCAGACTGCATCATGAAAAAGATGTGATTAACGCTGACGATGGCTTTGCTACGGTGGAAGAATCCGAAGTAGGAAAGAACAATAAACATAAAAGTTGGTCGGACACCGACCTCATCGCCCAGGCAGTTCTTTTCTTCATCGCAGCCTTCGAAGTAGTTTCAACGGCGATGACCTTCACACTTACTGAACTAGCACTGCATCCTGACGTGAAAGAGAGGCTTGCGCTTGAGATAAAGGAACACAATGAGAAATATGGTGGCAAGATAGACCTACATTCTATACAAACATTAACTTATTTGGATATGGTTATCTCAGAAGTATTGAGGTTGTGGGTACCTACCATACTCATAGAAAGGATGTGCGTTAAAGACTATAACATGGGGAAATCTAACGGTGAAGCTAAGGAAGATTTTATTCTCCGCAAAGGTGAATCACTACATATTCCAACTTGGTGCATTCATCGTGATCCACAATTCTTTCCTAATCCTAATAAGTTCGACCCTGAACGGTTCTCTAAAGAAAATCGACATCTAATCCATCCCATTGCCTACATGCCTTTCGGACTTGGACCAAGAAACTGCATCGGTTCGCGCTTCGCTCTCTGCGAGCTAAAGCTTTTAGTGTATCAGATTCTTCTTAATTTTGATTTGTCACCAACCAAAAAGACATGTATTCCAGCAAGATTGAACAAGAGCTCTTTCCATCTTCAACTTGAGGGCGGACATGCGCTGAAATTCAGATCACGTTATAATTTAATATTGTGA

Protein sequence:

>DPOGS201160-PA
MIVFLLWVVVLVAALALYYRQIYSRFSKFGVKQNNIVPFFGNMLKITLGMEHFAENIDSMYKNFPEERFIGRFEFSKPGVLIRDLELIKKVTVKDFEHFLDHRGFVDDKVEPLFARNLFSLKGQEWKDMRSTLSPAFTSSKIKQMVPFMEEVGEMMVRAVKKNIENSKVGQIEIDVKDLTSRYSNDVIASCAFGLKVDSHTTEDNEFYEMGKKASTFKFKQMLLFFMNSALPFVARLFKLKLFSRETTDFFVDLVQDTMKEREDKKIIRPDMIHLLMEAKKGKLTYDVANGKADSNVGFSTVEESDLGQKPVTRVWSDSDLIAQAVMFFITAFETISTVASFTLYELAVNPDVQDKLFQEIKEHNAKFGGKLDFNSIQNLTYLDMVVSEVLRLWPPAIVLDRVCNKDYNLGKPNSKATEDFIVRKGELVGVPVMSIHRDPQYFQNPNKFDPERFSEENRHLIDLMTYMPFGYGPRNCIGSRFALCEVKVLTYQIILNFEISPAEKTCIPAKSRMERHEVDPKSGFHELQDQADGTVYMILEIWIVVLFVAAVLYYRQVYSRFTKYGVKQQTPFPLFGNMWRVTLRLEHFTDTMNSMYKEFPEERFIGRFEFIKPCLFVTDIELIKKITVKDFEHFLDRSGFSVEVDPLFSRNLFSLKGQEWKDMRSTLSPAFTSSKIKQMVPFMEEVGEMMIQAVKNKIKNSGSEWIEIDCKDLTTRYTNDVIASCAFGIKVDSHTNVDNDFYKIGKIASTFKTRDVILFQLYSAAPFLGKFFKLELLSTKTRNFFIDLVLSTMKERETHNIIRPDMIHLLMEAKKGRLHHEKDVINADDGFATVEESEVGKNNKHKSWSDTDLIAQAVLFFIAAFEVVSTAMTFTLTELALHPDVKERLALEIKEHNEKYGGKIDLHSIQTLTYLDMVISEVLRLWVPTILIERMCVKDYNMGKSNGEAKEDFILRKGESLHIPTWCIHRDPQFFPNPNKFDPERFSKENRHLIHPIAYMPFGLGPRNCIGSRFALCELKLLVYQILLNFDLSPTKKTCIPARLNKSSFHLQLEGGHALKFRSRYNLIL-