Monarch geneset OGS2.0

DPOGS213244
TranscriptDPOGS213244-TA1635 bp
ProteinDPOGS213244-PA544 aa
Genomic positionDPSCF300124 - 304187-322649
RNAseq coverage61x (Rank: top 68%)
Annotation
HeliconiusHMEL0078560.076.29% 
BombyxBGIBMGA009523-TA0.082.05% 
DrosophilaCyp49a1-PD2e-13745.35% 
EBI UniRef50UniRef50_G6CJV50.0100.00%Cytochrome P450 301B1 n=2 Tax=Neoptera RepID=G6CJV5_DANPL
NCBI RefSeqNP_001164234.15e-16356.85%cytochrome P450 301B1 [Tribolium castaneum]
NCBI nr blastpgi|3838557343e-16557.62%PREDICTED: probable cytochrome P450 49a1-like [Megachile rotundata]
NCBI nr blastxgi|3838557343e-15957.83%PREDICTED: probable cytochrome P450 49a1-like [Megachile rotundata]
Group
Gene OntologyGO:00090551.2e-98electron carrier activity
GO:00200371.2e-98heme binding
GO:00167051.2e-98oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.2e-98iron ion binding
GO:00551141.2e-98oxidation-reduction process
KEGG pathwaydme:Dmel_CG60422e-77 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[76-544] IPR0011281.2e-98Cytochrome P450
[346-363] IPR0024011.8e-16Cytochrome P450, E-class, group I
Orthology groupMCL11627 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213244-TA
ATGGCAAGTTCACGAGTAACAATGAGACATAGTCTCAACATCGTGAGATGGCCTCTACGTGTCTGCAAACGGAGACTGGCGTCACAGCAGCGAGCTGTTGCTACCGATGGCGAAACTGGTAAAATATCACATTTGCAAATATGCCCGGCGAAAGGTCACAGGGCGAGATCAACTCAGGCTGTGGCGCCCCTCGAGTCAATCGCGTCAGTAAAACCCTGGGAAGAAGTTCCAGGGCCTAGACCACTGCCACTATTAGGAAACACTTGGAGATTCATACCATATATTGGTGGGTATTCAGTAGAGCATGTCGACAAAGTGTGTCTGTCTTTGCGTCAGGAATATGGAGATTGCGTCAAAATGGAAGGTCTGCTTGGTAGACCGGATATGCTGTTCGTTTTTGATGCGAGCGAAGTTGAACGAGTCTTTAGAGGAGAGGATTCGGCCCCTCATAGACCATCAATGCCCTCGTTGAACTATTACAAACACACATTGAGAAAAGACTTCTTTGGTGCCGAGAAGGACTGTGCCGGAGTTATTGCGGTTCATGGAGATTCCTGGTCAGCTTTTCGGACTAAAGTATCCCGTGTCGCTCTCAGCGCTGGCGCCGCAGCTCAATACACCGACCAAGTTGGGGAGGTAGCTGATGCTTTTGTTAAAAGGCTACGAAAAATAAGAAATGAAAGGAAAGAAACGCCGGACGACTTTCTAAATGAAGTTCATAAATGGTCATTAGAATCGTTAGGACTTATAGCGTTAGACACGAGGTTAGGCTGTTTGGAACAGCACGAGGGTTCGGAGACTCAGCAGCTTATAGATGCTGTGAATACTTTCTTCTTGTGTGTTGGCGAGTTGGAACTCAAGGCACCATGGTGGAGACTCTACCCCACGGCCATGTTCAAGAGATATGTAGCGGCCTTAGACACCATACTTAGTGTTACTCAGTCTCATGTCAGTCGTGCCTTAAAGGAATGTCAAGCTAACCCAAATGGCAGTAAATCTCTTCTCCAAGACCTAGTATCAGCAGCCGGACCTCGTGTTGCAGCAGTAGCAGCACTTGACATGTTCCTTGTGGGCATTGATACGACGTCGAATGCTGTAGCCTCAACTCTATATCAACTCTCTCTAAGGCCTGACGTACAAGAGAAGTTATATAAAGAAATTTCAGGTGTATTACAAGGACGCCCTATAAGACCTGGAGATGTTAATAAAATGCCATATCTAAAGGCGTGCATCAAAGAGGTTTTAAGAATGTATCCAGTTGTTATTGGTAATGGCCGGCAATTGAGCAAGGACACAGTTATATGTGGCTATAATATTCCGAAAGGGACGCAAGTGATATTCCAACACTACGTTATGGGAAACAGCGATGACTATTTCACAAACGCTTCACAGTTTTGTCCTGAAAGGTGGTTACAGCGTTCAATATATAAACACCATCCATTCGCGTCCTTGCCGTTTGGATTCGGCAAAAGGATGTGCCTCGGTAGGAGATTCGCTGAACTTGAAATTAATATCATCATTTGTAAAATGGTTCAATCGTTTCAAATGGAATACCACCACGAGCCCCTTGAATACCACGTTCATCCCATGTATACTCCCAATGGACCTATACGTTTAAAACTTATTGACCGTTAA

Protein sequence:

>DPOGS213244-PA
MASSRVTMRHSLNIVRWPLRVCKRRLASQQRAVATDGETGKISHLQICPAKGHRARSTQAVAPLESIASVKPWEEVPGPRPLPLLGNTWRFIPYIGGYSVEHVDKVCLSLRQEYGDCVKMEGLLGRPDMLFVFDASEVERVFRGEDSAPHRPSMPSLNYYKHTLRKDFFGAEKDCAGVIAVHGDSWSAFRTKVSRVALSAGAAAQYTDQVGEVADAFVKRLRKIRNERKETPDDFLNEVHKWSLESLGLIALDTRLGCLEQHEGSETQQLIDAVNTFFLCVGELELKAPWWRLYPTAMFKRYVAALDTILSVTQSHVSRALKECQANPNGSKSLLQDLVSAAGPRVAAVAALDMFLVGIDTTSNAVASTLYQLSLRPDVQEKLYKEISGVLQGRPIRPGDVNKMPYLKACIKEVLRMYPVVIGNGRQLSKDTVICGYNIPKGTQVIFQHYVMGNSDDYFTNASQFCPERWLQRSIYKHHPFASLPFGFGKRMCLGRRFAELEINIIICKMVQSFQMEYHHEPLEYHVHPMYTPNGPIRLKLIDR-