Monarch geneset OGS2.0

DPOGS211931
TranscriptDPOGS211931-TA1632 bp
ProteinDPOGS211931-PA543 aa
Genomic positionDPSCF300011 + 339372-343017
RNAseq coverage5769x (Rank: top 2%)
Annotation
HeliconiusHMEL0167050.087.16% 
BombyxBGIBMGA001162-TA0.081.29% 
DrosophilaCyp4g15-PA0.065.19% 
EBI UniRef50UniRef50_Q17DS90.071.89%Cytochrome P450 n=52 Tax=Neoptera RepID=Q17DS9_AEDAE
NCBI RefSeqNP_001106223.10.081.12%cytochrome P450 CYP4G25 [Bombyx mori]
NCBI nr blastpgi|2914640890.082.55%cytochrome P450 4G4 [Manduca sexta]
NCBI nr blastxgi|2914640890.082.55%cytochrome P450 4G4 [Manduca sexta]
Group
Gene OntologyGO:00090557e-121electron carrier activity
GO:00200377e-121heme binding
GO:00167057e-121oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055067e-121iron ion binding
GO:00551147e-121oxidation-reduction process
KEGG pathwaydme:Dmel_CG14382e-110 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[314-538] IPR0011287e-121Cytochrome P450
[333-350] IPR0024016.9e-21Cytochrome P450, E-class, group I
Orthology groupMCL10223 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211931-TA
ATGAGTTACGCTGCTGCTGAAACAGTGGCGGCAAGCAGCACTTGGGCAGCCACTAACCTGTTCTATGTGCTGCTAGTGCCAGCGCTGCTGCTATGGTACACATACTGGCGCATGTCCAGGAGGCATATGTATGAGCTTGCAGAAAAGATCGCCGGTCCACCAGGCTACCCCCTCATAGGCAATGCTCTGGAATTCACAGGGGGTTCCGATGAAATATTTAAAAATATAATGAAGAGGAGCTTCGAATTTAAAAATGAGACAGCCGTTAGAATTTGGATTGGACCCAGGCTCCTCGTGTTCTTGTATGACCCTCGGGATGTCGAGCTTATTCTTAGCAGCCACGTTCATATCGACAAAGCAGAAGAATACAGGTTCTTCAAGCCCTGGCTTGGAGATGGCCTTCTCATCAGTACCGGTCAAAAATGGCGTTCCCACCGAAAACTGATCGCTCCCACTTTCCACTTGAACGTTCTCAAGAGCTTCATCGACCTCTTCAATGCCAATTCACGAGCCGTGGTCAACAAGCTGAAGAAGGAAAGCGGGGAGTTCGATTGTCATGACTACATGAAAACTGCTATGGGAGTCAACAAGAACACTCAGGACAGTGGATTTGAATACGCCATGGCTGTCATGAAAATGTGTGACATCCTGCATTTAAGACACACAAAAATCTGGTTGAGACCCGACTTACTCTTCAATTTTACTCAGTATGCAAAGGTTCAAAACAAACTTCTCAGCATAATCCACGGGTTGACCACAAAGGTTATCAAACGGAAGAAGGAGGAATTCAAATCCGGAAAGAAGCCATCTATCCTTGAAACAGAAGTCACAACCAAAGACACCAAGACAACCTCTGTTGAGGGATTATCATTCGGCCAATCTGCTGGTCTCAAGGACGACTTGGATGTGGACGAAGATGTTGGACAGAAGAAACGTCTTGCTTTCTTAGATTTGCTCCTAGAAAGTGCTCAGGGTGGAATTGTCATCTCTGACACCGAAATCAAGGAACAAGTTGATACTATTATGTTTGAGGGTCACGACACTACAGCTGCTGGAAGCAGTTTCTTCTTATCGTTGATGGGTATCCATCAAGACATCCAAGCCAAGGTTGTTGAAGAATTAGATCAAATATTCGGTGACTCAGACCGCCCAGCTACCTTCCAGGACACGTTGGAAATGAAATATTTGGAAAGGTGTCTCATGGAAACACTACGTATGTTCCCACCAGTACCAATTATTGCTCGTCATCTAAATCAGGATATCACTCTACCCTCCAGTGGCAAAAAAGTACCAGCCGGTACCACCGTCGTCATCGGTACTTACAAACTCCACCGCAGTGAGAGCATTTACCCAAATCCGGATAAATTCGACCCTGACAACTTCCTTCCGGAACGTTCTGCTAATCGCCATTACTACGCGTTCGTACCCTTCTCAGCTGGACCAAGGAGTTGTGTTGGTCGTAAATACGCTATGTTGAAGCTAAAGATCATACTCTCGACCATCCTGAGGAATTTCCGCGTAATTTCGGACCTTAAAGAGGAAGACTTCAAACTTCAGGCCGACATTATACTGAAGCGAGCTGAAGGTTTCAAAGTTCGTCTTGAACCACGCAAGAGGCTAGTGAAAGCCTAA

Protein sequence:

>DPOGS211931-PA
MSYAAAETVAASSTWAATNLFYVLLVPALLLWYTYWRMSRRHMYELAEKIAGPPGYPLIGNALEFTGGSDEIFKNIMKRSFEFKNETAVRIWIGPRLLVFLYDPRDVELILSSHVHIDKAEEYRFFKPWLGDGLLISTGQKWRSHRKLIAPTFHLNVLKSFIDLFNANSRAVVNKLKKESGEFDCHDYMKTAMGVNKNTQDSGFEYAMAVMKMCDILHLRHTKIWLRPDLLFNFTQYAKVQNKLLSIIHGLTTKVIKRKKEEFKSGKKPSILETEVTTKDTKTTSVEGLSFGQSAGLKDDLDVDEDVGQKKRLAFLDLLLESAQGGIVISDTEIKEQVDTIMFEGHDTTAAGSSFFLSLMGIHQDIQAKVVEELDQIFGDSDRPATFQDTLEMKYLERCLMETLRMFPPVPIIARHLNQDITLPSSGKKVPAGTTVVIGTYKLHRSESIYPNPDKFDPDNFLPERSANRHYYAFVPFSAGPRSCVGRKYAMLKLKIILSTILRNFRVISDLKEEDFKLQADIILKRAEGFKVRLEPRKRLVKA-