Monarch geneset OGS2.0

DPOGS202332
TranscriptDPOGS202332-TA1551 bp
ProteinDPOGS202332-PA516 aa
Genomic positionDPSCF300032 + 633718-639544
RNAseq coverage110x (Rank: top 59%)
Annotation
HeliconiusHMEL0065950.068.88% 
BombyxBGIBMGA003944-TA8e-6929.66% 
DrosophilaCyp6a2-PA7e-9233.99% 
EBI UniRef50UniRef50_D2JLK01e-17454.74%Cytochrome P450 CYP6CT1 n=2 Tax=Ditrysia RepID=D2JLK0_9NEOP
NCBI RefSeqXP_975562.13e-10438.89%PREDICTED: similar to cytochrome P450 [Tribolium castaneum]
NCBI nr blastpgi|3083166284e-17454.74%cytochrome P450 CYP6CT1 [Zygaena filipendulae]
NCBI nr blastxgi|3083166283e-17054.74%cytochrome P450 CYP6CT1 [Zygaena filipendulae]
Group
Gene OntologyGO:00090553.5e-103electron carrier activity
GO:00200373.5e-103heme binding
GO:00167053.5e-103oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055063.5e-103iron ion binding
GO:00551143.5e-103oxidation-reduction process
KEGG pathwaydme:Dmel_CG94385e-90 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[37-510] IPR0011283.5e-103Cytochrome P450
[305-322] IPR0024011.6e-15Cytochrome P450, E-class, group I
Orthology groupMCL10169 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202332-TA
ATGGCGATTATAAAAATTCAGGATTATGCTTACGAACTGTTAATATTACTATTGATGGCATTCACTGTGTTGTACGTGTGGTTCCAATATAAATTTACGTATTGGAGCAGTAAAGGAGTGTTCAGTCCTACCCCGGTATTCCTTTTCGGAAATATACAGGATGTTATAAAAAGGAAGACGCAGTTTTTCCAGCCGTACTGCGATAATTATTTCAAATATAAACATTTGCCATACATAGGGATGTATTGTTTTAATAAGCCCGTACTGAGTATACACGACGCCGAGTTGGCTAAGCACATACTGATAAAGGATTTCGAACATTTCCAATCACATGGAATATTTTCTGGTGGTGTCGGGGATCCTTTGGCTGGACATCTTTTCAATTTACACGGATCAGCGTGGAAGCTTCTGAGGAACAAAATAACATCTGCATTCTCGTCATGTAAATTGAAATGTATGTATCCGCTGGTGGAGAAAATATCTAAGGAGGCGCTCGGGTACGGCGACCTGCTGCACGCGAGGAGCGAATCCATAAATTTCTCGGAATTTTACGAAAAGTACACTATGGAAATTATCGGTAATGTCGGCTTCGGGGTGGAGTGTAACGGTTTTAAAAATTCAAATTCAGAATTTTATTTGCGCGGACACGAGTATTTCAATCCTAATTCGATGTATTGGACACTAATAAGGGCTTTGGCGTTCTTTATGCCAAACTTCTTTGATAAGCTGAAGATAAGACGAATCAACCCGGACATTATAAACTTTTTCGATAATTTAGTCAGAGAGACCGTCGAGTATAGACGCAAACATAGCTACAAACGGAACGACTTTCTCCAGACTCTGATAGATTTAAATAACGATTCCAGTAAATGTGAAGAACGCGAATCCCAAAAGGGAGTTTTTACATTAACAGACGTCACATCAAACACTATGTTGTATATGTTTGCGGGTTACGAGACCTCGGCCACAACTGGGCAGTTTGCGGCGTACGAACTGGCAAAAAACCCCCACATTCAGACTAAGGCTAGGGAAGAAATAAGAAGGGTCCTCGCCAAATATGACGGCGAATGCAGTTACGAGGCCCAGGGTGAAATGACTTATATGAATATGATTTTAGATGAGACGATGCGAATGTACCCGCCACTTCGATCGCTTTACCGTGGCTGTACTAAGGAATATAGAATACCCGACAGTGACGTCACAATCGAGGAAGGCACCCTAGTGCTTATACCGATACATGCAATCCAGATGGATCCAGAAATATTCCAAGATCCGGAGACCTTCGATCCGGAAAGATTCTCCCCCGACAGAAAGAAACTTATCCATCCCTGTCATTGGATGCCGTTTGGCGAAGGTCCCCGGAAATGTCTAGGTCTCCGTCAAGGATACATTCAGTCGAAACTGGCTCTAGTCAAGTTATTACACAAGTATGAACTCTTGTTGGATGACCGCACTGCCGTTCCTATGAAGATTAAGGCCACATCACTAGCTTGCGCCGCTGACGGCGGTGTGTGGATACGGCTTAAGAAATTAACGGACGCTGTAAACTAG

Protein sequence:

>DPOGS202332-PA
MAIIKIQDYAYELLILLLMAFTVLYVWFQYKFTYWSSKGVFSPTPVFLFGNIQDVIKRKTQFFQPYCDNYFKYKHLPYIGMYCFNKPVLSIHDAELAKHILIKDFEHFQSHGIFSGGVGDPLAGHLFNLHGSAWKLLRNKITSAFSSCKLKCMYPLVEKISKEALGYGDLLHARSESINFSEFYEKYTMEIIGNVGFGVECNGFKNSNSEFYLRGHEYFNPNSMYWTLIRALAFFMPNFFDKLKIRRINPDIINFFDNLVRETVEYRRKHSYKRNDFLQTLIDLNNDSSKCEERESQKGVFTLTDVTSNTMLYMFAGYETSATTGQFAAYELAKNPHIQTKAREEIRRVLAKYDGECSYEAQGEMTYMNMILDETMRMYPPLRSLYRGCTKEYRIPDSDVTIEEGTLVLIPIHAIQMDPEIFQDPETFDPERFSPDRKKLIHPCHWMPFGEGPRKCLGLRQGYIQSKLALVKLLHKYELLLDDRTAVPMKIKATSLACAADGGVWIRLKKLTDAVN-