Monarch geneset OGS2.0

DPOGS205654
TranscriptDPOGS205654-TA1551 bp
ProteinDPOGS205654-PA516 aa
Genomic positionDPSCF300023 + 316318-321220
RNAseq coverage2233x (Rank: top 5%)
Annotation
HeliconiusHMEL0065880.074.67% 
BombyxBGIBMGA001004-TA0.069.29% 
DrosophilaCyp4g15-PA2e-15049.46% 
EBI UniRef50UniRef50_E2AH962e-16156.89%Cytochrome P450 4g1 n=2 Tax=Camponotus floridanus RepID=E2AH96_CAMFO
NCBI RefSeqNP_001106221.10.066.73%cytochrome P450 [Bombyx mori]
NCBI nr blastpgi|2914640910.071.20%cytochrome P450 4G49 [Manduca sexta]
NCBI nr blastxgi|2729795780.068.74%cytochrome P450 CYP4G48 [Zygaena filipendulae]
Group
Gene OntologyGO:00090551.4e-113electron carrier activity
GO:00200371.4e-113heme binding
GO:00167051.4e-113oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.4e-113iron ion binding
GO:00551141.4e-113oxidation-reduction process
KEGG pathwaydme:Dmel_CG14386e-91 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[26-510] IPR0011281.4e-113Cytochrome P450
[79-98] IPR0024018.1e-21Cytochrome P450, E-class, group I
Orthology groupMCL10223 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205654-TA
ATGACGGTGGTCGAAGATATCCAGCCGCACCAATGGAACTCCACGCGTCTTTACTTCTATCCGTTGGTCATCCTCGCGACTAGTCTTTGGCTTCTATACAGATGGCAGCAGCAAACTAAAATGTATAAGCTGGGGAACAAGATCCCCGGTCCCATGGCCTTGCCGATTTTTGGAAACGCACTTTTGGCCGTAAATAAAAAACCAGAGCAACTTTTAAGCATTGCCCTTGAATATTATGCATATTATGGCTCTGTGGTGAGAGGATGGCTGGGAAATAACTTGATAATATTTTTGGCGGACCCAAATGATGTTGAAGTCATTTTAAACAGCAATGTCCATATTGACAAGGCATCCGAATATAAATTCTTCCAACCGTGGCTAGGAGAAGGGCTGCTCATAAGCTCAGGAGAAAAATGGAGGTCCCATCGCAAAATGATAGCCCCAACCTTCCACATTAATATCCTCAAATCTTTCGTGGGAGTTTTTAATCAAAACAGCAAGAACGTTGTGGATAAAATGCGAGGTGAAATGGGAAAAGTGTTCGATGTTCACGATTATATGAGCGGCGTCACTGTGGATATTCTTCTAGAAACCGCAATGGGAATCACCAAGGAAACTCAAGATCAATCTGGTTTTGACTACGCTATGGCAGTGATGAAGCAAGTCATCAAAAATAAAAAAGAACGTTATCTCCAAAATAAAGCCAAAGGCATCATACCACCAACGATCGACGAAATTTCCAAATCGGCTCCTAAAACTGAAAATTATAATGTATTAGCCAACGAAAAAACGCTCGCTGATACAGTGTTCAAGGGCTACAGAGATGATTTAGATTTCAATGACGAAAATGATGTCGGTGAGAAAAAACGTCTGGCTTTCCTGGACCTCATGATTGAATCAGCACAGAACGGTTCAAACAAGATCACAGATTTTGAAATCAAAGAGGAAGGCCATGACACCACCGCAGCTGGATCCAGTTTCGTGCTTTGTCTCCTGGGAATTCACCAGGACATCCAAGCCAGGGTTTACGACGAGTTGTATTCAATCTTTGGAGATTCTGACCGCCCCGCCACTTTCGAAGACACCCTCCAAATGAAATACTTGGAGCGCGTCATCTTTGAATCGTTGAGAATGTACCCACCTGTACCCATTATTGCCAGGAAAATTAACCGTGATGTTAAGATAGCAACAAATGACTACGTATTGCCAGCTGGATGCACTGTGGTCATCGGAACATATGGAATCCACAGGAACCCTAAATATTATGAAAACCCCGACGTTTTCAACCCCGATAACTTCCTTCCTGAGAAGACACAGAACAGACACTATTACAGCTATATACCATTCAGTGCTGGGCCCAGGAGTTGTGTTGGACGTAAGTACGCCATTTTAAAATTGAAAATTTTACTATCGACAATCCTTCGCAATTACAAAATGGTGTCCGACATAACTGAGGATAAATTTGTCCTCCAAGCTGACATCATTCTGAAAAGACACGATGGCTTTAGGGTCCAGATTGAACCAAGGAAACGTGTTCCATCCACAGCATAA

Protein sequence:

>DPOGS205654-PA
MTVVEDIQPHQWNSTRLYFYPLVILATSLWLLYRWQQQTKMYKLGNKIPGPMALPIFGNALLAVNKKPEQLLSIALEYYAYYGSVVRGWLGNNLIIFLADPNDVEVILNSNVHIDKASEYKFFQPWLGEGLLISSGEKWRSHRKMIAPTFHINILKSFVGVFNQNSKNVVDKMRGEMGKVFDVHDYMSGVTVDILLETAMGITKETQDQSGFDYAMAVMKQVIKNKKERYLQNKAKGIIPPTIDEISKSAPKTENYNVLANEKTLADTVFKGYRDDLDFNDENDVGEKKRLAFLDLMIESAQNGSNKITDFEIKEEGHDTTAAGSSFVLCLLGIHQDIQARVYDELYSIFGDSDRPATFEDTLQMKYLERVIFESLRMYPPVPIIARKINRDVKIATNDYVLPAGCTVVIGTYGIHRNPKYYENPDVFNPDNFLPEKTQNRHYYSYIPFSAGPRSCVGRKYAILKLKILLSTILRNYKMVSDITEDKFVLQADIILKRHDGFRVQIEPRKRVPSTA-