Monarch geneset OGS2.0

DPOGS201540
TranscriptDPOGS201540-TA2319 bp
ProteinDPOGS201540-PA772 aa
Genomic positionDPSCF300006 + 1655741-1668530
RNAseq coverage931x (Rank: top 14%)
Annotation
HeliconiusHMEL0090670.087.21% 
BombyxBGIBMGA002716-TA0.083.02% 
DrosophilaCG5009-PA6e-15454.30% 
EBI UniRef50UniRef50_Q7KML29e-15254.30%Probable peroxisomal acyl-coenzyme A oxidase 1 n=26 Tax=Neoptera RepID=ACOX1_DROME
NCBI RefSeqXP_001847631.10.062.34%acyl-CoA oxidase [Culex quinquefasciatus]
NCBI nr blastpgi|3320276650.064.99%Putative peroxisomal acyl-coenzyme A oxidase 1 [Acromyrmex echinatior]
NCBI nr blastxgi|3320276650.064.99%Putative peroxisomal acyl-coenzyme A oxidase 1 [Acromyrmex echinatior]
Group
Gene OntologyGO:00039975.1e-270acyl-CoA oxidase activity
GO:00066315.1e-270fatty acid metabolic process
GO:00506605.1e-270flavin adenine dinucleotide binding
GO:00057775.1e-270peroxisome
GO:00551145.1e-270oxidation-reduction process
GO:00166271.6e-54oxidoreductase activity, acting on the CH-CH group of donors
GO:00066353e-53fatty acid beta-oxidation
GO:00081522.9e-52metabolic process
GO:00039958.3e-44acyl-CoA dehydrogenase activity
KEGG pathwaycqu:CpipJ_CPIJ0061871e-180 
 K00232 (E1.3.3.6, ACOX1, ACOX3)maps-> Peroxisome
    Fatty acid metabolism
    Biosynthesis of unsaturated fatty acids
    PPAR signaling pathway
    alpha-Linolenic acid metabolism
InterPro domain[1-772] IPR0122585.1e-270Acyl-CoA oxidase
[1-737] IPR0235702.3e-203Acyl-CoA oxidase, peroxisomal
[397-559] IPR0090751.6e-54Acyl-CoA dehydrogenase/oxidase C-terminal
[589-770] IPR0026553e-53Acyl-CoA oxidase, C-terminal
[6-280] IPR0091002.9e-52Acyl-CoA dehydrogenase/oxidase
[147-277] IPR0060918.3e-44Acyl-CoA oxidase/dehydrogenase, central domain
[20-146] IPR0137864.1e-31Acyl-CoA dehydrogenase/oxidase, N-terminal
Orthology groupMCL10144 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201540-TA
ATGGTGTCTAATAAAATAAACGCCGATTTACAAAAAGAAAGGGATTCTTGTAGTTTTAATGTCACCGAATTAACAAACTTAATTGATGGAGGTATTGAGAAAACTGCCGAAAGGAAGAAAAGAGAGGAAATGGCTCTCAAAGAAGGTATACATTTGGACGAAGTTCCGTCTGTGTATTTGAGTCACAAGGAGAAATATGAGTTAGCGATCAAGAAAGCCTGCCTCCTGTTTAAAATGATAAGACGACTTCAGGAAGAGGAAAATGCTGGCATGGAAAATTATATGGCAGTGCTTGGAGGTAATTTAGGGTCGGCCATTCTTGGGGATGGATCCCCTCTCACCCTCCACTATGTAATGTTCATACCCACATTGATGGGACAGGGTACAGTGGAACAGCAGGCTTATTGGATCGGCAGAGCATTTAATCTTGACATCATTGGCACATACGCTCAGACTGAATTGGGTCATGGCACATTCGTCCGCGGTCTTGAGACCACAGCCACCTACGATCCTTCTACTAAGGAGTTCATTCTACACAGCCCCACACTCACCGCATACAAGTGGTGGCCAGGCGGACTGGCCCACACAGCCAACTACTGTATAGTTATGGCCCAACTGCACACGAAAGGCAAATGCCACGGCATGCACGCCTTCATAGTCCAATTGCGTGATGAAGAAACACACATGCCCTTACCGGGGATTAAAGTTGGGGAGATCGGCGCCAAGCTCGGTATGAACGGCACTAACAACGGATTCCTTGGCTTCGACCAAGTCAGAATACCCAGAGACTATATGCTAATGAAGAACGCTAAAGTTTTAGAGAATGGACAGAAACTTTCAGCTGTACATCAGTATACCTTGGTGCATCATAGTGTCTTGANTAAATTGGCCCACACAGCCAACTACTGTATAGTTATGGCCCAACTGCACACGAAAGGCAAATGCCACGGCATGCACGCCTTCATAGTCCAATTGCGTGATGAAGAAACACACATGCCCTTACCGGGGATTAAAGTTGGTGAGATCGGCGCCAAGCTCGGTATGAACGGCACTAACAACGGATTCCTTGGCTTCGACCAAGTCAGAATACCCAGAGACTATATGCTAATGAAGAACGCTAAAGTTTTAGAGGACGGTACATACGTGACTGCACCAAGCTCGAAGCTCGCGTACGGCACCATGATGTTCGTGCGAGTGATGTTGGTCAACGACATGTGTAACTACATGGCTAAAGCGGTCACCATAGCCACCAGATACAGCGCTGTGAGGAGACAGTCGCAGCCTAAACCCAATGAACCGGAACCCCAGATCTTGGATTACGTGACGCAGCAGCACAAGTTGATGATAGGTCTAGCGTCGGTCCACGCCTTCAGAACATGCGCTGATTGGCTCTGGCAGATGTACAACAACGTCACCGCTGAACTCGAAGCGGGGGATATGGAGAGACTTCCAGAGCTTCACGCGTTATCGTGCTGTTTGAAAGCGGTGAGTACATCAGATGCCGCTCAGTGCGTGGAGCGCTGCAGACTGGCGTGCGGCGGACACGGGTACATGCTGTCCTCCAACTTGCCGCTCACATACGGTCTGGTCACGGCCGCCTGCACTTACGAAGGAGAGAATACCGTCATGTTGCTGCAGACTGCCAGATACCTGGTGAAGGCGTGGCAGCAGGCGGCCGGTGGCCAGACTCTACCACCGACGGTGAGTTACCTTCGCGAGGTGGTCGCCGGTCGTCGGTCGCCACCATTCGACAACACCATAGATGGTATAATCGCTGGCTTCTACCGAGTCGCCGCTGGTAAAATCGGTGCCTGTGTGGCGCAAATAGAGAAACGTCAGAAAACAGGCATGCCATACGAGGACGCCTGGAATATGACATCCGTTCAACTGACTTCAGCATCGGAGGCCCACTGTCGTGCGATAATCTTGTCAACGTACTACAAGGAGATCGAGCGTCAGAGTAACTCGGTTTCCCCCGAGCTGTCGACAGTGCTACGGCAGTTGGTAGACCTGTATGTGGTGTATTGGGCGCTGCAGTGCATTGGAGACCTACTCAGGTTCACATCGATCTCTGAGCGTGACATCGACCAACTTCAGGCCTGGTACGAGGATCTCCTCACCAGGCTTAGGCCGAACGCGGTGGGACTCGTGGACGCCTTCGACATTAGGGACGAGATCCTCAACTCTGCGCTGGGTGCATACGATGGCAATGCCTACGAACGTCTTATGGCGGAAGCTATGAAGAGCCCTCTCAACAAGGAACCGGTCAATCAAAGCTTCCATCAGTACTTGAAACCATTGATGCAGGGAAAGCTATAG

Protein sequence:

>DPOGS201540-PA
MVSNKINADLQKERDSCSFNVTELTNLIDGGIEKTAERKKREEMALKEGIHLDEVPSVYLSHKEKYELAIKKACLLFKMIRRLQEEENAGMENYMAVLGGNLGSAILGDGSPLTLHYVMFIPTLMGQGTVEQQAYWIGRAFNLDIIGTYAQTELGHGTFVRGLETTATYDPSTKEFILHSPTLTAYKWWPGGLAHTANYCIVMAQLHTKGKCHGMHAFIVQLRDEETHMPLPGIKVGEIGAKLGMNGTNNGFLGFDQVRIPRDYMLMKNAKVLENGQKLSAVHQYTLVHHSVLXKLAHTANYCIVMAQLHTKGKCHGMHAFIVQLRDEETHMPLPGIKVGEIGAKLGMNGTNNGFLGFDQVRIPRDYMLMKNAKVLEDGTYVTAPSSKLAYGTMMFVRVMLVNDMCNYMAKAVTIATRYSAVRRQSQPKPNEPEPQILDYVTQQHKLMIGLASVHAFRTCADWLWQMYNNVTAELEAGDMERLPELHALSCCLKAVSTSDAAQCVERCRLACGGHGYMLSSNLPLTYGLVTAACTYEGENTVMLLQTARYLVKAWQQAAGGQTLPPTVSYLREVVAGRRSPPFDNTIDGIIAGFYRVAAGKIGACVAQIEKRQKTGMPYEDAWNMTSVQLTSASEAHCRAIILSTYYKEIERQSNSVSPELSTVLRQLVDLYVVYWALQCIGDLLRFTSISERDIDQLQAWYEDLLTRLRPNAVGLVDAFDIRDEILNSALGAYDGNAYERLMAEAMKSPLNKEPVNQSFHQYLKPLMQGKL-