Monarch geneset OGS2.0

DPOGS207080
TranscriptDPOGS207080-TA1848 bp
ProteinDPOGS207080-PA615 aa
Genomic positionDPSCF300001 + 2585262-2588330
RNAseq coverage596x (Rank: top 21%)
Annotation
HeliconiusHMEL0140514e-14246.29% 
BombyxBGIBMGA000068-TA6e-12440.43% 
DrosophilaCG9519-PA3e-8133.98% 
EBI UniRef50UniRef50_D0ABA65e-12239.08%Putative ecdysone oxidase n=2 Tax=Nymphalidae RepID=D0ABA6_9NEOP
NCBI RefSeqNP_001177919.11e-11038.29%ecdysone oxidase [Bombyx mori]
NCBI nr blastpgi|2613359212e-12139.08%putative ecdysone oxidase [Heliconius melpomene]
NCBI nr blastxgi|2613359218e-12038.87%putative ecdysone oxidase [Heliconius melpomene]
Group
Gene OntologyGO:00166149.9e-130oxidoreductase activity, acting on CH-OH group of donors
GO:00088129.9e-130choline dehydrogenase activity
GO:00506609.9e-130flavin adenine dinucleotide binding
GO:00551149.9e-130oxidation-reduction process
GO:00060669.9e-130alcohol metabolic process
KEGG pathwaydme:Dmel_CG95144e-76 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[24-615] IPR0121329.9e-130Glucose-methanol-choline oxidoreductase
[76-373] IPR0001722.5e-65Glucose-methanol-choline oxidoreductase, N-terminal
[467-604] IPR0078671.5e-29Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL19954 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207080-TA
ATGCAGGCAGCACCAGAGTGTTATATAAGGATGAGAGAGTATAACATTGGGTGCAGTTTGTTGTTACTGGTCGAGTCGAGAGGATTCTCATACGCAACGCAGCTATCGAGGGCGATGGGCAACTTTTATAGGATCAATGCCTTGATTTTATTGTCTGCTCTAGGACTTACCGCAAATAAATGGCCTCCTGATACCTTTATTCCAAATAACGGAGAATTCACTGCCGATTATGTAGTGGTAGGAGCTGGTACGGCAGGAAGCATAATTGGCTTTCGTCTAACAGAGGATCCTAATGTCGATGTCGTGATGGTTGAAGCTGGCGATGATCCCCCAACAGATGCGGAATTACCAGGGTTATTCTTTTCATTGCCAAAAACTAAAATTGATTGGAATTATACATCAGAAGACGATGGCTACAGTGCTCAGTATCATAGAAATAAATTTGTTGATTTACCATCGGGAAAAGTACTCGGTGGAAGCAGCAGCCTTCATCACTTCTATTACCTCAGGGGAGATGCCGCTGACTTTGAAGACTGGGTGAAAGCTAGTGGCAATGAATCGTGGTCTTTAGAAAACCTCTTACCTTATTTTAAGAAGAGTGAACGTCTCGAGGACAAGGACATAAGCGATTCAGAAACTGGTAATTTACATGGATACAGCGGAGAGGTCGGAATCACGAGACGTGTAACAGAATTGCCAGAAAAATATTTACAAGCATTCCAAGAAGTTGGACATCCAGTTGTTCTTGATATTAACGGCCATCATGTCAAAGGATTTACACAACCTTTGTTTTTTATTGCTGAAAAGAAGCGACAAAGTAGTGCCGAAGGTTATTTAACTAGAGCAAAGTCTCGAGATAATCTTCATCTAGTAAAGAATACAATAGCTAACAGAATTTTGTTTGATTCCAATAATAATGCTATCGGTGTTGAATGCGCTTCATTAGACGGAAGAGTGTTCAAAGTTTTCGCTCGAAAGGAAGTCGTCATATCTGCTGGGGCTTTCAATACGCCTAAATTGTTAAAACTATCGGGCATAGGTCCTCGAGCTGAACTCGAAAGTTTTGGCATTAAGGTTATTTCAGATCTACCAGTGGGAGAAAATTTACAAGACCATTTGGCTGTCGTTCTTGCTCATGGACTAGAAAAAACTAATGACACTCCATCGGCTCCAATTCTGAATGATTTTCCTCTAGACACTTTTGTAGGTTTAGAATCTATTGACCCAAATCAGGAAAAACCAGATTATCTGACGTTAAACCTAATTTGTAGAAATAATCCAGAGTGTTTGAGTCAACTTTGTTCCGTTGTGTTTGGTTTAAACCAAGACGTATGTAATCAGATAATGAAAGCTGGTGAAGGTAGAGAGATTTTAGTCTCTATACTTACTGTCTGTCGTCCAGTATCCACTGGAAGAGTTTTACTGAAGAGTTCAGACCCTAAAGACCCGCCTGTGATCTATACCGGTTTCCTTTCTAACAAAACTGATCTGGAAAACAGCGCTCGTTATATCGAAGACTTCATAAGAGTTGTAGAGTCAAAATACTTTAAGAGTGTCGGAGGAGAGACTTTACAACCACATTTACCGAATTGTTCGCACTTACAGTGGAACACGAGAGAATATTGGAAGTGTTATGTTCTCAACATGATGGACACTACATTCCACTACAGTAGTACATGTCCAATGGGTTCCGTATTAGATTCTCAATTGAGAGTGCGAGGTGTGGGGAGACTGCGAGTAGGCGATGCCAGTGCTATGCCGAATATAGTCTCAAGTAACATAAACGCTGCTGTCATGGTACTTGCTGAAAAGCTTGCTGACCTTCTTAAGGAGTCAGGTAAACAATGA

Protein sequence:

>DPOGS207080-PA
MQAAPECYIRMREYNIGCSLLLLVESRGFSYATQLSRAMGNFYRINALILLSALGLTANKWPPDTFIPNNGEFTADYVVVGAGTAGSIIGFRLTEDPNVDVVMVEAGDDPPTDAELPGLFFSLPKTKIDWNYTSEDDGYSAQYHRNKFVDLPSGKVLGGSSSLHHFYYLRGDAADFEDWVKASGNESWSLENLLPYFKKSERLEDKDISDSETGNLHGYSGEVGITRRVTELPEKYLQAFQEVGHPVVLDINGHHVKGFTQPLFFIAEKKRQSSAEGYLTRAKSRDNLHLVKNTIANRILFDSNNNAIGVECASLDGRVFKVFARKEVVISAGAFNTPKLLKLSGIGPRAELESFGIKVISDLPVGENLQDHLAVVLAHGLEKTNDTPSAPILNDFPLDTFVGLESIDPNQEKPDYLTLNLICRNNPECLSQLCSVVFGLNQDVCNQIMKAGEGREILVSILTVCRPVSTGRVLLKSSDPKDPPVIYTGFLSNKTDLENSARYIEDFIRVVESKYFKSVGGETLQPHLPNCSHLQWNTREYWKCYVLNMMDTTFHYSSTCPMGSVLDSQLRVRGVGRLRVGDASAMPNIVSSNINAAVMVLAEKLADLLKESGKQ-