Monarch geneset OGS2.0

DPOGS200298
TranscriptDPOGS200298-TA1767 bp
ProteinDPOGS200298-PA588 aa
Genomic positionDPSCF300026 - 396818-398923
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0000573e-17449.48% 
BombyxBGIBMGA010448-TA3e-12742.54% 
DrosophilaCG9512-PA4e-8835.16% 
EBI UniRef50UniRef50_D0ABA62e-17249.31%Putative ecdysone oxidase n=2 Tax=Nymphalidae RepID=D0ABA6_9NEOP
NCBI RefSeqNP_001177919.12e-10638.57%ecdysone oxidase [Bombyx mori]
NCBI nr blastpgi|2613359217e-17249.31%putative ecdysone oxidase [Heliconius melpomene]
NCBI nr blastxgi|2613359211e-16849.31%putative ecdysone oxidase [Heliconius melpomene]
Group
Gene OntologyGO:00166143e-140oxidoreductase activity, acting on CH-OH group of donors
GO:00088123e-140choline dehydrogenase activity
GO:00506603e-140flavin adenine dinucleotide binding
GO:00551143e-140oxidation-reduction process
GO:00060663e-140alcohol metabolic process
KEGG pathwaydme:Dmel_CG95094e-83 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[2-588] IPR0121323e-140Glucose-methanol-choline oxidoreductase
[43-342] IPR0001726.4e-68Glucose-methanol-choline oxidoreductase, N-terminal
[438-575] IPR0078678.8e-29Glucose-methanol-choline oxidoreductase, C-terminal
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200298-TA
ATGGACCCGGAAACAGCTGTATCAAATGCCGTAACTGTACAATTAGCATTGAAAGTGATTGCTCTCACATTGGATCTCACTGCTTATCTTTTTCCAAAACAATGTGATGTGAACGACGACGATACATTTGATTTTATTATAATTGGAGCTGGATCCGCTGGAAGTGTTATTGCTAATCGGCTCTCAGAAATTGAACACTTCAAAGTTCTTATTATAGAAGCTGGTGGTGATCCACCCTTGGAAGTTATGTATCCAGGGCTGTCATCGTATACATATCACTCTCGTTTAGATTGGAATATAACTTCACAATACGATGGCACAACGGCCCAATGCAGAAAAGACAAAGTCATACCTCAGTTTAGTGGTCGCGTGCTTGGAGGAAGTAGCTCCATCAATTGCATGTATTACGTGCGAGGCAATCCCTACGATTACAATCGCTGGGCGCAGTTGGTCAATGATGAAACTTGGAAATGGGAAAATGTCCTCCCTTACTTCATTAAAAGTGAAAGGATGCTTGACAAGGATGTCTTGAAATCTGCTACTGGGAAGCTTCATGGAAGACACGGTGAAATTGGCATCACAAGAAGTATAGATAACAACACAAGAAGATTTCTACGTTCCTTAAAGGAAGACGGTATCCCCGTTACTATGGATTACAATGCCAATAAAACGTTAGGCTATAGCGATGTATTTTTTACGATTGCAGATGGCTATCGTCAAAGTACAGGATACAGATTCTTAGGGCTCGCTAGAAATAGACCTAACTTATATATATTAAAAAATACCGTAGCAACAAAAATATTATTCAACGACGATAAAAGAGCTTACGCTGTAGAGGTAGTTACAGAGAATAAAATAAAAAAAGTGACTCTAAAAGCAACTAAAGAAATAATTGTATCAGCAGGCGCCTTGAAATCTCCACAATTGTTGATGCTGTCAGGTATTGGTCCTAAAGATCATTTACGGACGTTAAATATCGATGTTATTGCTAACTTACCGGTTGGAAAAAATCTCCAAGATCATCTTGCCATACCAATATTACATACATTGCAAAAAAATAAAAAAAAATCTTTTCCCAAACCTTTCAACCCACATGTTTATCCGTACTCAAACATAGTCGGCTTCGTTGCTCTAAACAAATCACAGTCATACCCGGATTATGAATCAACAATTAATATTATAGATGATGGAGCTAAGGACTTACTTCAACTTTACTCTTTTGTGTATCAGTATTCTGATAATGTAAGTGATAGTATTTATAATTATGCTAAAGAAAGCACGGTTATTGAGACATTAATAACAGACCTTCATCCGAAATCTCGCGGAGAAATTTTGTTACGTAGTGTCAATCCATTTGATCATCCCTTAGTTTACACAGGTTATTTATCGGAAGAAGAAGATCTCGATAACACAATAAGATACATCGAAGACTATCTCCGCCTAACACACACTTCTTACTTTAAGAAAAATAATGCGCAAATGATAAACATAGTTGGTAATATGTGTAAGGGTTTCAAATTTGGCAGTAAAGATTACTGGACGTGCTATATACAGTGTACATTGAATAACATGACCCATTACTCAGGGACATGTGCGCTTGGGTCCGTTGTAGACAGTCGATTACTAGTCCGAGGCGTCAAAGGTTTGAGAGTCACCGACACTAGTATAATGCCATACATAGTTAGTGGAAATACAAATGCCCCTACCATGATGCTTGGTGAAAAGGTCTCTGATTTCATTAAAGAAGTGCATGGTGCATTAAAATAA

Protein sequence:

>DPOGS200298-PA
MDPETAVSNAVTVQLALKVIALTLDLTAYLFPKQCDVNDDDTFDFIIIGAGSAGSVIANRLSEIEHFKVLIIEAGGDPPLEVMYPGLSSYTYHSRLDWNITSQYDGTTAQCRKDKVIPQFSGRVLGGSSSINCMYYVRGNPYDYNRWAQLVNDETWKWENVLPYFIKSERMLDKDVLKSATGKLHGRHGEIGITRSIDNNTRRFLRSLKEDGIPVTMDYNANKTLGYSDVFFTIADGYRQSTGYRFLGLARNRPNLYILKNTVATKILFNDDKRAYAVEVVTENKIKKVTLKATKEIIVSAGALKSPQLLMLSGIGPKDHLRTLNIDVIANLPVGKNLQDHLAIPILHTLQKNKKKSFPKPFNPHVYPYSNIVGFVALNKSQSYPDYESTINIIDDGAKDLLQLYSFVYQYSDNVSDSIYNYAKESTVIETLITDLHPKSRGEILLRSVNPFDHPLVYTGYLSEEEDLDNTIRYIEDYLRLTHTSYFKKNNAQMINIVGNMCKGFKFGSKDYWTCYIQCTLNNMTHYSGTCALGSVVDSRLLVRGVKGLRVTDTSIMPYIVSGNTNAPTMMLGEKVSDFIKEVHGALK-