Monarch geneset OGS2.0

DPOGS200336
TranscriptDPOGS200336-TA1797 bp
ProteinDPOGS200336-PA598 aa
Genomic positionDPSCF300026 + 384279-391751
RNAseq coverage23x (Rank: top 78%)
Annotation
HeliconiusHMEL0000570.053.87% 
BombyxBGIBMGA010448-TA3e-12043.44% 
DrosophilaCG9512-PA3e-8132.17% 
EBI UniRef50UniRef50_D0ABA60.053.87%Putative ecdysone oxidase n=2 Tax=Nymphalidae RepID=D0ABA6_9NEOP
NCBI RefSeqNP_001177919.11e-10739.43%ecdysone oxidase [Bombyx mori]
NCBI nr blastpgi|2613359210.053.87%putative ecdysone oxidase [Heliconius melpomene]
NCBI nr blastxgi|2613359210.053.87%putative ecdysone oxidase [Heliconius melpomene]
Group
Gene OntologyGO:00166141.5e-121oxidoreductase activity, acting on CH-OH group of donors
GO:00088121.5e-121choline dehydrogenase activity
GO:00506601.5e-121flavin adenine dinucleotide binding
GO:00551141.5e-121oxidation-reduction process
GO:00060661.5e-121alcohol metabolic process
KEGG pathwaydme:Dmel_CG95091e-71 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[3-597] IPR0121321.5e-121Glucose-methanol-choline oxidoreductase
[54-351] IPR0001721.2e-63Glucose-methanol-choline oxidoreductase, N-terminal
[447-584] IPR0078673.2e-25Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL20933 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200336-TA
ATGGAAACACGGGAAATAATACGAAGGACTTTGGAAGTTCAATTTGCTTTTAGCACTATATTTCTTTACCTTGATTTTACAGGTTATTTATTTCCACCACAGGCCAACGTGAAGGGTTATTTATTTCCACCACAGGCCATCGTGAAGGACCGTAACGTCTATGATTTCATTGTGGTGGGAGGGGGAACAGCTGGCAGCGTTATAGCGTCGAGACTTACTGAAGTCAAGGAATTCAATGTTCTACTGATTGAAGCTGGTTCTGTCTCTCCACTCCAATGTTTAATTCCCGGATTGGTCCAGTATAACCCGAATTCCATAGTTGACTGGAACCATACTGCTCAAAACGATGGTTATGCAGCACAATGTCACAAAAATGGTGTGATGAGGCTTCCGCAAGGCAAATGTTTAGGAGGGACTAGTTGTTTTAATTACATGTTTTACAACCGTGGTAGCAAATACGACTACGATAGTTGGGCAGAAATCGCTAAAGACAGCACTTGGAATTGGGATAATGTGGTACCATATTTTATAAAAAGCGAAAATTTATTGGACAATGATATACTAAAGTCACCTGATGGAACACTTCATGGTACAAAAGGTTACATTAACGTGACAAGGGAATTAAGCGATAGAGCACTCGAATACCTTAAAGCCCTCGAAGAAGTCGGAGAGTCTTCAGTTGAAGACGTTAATGGGCAAGAATTTATTGGTTATACTCAGCCAATGTTAACATTATCAGGCGGCGTAAGGCAAAGTACGAGTGTTTGCTATATTACACCTGCCAAAGACCGCGAAAATTTGAAATTCATGAAGAATTCTCTAGTATCAAAAATTACAATTGATGAAAATGGGAGGGCGCGTGGTGTCGAAATAATAACAAAAGATAATAAAAAAATATCTGCGTACGCCAAAAATGAAATAATTGTTACTGCCGGAGTTATAAACAGCCCGAAGCTTCTTATGTTGTCAGGCATTGGTCCTAAAAGACACCTTAAGTCGTTAAACATAAAAGTTAATTCAGATTTACCAGTAGGCCGCAATTTGCAGGACCATAATTTAGTACCATTGTATATTGAAATGGAAGAATCTAAAGAGCCCGTTATACCTCGCAACCCTCACAAACATCCCTTTGATATGGTCACAGGTTTCGCATCTTTAAATAAGGATAAACCATATTATGCAGATTATCAAACACAAATATTTATAGTACCACACGGTTCACAAATGCCTGTACAGTATTTTACTAATGATTTTATGTACGAGGAAGATGTCTCCGAAAGGTTAAATGAAGGTAGCAACAGAGGAAATGCTGCCGTAGCCTTGATTGTCAACCTCCATCCAAAATCTAAAGGACAGATTCTTTTAAAAACCACAGACCCAAATGACAGCCCTCTAATTTATTCAGGAATCTTTTCTAATAGAAGAGACTTAGATAATACCGTTAAATATGTGAAAGATTTTGTAAAAGTTATGAATTCAGAACACTTTAAAAAAAATAACGCTAGTGTGGTGGACTTGTCAAATAAGCGTTGTGGACCTTTTGACCTTAATAGCACCGTCTTTTGGGAATGCTACAGCCGTTGTATGACAAACATTGCTTTTGATATGATTGGCACGTGTGCCATCAGCAAAGTTGTAGATAGTCAATTAAAAGTTATTGGGGTGGATGGATTGCGAGTGGCAGACGCGAGCGTCATACCCTTACCGATCGGTGCAAATCTGTATGCTCCGGTCGTGATGGTTGCTGAAAAAGTGTCAGATATGATAAAAAACGAGTACCAGAGCCAAAATAAATAA

Protein sequence:

>DPOGS200336-PA
METREIIRRTLEVQFAFSTIFLYLDFTGYLFPPQANVKGYLFPPQAIVKDRNVYDFIVVGGGTAGSVIASRLTEVKEFNVLLIEAGSVSPLQCLIPGLVQYNPNSIVDWNHTAQNDGYAAQCHKNGVMRLPQGKCLGGTSCFNYMFYNRGSKYDYDSWAEIAKDSTWNWDNVVPYFIKSENLLDNDILKSPDGTLHGTKGYINVTRELSDRALEYLKALEEVGESSVEDVNGQEFIGYTQPMLTLSGGVRQSTSVCYITPAKDRENLKFMKNSLVSKITIDENGRARGVEIITKDNKKISAYAKNEIIVTAGVINSPKLLMLSGIGPKRHLKSLNIKVNSDLPVGRNLQDHNLVPLYIEMEESKEPVIPRNPHKHPFDMVTGFASLNKDKPYYADYQTQIFIVPHGSQMPVQYFTNDFMYEEDVSERLNEGSNRGNAAVALIVNLHPKSKGQILLKTTDPNDSPLIYSGIFSNRRDLDNTVKYVKDFVKVMNSEHFKKNNASVVDLSNKRCGPFDLNSTVFWECYSRCMTNIAFDMIGTCAISKVVDSQLKVIGVDGLRVADASVIPLPIGANLYAPVVMVAEKVSDMIKNEYQSQNK-