Monarch geneset OGS2.0

DPOGS213848
TranscriptDPOGS213848-TA1971 bp
ProteinDPOGS213848-PA656 aa
Genomic positionDPSCF300380 + 110016-112625
RNAseq coverage15x (Rank: top 82%)
Annotation
HeliconiusHMEL0042530.058.08% 
BombyxBGIBMGA013788-TA0.054.31% 
DrosophilaCG9518-PA8e-12842.25% 
EBI UniRef50UniRef50_E2BJK21e-15948.73%Glucose dehydrogenase [acceptor] n=9 Tax=Endopterygota RepID=E2BJK2_HARSA
NCBI RefSeqXP_001945176.17e-16249.65%PREDICTED: similar to alcohol dehydrogenase [Acyrthosiphon pisum]
NCBI nr blastpgi|3838604642e-16151.30%PREDICTED: glucose dehydrogenase [acceptor]-like [Megachile rotundata]
NCBI nr blastxgi|3838604641e-15751.30%PREDICTED: glucose dehydrogenase [acceptor]-like [Megachile rotundata]
Group
Gene OntologyGO:00166141.7e-152oxidoreductase activity, acting on CH-OH group of donors
GO:00088121.7e-152choline dehydrogenase activity
GO:00506601.7e-152flavin adenine dinucleotide binding
GO:00551141.7e-152oxidation-reduction process
GO:00060661.7e-152alcohol metabolic process
KEGG pathwaydme:Dmel_CG95187e-126 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[56-653] IPR0121321.7e-152Glucose-methanol-choline oxidoreductase
[95-390] IPR0001721.6e-76Glucose-methanol-choline oxidoreductase, N-terminal
[496-640] IPR0078674e-31Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10891 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213848-TA
ATGATATCTAATGATACTGAAGATTCTACAACATTTTTCTTTACTGAAAATTGTGAAGGAGGTTTGGAAATAAACCAAACTAAGAATGCTATACTTGAAAATGGAATTGAACTCCTTCAAACTTCTGGTGAAATATTTCCTTACATCGACTCTCAAAACCAAACTGACAAACCTGAAGAACCAGAATTAATTGAAGAACCTCCGGATAAAAATATAAACTTAAACTCTAAACTAACACAAAAAGAGAAACTTATAAAAACGAGTGCAAAGAAATCTAACAAATATGACTTTATTATTGTCGGTGCTGGTTCAGCTGGCTGTGTTTTAGCCAATAGACTTTCAGAAGTAACGTCATGGCGGATACTTCTTTTAGAAGCTGGATCTGAAGAACCAGATATAACAATGATGCCAGCTGCCATAAGAGTTCTTAGCGGATCAAATATCGATTGGAATTATAATACTCAGCCGGAAGAACTGACTTGCAGGTCCATGACGAAACACTTATGCCAATGGCCCAGGGGTAAGACTTTAGGGGGTTCGAGTGCTATTAACTACATTATTTACATGAGAGGAAATAGACATGATTACGACCACTGGGCGGAAGTAGGAAATGAAGGCTGGAGCTACAATGAATTGCTGCCATATTTTAAGAAAATTGAAAACAGTGCTGATATTGAATCTCGTGACACACAGAACGGAGTAGGGGGACCGCTGAACGTAGAAAGATACACTTATGTTGATGCAAATACAATTATGCTAGTTAAAGCTTTAAATGAGTCCGGCCTTCCACTCATAGATTTAACCGGAGGAAATAGCGTCGGGACCAACATAGCTTCATCTACATCAAAAGATGGACGAAGAATGTCCACTAATGTAGCATATATTAAACCTATACGAGATATAAGATCAAATATTGATATAATTCTAAATGCATTTGTTACAAAATTAATTATAAATCCTAAAACAAAAAGGGCACTAGGTGTTACTTATGTTAAAAATGGGACTGCTTACAATGTTTTTGCCAAAAATGAGGTAATATTAAGTACTGGATCCCTGAATTCACCTAAACTTCTAATGTTATCTGGAGTCGGGCCTAGAGAACACATAGAAAATTTCCGCATACCAGTCGTTGCAGATTTGCAAGTGGGACATAATTTACAAGACCATACTACAGCGAATGGTTTCGTTCTCGCTCTGGCAAATAAAACATGGACTAACGTAAGCGACACGGTTTTATTTCAAGAAATACAAAATTACTACGAACAGGAACCCAAAAAAAGTGGACCGTTATCTACCACTAGTACTCTTAATAGTATTGGTTTTCTAAAAACTAAATATGCACGTGAAAATGCACCGGATATTCAATTCCATTTTGATGGTGTGAATGTCGAGGAGCTATACTCTGATCCTCCCGCTTATTTGGAAAGCAACGTACTTCCGATTTCTTATTATAATGGACTTTCACCAAAGGCAATATTATTGGTACCAAGAAGCAGAGGCATAGTTTTACTAAACGACACAGATCCCGTGAATGGGCCGCCCTTAATTTATCCCAGGTTTTTCACAGTCAAGGAAGATCTTGACGTACTCTTTGAAGGATTCCGTTATCTTATTGGTTTAGAAGAAACCAAATCATTTAAAGAAAATGGTGCACACTTTGTTAAAATTCCTGTAAAAAATTGCGAAGATTATATCTGGGGATCTTATAACTACTTTAAATGCTTACTTGTTGAATATACTGTAACTCTGTATCATCCTGTTGGCACTTGTAAAATGGGACCACCGTCCGATAAAGATGCTGTTGTCGATCCTAGATTACGGGTTTACGGTGTCAAGGGTTTGAGAGTAATAGACGCGTCTATAATGCCATTTATAGTTAGGGGAAATACGAATATACCAACTATAACTATAGCAGAAAAGGGAGCAGATATGATTAAAAAGGATTATTTAAAACGATGCAATTCTAACTGA

Protein sequence:

>DPOGS213848-PA
MISNDTEDSTTFFFTENCEGGLEINQTKNAILENGIELLQTSGEIFPYIDSQNQTDKPEEPELIEEPPDKNINLNSKLTQKEKLIKTSAKKSNKYDFIIVGAGSAGCVLANRLSEVTSWRILLLEAGSEEPDITMMPAAIRVLSGSNIDWNYNTQPEELTCRSMTKHLCQWPRGKTLGGSSAINYIIYMRGNRHDYDHWAEVGNEGWSYNELLPYFKKIENSADIESRDTQNGVGGPLNVERYTYVDANTIMLVKALNESGLPLIDLTGGNSVGTNIASSTSKDGRRMSTNVAYIKPIRDIRSNIDIILNAFVTKLIINPKTKRALGVTYVKNGTAYNVFAKNEVILSTGSLNSPKLLMLSGVGPREHIENFRIPVVADLQVGHNLQDHTTANGFVLALANKTWTNVSDTVLFQEIQNYYEQEPKKSGPLSTTSTLNSIGFLKTKYARENAPDIQFHFDGVNVEELYSDPPAYLESNVLPISYYNGLSPKAILLVPRSRGIVLLNDTDPVNGPPLIYPRFFTVKEDLDVLFEGFRYLIGLEETKSFKENGAHFVKIPVKNCEDYIWGSYNYFKCLLVEYTVTLYHPVGTCKMGPPSDKDAVVDPRLRVYGVKGLRVIDASIMPFIVRGNTNIPTITIAEKGADMIKKDYLKRCNSN-