Monarch geneset OGS2.0

DPOGS214077
TranscriptDPOGS214077-TA1989 bp
ProteinDPOGS214077-PA662 aa
Genomic positionDPSCF300634 + 3446-7355
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0042520.051.65% 
BombyxBGIBMGA013788-TA1e-18052.23% 
DrosophilaCG9517-PA3e-12640.57% 
EBI UniRef50UniRef50_UPI0000D569753e-15541.40%UPI0000D56975 related cluster n=1 Tax=unknown RepID=UPI0000D56975
NCBI RefSeqXP_001945176.15e-16544.65%PREDICTED: similar to alcohol dehydrogenase [Acyrthosiphon pisum]
NCBI nr blastpgi|3287207136e-16444.80%PREDICTED: glucose dehydrogenase [acceptor]-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287207131e-16044.80%PREDICTED: glucose dehydrogenase [acceptor]-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00166143.1e-156oxidoreductase activity, acting on CH-OH group of donors
GO:00088123.1e-156choline dehydrogenase activity
GO:00506603.1e-156flavin adenine dinucleotide binding
GO:00551143.1e-156oxidation-reduction process
GO:00060663.1e-156alcohol metabolic process
KEGG pathwaydme:Dmel_CG95187e-124 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[51-662] IPR0121323.1e-156Glucose-methanol-choline oxidoreductase
[104-400] IPR0001721.9e-75Glucose-methanol-choline oxidoreductase, N-terminal
[507-651] IPR0078671.5e-32Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10891 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214077-TA
ATGTCTACAATATGGCAACCACGGGACATATCTCAAATTTGCTCAGAGACGCAATCAAATTTAACGCAATGTTCTCCTGCCGGGTTTATGTTTTTGGCACTACTAGTTCGTCTTTTCGGTGGTGTGAATTATACTGTTAATGAGGTCAGTGATCGGACAAGGGTTAATATTGGATTGGACGAGATATCGCAAAGAATTGAAAACAATTCATTGCCAACAACTTATATCGAACAAAACCCACATAACTACGATGAGGAGAATAAATATAGTAGTGGAGAATATGAAGATGAGGCAAAAGAAAAAAATGAATATGATTTCATTATCGTTGGGGCTGGATCAGCAGGTTGTGTGCTAGCAAATAGATTATCAGAAGAAGAACAATGGCGGATACTCTTGATAGAAGCTGGATCCGAGGAACCGGATATAACAATGGTACCATCACTATATAAAGCATTAAAAGGATCTTCGTTGGATTGGAACTATAGCACACAACCGGAAGAAAAGAGTTGCAGATCCATGAAAGGACATATGTGTGATTTTACTCGTGGCAAAACCATGGGAGGTTCGAGCGCTGTAAACACCCTTGTGTATATGAGAGGCAATAGACGTGATTATGACCATTGGGAGGAAATAGGCAACTACGGATGGGGCTATGATAAGCTCTTGCCTTACTTTAGAAAATCTGAAAATAATAAAGCTGTTGAAGCACTTGATACGTATCTTCACGGAACAGGTGGACCCATCACAGTAGAGAGATATCCTTATTACGATGATAATAGTTTTATGCTTCTTGAATCTTTTAAGGAATCTAATGTCCCAGAAATAGATTTAACTGCAGAAGATAATATTGGTGTCAATATAGCTCTATCAACTTCTAAAGATGGAAGAAGAGTATCAGAAAATGTGGCTTACATCAAGCCTATTCGTGATATAAGAAAGAATCTTGATATTATAACAAACGCTTTTGTGACCAAATTAATTATAGACCACGAAACAAAAACGGTTTTAGGTGTTACGTATGAGAAAGGTGGCAAATCCTACAATGTTTATGCTAAAAAGGGAGTGATATCTAGTGGTGGAACTGTCAATTCCCCAAAATTATTGATGTTATCTGGTATTGGGCCTAGGGAGCATTTAGAGAGTTTGAATATATCTGTTGTTGCTGACTTATCTGTTGGTCATAATTTACAAGACCACGTTACTGCGAACGGTTTTATTATTTCTCTGTCAAATAAAACTGCCACCAACGTTAGTTCGGAGCAATTATTAGAAGAGGTGCAACGGTACCATGACCAGGAACCGAAAAAGTATGGACCATTGGCAACTACAAATGTTGCTGGTACTACTGCGTTTATAAAAACTATGTACTCTCTTGAAAATGCACCAGATATACAATTTATTTTTGAAGGTATTAATAATATTGCGGAGTTTTATTCTGATCCTCAAGCTTATTTAATGAGTGACAGTTTTACTGCTGCTTTTTATGATGGACTTTCTTGTAAACCTCTTTTAATAAAACCACGAAGTAGAGGTATTATTTTGCTTAACAATAACGATCCCGTTCACGGGAACCCTTTGATTTATCAGCGTTTCTTTACTGATAAGGAAGATATAGATGTTCTTATAGAAGGTTTTAAGTTTGCTTTAAGTTTAGAAGAAACTGAAGCATTTAAAAAAAATGGAGCGCGTTTTGTAAGAGTTCCTATAAAAAACTGTGAAAATCATGAGTGGGGATCCAATGATTATTTTGTATGTTTACTTACTGAGTATACTACTACTATTTATCATCCGGTTGGGACTTGTAAAATGGGGCCTTCGTCGGATAAAGACGCAGTTGTCGATCCTAGATTACGGGTGTACGGTGTCAAACGATTAAGAGTTGTTGATGCATCTGTAATGCCGTTTATACCAAGAGGCAATATAAATATACCAACAGTAACAATAGCAGAATATATATCGGATCTAATCAAATCTGAATATAAGCAATAA

Protein sequence:

>DPOGS214077-PA
MSTIWQPRDISQICSETQSNLTQCSPAGFMFLALLVRLFGGVNYTVNEVSDRTRVNIGLDEISQRIENNSLPTTYIEQNPHNYDEENKYSSGEYEDEAKEKNEYDFIIVGAGSAGCVLANRLSEEEQWRILLIEAGSEEPDITMVPSLYKALKGSSLDWNYSTQPEEKSCRSMKGHMCDFTRGKTMGGSSAVNTLVYMRGNRRDYDHWEEIGNYGWGYDKLLPYFRKSENNKAVEALDTYLHGTGGPITVERYPYYDDNSFMLLESFKESNVPEIDLTAEDNIGVNIALSTSKDGRRVSENVAYIKPIRDIRKNLDIITNAFVTKLIIDHETKTVLGVTYEKGGKSYNVYAKKGVISSGGTVNSPKLLMLSGIGPREHLESLNISVVADLSVGHNLQDHVTANGFIISLSNKTATNVSSEQLLEEVQRYHDQEPKKYGPLATTNVAGTTAFIKTMYSLENAPDIQFIFEGINNIAEFYSDPQAYLMSDSFTAAFYDGLSCKPLLIKPRSRGIILLNNNDPVHGNPLIYQRFFTDKEDIDVLIEGFKFALSLEETEAFKKNGARFVRVPIKNCENHEWGSNDYFVCLLTEYTTTIYHPVGTCKMGPSSDKDAVVDPRLRVYGVKRLRVVDASVMPFIPRGNINIPTVTIAEYISDLIKSEYKQ-