Monarch geneset OGS2.0

DPOGS210496
TranscriptDPOGS210496-TA1893 bp
ProteinDPOGS210496-PA630 aa
Genomic positionDPSCF300186 - 130507-132608
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0161100.083.63% 
BombyxBGIBMGA012586-TA0.075.48% 
DrosophilaCG9518-PA6e-10739.17% 
EBI UniRef50UniRef50_E0VFX67e-12538.84%Glucose dehydrogenase, putative n=2 Tax=Pediculus humanus corporis RepID=E0VFX6_PEDHC
NCBI RefSeqXP_001945176.19e-13043.08%PREDICTED: similar to alcohol dehydrogenase [Acyrthosiphon pisum]
NCBI nr blastpgi|3504012582e-13144.76%PREDICTED: glucose dehydrogenase [acceptor]-like [Bombus impatiens]
NCBI nr blastxgi|3504012588e-12844.68%PREDICTED: glucose dehydrogenase [acceptor]-like [Bombus impatiens]
Group
Gene OntologyGO:00166143e-117oxidoreductase activity, acting on CH-OH group of donors
GO:00088123e-117choline dehydrogenase activity
GO:00506603e-117flavin adenine dinucleotide binding
GO:00551143e-117oxidation-reduction process
GO:00060663e-117alcohol metabolic process
KEGG pathwaydme:Dmel_CG95185e-105 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[44-631] IPR0121323e-117Glucose-methanol-choline oxidoreductase
[90-390] IPR0001721.1e-67Glucose-methanol-choline oxidoreductase, N-terminal
[474-621] IPR0078678.2e-33Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL25981 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210496-TA
ATGACCTCTAGCAACATTACAGAGGACCTTTGTAACATATGCGACGGCAAAATCGAATGTGCTCCCACCGCCATACTACTGATAGCGCTCGTCCACACTCTGTACGGACACATTGGACCTGAACCGGATTACTTCGGTAAGAAAAAACCGAGTCTGATGAGACAGGAGAACGACCAGCCGTCTCAAGGTTACATGAGGTTTGAACCGGTCCATAGACACAAGATACTCGGTGAGGATAGAAAAGACGACTTGGACTCGGCCAACAAATACGACTTCATAGTGGTTGGAGGTGGCACGGCTGGGTGTGTTGTGGCGAGTCGACTTTCAGAGAACAGGAAATGGAAGGTTGTCTTGCTGGTGGAAGCTGGTCCAGAGGAACCAAAAATGGCTCTCATACCTGGACTCACCAGCGAGTTTAAAGGCTCGGCTCTTGACTGGCAATATTCTATGAGACCAAAAAAGGGGTTCTGTCAAGAGCGTGATTTAAAAGGTTGCGAGGTCGTTCAAGGTCGAGTTCTCGGGGGAAGTTCAACAATCAACGACATGGCGTACATGAGAGGCAGTCCTGCCGACTATGATGAGTGGGCGCTCAATGGAAACGAAGGCTGGAGTTTTAGTCAAGTGCTGCCTTACTTCAAGTATTCGGAAGGAAATTACGACAAAGACATATCAAAAAACAAATTCTTCCACTCCACGCAGGGACCCCTAGATGTCGGACGATATCCTTTCGTAGACGACAATGTAGACGTTCTTCTCAGTGCTTTCAATGAACTCGGTTACAACTACACGGATATAAATGGAAGAAACCAGTTGGGTTTCATGAGAGTCCAGGCCATGTCCTATTTTGGTGAAAGAGTCAGTGCTTACACAGCGTTCATTGAACCCATCAGGAAATTGAGGACGAATATAGATATAGTATCAGAGGCTCTGGTAACCAAGATATTATTAGAAGAGAAGGAAGATAGTCTCAGAGCAGTAGGTATAGAATATTACAAAAATGGTACAAACGTAGTAGTAAAAGCGTTTAAAGAAATAATTTTGAGTGCTGGGGCAATCAATTCACCAAAAATTCTTATGCAATCAGGTATCGGTCCACGAGAATACTTGGAATATTTGGACATGAAGGTATATTATGATTTACCAGTGGGAGCAAACTTTCACGATCACTTGTCCGTGTGTTTGCCTGTCATTAAACTGACTAAGTCGTCTACGATTTCAAAATTCTCTGAAAAGTTGAAGGATATAACAACATACTATACAAATGGTCTCGGACCTCTCTCGTCTAACTTTCAAGTAATAGCGTTTTTCGAATCGAGTATATCCGATATTTTGGGGACGCCTGATATTGAATTTAGGTTTAGAGGTCACGACTCAAATATGTATTACGATAAAATAGACATTTGCACTTCATTAATAACACCAAAAAGTCGAGGACAAATTGTATTAAATGCAACGGATCCTGTATTCGGCAAACCTTTGATTTATCCGAATTTTCTCAAAGATCCTTCAGACGAGAAAAAAATATTAGAAGGCATCCAGGAAGTCGTAAAATTATTTGATACTGAAGTGTTTAAAGCTGCTGAGTTCGAATTTGATCCGCGACCGATATTAGACAACCATTGTCGTGAACACGACAGAGTCTCAGAAGAGTTTTGGTCGTGTATTATAAGACAGTTCTCAGCACCTCTTCATAACTATGTAGGGACATGTAAGATGGGTCCCTCAAAAGACCCCGAGTCTGTTGTCGATAATAGCTTACGAGTTTACGGAGTTTCTAATTTGAGAGTCGTAGACGCTTCAATAATTCCCAAAATTACCAGAGGTGCCACCGGTGCCCCGGTGATAATGATTGCTGAAAAAGCAAGTGACCTAATAAAGACTACTTGGTACTAA

Protein sequence:

>DPOGS210496-PA
MTSSNITEDLCNICDGKIECAPTAILLIALVHTLYGHIGPEPDYFGKKKPSLMRQENDQPSQGYMRFEPVHRHKILGEDRKDDLDSANKYDFIVVGGGTAGCVVASRLSENRKWKVVLLVEAGPEEPKMALIPGLTSEFKGSALDWQYSMRPKKGFCQERDLKGCEVVQGRVLGGSSTINDMAYMRGSPADYDEWALNGNEGWSFSQVLPYFKYSEGNYDKDISKNKFFHSTQGPLDVGRYPFVDDNVDVLLSAFNELGYNYTDINGRNQLGFMRVQAMSYFGERVSAYTAFIEPIRKLRTNIDIVSEALVTKILLEEKEDSLRAVGIEYYKNGTNVVVKAFKEIILSAGAINSPKILMQSGIGPREYLEYLDMKVYYDLPVGANFHDHLSVCLPVIKLTKSSTISKFSEKLKDITTYYTNGLGPLSSNFQVIAFFESSISDILGTPDIEFRFRGHDSNMYYDKIDICTSLITPKSRGQIVLNATDPVFGKPLIYPNFLKDPSDEKKILEGIQEVVKLFDTEVFKAAEFEFDPRPILDNHCREHDRVSEEFWSCIIRQFSAPLHNYVGTCKMGPSKDPESVVDNSLRVYGVSNLRVVDASIIPKITRGATGAPVIMIAEKASDLIKTTWY-