Monarch geneset OGS2.0

DPOGS213845
TranscriptDPOGS213845-TA1650 bp
ProteinDPOGS213845-PA549 aa
Genomic positionDPSCF300380 - 87016-89465
RNAseq coverage1x (Rank: top 95%)
Annotation
HeliconiusHMEL0042520.065.85% 
BombyxBGIBMGA013788-TA0.059.20% 
DrosophilaCG9518-PA3e-12741.67% 
EBI UniRef50UniRef50_E2BJK27e-15248.43%Glucose dehydrogenase [acceptor] n=9 Tax=Endopterygota RepID=E2BJK2_HARSA
NCBI RefSeqXP_394222.29e-15951.92%PREDICTED: similar to CG9517-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3838604648e-16152.93%PREDICTED: glucose dehydrogenase [acceptor]-like [Megachile rotundata]
NCBI nr blastxgi|3838604642e-15652.93%PREDICTED: glucose dehydrogenase [acceptor]-like [Megachile rotundata]
Group
Gene OntologyGO:00166141e-130oxidoreductase activity, acting on CH-OH group of donors
GO:00088121e-130choline dehydrogenase activity
GO:00506601e-130flavin adenine dinucleotide binding
GO:00551141e-130oxidation-reduction process
GO:00060661e-130alcohol metabolic process
KEGG pathwaydme:Dmel_CG95182e-125 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[2-546] IPR0121321e-130Glucose-methanol-choline oxidoreductase
[2-283] IPR0001724.5e-66Glucose-methanol-choline oxidoreductase, N-terminal
[389-534] IPR0078672.6e-32Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10891 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213845-TA
ATGGCAAACAGACTTTCCGAAGTTAAAAAATGGCGGATACTATTATTAGAAGCTGGTCCTGAAGAACCAGATGTTTCAATGATTCCTGGAATTGTGAGAACGTTAGCAGGTTCGTCAATAGATTGGAATTACAGAACGCAGCCGGAACCATTAACCTGTAGATCCATAAGGGGAAAAACTTGTGCTTGGACTAGCGGAAAAACTATGGGTGGTTCAAGTTCAGTTAACTATTTAGTTTACATGAGAGGAAATAGGCGTGATTATGATCACTGGGCTGAATTAGGTAACCCTGGATGGAGCTATAAAGACCTATTGCCTTATTTTAAGAAGTCTGAAAACAATAGAGAAATTGAAGGTCGAGATCCATATTATCATGGAACAGGAGGGCCAATTACTGTAGAAAGATTTTCGTATTTAGACAGCAGCACGGTTATGCTAGTGAGAGCGTTCAATGAAACCGGTCTTCCAATTATTGATTTAAATAAAGAAAACAATATCGGTACTGATATAGCCTTATCTACTTCAAGAGATGGTCGAAGAGTTTCAACAAACGTAGCTTATATTAAACCAATACGCAAAGTGAGACCAAATATTGATATAATTGTTAATGCCTTTGTAAAACAGTTAATAATAAATCCTGCTACGAAAACGGTAAGAGGAGTCATTTACTTGAAAAATGGTATTACTTACAGGGTTTTTGCCAAAAAAGAAGTTATAGTAAGCAGCGGTGCTTTAAACTCTCCTAAATTGTTGATGTTGTCAGGTATAGGACCAAAAAAACATTTGGAAAGTTTAAATATACCAGTTATATCTAATTTATCTGTGGGACATAATTTGCAGGACCATGTGACTACGCACGGTCTCAGTATTTTATTAAACAATAAAACTTCAACGATGATCAGTGCAAAGGAATTATTCCAAAAAATACGAAAGTACTATGACGAAGATCCCAAGAAGGGTGGTCCATTATCAGCCACAAGCATTTTAAATAGTGTAGCATTTATAAAAACCAAATATGCAAATGAAGATGCCCCCGATATTCAATTTCACTTTGATGGTAGAAATGTTGAAGAGTTCTATTCAGATCCTCAAACATATATGGAAACGAATATTCTACCTGTTTCTTTCTACAACGGCCTTACAGCTAGACCGCTTCTGCTAATTCCAAAAAGTAGAGGAATTATTTTATTAAATAAAACCAATCCTGAATATGGGCCACCGTTAATTTATTCACGATTTTTTACAGTACAAGAGGATATAGACGTTATGATTGAGGGGTTGCGATACGCAATCAGTTTAGAAAAAACTGACGCTTTCAAAGAAAATGGTGCCCATTTTGTTAGAAAGCCAGTAAAAAACTGTGAATCTTATCTTTGGGGATCTTACGAATATTTAAAATGTTTACTGATCGAGTATACTACTACAATTTACCATCCTGTAGGGACATGTAAAATGGGTCCTCCAACAGATAAAGAAGCAGTTGTTGATAGCAGATTGAGAGTGTACGGTGTTAAGAGATTACGAGTTGTTGACGCTTCTATTATGCCATTTATTGTAAGAGGAAATACCAATATACCAACCGTAACAATAGCAGAACGTGCTTCTGATATGATTAAGGAAGATTACAGTGAGACAGTGGAAATATTATAA

Protein sequence:

>DPOGS213845-PA
MANRLSEVKKWRILLLEAGPEEPDVSMIPGIVRTLAGSSIDWNYRTQPEPLTCRSIRGKTCAWTSGKTMGGSSSVNYLVYMRGNRRDYDHWAELGNPGWSYKDLLPYFKKSENNREIEGRDPYYHGTGGPITVERFSYLDSSTVMLVRAFNETGLPIIDLNKENNIGTDIALSTSRDGRRVSTNVAYIKPIRKVRPNIDIIVNAFVKQLIINPATKTVRGVIYLKNGITYRVFAKKEVIVSSGALNSPKLLMLSGIGPKKHLESLNIPVISNLSVGHNLQDHVTTHGLSILLNNKTSTMISAKELFQKIRKYYDEDPKKGGPLSATSILNSVAFIKTKYANEDAPDIQFHFDGRNVEEFYSDPQTYMETNILPVSFYNGLTARPLLLIPKSRGIILLNKTNPEYGPPLIYSRFFTVQEDIDVMIEGLRYAISLEKTDAFKENGAHFVRKPVKNCESYLWGSYEYLKCLLIEYTTTIYHPVGTCKMGPPTDKEAVVDSRLRVYGVKRLRVVDASIMPFIVRGNTNIPTVTIAERASDMIKEDYSETVEIL-