Monarch geneset OGS2.0

DPOGS202485
TranscriptDPOGS202485-TA2310 bp
ProteinDPOGS202485-PA769 aa
Genomic positionDPSCF300463 - 39392-49120
RNAseq coverage22x (Rank: top 78%)
Annotation
HeliconiusHMEL0042520.064.23% 
BombyxBGIBMGA013789-TA0.066.60% 
DrosophilaCG9518-PA3e-12241.86% 
EBI UniRef50UniRef50_E2BJK27e-15150.84%Glucose dehydrogenase [acceptor] n=9 Tax=Endopterygota RepID=E2BJK2_HARSA
NCBI RefSeqXP_394222.28e-15651.96%PREDICTED: similar to CG9517-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|664992252e-15451.96%PREDICTED: glucose dehydrogenase [acceptor] [Apis mellifera]
NCBI nr blastxgi|664992254e-15251.96%PREDICTED: glucose dehydrogenase [acceptor] [Apis mellifera]
Group
Gene OntologyGO:00166141.8e-60oxidoreductase activity, acting on CH-OH group of donors
GO:00506601.8e-60flavin adenine dinucleotide binding
GO:00551141.8e-60oxidation-reduction process
KEGG pathwaydme:Dmel_CG95183e-120 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[232-504] IPR0001721.8e-60Glucose-methanol-choline oxidoreductase, N-terminal
[610-755] IPR0078673.5e-32Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10891 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202485-TA
ATGTATTTGTCTCTAGTTATGCGATTGTTTGGTGGAGTTAATTATAGCGTAAATTATCCCAGACGGAATCCATACTCGACTCTTCAATATACACATTCGTCGTCAATTGATAAAAATCCATTGCGCAATTCTCTGTCTGAATATTATCCCGGTCCATATCTGAACCCGGCGTATTCCCGATTAAATTACAATTCATATCCTCATGAAACACATTTTGATCGCAGTTTTAATTACAATCCTTTTCAACCATGGGACTCGATTGGTGGAGAGTCGAAAGTAGAAATAAAAAAAGAAAAGAACAAAGGAAAGGATGGAGGAAAGTCAAGAAGGAAAAGGAATGCTAAGGAATATGATTTTATTATTGTCGGGGCTGGATCGGCGGGCTGTGTGTTGGCGAATAGATTGTCAGAAGTCAAAAAATGGAGGGTGCTCTTGTTGGAAGCCGGTCCTGAAGAGCCAGACGTAACGATGGTGCCTTCATTAGCGACGATTCTAAGACAATCTTCTATTGACTGGCGGTATGAAACACAACCCGAACCCTTGACCTGTAGGTCTTATAGAAGCCGATCCTGTCCATGGACCAGAGGTAAAACAATGGGTGGTTCCAGTGCCATAAACTATTTAGTTTACATGAGAGGTAACAGATATGATTATGATAACTGGGCTAATTTAGGAAACCCTGGCTGGAGTTATAATGAGGTGCTCTTGTTGGAAGCCGGTCCTGAAGAGCCAGACGTAACGATGGTGCCTTCATTAGCGACGATTCTAAGACAATCTTCTATTGACTGGCGGTATGAAACACAACCCGAACCCTTGACCTGTAGGTCTTATAGAAGCCGATCCTGTCCATGGACCAGAGGTAAAACAATGGGTGGTTCCAGTGCCATAAACTATTTAGTTTACATGAGAGGTAACAGATATGATTATGATAACTGGGCTAATTTAGGAAACCCTGGCTGGAGTTATAATGAGTTGCTGCCTTACTTTAGAAAATCTGAAAACAACCGTGATGTTGAATCTTATGACAATTTCCTTCATGGAGTAGGGGGACCTATTACTGTGGAGAGATTTCCATATGTTGATATCAATACTGCTAAATTAGTAGCAGCATTTCAAGATAAAGGGCTTCCATTAATAGATTTGACGTCAGAAAATAACTTAGGTACAAATATAGGACTATCAACTTCCAGAGATGGGCGAAGGATGTCTATAAATGTGGCATATATCAAGCCCATACGTGATGTTAGACCAAACATTGATATAGTGGTCAACGCATTTGCTACTACGTTAATAATAGACCCCCAAACAAAAATGGTTCTTGGGGTTACATATATAAAGAATGGTGTTACATATAACGTTTTTGCAAAGAAAGAAGTAATAGTAAGTGCTGGGACAATAAATTCTCCAAAATTACTTATGCTTTCTGGGATCGGCCCAAAAGAGCATTTGCAAAGCTTGAATATACCAATAATATCGGAATTGGCAGTCGGCCAAAATTTACAAGATCACACAACCACTGACGGACTAACTATTGCTTTATCAAATAAGACATCTACCTTAGTGAGTACTGAGACACTCCTTAATGAAGTACAGAATTACCACCAACAGGACCCTAAAAAAGATGGACCTTTGGCGACAACTAATACTCTTAATGCCATTGCTTTTATCAAAACTAAATATGCGACTGTAAATGCACCAGATATACAATTTCATTTTGATGGAAGAAATGTTGAAGACTTTTACGCAGATCCTCAAACATATTTGGAAACCAACATTTGGCCTTTAGCTTTTTATAATGGTTTATCAGCAAGACCACTTCTGCTTACCCCCAAAAGTAGGGGAGTTATTTTACTAAACCATACTGATCCTATCTTTGGCACACCTTTAATATACCCACGCTTTTTCACAGTCAAGGAAGACTTAGATGCGTTAATCGAAGGATTACGTTTTGCTGTAAGTTTAGAGGAAACTGAAACATTTAAAAGCATTGGTGCACATTTTGTTAGAGTTCCTGTTAAGAATTGTGAAAATCATATTTGGGGTTCTTATAATTATTTTGCGTGTTTACTTATTGAGTATACTTCAACAATTTACCATCCAGTTGGTACTTGTAAGATGGGTCCCGCTTGGGACAAAGATGCTGTTGTTGACTCAAGATTGCGAGTGTATGGGGTTAAACGATTAAGAGTAATTGACGCATCCATAATGCCAGAAATAGTTAGAGGGAACACAAATATCCCAACTGTCACCATAGCAGAACGTGCATCAGATATGATAAAGGAAGAATATTTGACAAAACAACATTTATAA

Protein sequence:

>DPOGS202485-PA
MYLSLVMRLFGGVNYSVNYPRRNPYSTLQYTHSSSIDKNPLRNSLSEYYPGPYLNPAYSRLNYNSYPHETHFDRSFNYNPFQPWDSIGGESKVEIKKEKNKGKDGGKSRRKRNAKEYDFIIVGAGSAGCVLANRLSEVKKWRVLLLEAGPEEPDVTMVPSLATILRQSSIDWRYETQPEPLTCRSYRSRSCPWTRGKTMGGSSAINYLVYMRGNRYDYDNWANLGNPGWSYNEVLLLEAGPEEPDVTMVPSLATILRQSSIDWRYETQPEPLTCRSYRSRSCPWTRGKTMGGSSAINYLVYMRGNRYDYDNWANLGNPGWSYNELLPYFRKSENNRDVESYDNFLHGVGGPITVERFPYVDINTAKLVAAFQDKGLPLIDLTSENNLGTNIGLSTSRDGRRMSINVAYIKPIRDVRPNIDIVVNAFATTLIIDPQTKMVLGVTYIKNGVTYNVFAKKEVIVSAGTINSPKLLMLSGIGPKEHLQSLNIPIISELAVGQNLQDHTTTDGLTIALSNKTSTLVSTETLLNEVQNYHQQDPKKDGPLATTNTLNAIAFIKTKYATVNAPDIQFHFDGRNVEDFYADPQTYLETNIWPLAFYNGLSARPLLLTPKSRGVILLNHTDPIFGTPLIYPRFFTVKEDLDALIEGLRFAVSLEETETFKSIGAHFVRVPVKNCENHIWGSYNYFACLLIEYTSTIYHPVGTCKMGPAWDKDAVVDSRLRVYGVKRLRVIDASIMPEIVRGNTNIPTVTIAERASDMIKEEYLTKQHL-