Monarch geneset OGS2.0

DPOGS215011
TranscriptDPOGS215011-TA1440 bp
ProteinDPOGS215011-PA479 aa
Genomic positionDPSCF300256 + 178731-180170
RNAseq coverage459x (Rank: top 27%)
Annotation
HeliconiusHMEL0101740.094.36% 
BombyxBGIBMGA012188-TA0.091.23% 
Drosophilasgl-PA0.077.45% 
EBI UniRef50UniRef50_O023730.077.45%UDP-glucose 6-dehydrogenase n=157 Tax=cellular organisms RepID=UGDH_DROME
NCBI RefSeqXP_396801.20.079.87%PREDICTED: similar to sugarless CG10072-PA [Apis mellifera]
NCBI nr blastpgi|3800170730.080.13%PREDICTED: UDP-glucose 6-dehydrogenase-like [Apis florea]
NCBI nr blastxgi|3800170730.080.13%PREDICTED: UDP-glucose 6-dehydrogenase-like [Apis florea]
Group
Gene OntologyGO:00166169.9e-101oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00512879.9e-101NAD binding
GO:00551149.9e-101oxidation-reduction process
GO:00054882e-90binding
GO:00164911.1e-29oxidoreductase activity
KEGG pathwayame:4133560.0 
 K00012 (E1.1.1.22, ugd)maps-> Starch and sucrose metabolism
    Ascorbate and aldarate metabolism
    Pentose and glucuronate interconversions
    Amino sugar and nucleotide sugar metabolism
InterPro domain[5-443] IPR0174769.9e-101Nucleotide sugar dehydrogenase
[244-465] IPR0160402e-90NAD(P)-binding domain
[5-189] IPR0017322e-58UDP-glucose/GDP-mannose dehydrogenase, N-terminal
[315-458] IPR0140271.5e-39UDP-glucose/GDP-mannose dehydrogenase, C-terminal
[213-309] IPR0140261.8e-32UDP-glucose/GDP-mannose dehydrogenase, dimerisation
[212-313] IPR0089271.1e-296-phosphogluconate dehydrogenase, C-terminal-like
[216-243] IPR0211571.2e-14Cytochrome c1, transmembrane anchor, C-terminal
Orthology groupMCL13194 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215011-TA
ATGGTGATCGAAAAGATTTGCTGTCTGGGTGCTGGCTACGTCGGAGGACCTACGTGCAGTGTTATTGCTCTCAAATGCCCTAATATTAAAGTGACCGTGTGTGACAAGAGCTTGGAGAGGATAAATCAATGGAACTCCGATAAATTACCTATTTATGAGCCCGGTTTGGACGAAGTTGTTAGGCAATGTCGCGGACGAAACTTGTTCTTCTCGACGGACATAGAATCGAGCATTTTGGAGGCTGATTTAATTTTCATCTCAGTGAACACACCGACTAAAACGATCGGAAACGGCAAAGGTAGGGCTGCGGATCTGAAGTATATAGAAGGTGCCGCCCGCATGATAGCCGATATAGCGACTAGTAATAAAATAGTAGTAGAAAAGAGCACGGTGCCAGTGAAAGCGGCAGAAATCATCATGAAGATTCTTCGCGCTAACACGAAACCCGGCGTCGAATATCAAATATTATCCAACCCGGAGTTCTTAGCTGAAGGTACAGCGATCGTGGACTTGGTAGAAGCTGAAAGGGTCCTAATAGGAGGAGAAGACACTCCTGAAGGCCAGAAGGCGGTGCAGCAATTGTGTTGGGTCTACGAGCATTGGATTCCTGCCAAAAACATACTCACAACTAATACTTGGAGTTCAGAGTTGTCGAAGCTGGCTGCAAATGCCTTTCTGGCTCAAAGGATATCCAGTATTAACTCACTGTCTGCCGTCTGTGAGGCTACAGGCGCTGATGTCTCTGAGGTAGCCAGAGCAGTCGGTAGAGATTCAAGAATCGGCCCAAAATTCCTCGAAGCCTCAATAGGCTTTGGAGGTAGTTGCTTCCAAAAGGATATCCTCAATTTAATTTATCTATCCGAGTGCTTAAATCTGCCGGAGGTGGCCGCCTACTGGCAGCAAGTTGTTAGCTTGAACGATTACCAGAAAACAAGGTTCACCCGCAAAGTGATTGAATCATTATTCAACACTGTTGCCGACAAGAAAATTGCTATACTTGGCTTTTCATTTAAAAAGAACACCGGAGACACCAGAGAGTCCCCAGCTATATATGTTTGTAAGACGTTATTGGATGAGGGGGCAAAACTGCACATTTATGACCCTAAAGTGGAACATGAACAAATCTTCTATGAGCTATCTCATCCGCTGGTCACAAATGAGCCGGAGATTGTTCGGAAGAACATCCAAATACACGAAACCGCTTACTCAGCTGTCGCTGGAGCTCATGCCCTCGTGCTCTGCACCGAGTGGGATGAATTCAAAACCTTAGATTATAAGAAAATATATGAAGTAATGATGAAGCCGGCCTATGTGTTTGATGGAAGAAAGATCCTCGATCACGAGGCCTTATTGAACATGGGATTCCATGTTCAGACGATCGGCAAGAGGCTGTCCAGGACCAGCAGCATCCGGGCTCAAGGCAGTCAGACCATGCCGTAA

Protein sequence:

>DPOGS215011-PA
MVIEKICCLGAGYVGGPTCSVIALKCPNIKVTVCDKSLERINQWNSDKLPIYEPGLDEVVRQCRGRNLFFSTDIESSILEADLIFISVNTPTKTIGNGKGRAADLKYIEGAARMIADIATSNKIVVEKSTVPVKAAEIIMKILRANTKPGVEYQILSNPEFLAEGTAIVDLVEAERVLIGGEDTPEGQKAVQQLCWVYEHWIPAKNILTTNTWSSELSKLAANAFLAQRISSINSLSAVCEATGADVSEVARAVGRDSRIGPKFLEASIGFGGSCFQKDILNLIYLSECLNLPEVAAYWQQVVSLNDYQKTRFTRKVIESLFNTVADKKIAILGFSFKKNTGDTRESPAIYVCKTLLDEGAKLHIYDPKVEHEQIFYELSHPLVTNEPEIVRKNIQIHETAYSAVAGAHALVLCTEWDEFKTLDYKKIYEVMMKPAYVFDGRKILDHEALLNMGFHVQTIGKRLSRTSSIRAQGSQTMP-