Monarch geneset OGS2.0

DPOGS215935
TranscriptDPOGS215935-TA1086 bp
ProteinDPOGS215935-PA361 aa
Genomic positionDPSCF300308 - 73841-78371
RNAseq coverage1029x (Rank: top 12%)
Annotation
HeliconiusHMEL0222532e-14393.28% 
BombyxBGIBMGA001865-TA3e-12487.18% 
DrosophilaGale-PB3e-11353.76% 
EBI UniRef50UniRef50_E0VSA58e-14669.16%UDP-glucose 4-epimerase, putative n=1 Tax=Pediculus humanus corporis RepID=E0VSA5_PEDHC
NCBI RefSeqXP_393006.19e-14970.55%PREDICTED: similar to Probable UDP-glucose 4-epimerase (Galactowaldenase) (UDP-galactose 4-epimerase) [Apis mellifera]
NCBI nr blastpgi|3071972962e-14871.51%UDP-glucose 4-epimerase [Harpegnathos saltator]
NCBI nr blastxgi|3071972967e-14671.51%UDP-glucose 4-epimerase [Harpegnathos saltator]
Group
Gene OntologyGO:00060121.4e-168galactose metabolic process
GO:00039781.4e-168UDP-glucose 4-epimerase activity
GO:00054885.1e-69binding
GO:00442374.6e-55cellular metabolic process
GO:00038244.6e-55catalytic activity
GO:00506624.6e-55coenzyme binding
GO:00059751.1e-05carbohydrate metabolic process
GO:00168571.1e-05racemase and epimerase activity, acting on carbohydrates and derivatives
KEGG pathwayame:4094993e-148 
 K01784 (galE, GALE)maps-> Galactose metabolism
    Amino sugar and nucleotide sugar metabolism
InterPro domain[5-352] IPR0058861.4e-168UDP-glucose 4-epimerase
[5-192] IPR0160405.1e-69NAD(P)-binding domain
[7-272] IPR0015094.6e-55NAD-dependent epimerase/dehydratase
Orthology groupMCL17525 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215935-TA
ATGCCACGCTTCAAGACGATCATGGTAACGGGAGGGGCTGGTTACATTGGAAGCCACTGTGTGGTAGATCTGCTTGAGGCGGGTTATGAAGTAGTTGCCATAGACAATTTCGCTAATGCCGTCGGCGACGAAGAAGGTTCTCCAGCATTACAAAGAGCTGAGGAGATAACTGGCAAGCAGATCACGTTCTACAAAGCCGACCTGTTAGACAAGCAGCAGATAAACAATATATTTGATAAGCATATAGTGGATTGTGTTATCCACTTTGCCGCCTTAAAGGCGGTTGGCGAGTCAATGCAACAGCCCCTACTCTACTACCAGAACAACCTCTTAGGAATGCTAAATTTATTAGAGGTCATGCGCTCTCACAACTGCTACCAAATGGTGTTTTCGTCTTCCTGTACCGTGTACGGCGAGCCGGATCAGCTCCCCATAACGGAGACACATCACACTGGCAACATTACCAACGTGTACGGCCGGACTAAATACTTCATAGAGGAAATGCTTAAGGACCTCAGTGCGGCCGATGAGAAATGGAACATTATCTCTCTAAGGTATTTCAATCCCGTGGGCGCTCATCCTTCCGGTCTAATCGGTGAAGATCCTACCAAAGAGTTCACAAATCTAATGCCTTTTATGGCGCAGGTGGCACTTGGAAAGAAACCTGTGCTCACTATTTTTGGCAATGATTACAATACCCCTGACGGAACTGGTATTCGGGATTACATCCACGTTATGGATTTGGCGGGAGGTCATGTGGCCGCTCTCAATCTCCTGAGTGAAAATCACGTGCGACTCAAGGTATTTAATCTAGGCACTGGTAAGGGCGTATCAGTGAAAGAGCTGGTGAATGTATTCGAACGTGTGACCGGAACAAACATTCCAGTAAAATATGTATCGAGGCGGCTTGGAGATATAACGGCTATGTGGGCGGACGCCACGCTCGCTAAAAACGAACTAGGATGGACGACCAAACGTACCGTTGAGGAAATGTGTACAGATTTTTGGAGATGGCAAACCATGAATCCTGATGGCTATCCCAAGAAGAATAAAACAACCGTCATTGTAGTCAATGGAAAAAGTTAA

Protein sequence:

>DPOGS215935-PA
MPRFKTIMVTGGAGYIGSHCVVDLLEAGYEVVAIDNFANAVGDEEGSPALQRAEEITGKQITFYKADLLDKQQINNIFDKHIVDCVIHFAALKAVGESMQQPLLYYQNNLLGMLNLLEVMRSHNCYQMVFSSSCTVYGEPDQLPITETHHTGNITNVYGRTKYFIEEMLKDLSAADEKWNIISLRYFNPVGAHPSGLIGEDPTKEFTNLMPFMAQVALGKKPVLTIFGNDYNTPDGTGIRDYIHVMDLAGGHVAALNLLSENHVRLKVFNLGTGKGVSVKELVNVFERVTGTNIPVKYVSRRLGDITAMWADATLAKNELGWTTKRTVEEMCTDFWRWQTMNPDGYPKKNKTTVIVVNGKS-