Monarch geneset OGS2.0

DPOGS210580
TranscriptDPOGS210580-TA1113 bp
ProteinDPOGS210580-PA370 aa
Genomic positionDPSCF300168 - 583775-589407
RNAseq coverage1288x (Rank: top 10%)
Annotation
HeliconiusHMEL0082960.082.31% 
BombyxBGIBMGA013631-TA8e-17980.16% 
DrosophilaCG5854-PA3e-9847.01% 
EBI UniRef50UniRef50_G6DIV30.0100.00%UDP-galactose 4-epimerase n=10 Tax=Endopterygota RepID=G6DIV3_DANPL
NCBI RefSeqNP_001040224.17e-17680.22%UDP-galactose 4-epimerase [Bombyx mori]
NCBI nr blastpgi|1140521661e-17480.22%UDP-galactose 4-epimerase [Bombyx mori]
NCBI nr blastxgi|1140521664e-16980.22%UDP-galactose 4-epimerase [Bombyx mori]
Group
Gene OntologyGO:00054888e-27binding
GO:00442372e-18cellular metabolic process
GO:00038242e-18catalytic activity
GO:00506622e-18coenzyme binding
KEGG pathwaytag:Tagg_12116e-09 
 K01784 (galE, GALE)maps-> Galactose metabolism
    Amino sugar and nucleotide sugar metabolism
InterPro domain[218-265] IPR0160408e-27NAD(P)-binding domain
[13-246] IPR0015092e-18NAD-dependent epimerase/dehydratase
Orthology groupMCL16019 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210580-TA
ATGTCGGATTCCACCGGTGATAATTCAAAATCTCGAGTTATAGTTTTAGGAGGGTGCGGCTTCATCGGTCGGAATCTGGTGGATTATTTAATTAGCAATGACTTAGTTAGTCACTTGCGTGTAGTGGATAAAACTCCTCCACAATTGGCCTTCCTGAATCCTACACACTCGAAGGCCTTCGAGGACCCCCGCGTTGAATACAAAAGCGCAAACCTCATAAATCCAGTATCATGCGCCAGTGCCCTCGAACCGTCCGAGCAACCCTGGTCCTTGGTGGTGAACTGCGCGGGCGAGACTCGCTTTGGTCAGACGGAAGCGGTCTACGCCGAGGGCATCCACAACCTCAGCGTCACAGTGGCCAAGCAATGTGCCCTCATGAAACTACGGCTCATTGAAATATCCAGCGGATGCATGTACAGCAGCGACAAGCCGCAGAAAGAGGACTGTCCCGTGGAACCGTGGACTGTGGAGGGTAGGATGAAGGCCAGAGTTGAAGAGGAGTTGAAGAACATGGATCTAGACTACACCATCATAAGACCAGCCATAGTATACGGAGTAGGAGATAGGAGGAGTCTCACGCCCCGTCTTCTATATGGTGGTATATACAAACACCTGGGAGAAACCATGAAGCTGTTATGGACAGCGGACCTCAAGATGAACACGGTCCACGTGTTAGACGTCTGTCGGGCCGTGTGGACCCTCGCCAGGAGAAACGACGCGATCAGACAGACATACAACCTGGTGGATGACGCCAACAGCACTCAAGGCAATCTCGCAGAGATCGTCTCGGAGATATTCAATATAAACCACGATTACTACGGAACTGCGATATCCACATTGGCTAAGAACGACATAGCCTCAGTAGCTGAAGAGGCGAACGACAAACACCTTACCGCGTGGGCGGATATCTGCCGGAAGTATTCGTTGGAGCACAGTCCCCTGGAACCGAGCGCTGGAGCTGAGCTATTACTGAACAAACAGCTGTGTCTGGACGGAAGCAAACTGAAGGAAATCATGACAATGGACGTGCCCGCGCCCACGGCCTCCGCCCTACTTGAGGTGCTGCAAGATTATGCCTCAATGAATCTATTCCCAAAGGAGCTTCTGATGTGA

Protein sequence:

>DPOGS210580-PA
MSDSTGDNSKSRVIVLGGCGFIGRNLVDYLISNDLVSHLRVVDKTPPQLAFLNPTHSKAFEDPRVEYKSANLINPVSCASALEPSEQPWSLVVNCAGETRFGQTEAVYAEGIHNLSVTVAKQCALMKLRLIEISSGCMYSSDKPQKEDCPVEPWTVEGRMKARVEEELKNMDLDYTIIRPAIVYGVGDRRSLTPRLLYGGIYKHLGETMKLLWTADLKMNTVHVLDVCRAVWTLARRNDAIRQTYNLVDDANSTQGNLAEIVSEIFNINHDYYGTAISTLAKNDIASVAEEANDKHLTAWADICRKYSLEHSPLEPSAGAELLLNKQLCLDGSKLKEIMTMDVPAPTASALLEVLQDYASMNLFPKELLM-