Monarch geneset OGS2.0

DPOGS202718
TranscriptDPOGS202718-TA1050 bp
ProteinDPOGS202718-PA349 aa
Genomic positionDPSCF300272 + 54509-58989
RNAseq coverage369x (Rank: top 32%)
Annotation
HeliconiusHMEL0041144e-14773.64% 
BombyxBGIBMGA001383-TA1e-15271.14% 
DrosophilaGale-PB7e-12761.49% 
EBI UniRef50UniRef50_Q143764e-11959.71%UDP-glucose 4-epimerase n=1732 Tax=root RepID=GALE_HUMAN
NCBI RefSeqXP_320278.21e-12963.69%AGAP012261-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|583937412e-12863.69%AGAP012261-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1951262223e-12663.58%GI13017 [Drosophila mojavensis]
Group
Gene OntologyGO:00060121.1e-162galactose metabolic process
GO:00039781.1e-162UDP-glucose 4-epimerase activity
GO:00054883.5e-68binding
GO:00442371.8e-52cellular metabolic process
GO:00038241.8e-52catalytic activity
GO:00506621.8e-52coenzyme binding
GO:00059751.5e-05carbohydrate metabolic process
GO:00168571.5e-05racemase and epimerase activity, acting on carbohydrates and derivatives
KEGG pathwayaga:AgaP_AGAP0122613e-129 
 K01784 (galE, GALE)maps-> Galactose metabolism
    Amino sugar and nucleotide sugar metabolism
InterPro domain[1-349] IPR0058861.1e-162UDP-glucose 4-epimerase
[1-192] IPR0160403.5e-68NAD(P)-binding domain
[3-271] IPR0015091.8e-52NAD-dependent epimerase/dehydratase
Orthology groupMCL14256 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202718-TA
ATGTCGATATTAGTGACCGGCGGGGCGGGTTACGTCGGCTCTCACACCGTGATCGCATTGCTGGAAGAAAAAGACGTTGATATCATAGTCGTTGACAACCTCAGCAACGCTTACAGAGGTGATGGCGTCAAGAAACCGGAGCCGTTGTTGAGGATAGAGGAGATGACGGACAGACACATACATTTCTACGACATCGACATCAGAGATAGGAACGGACTGGACAAAATATTTGATAATCATAAAATAGACTGCGTGATACACTTTGCCGCGTTAAAAGCGGTGGGGGAGTCGGTCCAGAAGCCGCTGGAGTATTACCAGGCGAACATTAGCGGGACCTGCACCTTGCTGGAGTCAATGCGAGCCCACAACGTCTACAAGCTGGTGTACAGCTCATCCTGCACAGTCTACGGAGACCCGCAGAGGTTACCCCTGGATGAGAACCACCCTACAGGTGGGGGTATCACCAACCCTTACGGCAAGACCAAATACTTCTGTGAGGAGATAATGAAAGATCTGTGCAGCAGCGACCAGAACTGGAAGATCGTATCTCTCCGTTACTTCAATCCCGTGGGAGCTCATATCAGCGGCAGGATAGGTGAGGATCCGACAGGACCTCCCAACAACCTGATGCCCTATATATCACAGGTGGCGGTGGGCCGTCTCCAGGAGTTGCAGGTGTTCGGCTCAGATTATCCGACTGTGGACGGGACGGGTGTGAGGGACTATGTACATGTAGTGGACCTCGCTGACGGACACCTGCGAGCAGCTAGACTGTTCCACGACCAGAGCTTCAGAGGATTCCATGCTGTCAACCTGGGCACAGGTCAAGGCCAGTCCGTCCTCCAACTGGTGTCATCCTTCGAGCGTGTGAGCGGGCGGCGGGTTCCTTATCGAGTGGTGGGACGACGAGCTGGAGACGTCGCTGCCAACTACGCTGACGTGACCCTGGCCCGGACACTCCTCGGCTGGAAAGCCACCAGGAGCCTGGACCAGATGTGCGAGGACACCTGGAGGTGGCAGGAGAAGAACCCTCGGGGGTACAGGGACTGA

Protein sequence:

>DPOGS202718-PA
MSILVTGGAGYVGSHTVIALLEEKDVDIIVVDNLSNAYRGDGVKKPEPLLRIEEMTDRHIHFYDIDIRDRNGLDKIFDNHKIDCVIHFAALKAVGESVQKPLEYYQANISGTCTLLESMRAHNVYKLVYSSSCTVYGDPQRLPLDENHPTGGGITNPYGKTKYFCEEIMKDLCSSDQNWKIVSLRYFNPVGAHISGRIGEDPTGPPNNLMPYISQVAVGRLQELQVFGSDYPTVDGTGVRDYVHVVDLADGHLRAARLFHDQSFRGFHAVNLGTGQGQSVLQLVSSFERVSGRRVPYRVVGRRAGDVAANYADVTLARTLLGWKATRSLDQMCEDTWRWQEKNPRGYRD-