Monarch geneset OGS2.0

DPOGS215969
TranscriptDPOGS215969-TA966 bp
ProteinDPOGS215969-PA321 aa
Genomic positionDPSCF300078 - 673326-676581
RNAseq coverage2460x (Rank: top 5%)
Annotation
HeliconiusHMEL0180443e-17390.34% 
BombyxBGIBMGA000926-TA2e-16686.29% 
DrosophilaTal-PA2e-12265.52% 
EBI UniRef50UniRef50_E3X0U73e-12760.43%Transaldolase n=4 Tax=Coelomata RepID=E3X0U7_ANODA
NCBI RefSeqNP_001040544.11e-16385.98%transaldolase [Bombyx mori]
NCBI nr blastpgi|1140526132e-16285.98%transaldolase [Bombyx mori]
NCBI nr blastxgi|1140526133e-15485.98%transaldolase [Bombyx mori]
Group
Gene OntologyGO:00059752.2e-186carbohydrate metabolic process
GO:00060982.7e-152pentose-phosphate shunt
GO:00048012.7e-152sedoheptulose-7-phosphate:D-glyceraldehyde-3-phosphate glyceronetransferase activity
GO:00057372.7e-152cytoplasm
GO:00081529.3e-126metabolic process
GO:00038249.3e-126catalytic activity
KEGG pathwayaag:AaeL_AAEL0093895e-144 
 K00616 (E2.2.1.2, talA, talB)maps-> Pentose phosphate pathway
InterPro domain[1-315] IPR0015852.2e-186Transaldolase
[3-316] IPR0047302.7e-152Transaldolase 1
[1-316] IPR0137859.3e-126Aldolase-type TIM barrel
Orthology groupMCL14372 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215969-TA
ATGAGTGCCTTAGACCAATTAAAACAATTTTCAACTGTTGTGGCCGATACCGGTGACTTCGAAGCTATGAAGGCTTACAAGCCTACGGATGCCACTACGAATCCAAGTTTAATTCTCTCTGCAGCTGGTATGGAACAATATCAACATTTGCTGGACAAAGCTATCAAATATGGAATGGACTGTGGGGGAAATCTAGAAGAACAATTATCTGAGGTCATGGACATGTTAAGCGTTTTGTTTGGTTGTGAAATACTAAAAATTATACCTGGACGTGTTTCTGTTGAAGTAGATGCGAGATTGTCATTCGATAAGGATGCTAGTATGGCTAAAGCAATAAAATTGATTGGCATGTTTGCTGAGCAAGGTATCAAGAAAGAGAGGATTTTGATCAAACTTGCATCAACATGGGAGGGTATCCAAGCTGCACGGGAATTGGAGAAAAAACATGGTATTCATTGTAATCTGACCCTCCTGTTCTCAATGTGCCAAGCAATTGCATGTGCTGAGGCCAATGTGACATTGATATCACCATTTGTCGGAAGAATATTGGATTGGTATGTGGAACACACAAAGTTGACATACGAACCCAAGGATGATCCGGGAGTGTTGTCTGTATCCCGTGTGTACAACTACTACAAGAAGTTTGGCTACAAGACACAGGTTATGGGGGCCTCATTCCGTAACACTGGAGAGATAAAAGAGCTGGCTGGCTGCGATTTGCTCACCATCAGTCCAAAACTGCTTCAGGAGCTTGCCAACAGTGAACAGCCTCTTAAGAAGGTTCTTGATCCAAAAACAGCGGCTCAGTGTGACATTAAGAAAATATCTTTAACCGAGGCACAGTTCCGCTGGCAACTGAACGAAGACCAAATGGCCACTGACAAACTTTCCGACGGCATCAGAAAATTTGCAGCGGATGGCAGGAAGCTGGAATCTCTCATTAAATCATTACTCTCTAAGAAGTAA

Protein sequence:

>DPOGS215969-PA
MSALDQLKQFSTVVADTGDFEAMKAYKPTDATTNPSLILSAAGMEQYQHLLDKAIKYGMDCGGNLEEQLSEVMDMLSVLFGCEILKIIPGRVSVEVDARLSFDKDASMAKAIKLIGMFAEQGIKKERILIKLASTWEGIQAARELEKKHGIHCNLTLLFSMCQAIACAEANVTLISPFVGRILDWYVEHTKLTYEPKDDPGVLSVSRVYNYYKKFGYKTQVMGASFRNTGEIKELAGCDLLTISPKLLQELANSEQPLKKVLDPKTAAQCDIKKISLTEAQFRWQLNEDQMATDKLSDGIRKFAADGRKLESLIKSLLSKK-