Monarch geneset OGS2.0

DPOGS203290
TranscriptDPOGS203290-TA1458 bp
ProteinDPOGS203290-PA485 aa
Genomic positionDPSCF300003 - 1562546-1565950
RNAseq coverage290x (Rank: top 38%)
Annotation
HeliconiusHMEL0063907e-11574.48% 
BombyxBGIBMGA012238-TA0.071.22% 
DrosophilaCG10467-PA5e-7841.41% 
EBI UniRef50UniRef50_E3WQS32e-9845.57%Putative uncharacterized protein n=2 Tax=cellular organisms RepID=E3WQS3_ANODA
NCBI RefSeqXP_001654671.11e-9946.12%aldose-1-epimerase [Aedes aegypti]
NCBI nr blastpgi|1571266102e-9846.12%aldose-1-epimerase [Aedes aegypti]
NCBI nr blastxgi|1571266103e-9945.86%aldose-1-epimerase [Aedes aegypti]
Group
Gene OntologyGO:00302461.5e-104carbohydrate binding
GO:00059751.5e-104carbohydrate metabolic process
GO:00038241.5e-104catalytic activity
GO:00168537.2e-103isomerase activity
GO:00193187.2e-103hexose metabolic process
KEGG pathwayaag:AaeL_AAEL0105903e-99 
 K01785 (E5.1.3.3, galM)maps-> Glycolysis / Gluconeogenesis
InterPro domain[426-478] IPR0147181.5e-104Glycoside hydrolase-type carbohydrate-binding, subgroup
[89-481] IPR0154437.2e-103Aldose 1-epimerase, subgroup
[89-478] IPR0110139.7e-93Glycoside hydrolase-type carbohydrate-binding
[91-376] IPR0081831.8e-58Aldose 1-epimerase
Orthology groupMCL25936 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203290-TA
ATGGCTGAAGCAACCGATGATAATGCGAGTGGAGATACTGTTTCAGTAGTCAAACCTGAAGTTTCCGAAGCACCAGTGACTATTCCACCACCGAAACCAGATGTGGAACTGATAGTAGACGGTTTCGGTTTCATGCCAAAGGATTTAAAGTTGGATTCAAAATCCCTCTCTACAGATTTAAAAAAATCTTCTAAAGTATTAAAAGATATTAAATCCCCCGTAAGCGAACATAGTCCATGTGCCTCTAGCACAAATATCGATATCGTTAGAAGGTATACTTGGAGGACAAAAAACAGGATGACAGTACAGGTTATAACCTATGGCGCAACCATCACATCGATTCAAGTTCCTGACAAGAGAGGTGTTCCGGATGATGTGGTTGCAGGTTTTGATACCTTAGAAGAATATTTCCAACCAAGAAATCCATACTTCGGTGCAACTATAGGGCGCTATGCAAACTATATACAAGATGGGACAATGGTGGTGAGACCTTCCGGCAGGATGTACATGCTCTCTACCAACAAAGGACACAATCATTATCACGGAGGATATGTTGGATTTGATAAGGTTAACTGGCGATCTTATGTAACGGGCAACAAAGTTATCATGAGTCACGTCTCGGAAAGATTCCACGAAGGTTATCCTGGCACTGTGATGGCACAAATTTCTTTTGAAGTGACTTGCGATAATACAATCAAAATTGAAATGAAATGTACGACAAGTGAACCAACAATAGTGAATTTAAGTAACACAAATTACTTTAACCTGGCTGGACATCATTCCGGCCCAGATCTAATGTATGATCATATTATTACAATCAACGCTGATAAATATACAGCTGTGGATGACGACGGACTTGTGACAGGGGAAAAGAACGTTGTTGGCGGAACACCATATGACTTCAGAGTACCGCGGTCATTGCGGGTGATGCTGCCGAAAATTCCAATGGGCGGCTATGATATCAATTTCTGCGTAACGCAAGGCACGGAACAAGATCTTACATTCCAAGCAAGAGCCCTTCACCCGCTCACAGGAAGAGTGTTAGAAATATATAGCAATCAACCTGGAATGCAATTCTATACAGGAAACCTTTTGCCTGATCCCGACAAAATTGTTGAAGAACCGGGTGAAACTGAAGAGGAAGGCAAATCTGAATCTGAATCTGAATCTGACAGTGGGAGCGAAAGCGAGTCGAACAGTAGTGAGGAGAGTGCAAGAGGTGAAGAGGCGATAGAAGAGGCGAAGGAAGAGGAGGGCAAGAGAAGTTATGGATACGTACCATTAATGGGTAAACATGGCACGTTTTACCGGAAACACGGACTATTTTGTATGATGCCGCAAAACTATCCGGATGCTGTCAATCATAAAAACTTCCCGAATTCCGTTTTAAATCCAGGAGAAACTTACATTCACAAAATACAGTACAAGTTTGGTATTTTATTAGGAAAATATGTATGA

Protein sequence:

>DPOGS203290-PA
MAEATDDNASGDTVSVVKPEVSEAPVTIPPPKPDVELIVDGFGFMPKDLKLDSKSLSTDLKKSSKVLKDIKSPVSEHSPCASSTNIDIVRRYTWRTKNRMTVQVITYGATITSIQVPDKRGVPDDVVAGFDTLEEYFQPRNPYFGATIGRYANYIQDGTMVVRPSGRMYMLSTNKGHNHYHGGYVGFDKVNWRSYVTGNKVIMSHVSERFHEGYPGTVMAQISFEVTCDNTIKIEMKCTTSEPTIVNLSNTNYFNLAGHHSGPDLMYDHIITINADKYTAVDDDGLVTGEKNVVGGTPYDFRVPRSLRVMLPKIPMGGYDINFCVTQGTEQDLTFQARALHPLTGRVLEIYSNQPGMQFYTGNLLPDPDKIVEEPGETEEEGKSESESESDSGSESESNSSEESARGEEAIEEAKEEEGKRSYGYVPLMGKHGTFYRKHGLFCMMPQNYPDAVNHKNFPNSVLNPGETYIHKIQYKFGILLGKYV-