Monarch geneset OGS2.0

DPOGS212528
TranscriptDPOGS212528-TA1029 bp
ProteinDPOGS212528-PA342 aa
Genomic positionDPSCF300222 + 661662-666329
RNAseq coverage65x (Rank: top 67%)
Annotation
HeliconiusHMEL0093451e-13464.99% 
BombyxBGIBMGA009800-TA5e-12762.94% 
DrosophilaCG12766-PA6e-7141.67% 
EBI UniRef50UniRef50_B8ZX071e-12659.94%Putative aldo-ketose reductase 1 n=1 Tax=Papilio dardanus RepID=B8ZX07_PAPDA
NCBI RefSeqXP_969383.13e-7240.56%PREDICTED: similar to aldo-keto reductase [Tribolium castaneum]
NCBI nr blastpgi|2196860823e-12659.94%putative aldo-ketose reductase 1 [Papilio dardanus]
NCBI nr blastxgi|2196860821e-12359.94%putative aldo-ketose reductase 1 [Papilio dardanus]
Group
Gene OntologyGO:00551142.9e-44oxidation-reduction process
GO:00164912.9e-44oxidoreductase activity
KEGG pathwayaag:AaeL_AAEL0040889e-70 
 K00011 (E1.1.1.21, AKR1)maps-> Galactose metabolism
    Glycerolipid metabolism
    Pentose and glucuronate interconversions
    Fructose and mannose metabolism
    Pyruvate metabolism
InterPro domain[8-318] IPR0013952.2e-126Aldo/keto reductase
[10-318] IPR0232103e-100NADP-dependent oxidoreductase domain
[52-76] IPR0204712.9e-44Aldo/keto reductase subgroup
Orthology groupMCL19842 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212528-TA
ATGTCCACCGATCTGAAGGTAAACATACCGTCATTCACACTCAACAATGGTTTGAAGATACCTGCTGTTGGCTACGGCACGTGGGTCAATGTTGATAAAAATAACCGCCTCTTATTGGAAGATAAGCCGAAATTACTAGAGGCTATGTCACACGCCATTGACGTGGGCTATAGACACTTGGACACGGCACACCTTTATCGGACAGAACCGGAAGTTGGCTGGGTAGTGCGGAAGAAGATACAGGAAGGGGTGATCACCAGGGAGCAAATGTTCATCACTACAAAGGTATGGCAGCACAATCACAGGCCGGAAGATGTCGAGGCCTCAGTCCGAGGATCCCTTCAGAGGATGGGTCTGGATTATGTTGATCTCGTGCTTATGCACTGGCCTATGGCTATATCGGTCGAGGGTGCGGATGAAAAAATAGATTACATTGAAACCTGGCGCGGGTTCGAGTCCGTTTTAGACAAAGGTCTGGCAAAATCTATAGGAGTCTCAAACTTCAATGCTCAACAGATAGAGAGGTTGTTGAGGAACTGTCGAGTCAAACCGGTGGTGAACCAAGTTGAGTTAAACATAAACCTGGCCCAAAAAGATTTGGTGTCTTATTGTCAGAAGAATGGCATCCAGTGTGTCGCGTACACGCCCTTCGGTAGTATGATGCCGAGTAGAGAAGATCCGAATAATGAAGGAACGAAGGTCGATGACCCCCGGCTGACATCCATCGCCAAAAAACATGGAAAAACAGTTGGACAGATAGCACTCAGACATTTGTATGAACGCGGTTTGGTCGCTATACCAAAAACGATAACGAAGTCTCGCGTCGTGGAAAACGCTTCCATCTTCGACTTCCAGCTCGATGCAAGCGACGTTGAGACGCTAAACGCTCTCGATAACGGCTATCGAACAGTCCGTCCGTTATTCTGGCAAGAATATGAATACTATCCGTTTGATCGTGTCGACGCCCCGATACCGAAGATACCGGAGCAATACTTGAAGTGGGAAGATGGCGGGAAAATTGACATCTGA

Protein sequence:

>DPOGS212528-PA
MSTDLKVNIPSFTLNNGLKIPAVGYGTWVNVDKNNRLLLEDKPKLLEAMSHAIDVGYRHLDTAHLYRTEPEVGWVVRKKIQEGVITREQMFITTKVWQHNHRPEDVEASVRGSLQRMGLDYVDLVLMHWPMAISVEGADEKIDYIETWRGFESVLDKGLAKSIGVSNFNAQQIERLLRNCRVKPVVNQVELNINLAQKDLVSYCQKNGIQCVAYTPFGSMMPSREDPNNEGTKVDDPRLTSIAKKHGKTVGQIALRHLYERGLVAIPKTITKSRVVENASIFDFQLDASDVETLNALDNGYRTVRPLFWQEYEYYPFDRVDAPIPKIPEQYLKWEDGGKIDI-