Monarch geneset OGS2.0

DPOGS211672
TranscriptDPOGS211672-TA885 bp
ProteinDPOGS211672-PA294 aa
Genomic positionDPSCF300151 + 316521-323714
RNAseq coverage1299x (Rank: top 10%)
Annotation
HeliconiusHMEL0158874e-10060.87% 
BombyxBGIBMGA001367-TA6e-10461.67% 
DrosophilaCG12766-PA7e-6744.25% 
EBI UniRef50UniRef50_D2SNM41e-10562.91%Aldo-keto reductase n=7 Tax=Obtectomera RepID=D2SNM4_HELVI
NCBI RefSeqXP_969383.14e-7746.28%PREDICTED: similar to aldo-keto reductase [Tribolium castaneum]
NCBI nr blastpgi|2609078274e-10562.91%aldo-keto reductase [Heliothis virescens]
NCBI nr blastxgi|2609078279e-10362.91%aldo-keto reductase [Heliothis virescens]
Group
Gene OntologyGO:00551141.6e-45oxidation-reduction process
GO:00164911.6e-45oxidoreductase activity
KEGG pathwayaag:AaeL_AAEL0040881e-68 
 K00011 (E1.1.1.21, AKR1)maps-> Galactose metabolism
    Glycerolipid metabolism
    Pentose and glucuronate interconversions
    Fructose and mannose metabolism
    Pyruvate metabolism
InterPro domain[8-294] IPR0013951.1e-121Aldo/keto reductase
[8-294] IPR0232103.8e-98NADP-dependent oxidoreductase domain
[29-53] IPR0204711.6e-45Aldo/keto reductase subgroup
Orthology groupMCL23351 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211672-TA
ATGCTCTGGCTGTACCTGGTGCTGGGTGCTACATCGGTCCTCGGGAAGGGTTCACCGGAAGAGGTTCAACAGGCAGTGGAAGCAGCCATAGATGCCGGCTACAGACACATCGACACCGCTCACATTTATAATACAGAGAAACAGGTCGGCAAGGGATTGAAGAAGAAAATAGAAGAGGGAGTAGTTAAGAGAGAGGACATGTTCATAACGACTAAGTTGTGGAGTGACGCTCATCCGCGCGATGCTGTGATACCAACTCTGAACGAGTCCCTCAACCATCTGGGAATGGATTATGTGGACTTATACCTCATCCACTGGCCAGTCGCCACTTTCAGCAATGGTTCCATTCAAGACGTAGACTATCTAGACACCTGGAAGGGTCTTATGGAAGCAAAGAACTTAGGTCTAACCAGGTCTATCGGAGTGTCCAACTTCAATATAGAACAACTTAAAAGGTTGATTGATAGCTCCGGAGTTACGCCAGCAGTTCTTCAGATTGAGGTCAACCTTAACATCCAACAACCTGAACTGCTGGAATTCTGCAAGGCCCACAACATCGTCGTCATGGGCTACACTCCCTTCGGATCCATCTTCCCCCAGAAGGCCGCTGAGAGCGCTCCTCCACCGAGGGTTGATGACGAGCAGCTCGTGCATATAGCTAAGAAATACAACAAGACTGTCCCTCAAGTGGTGCTCAGATACCTGTTCGAATTAGGAGTGGTGCCGATCCCTAAATCCGTGAAGAAGAATCGAGTGGAGGAGAACATTGACATCTTCGACTTTGAGCTGACTCCAGAAGAGAGGAACCTCCTCAAGAGTTACGACGCCAACTATAAGATTGTGAACGTTGCTTTGTGGAAGGACTCTCCGTACTATCCGTTTTGA

Protein sequence:

>DPOGS211672-PA
MLWLYLVLGATSVLGKGSPEEVQQAVEAAIDAGYRHIDTAHIYNTEKQVGKGLKKKIEEGVVKREDMFITTKLWSDAHPRDAVIPTLNESLNHLGMDYVDLYLIHWPVATFSNGSIQDVDYLDTWKGLMEAKNLGLTRSIGVSNFNIEQLKRLIDSSGVTPAVLQIEVNLNIQQPELLEFCKAHNIVVMGYTPFGSIFPQKAAESAPPPRVDDEQLVHIAKKYNKTVPQVVLRYLFELGVVPIPKSVKKNRVEENIDIFDFELTPEERNLLKSYDANYKIVNVALWKDSPYYPF-