Monarch geneset OGS2.0

DPOGS211673
TranscriptDPOGS211673-TA984 bp
ProteinDPOGS211673-PA327 aa
Genomic positionDPSCF300151 + 326278-330849
RNAseq coverage3480x (Rank: top 4%)
Annotation
HeliconiusHMEL0158873e-9056.18% 
BombyxBGIBMGA001367-TA6e-10960.13% 
DrosophilaCG10638-PA7e-6643.17% 
EBI UniRef50UniRef50_D2SNM45e-10858.02%Aldo-keto reductase n=7 Tax=Obtectomera RepID=D2SNM4_HELVI
NCBI RefSeqXP_969383.13e-7544.34%PREDICTED: similar to aldo-keto reductase [Tribolium castaneum]
NCBI nr blastpgi|2609078272e-10758.02%aldo-keto reductase [Heliothis virescens]
NCBI nr blastxgi|2609078274e-10560.32%aldo-keto reductase [Heliothis virescens]
Group
Gene OntologyGO:00551146.6e-40oxidation-reduction process
GO:00164916.6e-40oxidoreductase activity
KEGG pathwayaag:AaeL_AAEL0040882e-67 
 K00011 (E1.1.1.21, AKR1)maps-> Galactose metabolism
    Glycerolipid metabolism
    Pentose and glucuronate interconversions
    Fructose and mannose metabolism
    Pyruvate metabolism
InterPro domain[23-327] IPR0013951.5e-127Aldo/keto reductase
[25-327] IPR0232105.2e-100NADP-dependent oxidoreductase domain
[62-86] IPR0204716.6e-40Aldo/keto reductase subgroup
Orthology groupMCL23351 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211673-TA
ATGCTGTGGCTGTACCTGGTGCTGTGTGTCACATCGGTCCTCGGGAAGAACACTCCATCTGTCGCAGTAGCGCCGACAGTGAGGCTCAGAGATGGGAACCTCATGCCGCGCCTGGGTCTCGGCACCTGGCTTGGGTTCATCAAAATGGGTCCTGAGGAGGAAGTCCGTGGTGCTGTGGAAGCAGCCATCGACGCGGGTTATAGGCTCATCGACACGGCCAACATTTACATCACAGAGGAACAAGTGGCGCTGGGAATGAAGAAAAAGTTAGATGAAGGTGTTGTTAAAAGAGAGGAGATGTTCATCACCACTAAGCTGTGGAACGACGCCCATCGTTACGAAGCTGTGCTACCAGCGCTGAAGAACTCTCTCCAGAAGCTAGAGCTGGAGTATGTTGACCTCTACCTCATACACTTCCCTATTGCCATCGATTCTAATGGATCCGTCGTCGACGTAGACTACTTAGAGACCTGGCGCGGCATGATAGAGGTCCAAAGGCAAGGTCTCACGAACTCCATCGGAGTTTCTAACTTTAACATCTCGCAGCTGCAAAGGTTGATTGAACAAACGGGCGTCACCCCAGCAGTTCTTCAGGTTGAGGTTAACCTTAACATCCAACAACCTGAACTGCTGGAATTCTGCAAGGCCCACAACATCGTCGTCATGGGCTACACTCCCTTCGGATCCATCTTCCCCCAGAAGGCCGCTGAGAGCGCTCCTCCCCCGAGGGTTGATGACGAGGAGCTCGTCCATATAGCTAAGAAATACAACAAGACCGTCCCTCAAGTGGTGCTCAGATACCTGTTCGAATTAGGAGTGGTGCCGATCCCTAAATCCGTGAAGAAGAATCGAGTGGAGGAGAACATTGACATCTTCGACTTTGAGCTGACCCCAGAAGAGAGGAACCTCCTCAAGAGTTACGACGCCAACTATAAGATTGTGAACGTTGCTTTGTGGAAGGACTCTCCGTACTATCCGTTTTGA

Protein sequence:

>DPOGS211673-PA
MLWLYLVLCVTSVLGKNTPSVAVAPTVRLRDGNLMPRLGLGTWLGFIKMGPEEEVRGAVEAAIDAGYRLIDTANIYITEEQVALGMKKKLDEGVVKREEMFITTKLWNDAHRYEAVLPALKNSLQKLELEYVDLYLIHFPIAIDSNGSVVDVDYLETWRGMIEVQRQGLTNSIGVSNFNISQLQRLIEQTGVTPAVLQVEVNLNIQQPELLEFCKAHNIVVMGYTPFGSIFPQKAAESAPPPRVDDEELVHIAKKYNKTVPQVVLRYLFELGVVPIPKSVKKNRVEENIDIFDFELTPEERNLLKSYDANYKIVNVALWKDSPYYPF-