Monarch geneset OGS2.0

DPOGS212529
TranscriptDPOGS212529-TA885 bp
ProteinDPOGS212529-PA294 aa
Genomic positionDPSCF300222 + 666990-669828
RNAseq coverage25x (Rank: top 77%)
Annotation
HeliconiusHMEL0093456e-11865.31% 
BombyxBGIBMGA009800-TA4e-11363.64% 
DrosophilaCG10863-PA3e-6844.84% 
EBI UniRef50UniRef50_B8ZX074e-11160.20%Putative aldo-ketose reductase 1 n=1 Tax=Papilio dardanus RepID=B8ZX07_PAPDA
NCBI RefSeqXP_001957092.16e-6945.00%GF10249 [Drosophila ananassae]
NCBI nr blastpgi|2196860821e-11060.20%putative aldo-ketose reductase 1 [Papilio dardanus]
NCBI nr blastxgi|2196860822e-10860.20%putative aldo-ketose reductase 1 [Papilio dardanus]
Group
Gene OntologyGO:00551148.6e-44oxidation-reduction process
GO:00164918.6e-44oxidoreductase activity
KEGG pathwaydme:Dmel_CG108633e-66 
 K00011 (E1.1.1.21, AKR1)maps-> Galactose metabolism
    Glycerolipid metabolism
    Pentose and glucuronate interconversions
    Fructose and mannose metabolism
    Pyruvate metabolism
InterPro domain[3-270] IPR0013953e-113Aldo/keto reductase
[2-270] IPR0232101.4e-90NADP-dependent oxidoreductase domain
[4-28] IPR0204718.6e-44Aldo/keto reductase subgroup
Orthology groupMCL19842 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212529-TA
ATGTCACACGCCATTGACGTGGGCTATAGACACTTGGACACGGCACACCTTTACCGGACAGAACCGGAAGTTGGCTGGGTAGTGCGGAAGAAGATACAGGAAGGGGTGATCACCAGGGAGCAAATGTTCATCACTACAAAGGTATGGCAGCACAATCACAGGCCGGAAGATGTCGAGGCCTCGGTCCGAGGATCCCTTCAGAGGATGGGTCTGGATTATGTTGATCTTGTGCTTATGCACTGGCCTATGGCTATATCGGTCGAGGGTGCGGATGAAAAAATTGATTACATTGAAACCTGGCGCGGGTTCGAGTCCGTTCTAGACAAAGGTCTGGCAAAATCTATAGGCATCTCAAACTTCAATGCGCAACAGATAGAGAGGTTGTTGAGGAACTGTCGAGTCAAACCGGTGGTGAACCAAGTTGAGTTAAACATAAACCTGGCCCAAAAAGATTTGGTGTCTTATTGCCAGAAGAATGACATCCAGTGTGTCGCGTACACGCCCTTCGGTAGTATGATGCCGAGTAGAGAAGATCCGAATAATGAAGGAACGAAGGTCGATGACCCCCGGCTGACATCCATCGCCAAAAAACATGGAAAAACTGTTGGACAGATAGCACTCAGACATTTGTATGAACGCGGTTTGGTCGCTATACCAAAAACGATAACGAAGTCTCGCGTCGTGGAAAACGCTTCCATCTTCGACTTCCAGCTCGATGCAAGCGACGTTGAGACGCTAAACGCTCTCGATAACGGCTATCGAACAGTTCGTCCGTTATTCTGGCAAGAATATGAATACTATCCGTTTGATCGTGTCGACGCCCCGATACCGAAGATACCGGAGCAATACTTGAAGTGGGAAGATGGCGGGAAAATTGACATCTGA

Protein sequence:

>DPOGS212529-PA
MSHAIDVGYRHLDTAHLYRTEPEVGWVVRKKIQEGVITREQMFITTKVWQHNHRPEDVEASVRGSLQRMGLDYVDLVLMHWPMAISVEGADEKIDYIETWRGFESVLDKGLAKSIGISNFNAQQIERLLRNCRVKPVVNQVELNINLAQKDLVSYCQKNDIQCVAYTPFGSMMPSREDPNNEGTKVDDPRLTSIAKKHGKTVGQIALRHLYERGLVAIPKTITKSRVVENASIFDFQLDASDVETLNALDNGYRTVRPLFWQEYEYYPFDRVDAPIPKIPEQYLKWEDGGKIDI-