Monarch geneset OGS2.0

DPOGS203587
TranscriptDPOGS203587-TA1170 bp
ProteinDPOGS203587-PA389 aa
Genomic positionDPSCF300063 - 1048486-1053814
RNAseq coverage2585x (Rank: top 5%)
Annotation
HeliconiusHMEL0158736e-12859.79% 
BombyxBGIBMGA001348-TA2e-8545.48% 
DrosophilaCG10638-PA3e-6436.76% 
EBI UniRef50UniRef50_Q9Y0203e-10651.34%3-dehydroecdysone 3beta-reductase n=2 Tax=Obtectomera RepID=Q9Y020_SPOLI
NCBI RefSeqXP_968650.15e-7137.99%PREDICTED: similar to aldo-keto reductase [Tribolium castaneum]
NCBI nr blastpgi|47539121e-10551.34%3-dehydroecdysone 3beta-reductase [Spodoptera littoralis]
NCBI nr blastxgi|47539124e-8948.13%3-dehydroecdysone 3beta-reductase [Spodoptera littoralis]
Group
Gene OntologyGO:00551145.5e-16oxidation-reduction process
GO:00164915.5e-16oxidoreductase activity
KEGG pathwaytca:6579433e-65 
 K00011 (E1.1.1.21, AKR1)maps-> Galactose metabolism
    Glycerolipid metabolism
    Pentose and glucuronate interconversions
    Fructose and mannose metabolism
    Pyruvate metabolism
InterPro domain[13-376] IPR0013953.6e-121Aldo/keto reductase
[185-377] IPR0232101.4e-97NADP-dependent oxidoreductase domain
[56-80] IPR0204715.5e-16Aldo/keto reductase subgroup
Orthology groupMCL34508 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203587-TA
ATGATAAAGCTGACAGAGATGACCGTCGGCCAAACGAATGTTCCGAAATACAAACTCAACAACGGAAGAGAGATGCCCGCCATTGCTCTGGGGACTTACTTAGGGCACGACAAGAGCGGTATGGTGAGGTCGGTCAACAAACAGCTGAGGGACGTGGTGATGCAAGCCATAGACGTCGGATACAGACACTTCGACACAGCGGAGATATACGGCACTGAGGAGGAGCTGGGGGAGGGAGTCCGGAGGAAGATGGAGGAGGGAGCCGTCCGCAGGGAGGAACTGTTCATTACCGACAAGCTATGGAACACTCACCACAAGCGGGAGCAGGTGGTGCCGTCTCTGAGGGAGTCTCTCTGTAAGATGGGACTGAGTTACATAGACCTGTTCCTCATGCACTGGCCCATGGGACTTCATCTTAGAGACTGGTTTCCAAGAGAGCGACAGGAACAAACACTGGACGGCATCTGTACACGCGTGTTAGTAGTCAGGGAAATCTTTTTCACATGTACCCGCAACACCGACGTGCTTCAACAGGTGCTCATGGCCCTGAGATTCCGTCTTCAGACCGCCATCACATCGGAGGACTACACGCACTCGGACGTGGACTTCATGGAGACGTGGCTGGGCCTGGAGGACGCGGTGCGGCTCGGCCTCGCGAGGAACATCGGCGTGTCCAACTTCAACAAGCAACAGCTGGAGAGGATCCTCCGCGAGGGCAGCATCCGACCAGCGGCGCTGCAGATAGAGGTCCATCCTCAAATAATACAAACGGAGCTGGTGCAGCTCGCTCAGCGTCACCAGCTCGTGGTGATGGGGTACAGTCCGTTCGGGTCGCTGGTGACCAGGTACGGCATCAAGTTCCCGGGACCCACCATCGACGACCCCACACTCGTCGCCATAGCACGCAGACACCACAAGACTACACCGCAAGTGGTACTGAGGTGGCTGGTCGACAGGAACGTCGTGCCGGTCACCAAGACCGTCAACCCCTCGCGGCTGAAGGAGAACATCGACATATTCGACTTTGAATTGAGCGCCAGAGAAATCGAAATCATCAACAAGTTTGATGAAAAAACCCGCTACACCCTCCCGTCGTTTTGGCAAACACACGCTTACTACCCGTTTGAAAAGGTCGACAATCCTTCAGCAGACCCTTTCATCAAACATTAG

Protein sequence:

>DPOGS203587-PA
MIKLTEMTVGQTNVPKYKLNNGREMPAIALGTYLGHDKSGMVRSVNKQLRDVVMQAIDVGYRHFDTAEIYGTEEELGEGVRRKMEEGAVRREELFITDKLWNTHHKREQVVPSLRESLCKMGLSYIDLFLMHWPMGLHLRDWFPRERQEQTLDGICTRVLVVREIFFTCTRNTDVLQQVLMALRFRLQTAITSEDYTHSDVDFMETWLGLEDAVRLGLARNIGVSNFNKQQLERILREGSIRPAALQIEVHPQIIQTELVQLAQRHQLVVMGYSPFGSLVTRYGIKFPGPTIDDPTLVAIARRHHKTTPQVVLRWLVDRNVVPVTKTVNPSRLKENIDIFDFELSAREIEIINKFDEKTRYTLPSFWQTHAYYPFEKVDNPSADPFIKH-