Monarch geneset OGS2.0

DPOGS207020
TranscriptDPOGS207020-TA969 bp
ProteinDPOGS207020-PA322 aa
Genomic positionDPSCF300001 + 1383729-1388973
RNAseq coverage1560x (Rank: top 8%)
Annotation
HeliconiusHMEL0158738e-8848.00% 
BombyxBGIBMGA012831-TA4e-13671.90% 
DrosophilaCG10638-PA9e-7142.19% 
EBI UniRef50UniRef50_Q8WRT06e-9055.25%3-dehydrecdysone 3b-reductase n=3 Tax=Obtectomera RepID=Q8WRT0_TRINI
NCBI RefSeqXP_969383.16e-8648.28%PREDICTED: similar to aldo-keto reductase [Tribolium castaneum]
NCBI nr blastpgi|3796981802e-13371.90%aldo-keto reductase 2E [Bombyx mori]
NCBI nr blastxgi|3796981805e-12971.90%aldo-keto reductase 2E [Bombyx mori]
Group
Gene OntologyGO:00551143.6e-49oxidation-reduction process
GO:00164913.6e-49oxidoreductase activity
KEGG pathwayaag:AaeL_AAEL0040882e-77 
 K00011 (E1.1.1.21, AKR1)maps-> Galactose metabolism
    Glycerolipid metabolism
    Pentose and glucuronate interconversions
    Fructose and mannose metabolism
    Pyruvate metabolism
InterPro domain[18-322] IPR0013957.4e-136Aldo/keto reductase
[20-322] IPR0232102.2e-102NADP-dependent oxidoreductase domain
[58-82] IPR0204713.6e-49Aldo/keto reductase subgroup
Orthology groupMCL30961 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207020-TA
ATGAGATCTGTAATAGTTTTACTGTTTGGAATTGCTGTAACAACAGCAGCAAAGGCTCCATGTGTTGAATTAAACAATGGCGTCTTCATGCCGGTCGTTGCCCTCGGCACAGGACGCGGGACAGCAAGCGAATCAGCACCACTGGATGAAGTGCGTCAGTCGGTGTATTGGGCCATCGAAGCTGGTTACAGGCATGTAGACACTGCAGCCATTTATGGAGATGAGCAGCAAGTTGGTGAAGGAGTGGCTCAAGCTATAGCCAATGGTCTGGTTACCAGGGAGGAGATGTTTATCACAACTAAGCTCTGGAACAATAGACACCGCAGAGACCAAGTCGTACCCGCTCTCAAGGAATCGCTTAGCAAACTGGGTCTGGACTATGTTGACCTCTATCTTATTCATTTCCCCATCGCTGAAGACGACGAGGGTTCAGTCCTAAATACAGATTATCTGGAAACATGGAAAGGAATGGAAGATGCCAAGGACTTGGGCCTGGCAAGGTCCATAGGTGTTTCAAACTTTAACGCGTCCCAAATATCCAGACTCGTCTCCAACTCCAGAATATGGCCAGTCGTTAATGAAGTTGAGGTCAACCCGAGTCTAACCCAGGAGCCCTTGGTAAAACATTGCCAGAGTCTTGGCATTGTAGTGATGGCGTACAGTCCGTTCGGATTCTTGGTGTCCCGGAACAGGCCGGACGCACCACCACCAAGGGTGGATGACCCCGCACTCGTCCAAATGGCGCGGAAATATAACAAGACCACGGGTCAGATCGTTCTGAGATATCTGATCGAACGTGGTCTGGTCCCTATTCCTAAGTCCACAAACCAGAAGAGGCTCGCTCAGAACATCGACCTCTTCGATTTTAGTCTCACCAAGGAGGAAATCGCATTAATTAGTAGATTCAACAAAAATGTCCGCGTCATTGACTTTGCTGATTATAAGGGACATCCATACTTCCCCTTCTGA

Protein sequence:

>DPOGS207020-PA
MRSVIVLLFGIAVTTAAKAPCVELNNGVFMPVVALGTGRGTASESAPLDEVRQSVYWAIEAGYRHVDTAAIYGDEQQVGEGVAQAIANGLVTREEMFITTKLWNNRHRRDQVVPALKESLSKLGLDYVDLYLIHFPIAEDDEGSVLNTDYLETWKGMEDAKDLGLARSIGVSNFNASQISRLVSNSRIWPVVNEVEVNPSLTQEPLVKHCQSLGIVVMAYSPFGFLVSRNRPDAPPPRVDDPALVQMARKYNKTTGQIVLRYLIERGLVPIPKSTNQKRLAQNIDLFDFSLTKEEIALISRFNKNVRVIDFADYKGHPYFPF-