Monarch geneset OGS2.0

DPOGS203596
TranscriptDPOGS203596-TA1035 bp
ProteinDPOGS203596-PA344 aa
Genomic positionDPSCF300063 - 736908-738795
RNAseq coverage24x (Rank: top 77%)
Annotation
HeliconiusHMEL0158733e-7944.92% 
BombyxBGIBMGA012831-TA4e-8350.00% 
DrosophilaCG12766-PA6e-5537.22% 
EBI UniRef50UniRef50_G6D3200.0100.00%Aldo-keto reductase n=5 Tax=Obtectomera RepID=G6D320_DANPL
NCBI RefSeqXP_974493.18e-6541.85%PREDICTED: similar to aldo-keto reductase [Tribolium castaneum]
NCBI nr blastpgi|3640235655e-9071.43%seminal fluid protein CSSFP008 [Chilo suppressalis]
NCBI nr blastxgi|3640235653e-8871.43%seminal fluid protein CSSFP008 [Chilo suppressalis]
Group
Gene OntologyGO:00551141.2e-43oxidation-reduction process
GO:00164911.2e-43oxidoreductase activity
KEGG pathwayaag:AaeL_AAEL0040882e-61 
 K00011 (E1.1.1.21, AKR1)maps-> Galactose metabolism
    Glycerolipid metabolism
    Pentose and glucuronate interconversions
    Fructose and mannose metabolism
    Pyruvate metabolism
InterPro domain[19-331] IPR0013953.6e-115Aldo/keto reductase
[23-326] IPR0232101.8e-91NADP-dependent oxidoreductase domain
[63-87] IPR0204711.2e-43Aldo/keto reductase subgroup
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203596-TA
ATGGACGCGCCTCTTATTATTGCTATAATTCTATTTTTTTCTGGTTTGGCGAACGCGTCTCTGTCATATAATCTTAACGATGGTAATGTGATACCAGCAATAGCACTTGGTACAAGTTTGGGACACTTGGCTGATGGTACCAGAGTGTTGTCAGTGAACCACTCGCTTGCTCGAGCCGTGCAAGGAGCACTGACGGCGGGTTACAGACACATTGACACGGCCTCATTGTACCGAGTTGAGGACGAAGTTGGGCTTGGAATACGTTGGTATTTAAATGACACCACTAAAAGACAGAATATTTACGTCACCACCAAGTTATGGAACGATGCCCACGCTCGGGACGAGGTGGTACCAGCCATCAGACGATCTCTACAGGATTTGCAGCTGGAATATGTCGACCTGTATCTAATGCATTTCCCTATGGCGTACACGAAAGACGGAAAGATAAGCGACACCGACTACTTGGAAACGTGGAAAGGATTAGAGGACGCCAAAAAATTAAATCTAACCCGGTCAATTGGCGTGTCCAACTTCAACTTAACACAGATGAAACGATTGTGGAATGACTCAGAAATCAAGCCAGCTGTGCTACAAATTGAAGTCAATCCAACAATAACCCAAGATGAAATAATAGACTGGTGTGATGAACACGCTGTCATCGTTATGGCATACAGTCCTTTCGGCGCCATTTTGGGTCGCAAGAAAAACTCTCCATTACGTGCAGATGACCCTTTATTAATAAGCTTAGCCCAAAAATACAACAAAACTGTTCCACAAATCTTATTACGATATTTGTTAGATAGACATCTAGTAGTCATCCCTCGATCAACAAACTACAGCCGAATCAAAGAGAACTTTAATATAACAGACTTCTCACTTGCGCCAGAAGAGGTGAAACTATTGTCGAGTTTCAATAGAGAGTACAGGTTAAGAACGCAGGTCAAATGGTATCCCCACCCGCACTTCCCCTTCCAGAAGAAAAATCTCACGGAATCTGAAATACAGTACATAGTTGAACACAGTAAAGAAGATTAG

Protein sequence:

>DPOGS203596-PA
MDAPLIIAIILFFSGLANASLSYNLNDGNVIPAIALGTSLGHLADGTRVLSVNHSLARAVQGALTAGYRHIDTASLYRVEDEVGLGIRWYLNDTTKRQNIYVTTKLWNDAHARDEVVPAIRRSLQDLQLEYVDLYLMHFPMAYTKDGKISDTDYLETWKGLEDAKKLNLTRSIGVSNFNLTQMKRLWNDSEIKPAVLQIEVNPTITQDEIIDWCDEHAVIVMAYSPFGAILGRKKNSPLRADDPLLISLAQKYNKTVPQILLRYLLDRHLVVIPRSTNYSRIKENFNITDFSLAPEEVKLLSSFNREYRLRTQVKWYPHPHFPFQKKNLTESEIQYIVEHSKED-