Monarch geneset OGS2.0

DPOGS211662
TranscriptDPOGS211662-TA1434 bp
ProteinDPOGS211662-PA477 aa
Genomic positionDPSCF300151 - 302506-310554
RNAseq coverage416x (Rank: top 29%)
Annotation
HeliconiusHMEL0158873e-11067.36% 
BombyxBGIBMGA001351-TA4e-10860.00% 
DrosophilaCG10638-PA6e-6243.89% 
EBI UniRef50UniRef50_D2SNM42e-9755.52%Aldo-keto reductase n=7 Tax=Obtectomera RepID=D2SNM4_HELVI
NCBI RefSeqXP_974493.15e-7144.84%PREDICTED: similar to aldo-keto reductase [Tribolium castaneum]
NCBI nr blastpgi|3640235633e-10558.15%seminal fluid protein CSSFP007 [Chilo suppressalis]
NCBI nr blastxgi|3640235636e-10359.28%seminal fluid protein CSSFP007 [Chilo suppressalis]
Group
Gene OntologyGO:00551141.6e-44oxidation-reduction process
GO:00164911.6e-44oxidoreductase activity
KEGG pathwaytca:6579437e-66 
 K00011 (E1.1.1.21, AKR1)maps-> Galactose metabolism
    Glycerolipid metabolism
    Pentose and glucuronate interconversions
    Fructose and mannose metabolism
    Pyruvate metabolism
InterPro domain[176-477] IPR0013951.4e-126Aldo/keto reductase
[177-476] IPR0232109.2e-100NADP-dependent oxidoreductase domain
[215-239] IPR0204711.6e-44Aldo/keto reductase subgroup
Orthology groupMCL26752 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211662-TA
ATGCCGAGTTTCAAACTTAACGATGGCTACGATATACCCGCTATAGCACTTGGCACATCGCCGGGTTACAACAGTAAGCAAAAACCGAACGAGATGGAAGTAGCAGTGAAGTGGGCCTTGGACGCGGGATACCGACATTTTGATACAGCAGCTATTTACAAGGTAGAAGATCAGGTCGGCCGTGCTCTGAAGAAATATCCGAGAGAAGAAGTCTTCGTTACAACTAAGCTGTGGAACACGAAGCATGCTATAACCGACGTGGTACCCGCCCTTCAAGAGTCGCTGAAGAACCTTCAGCTCAAGTACGTGGACCTGTACCTCATACACTGGCCCGTATCACAGCATGAGAATCTGACAGCATCAGACGTTGACTATCTGGACTCCTGGCGAGGAATGATGAATGCTAAGAAATTGGGATTAACGAGATCCATCGGCGTCTCTAACTTTAACAAAGAACAGATTCAGAGGATTATCGACAGCGGGCTGGAGGTGCCAGCTGTTAACCAGGTCGAGGATGCGGAGGTGCAGATGCCGAGGTTCAAACTTAACGATGGGTACGATATACCCGCTATAGCACTTGGAACATGGCTGGGTCACAACAGTAAGCCAAAACCGAACGAGGTCGAATTAGCAGTGAAGTGGGCCTTGGACGCCGGATACCGACATATTGATACAGCAGCTATATACAAGGTAGAAGATCAGGTCGGCCGTGCTCTGAAGAAATATCCCAGAGAAGAAGTCTTCGTAACATCCAAGCTGTGGAACACGAAGCATGCTATAAACGACGTGGTACCCGCCCTTCAAGAGTCGCTGAAGAACCTTCAGCTCAAATACGTGAACCTGTACCTCATACACTGGCCCGTAGCACTGCATGAGAATCTGACAGCATCAGACGTTGACTATCTGGACTCCTGGCGAGGAATGATGAATGCTAAGAAATTGGGATTAACGAGATCCATCGGCGTCTCTAACTTTAACAAAGAACAGATTCAGAGGATTATCGACAGCGGGCTGGAGGTGCCAGCTGTTAACCAGGTCGAGATTAACCTGAACCTGCAGCAACCAGATCTACTGGAATATGCCAAGTCAAAGAACATCACAATTTGTGGCTATACGCCATTCGGTTCGTTGTTCTACAGTAAGGGCAGCGAGGACGCGCCGCCCCCGAGGCTCAATGACCCCGTCCTCACCAAAATGGCTGAGAAATATAATAAGACTGTCCCTCAAATTGCTCTCAGATATTTGCACGAACTGGGCGTCATTCCTCTACCAAAGTCCGTCACGAGGAACCGCATCGAGCAAAATATAGACATCTTCAACTTCTCCCTGACGGATGAAGAGAAAAACCTTCTAAAGGGATTTGATAGAAACTACAGAACCCTCCCGCAGTACAAGTGGAAGGACTTTCCCTATTATCCCTTCGAAAAAAACTAG

Protein sequence:

>DPOGS211662-PA
MPSFKLNDGYDIPAIALGTSPGYNSKQKPNEMEVAVKWALDAGYRHFDTAAIYKVEDQVGRALKKYPREEVFVTTKLWNTKHAITDVVPALQESLKNLQLKYVDLYLIHWPVSQHENLTASDVDYLDSWRGMMNAKKLGLTRSIGVSNFNKEQIQRIIDSGLEVPAVNQVEDAEVQMPRFKLNDGYDIPAIALGTWLGHNSKPKPNEVELAVKWALDAGYRHIDTAAIYKVEDQVGRALKKYPREEVFVTSKLWNTKHAINDVVPALQESLKNLQLKYVNLYLIHWPVALHENLTASDVDYLDSWRGMMNAKKLGLTRSIGVSNFNKEQIQRIIDSGLEVPAVNQVEINLNLQQPDLLEYAKSKNITICGYTPFGSLFYSKGSEDAPPPRLNDPVLTKMAEKYNKTVPQIALRYLHELGVIPLPKSVTRNRIEQNIDIFNFSLTDEEKNLLKGFDRNYRTLPQYKWKDFPYYPFEKN-