Monarch geneset OGS2.0

DPOGS204194
TranscriptDPOGS204194-TA1419 bp
ProteinDPOGS204194-PA472 aa
Genomic positionDPSCF300034 + 516959-524986
RNAseq coverage963x (Rank: top 13%)
Annotation
HeliconiusHMEL0099093e-7373.86% 
BombyxBGIBMGA005155-TA2e-6667.43% 
DrosophilaCG2767-PB7e-3744.02% 
EBI UniRef50UniRef50_D2SNR92e-5272.22%Aldo-keto reductase (Fragment) n=1 Tax=Heliothis virescens RepID=D2SNR9_HELVI
NCBI RefSeqXP_394676.23e-4147.34%PREDICTED: similar to CG2767-PA [Apis mellifera]
NCBI nr blastpgi|2609079187e-5272.22%aldo-keto reductase [Heliothis virescens]
NCBI nr blastxgi|2609079184e-5072.22%aldo-keto reductase [Heliothis virescens]
Group
Gene OntologyGO:00551142.6e-14oxidation-reduction process
GO:00164912.6e-14oxidoreductase activity
KEGG pathwaydme:Dmel_CG27675e-35 
 K00002 (E1.1.1.2, adh)maps-> Glycolysis / Gluconeogenesis
    Glycerolipid metabolism
    Caprolactam degradation
InterPro domain[44-472] IPR0013953.5e-70Aldo/keto reductase
[44-455] IPR0232108.3e-61NADP-dependent oxidoreductase domain
[59-83] IPR0204712.6e-14Aldo/keto reductase subgroup
Orthology groupMCL30310 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204194-TA
ATGTTTATCAGTCACCGCAATAAATGGGTATCATGCACCAATGAACTCGTTGTTAACCACTGTTGTGACGTCGATTCGATGATACAAAACACAAACAAATGGGATACAGTGAACAAAAAAGATGTTATAACTTTAAGGGCTCCAGCAGATATTCTCGTAAAATCAGTCGAGACGGCCTTAGAGGCTGGTTACAGACATTTTGATTCCGCTCGTGCTTATGAAGACGAGGCTGCTTTGGGTAAAGCCCTTAATCAATGGATAGGAGGGGACGTAAACAAAAGGAAGGAATTATTTGTTGCAACAAAACTACCGCCTGGTGGTAACCGTCCAGATCTCGTCGAGCAGTATTTCGAAGACTCCCGTCAAGATTTAGGATTGGATTACATAGACCTTTATCTATTACACACGCCATTCGGATTTAAACATGTCCCTGGTGATTTGCATCCGAAAAACCCTGACGGCAGTATGATGGTCGACCATACAACAGATTTAATAGCTGTCTGGAAGAACACTACCCAGACATATCTCAGTAGACCGTTAGAAGAGAGGTCTTCGATTCGTTTACAACACTCCATATGTTCAAAACTTGGACCTAAACTTCTATGTCCATGTTGTTGTGGAGGTGATGATAAGATGGAAAGCTGCGATATGCCATATAATCGACAGACTACCTCGGATTCTTATGAAACTTATTCTACACTAAAAACAACACCCACTCCATCTTTACCAATTGATCCTAAACAAAAGTGTGAATCTGACATCGTCGTAAGTGAAAACCCGCTGATAACACCAAATGAAACTATTGAAAAGCTTTTAGAAGAAACTATTAATAAACACAAACAAACACTTTGTGAAACCAAACCACCGGCTTCTTTAAAAAAGGCAGTACTAAAACTAAAAGAGTCTGGAAGAGTCCGCCATGTTGGTGTATCCAACGTAAATGAGGAACAACTAATAAGATTGACAGCTGTAGAAAAGCCGGCATGTTTACAAGTCGAGGTTCACGTTTTATTCCAACAAAAGTCGTTGATAGCAGCCGCAAACCGACTCGGAATACCTGTAGTGGCATATTCTCCTTTAGGATCGAGGGCTTTAACTAATATGCTAGCTGCGAAAACAGGACGTAATTACCCGCATCTATTAGAATTACCCCTGGTATTAGAGCTTTCAAAGAAATATTCACGGACGCCAGCACAAATATTACTAAGATTCTTACTTCAACATGGCGTTGGCGCCATACCTAAAAGTACAGATCCAACAAGAATAAAACAGAACATATCTCTTTGGGATTTCGAATTGAATGACACGGAAATGCAAGATTTGCATAACTTAGATCGTGGAGAAGAAGGACGTATATGTGACTTTGGATTCTTCATCGGCGTCCAAACACACCCTGAATTTCCTTTTAAAAAGAATTAA

Protein sequence:

>DPOGS204194-PA
MFISHRNKWVSCTNELVVNHCCDVDSMIQNTNKWDTVNKKDVITLRAPADILVKSVETALEAGYRHFDSARAYEDEAALGKALNQWIGGDVNKRKELFVATKLPPGGNRPDLVEQYFEDSRQDLGLDYIDLYLLHTPFGFKHVPGDLHPKNPDGSMMVDHTTDLIAVWKNTTQTYLSRPLEERSSIRLQHSICSKLGPKLLCPCCCGGDDKMESCDMPYNRQTTSDSYETYSTLKTTPTPSLPIDPKQKCESDIVVSENPLITPNETIEKLLEETINKHKQTLCETKPPASLKKAVLKLKESGRVRHVGVSNVNEEQLIRLTAVEKPACLQVEVHVLFQQKSLIAAANRLGIPVVAYSPLGSRALTNMLAAKTGRNYPHLLELPLVLELSKKYSRTPAQILLRFLLQHGVGAIPKSTDPTRIKQNISLWDFELNDTEMQDLHNLDRGEEGRICDFGFFIGVQTHPEFPFKKN-