Monarch geneset OGS2.0

DPOGS212711
TranscriptDPOGS212711-TA918 bp
ProteinDPOGS212711-PA305 aa
Genomic positionDPSCF300012 - 648345-649933
RNAseq coverage447x (Rank: top 27%)
Annotation
HeliconiusHMEL0155445e-9066.67% 
BombyxBGIBMGA013254-TA5e-9569.51% 
DrosophilaCG10638-PA5e-7348.12% 
EBI UniRef50UniRef50_G9F9F43e-12569.18%Seminal fluid protein CSSFP004 isoform 1 n=2 Tax=Obtectomera RepID=G9F9F4_9NEOP
NCBI RefSeqXP_974493.13e-8047.88%PREDICTED: similar to aldo-keto reductase [Tribolium castaneum]
NCBI nr blastpgi|3640235559e-12569.18%seminal fluid protein CSSFP004 isoform 1 [Chilo suppressalis]
NCBI nr blastxgi|3640235552e-12269.18%seminal fluid protein CSSFP004 isoform 1 [Chilo suppressalis]
Group
Gene OntologyGO:00551142.4e-47oxidation-reduction process
GO:00164912.4e-47oxidoreductase activity
KEGG pathwayaag:AaeL_AAEL0040882e-76 
 K00011 (E1.1.1.21, AKR1)maps-> Galactose metabolism
    Glycerolipid metabolism
    Pentose and glucuronate interconversions
    Fructose and mannose metabolism
    Pyruvate metabolism
InterPro domain[2-305] IPR0013951.1e-130Aldo/keto reductase
[8-304] IPR0232102.8e-105NADP-dependent oxidoreductase domain
[40-64] IPR0204712.4e-47Aldo/keto reductase subgroup
Orthology groupMCL26050 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212711-TA
ATGTCTCCTAAAGTTATGAATGCAAGGCTTAACAATGGGAAAGAGATTCCGATGGTGGGTCTAGGTACATATACGCGTCAATTTGATCCGGAGTTAGTTAAACAAGCTGTGGAATGGGGAATAGATTTTGGTTATAGACACATAGATACTGCATCATTTTATAAGAATGAAGAGCTGCTAGGGGAGGTAATAGCCAATAAAATTAAACAAGGCTGTGTTAAAAGGGAAGATTTATTTGTAACAACAAAGCTATGGAGCGACAGTCACTCTGAAGAGGATGTGATACCCGCATTGAAGGAATCATTGAGAAAATTAAAACTTGGTTATATCGATTTATACTTAATACATTGGCCTGTATCTATAAGTGAAAATGGAGAGGATGTAGCCATAGATTATCTTAATACCTGGAAAAGTATGGAGCAGGCTGTAAATCTTGGTCTTGCCAAGTCAATAGGGGTGTCAAATTTTAATGAAGAACAACTGGAAAGGTTGTACAACCATGCTAATATAAAACCGACAGTTAATCAAGTTGAGATAAGCCCAACATTGACCCAACATAAGTTAGTAGATTTCTGCAAGAAACTATCTGTGATACCGATTGCGTACACACCTTTGGGACTCTTGTCCGGGGCGAGGCCGGAATTTATTGGCAAGGATGTCATCAAAACGGATCCTAAATTAGAAAAGATAGCAGAAAAGTATGGAAAAACTAAAGCTCAAGTGGTTTTAAGATATTTGATCCAGCGCGGTATCCCGGTGATACCGAAATCCTTTACCAAATCCAGGATAGAGGAGAACTTGAACATTTTTGATTTCGAGCTGACCAATGATGAAATGTCAACCATAGACGGCTACAATCTAGATCATCGTTGTGTGCCTTCATTGCGATTCAAATCCTGTACTTATTATCCATTCTGA

Protein sequence:

>DPOGS212711-PA
MSPKVMNARLNNGKEIPMVGLGTYTRQFDPELVKQAVEWGIDFGYRHIDTASFYKNEELLGEVIANKIKQGCVKREDLFVTTKLWSDSHSEEDVIPALKESLRKLKLGYIDLYLIHWPVSISENGEDVAIDYLNTWKSMEQAVNLGLAKSIGVSNFNEEQLERLYNHANIKPTVNQVEISPTLTQHKLVDFCKKLSVIPIAYTPLGLLSGARPEFIGKDVIKTDPKLEKIAEKYGKTKAQVVLRYLIQRGIPVIPKSFTKSRIEENLNIFDFELTNDEMSTIDGYNLDHRCVPSLRFKSCTYYPF-