Monarch geneset OGS2.0

DPOGS201372
TranscriptDPOGS201372-TA1116 bp
ProteinDPOGS201372-PA371 aa
Genomic positionDPSCF300083 - 93799-95935
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0025637e-7245.30% 
BombyxBGIBMGA000695-TA1e-6645.14% 
DrosophilaCG6083-PA2e-4136.76% 
EBI UniRef50UniRef50_G9F9F69e-8752.33%Seminal fluid protein CSSFP005 n=3 Tax=Obtectomera RepID=G9F9F6_9NEOP
NCBI RefSeqXP_002426538.12e-6544.72%aldo-keto reductase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3640235593e-8652.33%seminal fluid protein CSSFP005 [Chilo suppressalis]
NCBI nr blastxgi|3640235592e-8452.33%seminal fluid protein CSSFP005 [Chilo suppressalis]
Group
Gene OntologyGO:00551143e-45oxidation-reduction process
GO:00164913e-45oxidoreductase activity
KEGG pathwaylsa:LSA02381e-48 
 K00100 (E1.1.1.-)maps-> Linoleic acid metabolism
    Bisphenol A degradation
    Fructose and mannose metabolism
    Butanoate metabolism
    Tetrachloroethene degradation
InterPro domain[82-370] IPR0013951.5e-102Aldo/keto reductase
[85-362] IPR0232107.7e-93NADP-dependent oxidoreductase domain
[117-141] IPR0204713e-45Aldo/keto reductase subgroup
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201372-TA
ATGGATCAGATCCGGTACTTAGCACAAATACCAAAGTGCATAAATGCTCTATGCAACGTGGGTGGTTTAGAGGTGAGAGTTGCTGCCACCCAAACAGTCTCCCTACATCCACATTTTGCCTCCACCGATTCAATTTGGTATCAAAATTGGAGCAACTTGTCAGTTGTCTCGTTGCACCAGCCAAGTGTAGATTCTGGTGTTCTCAGCGAAGAAATATTTAAGATGTCGGAGGATCACAACTATCGTGATAAATCATTTACGTTAAGAGATGGAATTAAGGTGCCTGCGATTGGATTGGGCACCTATCGTGTCAACAACTACGATATCGTGTATGAGACTATGAACAATGCACTGGCTCGCGGCTACAGGCTTTTTGATACTGCCGCTATTTATGGAAACGAAGAGCATTTAGGAAGAGCCTTACGGGAGCTCCTTCCGCAACATAATCTTCAAAGAGAAGATGTTTTTATAACCACCAAGTTGGCGCCATCTGTTAATAACAAGTCCGTTAGGCGGGCTTTTAATCGTTCGCTTACCAATCTGGGATTAGATTACCTGGATATGTACTTAATAAATTTCCCGGGATATGCGAAACAAAATCCAAAAGGCGCTTTAAATAAAAAGAGGCGCGAGGAGACTTGGCTTGCGATTGTTAAGTTATACGATGAAGGTAAGGTCAATGCAGTTGGTGTATCCAACTTCACTCTGAAGCACCTCCGTGAATTGACCGATGTCATGGCTATTGGACCTATGGTCAACCAGATCGAATATCACCCGTACTACGTGGATACCGAGTTGATGCAATATTGCCTTCAAAACAACATTCTAGTTCAAGCTTATAACTCGTTTGGCGGACTGTCGTTGAGGAACAATGATCTAATGGAGGATCCGGTCGTCAAAAAGATTGCTAACAAACACGAAGTTACTAATAGCCAAGTTCTACTGGCCTGGGCCCTGCAGCGTGGAGTTGCTGTGATTCCCAAATCTGTCACTCCAGAACATATGGAAGAGAACATTATGATCAACTTAAAACTAACAGAACGCCAGATGCTATCCCTCGATGCTCTGGCTGTCAAAAATAAGAAGTACTCGTGGGATCCCATTCATATTGCATGA

Protein sequence:

>DPOGS201372-PA
MDQIRYLAQIPKCINALCNVGGLEVRVAATQTVSLHPHFASTDSIWYQNWSNLSVVSLHQPSVDSGVLSEEIFKMSEDHNYRDKSFTLRDGIKVPAIGLGTYRVNNYDIVYETMNNALARGYRLFDTAAIYGNEEHLGRALRELLPQHNLQREDVFITTKLAPSVNNKSVRRAFNRSLTNLGLDYLDMYLINFPGYAKQNPKGALNKKRREETWLAIVKLYDEGKVNAVGVSNFTLKHLRELTDVMAIGPMVNQIEYHPYYVDTELMQYCLQNNILVQAYNSFGGLSLRNNDLMEDPVVKKIANKHEVTNSQVLLAWALQRGVAVIPKSVTPEHMEENIMINLKLTERQMLSLDALAVKNKKYSWDPIHIA-