Monarch geneset OGS2.0

DPOGS207021
TranscriptDPOGS207021-TA1905 bp
ProteinDPOGS207021-PA634 aa
Genomic positionDPSCF300001 + 1395749-1401304
RNAseq coverage74x (Rank: top 65%)
Annotation
HeliconiusHMEL0106291e-8738.70% 
BombyxBGIBMGA012831-TA2e-6546.49% 
DrosophilaCG10638-PA4e-5136.05% 
EBI UniRef50UniRef50_Q8WRT09e-6144.37%3-dehydrecdysone 3b-reductase n=3 Tax=Obtectomera RepID=Q8WRT0_TRINI
NCBI RefSeqXP_624401.21e-6040.89%PREDICTED: similar to CG10638-PA, isoform A isoform 1 [Apis mellifera]
NCBI nr blastpgi|3640235678e-6648.58%seminal fluid protein CSSFP009 [Chilo suppressalis]
NCBI nr blastxgi|3640235673e-6248.58%seminal fluid protein CSSFP009 [Chilo suppressalis]
Group
Gene OntologyGO:00551144.8e-41oxidation-reduction process
GO:00164914.8e-41oxidoreductase activity
KEGG pathwayhsa:16457e-51 
 K00089 (E1.1.1.213)maps-> Steroid hormone biosynthesis
 K00212 (E1.3.1.20)maps-> Metabolism of xenobiotics by cytochrome P450
InterPro domain[323-628] IPR0013952.5e-105Aldo/keto reductase
[326-621] IPR0232104.8e-84NADP-dependent oxidoreductase domain
[359-383] IPR0204714.8e-41Aldo/keto reductase subgroup
Orthology groupMCL34690 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207021-TA
ATGCCGCCCATCGCATTTGGTACTGCAGCTCCGATAAGCGATTTGGATGACGTTGTTTCATCTGTAATAACCGCAATAGAAACAGGATTCAGACATATTGATACAGCTCCGTTATATTTCAATGAGGCGCAAATAGGGGCAGCTATCTCGAACGTCACGAAGCGGGGTTTAGTGCTGAGGAGAGATCTATTTATTACTACAAAGCTAGACGCTTATTCAAACCGAAGTGAAATTATTCCAGCTATTAAAGGCAGCTTACAAAGGCTACAGTTAAGCTATGTAGATTTATATTTGATCCATACATCAGAAAATGTGCCCACAGGAACACCAATAGACTTTCTCGACATTTGGAAAGGCATGGAAGAAGTGAAAATGATGGGCTTGGCAAGGTCCATTGGTCTATCTAATTTTGATAGCAAAAAAATCAATACGATCCTCGCACATGGCAGAATAAGGCCGTCTGTTAATCAAATAGAGGTTAATCCTACCTTTGCAAATCTTGATTTGGTATCGTACTGTCAGAACGAAGGCATAGCTGTTATGGCTTATTCTCCATTCGGCTTGCTTGTACCGCGGCCTTATAAAAATACAACCAATGATCTCACGTTTGATGATAATACTTTTATGAAATTATCTCGAAAATACTATAAGGTGCCCAGCCAAGTCGTGTTACGTTATTTGATAGACCGAGGGACAGTTCCCATACCGAAGTCTTTTAACAAGGAGCACATCAAGTCAAATTTCAACGTTTTAAATTTTAAGCTTACCCAAAAAGAAGTGTACGAAATTAATGAATTGGATAGAGATATAAGGTTGTACAATTTTGATAATACGAGTATAGAGGACCTATATGAATATTATTTTGGAACCAGTTCCGCAGAAGTTTGGTCACGGGCCGCTAATATGAACGAGCTTCCACAACCACAAACAACTAACGACACAGGTGATACAATTGTTTTCGACATTATAAAAATGAACATTCTCCTGAATGATGGCTACACAATGCCACCAATTGCTTTCGGTACTTTCGGAAAGATTAAGGACGTGAAGACCATTACAAAAACGGTGGTTGAAGCGATAGAATCGGGATACCGACATTTCGATACAGCTCCCTTGTATTTCAATGAGGTGCAAGTAGGGGAGGGTATTGTAGACGCCATAGAGCGTGGTCTAGTAGACAGAAAAGATCTATTTATCACAACCAAGCTTGATATTTATTCAAATCGAAGTGAAGTCGTTCCTGCTCTCAAAAGAAGCCTGCAGAGGTTACAATTGGCTTACGTTGATCTTTATCTTATCCACATACCATTCGATGTGCTTACAGGCAAAGAAATAAACTGTACTGATATATGGGAAGGCATGGAGGAAGCGAAACTATTAGGACTGACAAACTCAATTGGAATTTCGAATTTTAATCATTCACAAATAGATAAAATACTTGAAGTGTGTAATATAAAACCTGCTGTTATTCAAGTGGAGGTAAGCCCTACGTTTACAAACATTGCCCTGGTGGACTACTGTCAGAGTCACCAAATACACGTGACTGCTTTTTCACCATTCGGGTTTTTAGCACCGCGACCTTTTAGAAATTACACCCCCACCACAGATTTTGCTAACACCACGTTGGTGACCATAGCTAAGAAGCACAACAAAACCCCCAGTCAAATTGTGCTACGTTATCTGATAGATCGTGGAATCACACCGATACCGGCGTCTTCTAACAAAGATTACATGCAATTAAATTTTAATGTATTAGACTTTAGTCTGACACAAAGTGAAGTAGTCAGTATTAATAATTTAAATGTAAGCGAGGCAGTTTACGATTTTGATAACTTGGATAACTTGTACCAATACTTTTTTGATACTAATATGGAAGAAGTTTTCAAAATCGTGAATGATATGTAA

Protein sequence:

>DPOGS207021-PA
MPPIAFGTAAPISDLDDVVSSVITAIETGFRHIDTAPLYFNEAQIGAAISNVTKRGLVLRRDLFITTKLDAYSNRSEIIPAIKGSLQRLQLSYVDLYLIHTSENVPTGTPIDFLDIWKGMEEVKMMGLARSIGLSNFDSKKINTILAHGRIRPSVNQIEVNPTFANLDLVSYCQNEGIAVMAYSPFGLLVPRPYKNTTNDLTFDDNTFMKLSRKYYKVPSQVVLRYLIDRGTVPIPKSFNKEHIKSNFNVLNFKLTQKEVYEINELDRDIRLYNFDNTSIEDLYEYYFGTSSAEVWSRAANMNELPQPQTTNDTGDTIVFDIIKMNILLNDGYTMPPIAFGTFGKIKDVKTITKTVVEAIESGYRHFDTAPLYFNEVQVGEGIVDAIERGLVDRKDLFITTKLDIYSNRSEVVPALKRSLQRLQLAYVDLYLIHIPFDVLTGKEINCTDIWEGMEEAKLLGLTNSIGISNFNHSQIDKILEVCNIKPAVIQVEVSPTFTNIALVDYCQSHQIHVTAFSPFGFLAPRPFRNYTPTTDFANTTLVTIAKKHNKTPSQIVLRYLIDRGITPIPASSNKDYMQLNFNVLDFSLTQSEVVSINNLNVSEAVYDFDNLDNLYQYFFDTNMEEVFKIVNDM-