Monarch geneset OGS2.0

DPOGS206943
TranscriptDPOGS206943-TA942 bp
ProteinDPOGS206943-PA313 aa
Genomic positionDPSCF300001 - 529387-545646
RNAseq coverage522x (Rank: top 24%)
Annotation
HeliconiusHMEL0110741e-13068.47% 
BombyxBGIBMGA012888-TA3e-10571.26% 
DrosophilaCG1444-PA2e-7344.15% 
EBI UniRef50UniRef50_Q7PHB82e-8250.68%AGAP004532-PA n=2 Tax=Anopheles gambiae RepID=Q7PHB8_ANOGA
NCBI RefSeqXP_971720.12e-8650.98%PREDICTED: similar to steroid dehydrogenase [Tribolium castaneum]
NCBI nr blastpgi|910942973e-8550.98%PREDICTED: similar to steroid dehydrogenase [Tribolium castaneum]
NCBI nr blastxgi|910942972e-8250.98%PREDICTED: similar to steroid dehydrogenase [Tribolium castaneum]
Group
Gene OntologyGO:00054886.2e-51binding
GO:00081521.9e-23metabolic process
GO:00164911.9e-23oxidoreductase activity
KEGG pathwayecb:1000500758e-73 
 K00044 (E1.1.1.62, HSD17B)maps-> Steroid hormone biosynthesis
 K10251 (KAR)maps-> Biosynthesis of unsaturated fatty acids
InterPro domain[46-257] IPR0160406.2e-51NAD(P)-binding domain
[49-218] IPR0021981.9e-23Short-chain dehydrogenase/reductase SDR
[50-67] IPR0023472.1e-18Glucose/ribitol dehydrogenase
Orthology groupMCL14150 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206943-TA
ATGGCTCAATTGGGGTATTTAGAAAAATTCGGTATTGTTTTCCTAATTGGTTTGCTCTATTATGTAGTTAAGAGTGTGGTCAATGTGTTTTACACTTATATCATTGGACCGGTGGTGAATAAGGTAGACTTTAAGTCTAAAGGAAAGTGGGCATTAGTCACTGGCAGCACCGATGGAATTGGCAAAGCCTATGCCAGGGAGCTCGCATCCCGTGGCTGTGATATAGTTCTAGTCAGCCGTTCTTATGATAAGCTCATGGAGACGGCAAATGAAATTGAAAAAGACTTCAAAGTGGAAACAAGGATCGTTGTTGCTGACTTCAGCGATGCTGATATATATGAAATGATCTCAAAGGAGGTTGCTGACCTTGAGATTGGTACTCTGGTCAACAATGTGGGTGTCTCCTATAAATATCCGGAGTATTTCCTGGAAATCGCTGATTGGGAGAAGACTATCAGTACAATGATAAAAGCTAACGTTGTCTCTGTAACTCGTATGACTGGTATCGTGATGCCTGGTATGGTAGAACGTGGTAAGGGCGTGGTCATCAATATTGGATCCGGATCTTCCATAATACCCAGCCCCCTTCTCACGGTATACGCGTCTACTAAGGCCTATGTGGAAAAATTCTCTGAAGGTCTTGAAATGGAGTATAGCAAGAGAGGCATTATCGTGCAATGTGTTCTACCCGGTTTAGTTTGCTCCAATATGTCTGGAATACGCCGAAGCACTTTGATAGCACCAACAGCGAAGACATTTGTCAAGTCTGCCATCAGCTTAGTCGGAACTACCTCCAAAACGACAGGATATTTCCCTCACACTCTCTTCTTCTGTGTGGTTAATTCTATCCACAGTGTCGCATCGCGTTTCAGTGTGTGGTTGGTGACCCGCAGCATGGAAAATACCCGCAGGAAGGCGCTGAAAAAGTACAAAAAAGAATAA

Protein sequence:

>DPOGS206943-PA
MAQLGYLEKFGIVFLIGLLYYVVKSVVNVFYTYIIGPVVNKVDFKSKGKWALVTGSTDGIGKAYARELASRGCDIVLVSRSYDKLMETANEIEKDFKVETRIVVADFSDADIYEMISKEVADLEIGTLVNNVGVSYKYPEYFLEIADWEKTISTMIKANVVSVTRMTGIVMPGMVERGKGVVINIGSGSSIIPSPLLTVYASTKAYVEKFSEGLEMEYSKRGIIVQCVLPGLVCSNMSGIRRSTLIAPTAKTFVKSAISLVGTTSKTTGYFPHTLFFCVVNSIHSVASRFSVWLVTRSMENTRRKALKKYKKE-