Monarch geneset OGS2.0

DPOGS212758
TranscriptDPOGS212758-TA1416 bp
ProteinDPOGS212758-PA471 aa
Genomic positionDPSCF300012 + 750500-752739
RNAseq coverage32x (Rank: top 75%)
Annotation
HeliconiusHMEL0070591e-13052.33% 
BombyxBGIBMGA013160-TA8e-7557.54% 
DrosophilaCG3699-PA1e-4541.60% 
EBI UniRef50UniRef50_D2SNW11e-6151.39%Hydroxybutyrate dehydrogenase n=7 Tax=Obtectomera RepID=D2SNW1_HELVI
NCBI RefSeqXP_001652982.13e-4945.45%short chain type dehydrogenase [Aedes aegypti]
NCBI nr blastpgi|3796990463e-7057.14%3-dehydroecdysone 3alpha-reductase [Bombyx mori]
NCBI nr blastxgi|3796990465e-6857.14%3-dehydroecdysone 3alpha-reductase [Bombyx mori]
Group
Gene OntologyGO:00054882.5e-75binding
GO:00081521.9e-26metabolic process
GO:00164911.9e-26oxidoreductase activity
KEGG pathwaynpu:Npun_F22111e-33 
 K00034 (gdh)maps-> Pentose phosphate pathway
InterPro domain[223-466] IPR0160402.5e-75NAD(P)-binding domain
[229-246] IPR0023473e-38Glucose/ribitol dehydrogenase
[6-169] IPR0021981.9e-26Short-chain dehydrogenase/reductase SDR
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212758-TA
ATGAGTTTCAACAATAAAGTAGTGTTGGTGACCGGAGCCAGCTCTGGTATCGGAGCTGCTGCGGCGGAGGCGTTCAGTTCAGAGGGAGCTCAGGTTGTGATGGTTGGACGCAACGAGACCAAGCTGTCGGCCGTGGCTGCGAAATGCAACAAACCTTTCGTCATACGAGCTGATGTCGCTAACGATGATGACGCGAGAAGAATTATAGATGAAACTATTGAACGTTTCGGCCGGATAGACATCTTAATAAATAACGCTGGTGTATCAGGAGTGGCTAAATTGATATCCGGTAAAATTATTGATATCTATGATAAAATTATGCCCATTAACCTTCGCGCAGTTGTGTTATTGACGTCGCTGGCAACACCGCATCTAATAAAGACGAAGGGCAATATAGTTAATATATCCAGCGTAGCAGCGAGACTCATGGATTCAAGGAGCGGTACGGCTATGTACAATGTTTCGAAGGCAGCTTTAAATCACTTCAGTTTATGCGCTGCTGAGGAGCTTTCTCGGTATGGTGTCAGAGTCAACACTATCAGCCCAGGTCCAGTAGTAACCGATATCTTGGTAAACACAAATGCAAAGGCGACGTGGGATGATTTTAAAAAAATCACTTGCCTGGACAGGGTTTCTGATCCCGTGGAAATAGTAGATTTAATAATCATGAGTTTCAAGAATAAAGTAGTGTTGGTGACCGGAGCCAGCTCTGGTATCGGAGCTGCTGCGGCGGAGGCGTTCAGTTCAGAGGGAGCTCAGGTTGTGATGGTTGGACGCAACGAGACCAAGCTGTCGGCCGTGGCTGCGAAATGCAACAAAGCGTTCTTCATACGAGCTGATGTCGCTAACGATGATGACGCACGATTTATCTTGGATGAGACGATTAATCGTTTTGGGAAATTGGACATTTTAATCAATAACGCCGGAGTTATAATCCCAGAAAGCATTCTTAACGTGAATGTATTAAAATCCTACGACACAACGATGGGAATAAATTTGCGAGCTATCGTACATCTAACGTCTTTGGCTGCGTCACATTTGATTGAAACGAAAGGTAGCATCATAAACATATCCAGTGTTGGCGGGCAGATGGTGCCGCGACCTGGTTCAGGGTTCTCCATGTACTATGTATCGAAAGCCGCTTTGAATCATTTCGGTCTTTGTATAGCAGCTGAATTAGCGCCTTATGGTGTCAGAGTGAACACGATCAGTCCCGGCCCAGTGAGGACCAACATTTTTAACGATCTCAACATCGACGTAGACAACTTCTGTAACAGCACGGCGCTACAAAGAGTGTCCGAACCAAAGGAGGTAGCGGACTTGATTTTATTCCTTTCAAGTGATAAGGCTAAAGCAATAACCGGCTCGAACTACGTCATCGACAACGGTATTTTTATTTGCCGTGCCATGAAGTAA

Protein sequence:

>DPOGS212758-PA
MSFNNKVVLVTGASSGIGAAAAEAFSSEGAQVVMVGRNETKLSAVAAKCNKPFVIRADVANDDDARRIIDETIERFGRIDILINNAGVSGVAKLISGKIIDIYDKIMPINLRAVVLLTSLATPHLIKTKGNIVNISSVAARLMDSRSGTAMYNVSKAALNHFSLCAAEELSRYGVRVNTISPGPVVTDILVNTNAKATWDDFKKITCLDRVSDPVEIVDLIIMSFKNKVVLVTGASSGIGAAAAEAFSSEGAQVVMVGRNETKLSAVAAKCNKAFFIRADVANDDDARFILDETINRFGKLDILINNAGVIIPESILNVNVLKSYDTTMGINLRAIVHLTSLAASHLIETKGSIINISSVGGQMVPRPGSGFSMYYVSKAALNHFGLCIAAELAPYGVRVNTISPGPVRTNIFNDLNIDVDNFCNSTALQRVSEPKEVADLILFLSSDKAKAITGSNYVIDNGIFICRAMK-