Monarch geneset OGS2.0

DPOGS215254
TranscriptDPOGS215254-TA1104 bp
ProteinDPOGS215254-PA367 aa
Genomic positionDPSCF300047 - 260839-263915
RNAseq coverage1x (Rank: top 95%)
Annotation
HeliconiusHMEL0086064e-8858.89% 
BombyxBGIBMGA013160-TA1e-7856.52% 
DrosophilaCG31548-PA1e-4542.41% 
EBI UniRef50UniRef50_D2SNW13e-6653.15%Hydroxybutyrate dehydrogenase n=7 Tax=Obtectomera RepID=D2SNW1_HELVI
NCBI RefSeqXP_974115.12e-4942.91%PREDICTED: similar to 3-hydroxybutyrate dehydrogenase type 2 [Tribolium castaneum]
NCBI nr blastpgi|3796990467e-7354.55%3-dehydroecdysone 3alpha-reductase [Bombyx mori]
NCBI nr blastxgi|3796990464e-7154.37%3-dehydroecdysone 3alpha-reductase [Bombyx mori]
Group
Gene OntologyGO:00054882.6e-77binding
GO:00081522e-27metabolic process
GO:00164912e-27oxidoreductase activity
KEGG pathwaycyc:PCC7424_28482e-35 
 K00059 (fabG)maps-> Biosynthesis of unsaturated fatty acids
    Fatty acid biosynthesis
InterPro domain[119-364] IPR0160402.6e-77NAD(P)-binding domain
[125-142] IPR0023471.8e-40Glucose/ribitol dehydrogenase
[124-289] IPR0021982e-27Short-chain dehydrogenase/reductase SDR
Orthology groupMCL17140 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215254-TA
ATGAATTTCCAAGATAAAGTAGTCATTGTAACTGGAGCCAGTTCAGGAATTGGTGCAGCGATAGCAGTGGGTTTTAGCTCCGAGGGTGCACGTGTGGTGATAGTTGGTCGCAATGAGAGTAAACTAGCCTCAGTAGCAGCTCGCTGCAACGAACCACTTGTGGTGAAAGCTGACGTTGGAAACGATGAAGATGCAAAACGTATAATCGATCGAACGATTGATCATTTTGGACGAATTGACATTTTAATTAACAATGCTGGTATCGGAGTATGGGGAACGCTCGTTTCAGGCAAACTTGTTGAGTCGTATGACACTGTTATGCGGGTCAACCTTCGAGCTGTGACTAACATCAATATGAATTTCCAAGATAAAGTAGTCATTGTAACTGGAGCCAGTTCAGGAATTGGTGCAGCGATAGCAGTGGGTTTTAGCTCCGAGGGTGCACGTGTGGTGATAGTTGGTCGCAATGAGAGTAAACTAGCCTCAGTAGCAGCTCGCTGCAACGAACCACTTGTGGTGAAAGCTGACGTTGGAAACGATGAAGATGCAAAACGTATAATCGATCGAACGATTGATCATTTTGGACGAATTGACATTTTAATTAACAATGCTGGTATCGGAGTATGGGGAACACTCGTTTCAGGCAAACTTGTTGAGTCGTATGACACTGTTATGCGGGTCAACCTTCGAGCTGTGGTAAATTTGACATCACTAGCTACTCCTTATTTAATTCAGACTAAGGGTAATGTTATCAACATATCCAGCATCGGAAGCTTAATTCCAGCAATTGGAAGTTCTGGTTTTTCAATGTATGCTGTGTCGAAAGCAGCTATAAATCATTTCGGTGCATGTGCTGCTGCAGAATTAGCAGAGTATGGAGTAAGAGTTAATACTGTAAGCCCTGGTCCAGTTGTTACTGATATTTTGGAAACTTCTAAATCTCCCATAACATGGGATGATTTCAAAAAAATGACTGCACTAGATAGAGTGTCTCAACCAGAAGAGATAGCAGATTTAGTAATGTTTCTCGCGAGTGATAAAGCGAAGGCTATTACCGGATCTAATCATGTCAGTGACAATGGTCTGTTTGTTAAACGTTATTAA

Protein sequence:

>DPOGS215254-PA
MNFQDKVVIVTGASSGIGAAIAVGFSSEGARVVIVGRNESKLASVAARCNEPLVVKADVGNDEDAKRIIDRTIDHFGRIDILINNAGIGVWGTLVSGKLVESYDTVMRVNLRAVTNINMNFQDKVVIVTGASSGIGAAIAVGFSSEGARVVIVGRNESKLASVAARCNEPLVVKADVGNDEDAKRIIDRTIDHFGRIDILINNAGIGVWGTLVSGKLVESYDTVMRVNLRAVVNLTSLATPYLIQTKGNVINISSIGSLIPAIGSSGFSMYAVSKAAINHFGACAAAELAEYGVRVNTVSPGPVVTDILETSKSPITWDDFKKMTALDRVSQPEEIADLVMFLASDKAKAITGSNHVSDNGLFVKRY-