Monarch geneset OGS2.0

DPOGS209267
TranscriptDPOGS209267-TA909 bp
ProteinDPOGS209267-PA302 aa
Genomic positionDPSCF300111 + 602248-604745
RNAseq coverage197x (Rank: top 47%)
Annotation
HeliconiusHMEL0082652e-10861.13% 
BombyxBGIBMGA007047-TA1e-10657.10% 
Drosophilasro-PA7e-6041.45% 
EBI UniRef50UniRef50_D4Q9I12e-10457.10%Short-chain dehydrogenase/reductase n=2 Tax=Obtectomera RepID=D4Q9I1_BOMMO
NCBI RefSeqNP_001171333.15e-10557.10%short-chain dehydrogenase/reductase [Bombyx mori]
NCBI nr blastpgi|2954240919e-10457.10%short-chain dehydrogenase/reductase [Bombyx mori]
NCBI nr blastxgi|2954240916e-9957.10%short-chain dehydrogenase/reductase [Bombyx mori]
Group
Gene OntologyGO:00054883.5e-43binding
GO:00081521.9e-18metabolic process
GO:00164911.9e-18oxidoreductase activity
KEGG pathwaydme:Dmel_CG120686e-58 
 K00019 (E1.1.1.30, bdh)maps-> Butanoate metabolism
    Synthesis and degradation of ketone bodies
InterPro domain[5-263] IPR0160403.5e-43NAD(P)-binding domain
[5-177] IPR0021981.9e-18Short-chain dehydrogenase/reductase SDR
[6-23] IPR0023473.6e-14Glucose/ribitol dehydrogenase
Orthology groupMCL15661 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209267-TA
ATGTCGTGGACTAAGGTGGTAGCGATAACTGGTTGCGATAGTGGCCTGGGTTGGAGTATAGCTGCGCGTTCAGCGCGAGAGGGTTTCATTACTGTGGCTGGCATGTATAACGGCATTAACACAGAAGCAGCTAAGTCTCTGAACCGTCTTCGCGCTCATCCCCACAAGCTGGATATCACCGACGCCAGTAGTGTACTAAGTTTTTACGATTATGTCAAGAAAATTTTGCATAATAACAATAACTATGAATTATACGCAATTGTTAACAACGCGGGAGTTATGACTATTGGAGATTATGAGTGGCAAACACCAAAAATTATAGAAGATACAATTAACATCAACTTACTTGGAACTATGAAATTCACTTCAGCTTTCTTGCCAGATTTACGCAGGAACGCATTAAAGAATAAAAACAACCCTCGTATAATCAACGTAGCAAGTCATTGTGGCCTTCAACCATTACCTGGTTTCGGGCCGTACAGCGCAAGTAAAGCTGGTTTACTCGCCTGGAGTAAAGCGTTACGTCTTGAACACATGAACATGGGGTTAAAAGTTGTTTCATTCATACCAGGTGGTTTCGTTGGTGCCAGTAATCTTATGACGAATCAGTATTCAAACGCAAATGCTATGGTGGAACATCTGACCGAAGAACAAAAATCGCTTTACGAAACAAAAATTCGTAGATTAAATGATTATTTAAAACTTGCTTCGAATAATTCAAGATTTGATTCCTTAAAAGATGAAAATATAATTGAAACATTTATGATGGCCCTCACTGATGAAAATCCTAAGACAATGTACAAAGTTGAATCGTGGCGTTACAAACTTTATTATAATTTGTTTAAGTTTCCTCTGCCGGATAAATCTTACAGGTGGTTGATTAATAAATTTCTGGACTTCCCAAAATAA

Protein sequence:

>DPOGS209267-PA
MSWTKVVAITGCDSGLGWSIAARSAREGFITVAGMYNGINTEAAKSLNRLRAHPHKLDITDASSVLSFYDYVKKILHNNNNYELYAIVNNAGVMTIGDYEWQTPKIIEDTININLLGTMKFTSAFLPDLRRNALKNKNNPRIINVASHCGLQPLPGFGPYSASKAGLLAWSKALRLEHMNMGLKVVSFIPGGFVGASNLMTNQYSNANAMVEHLTEEQKSLYETKIRRLNDYLKLASNNSRFDSLKDENIIETFMMALTDENPKTMYKVESWRYKLYYNLFKFPLPDKSYRWLINKFLDFPK-