DPGLEAN18566 in OGS1.0

New model in OGS2.0DPOGS201210 
Genomic Positionscaffold5083:- 523-7720
See gene structure
CDS Length618
Paired RNAseq reads  1343
Single RNAseq reads  10761
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001367 (2e-17)
Best Drosophila hit  CG12766 (2e-23)
Best Human hitPREDICTED: aldo-keto reductase family 1 member C2-like isoform 4 (2e-26)
Best NR hit (blastp)  aldo-keto reductase [Heliothis virescens] (4e-42)
Best NR hit (blastx)  aldo-keto reductase [Heliothis virescens] (8e-34)
GeneOntology terms

















  
GO:0047115 trans-1,2-dihydrobenzene-1,2-diol dehydrogenase activity
GO:0047718 indanol dehydrogenase activity
GO:0055114 oxidation reduction
GO:0015721 bile acid and bile salt transport
GO:0030299 intestinal cholesterol absorption
GO:0031406 carboxylic acid binding
GO:0042632 cholesterol homeostasis
GO:0005829 cytosol
GO:0006805 xenobiotic metabolic process
GO:0046683 response to organophosphorus
GO:0016491 oxidoreductase activity
GO:0004033 aldo-keto reductase activity
GO:0008206 bile acid metabolic process
GO:0051260 protein homooligomerization
GO:0032052 bile acid binding
GO:0007586 digestion
GO:0005737 cytoplasm
GO:0047042 3-alpha-hydroxysteroid dehydrogenase (B-specific) activity
GO:0047006 20-alpha-hydroxysteroid dehydrogenase activity
InterPro families


  
IPR023210 NADP-dependent oxidoreductase domain
IPR020471 Aldo/keto reductase subgroup
IPR001395 Aldo/keto reductase
IPR018170 Aldo/keto reductase, conserved site
Orthology groupND

Nucleotide sequence:

ATGCTCTGGCTGTACCTGGTGCTGTGTGTCACATCGGCCCTCGGGAAGAACATTCATTCT
TCTGGAGTCGCGCCGATAGTCAAGCTGAACGATGGATATGCAATGCCGCGCTTGGGATTG
GGAACGTGGCTTGGCATTCTAACGACGGGTTCACCGGAAGAGGTTCAACAGGCAGTGGAA
GCAGCCATAGATGCTGGCTACAGACACATCGACACCGCTCACATTTATAATACAGAGAAA
CAGGTCGGCAAAGGATTGAAGAAGAAAATAGAAGAGGGGGTAGTTAAGAGAGAGGACATG
TTCATAACGACTAAGTTGTGGAGTGACGCTCATCCGCGCGATGCTGTGATACCAACTCTG
AACGAGTCCCTCAACCATCTGGGAATGGATTATGTGGACTTATACCTCATCCACTGGCCA
GTCGCCACTTTCTTCGAATTAGGAGTGGTGCCGATCCCTAAATCCGTGAAGAAGAATCGA
GTGGAGGAGAACATTGACATCTTCGACTTTGAGCTGACTCCAGAAGAGAGGAACCTCCTC
AAGAGTTACGACGCCAACTATAGGACTATTGACCCAAAGTTTTGGAAGAAGTCACCCTAT
TATCCATTTAATAATTAA

Protein sequence:

MLWLYLVLCVTSALGKNIHSSGVAPIVKLNDGYAMPRLGLGTWLGILTTGSPEEVQQAVE
AAIDAGYRHIDTAHIYNTEKQVGKGLKKKIEEGVVKREDMFITTKLWSDAHPRDAVIPTL
NESLNHLGMDYVDLYLIHWPVATFFELGVVPIPKSVKKNRVEENIDIFDFELTPEERNLL
KSYDANYRTIDPKFWKKSPYYPFNN