New model in OGS2.0 | DPOGS201210  |
---|---|
Genomic Position | scaffold5083:- 523-7720 |
See gene structure | |
CDS Length | 618 |
Paired RNAseq reads   | 1343 |
Single RNAseq reads   | 10761 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001367 (2e-17) |
Best Drosophila hit   | CG12766 (2e-23) |
Best Human hit | PREDICTED: aldo-keto reductase family 1 member C2-like isoform 4 (2e-26) |
Best NR hit (blastp)   | aldo-keto reductase [Heliothis virescens] (4e-42) |
Best NR hit (blastx)   | aldo-keto reductase [Heliothis virescens] (8e-34) |
GeneOntology terms    | GO:0047115 trans-1,2-dihydrobenzene-1,2-diol dehydrogenase activity GO:0047718 indanol dehydrogenase activity GO:0055114 oxidation reduction GO:0015721 bile acid and bile salt transport GO:0030299 intestinal cholesterol absorption GO:0031406 carboxylic acid binding GO:0042632 cholesterol homeostasis GO:0005829 cytosol GO:0006805 xenobiotic metabolic process GO:0046683 response to organophosphorus GO:0016491 oxidoreductase activity GO:0004033 aldo-keto reductase activity GO:0008206 bile acid metabolic process GO:0051260 protein homooligomerization GO:0032052 bile acid binding GO:0007586 digestion GO:0005737 cytoplasm GO:0047042 3-alpha-hydroxysteroid dehydrogenase (B-specific) activity GO:0047006 20-alpha-hydroxysteroid dehydrogenase activity |
InterPro families    | IPR023210 NADP-dependent oxidoreductase domain IPR020471 Aldo/keto reductase subgroup IPR001395 Aldo/keto reductase IPR018170 Aldo/keto reductase, conserved site |
Orthology group | ND |
Nucleotide sequence:
ATGCTCTGGCTGTACCTGGTGCTGTGTGTCACATCGGCCCTCGGGAAGAACATTCATTCT
TCTGGAGTCGCGCCGATAGTCAAGCTGAACGATGGATATGCAATGCCGCGCTTGGGATTG
GGAACGTGGCTTGGCATTCTAACGACGGGTTCACCGGAAGAGGTTCAACAGGCAGTGGAA
GCAGCCATAGATGCTGGCTACAGACACATCGACACCGCTCACATTTATAATACAGAGAAA
CAGGTCGGCAAAGGATTGAAGAAGAAAATAGAAGAGGGGGTAGTTAAGAGAGAGGACATG
TTCATAACGACTAAGTTGTGGAGTGACGCTCATCCGCGCGATGCTGTGATACCAACTCTG
AACGAGTCCCTCAACCATCTGGGAATGGATTATGTGGACTTATACCTCATCCACTGGCCA
GTCGCCACTTTCTTCGAATTAGGAGTGGTGCCGATCCCTAAATCCGTGAAGAAGAATCGA
GTGGAGGAGAACATTGACATCTTCGACTTTGAGCTGACTCCAGAAGAGAGGAACCTCCTC
AAGAGTTACGACGCCAACTATAGGACTATTGACCCAAAGTTTTGGAAGAAGTCACCCTAT
TATCCATTTAATAATTAA
Protein sequence:
MLWLYLVLCVTSALGKNIHSSGVAPIVKLNDGYAMPRLGLGTWLGILTTGSPEEVQQAVE
AAIDAGYRHIDTAHIYNTEKQVGKGLKKKIEEGVVKREDMFITTKLWSDAHPRDAVIPTL
NESLNHLGMDYVDLYLIHWPVATFFELGVVPIPKSVKKNRVEENIDIFDFELTPEERNLL
KSYDANYRTIDPKFWKKSPYYPFNN