DPGLEAN15568 in OGS1.0

New model in OGS2.0DPOGS207021 
Genomic Positionscaffold1:+ 994968-1000693
See gene structure
CDS Length1890
Paired RNAseq reads  159
Single RNAseq reads  400
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012831 (5e-43)
Best Drosophila hit  CG9436 (2e-42)
Best Human hitPREDICTED: aldo-keto reductase family 1 member C2-like isoform 1 (3e-44)
Best NR hit (blastp)  similar to CG10638-PA [Papilio xuthus] (2e-62)
Best NR hit (blastx)  PREDICTED: similar to GA10458-PA [Nasonia vitripennis] (1e-60)
GeneOntology terms

















  
GO:0047115 trans-1,2-dihydrobenzene-1,2-diol dehydrogenase activity
GO:0047718 indanol dehydrogenase activity
GO:0055114 oxidation reduction
GO:0015721 bile acid and bile salt transport
GO:0030299 intestinal cholesterol absorption
GO:0031406 carboxylic acid binding
GO:0042632 cholesterol homeostasis
GO:0005829 cytosol
GO:0006805 xenobiotic metabolic process
GO:0046683 response to organophosphorus
GO:0016491 oxidoreductase activity
GO:0004033 aldo-keto reductase activity
GO:0008206 bile acid metabolic process
GO:0051260 protein homooligomerization
GO:0032052 bile acid binding
GO:0007586 digestion
GO:0005737 cytoplasm
GO:0047042 3-alpha-hydroxysteroid dehydrogenase (B-specific) activity
GO:0047006 20-alpha-hydroxysteroid dehydrogenase activity
InterPro families


  
IPR023210 NADP-dependent oxidoreductase domain
IPR018170 Aldo/keto reductase, conserved site
IPR020471 Aldo/keto reductase subgroup
IPR001395 Aldo/keto reductase
Orthology groupMCL25413

Nucleotide sequence:

ATGCATAACACTATAATTATCTTTGTCCTCTTGTGTACTTTCCAAATTGTTCTTGGGACA
TATTGGAAGAATATTTTATTGAATGATGGCGCAATCATGCCGCCCATCGCATTTGGTACT
GCAGCTCCGATAAGCGATTTGGATGACGTTGTTTCATCTGTAATAACCGCAATAGAAACA
GGATTCAGACATATTGATACAGCTCCGTTATATTTCAATGAGGCGCAAATAGGGGCAGCT
ATCTCGAACGTCACGAAGCGGGGTTTAGTGCTGAGGAGAGATCTATTTATTACTACAAAG
CTAGACGCTTATTCAAACCGAAGTGAAATTATTCCAGCTATTAAAGGCAGCTTACAAAGG
CTACAGTTAAGCTATGTAGATTTATATTTGATCCATACATCAGAAAATGTGCCCACAGGA
ACACCAATAGACTTTCTCGACATTTGGAAAGGCATGGAAGAAGTGAAAATGATGGGCTTG
GCAAGGTCCATTGGTCTATCTAATTTTGATAGCAAAAAAATCAATACGATCCTCGCACAT
GGCAGAATAAGGCCGTCTGTTAATCAAATAGAGGTTAATCCTACCTTTGCAAATCTTGAT
TTGGTATCGTACTGTCAGAACGAAGGCATAGCTGTTATGGCTTATTCTCCATTCGGCTTG
CTTGTACCGCGGCCTTATAAAAATACAACCAATGATCTCACGTTTGATGATAATACTTTT
ATGAAATTATCTCGAAAATACTATAAGGTGCCCAGCCAAGTCGTGTTACGTTATTTGATA
GACCGAGGGACAGTTCCCATACCGAAGTCTTTTAACAAGGAGCACATCAAGTCAAATTTC
AACGTTTTAAATTTTAAGCTTACCCAAAAAGAAGTGTACGAAATTAATGAATTGGATAGA
GATATAAGGTTGTACAATTTTGATAATACGAGTATAGAGGACCTATATGAATATTATTTT
GGAACCAGTTCCGCAGAAGTTTGGTCACGGGCCGCTAATATGAACGAGCTTCCACAACCA
CAAACAACTAACGACACAGGTGATACAATTGTTTTCGACATTATAAAAATGAACATTCTC
CTGAATGATGGCTACACAATGCCACCAATTGCTTTCGGTACTTTCGGAAAGATTAAGGAC
GTGAAGACCATTACAAAAACGGTGGTTGAAGCGATAGAATCGGGATACCGACATTTCGAT
ACAGCTCCCTTGTATTTCAATGAGGTGCAAGTAGGGGAGGGTATTGTAGACGCCATAGAG
CGTGGTCTAGTAGACAGAAAAGATCTATTTATCACAACCAAGCTTACAGGCAAAGAAATA
AACTGTACTGATATATGGGAAGGCATGGAGGAAGCGAAACTATTAGGACTGACAAACTCA
ATTGGAATTTCGAATTTTAATCATTCACAAATAGATAAAATACTTGAAGTGTGTAATATA
AAACCTGCTGTTATTCAAGTGGAGGTAAGCCCTACGTTTACAAACATTGCCCTGGTGGAC
TACTGTCAGAGTCACCAAATACACGTGACTGCTTTTTCACCATTCGGGTTTTTAGCACCG
CGACCTTTTAGAAATTACACCCCCACCACAGATTTTGCTAACACCACGTTGGTGACCATA
GCTAAGAAGCACAACAAAACCCCCAGTCAAATTGTGCTACGTTATCTGATAGATCGTGGA
ATCACACCGATACCGGCGTCTTCTAACAAAGATTACATGCAATTAAATTTTAATGTATTA
GACTTTAGTCTGACACAAAGTGAAGTAGTCAGTATTAATAATTTAAATGTAAGCGAGGCA
GTTTACGATTTTGATAACTTGGATAACTTGTACCAATACTTTTTTGATACTAATATGGAA
GAAGTTTTCAAAATCGTGAATGATATGTAA

Protein sequence:

MHNTIIIFVLLCTFQIVLGTYWKNILLNDGAIMPPIAFGTAAPISDLDDVVSSVITAIET
GFRHIDTAPLYFNEAQIGAAISNVTKRGLVLRRDLFITTKLDAYSNRSEIIPAIKGSLQR
LQLSYVDLYLIHTSENVPTGTPIDFLDIWKGMEEVKMMGLARSIGLSNFDSKKINTILAH
GRIRPSVNQIEVNPTFANLDLVSYCQNEGIAVMAYSPFGLLVPRPYKNTTNDLTFDDNTF
MKLSRKYYKVPSQVVLRYLIDRGTVPIPKSFNKEHIKSNFNVLNFKLTQKEVYEINELDR
DIRLYNFDNTSIEDLYEYYFGTSSAEVWSRAANMNELPQPQTTNDTGDTIVFDIIKMNIL
LNDGYTMPPIAFGTFGKIKDVKTITKTVVEAIESGYRHFDTAPLYFNEVQVGEGIVDAIE
RGLVDRKDLFITTKLTGKEINCTDIWEGMEEAKLLGLTNSIGISNFNHSQIDKILEVCNI
KPAVIQVEVSPTFTNIALVDYCQSHQIHVTAFSPFGFLAPRPFRNYTPTTDFANTTLVTI
AKKHNKTPSQIVLRYLIDRGITPIPASSNKDYMQLNFNVLDFSLTQSEVVSINNLNVSEA
VYDFDNLDNLYQYFFDTNMEEVFKIVNDM