New model in OGS2.0 | DPOGS207021  |
---|---|
Genomic Position | scaffold1:+ 994968-1000693 |
See gene structure | |
CDS Length | 1890 |
Paired RNAseq reads   | 159 |
Single RNAseq reads   | 400 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012831 (5e-43) |
Best Drosophila hit   | CG9436 (2e-42) |
Best Human hit | PREDICTED: aldo-keto reductase family 1 member C2-like isoform 1 (3e-44) |
Best NR hit (blastp)   | similar to CG10638-PA [Papilio xuthus] (2e-62) |
Best NR hit (blastx)   | PREDICTED: similar to GA10458-PA [Nasonia vitripennis] (1e-60) |
GeneOntology terms    | GO:0047115 trans-1,2-dihydrobenzene-1,2-diol dehydrogenase activity GO:0047718 indanol dehydrogenase activity GO:0055114 oxidation reduction GO:0015721 bile acid and bile salt transport GO:0030299 intestinal cholesterol absorption GO:0031406 carboxylic acid binding GO:0042632 cholesterol homeostasis GO:0005829 cytosol GO:0006805 xenobiotic metabolic process GO:0046683 response to organophosphorus GO:0016491 oxidoreductase activity GO:0004033 aldo-keto reductase activity GO:0008206 bile acid metabolic process GO:0051260 protein homooligomerization GO:0032052 bile acid binding GO:0007586 digestion GO:0005737 cytoplasm GO:0047042 3-alpha-hydroxysteroid dehydrogenase (B-specific) activity GO:0047006 20-alpha-hydroxysteroid dehydrogenase activity |
InterPro families    | IPR023210 NADP-dependent oxidoreductase domain IPR018170 Aldo/keto reductase, conserved site IPR020471 Aldo/keto reductase subgroup IPR001395 Aldo/keto reductase |
Orthology group | MCL25413 |
Nucleotide sequence:
ATGCATAACACTATAATTATCTTTGTCCTCTTGTGTACTTTCCAAATTGTTCTTGGGACA
TATTGGAAGAATATTTTATTGAATGATGGCGCAATCATGCCGCCCATCGCATTTGGTACT
GCAGCTCCGATAAGCGATTTGGATGACGTTGTTTCATCTGTAATAACCGCAATAGAAACA
GGATTCAGACATATTGATACAGCTCCGTTATATTTCAATGAGGCGCAAATAGGGGCAGCT
ATCTCGAACGTCACGAAGCGGGGTTTAGTGCTGAGGAGAGATCTATTTATTACTACAAAG
CTAGACGCTTATTCAAACCGAAGTGAAATTATTCCAGCTATTAAAGGCAGCTTACAAAGG
CTACAGTTAAGCTATGTAGATTTATATTTGATCCATACATCAGAAAATGTGCCCACAGGA
ACACCAATAGACTTTCTCGACATTTGGAAAGGCATGGAAGAAGTGAAAATGATGGGCTTG
GCAAGGTCCATTGGTCTATCTAATTTTGATAGCAAAAAAATCAATACGATCCTCGCACAT
GGCAGAATAAGGCCGTCTGTTAATCAAATAGAGGTTAATCCTACCTTTGCAAATCTTGAT
TTGGTATCGTACTGTCAGAACGAAGGCATAGCTGTTATGGCTTATTCTCCATTCGGCTTG
CTTGTACCGCGGCCTTATAAAAATACAACCAATGATCTCACGTTTGATGATAATACTTTT
ATGAAATTATCTCGAAAATACTATAAGGTGCCCAGCCAAGTCGTGTTACGTTATTTGATA
GACCGAGGGACAGTTCCCATACCGAAGTCTTTTAACAAGGAGCACATCAAGTCAAATTTC
AACGTTTTAAATTTTAAGCTTACCCAAAAAGAAGTGTACGAAATTAATGAATTGGATAGA
GATATAAGGTTGTACAATTTTGATAATACGAGTATAGAGGACCTATATGAATATTATTTT
GGAACCAGTTCCGCAGAAGTTTGGTCACGGGCCGCTAATATGAACGAGCTTCCACAACCA
CAAACAACTAACGACACAGGTGATACAATTGTTTTCGACATTATAAAAATGAACATTCTC
CTGAATGATGGCTACACAATGCCACCAATTGCTTTCGGTACTTTCGGAAAGATTAAGGAC
GTGAAGACCATTACAAAAACGGTGGTTGAAGCGATAGAATCGGGATACCGACATTTCGAT
ACAGCTCCCTTGTATTTCAATGAGGTGCAAGTAGGGGAGGGTATTGTAGACGCCATAGAG
CGTGGTCTAGTAGACAGAAAAGATCTATTTATCACAACCAAGCTTACAGGCAAAGAAATA
AACTGTACTGATATATGGGAAGGCATGGAGGAAGCGAAACTATTAGGACTGACAAACTCA
ATTGGAATTTCGAATTTTAATCATTCACAAATAGATAAAATACTTGAAGTGTGTAATATA
AAACCTGCTGTTATTCAAGTGGAGGTAAGCCCTACGTTTACAAACATTGCCCTGGTGGAC
TACTGTCAGAGTCACCAAATACACGTGACTGCTTTTTCACCATTCGGGTTTTTAGCACCG
CGACCTTTTAGAAATTACACCCCCACCACAGATTTTGCTAACACCACGTTGGTGACCATA
GCTAAGAAGCACAACAAAACCCCCAGTCAAATTGTGCTACGTTATCTGATAGATCGTGGA
ATCACACCGATACCGGCGTCTTCTAACAAAGATTACATGCAATTAAATTTTAATGTATTA
GACTTTAGTCTGACACAAAGTGAAGTAGTCAGTATTAATAATTTAAATGTAAGCGAGGCA
GTTTACGATTTTGATAACTTGGATAACTTGTACCAATACTTTTTTGATACTAATATGGAA
GAAGTTTTCAAAATCGTGAATGATATGTAA
Protein sequence:
MHNTIIIFVLLCTFQIVLGTYWKNILLNDGAIMPPIAFGTAAPISDLDDVVSSVITAIET
GFRHIDTAPLYFNEAQIGAAISNVTKRGLVLRRDLFITTKLDAYSNRSEIIPAIKGSLQR
LQLSYVDLYLIHTSENVPTGTPIDFLDIWKGMEEVKMMGLARSIGLSNFDSKKINTILAH
GRIRPSVNQIEVNPTFANLDLVSYCQNEGIAVMAYSPFGLLVPRPYKNTTNDLTFDDNTF
MKLSRKYYKVPSQVVLRYLIDRGTVPIPKSFNKEHIKSNFNVLNFKLTQKEVYEINELDR
DIRLYNFDNTSIEDLYEYYFGTSSAEVWSRAANMNELPQPQTTNDTGDTIVFDIIKMNIL
LNDGYTMPPIAFGTFGKIKDVKTITKTVVEAIESGYRHFDTAPLYFNEVQVGEGIVDAIE
RGLVDRKDLFITTKLTGKEINCTDIWEGMEEAKLLGLTNSIGISNFNHSQIDKILEVCNI
KPAVIQVEVSPTFTNIALVDYCQSHQIHVTAFSPFGFLAPRPFRNYTPTTDFANTTLVTI
AKKHNKTPSQIVLRYLIDRGITPIPASSNKDYMQLNFNVLDFSLTQSEVVSINNLNVSEA
VYDFDNLDNLYQYFFDTNMEEVFKIVNDM