DPGLEAN00749 in OGS1.0

New model in OGS2.0DPOGS203596 
Genomic Positionscaffold2625:- 17550-19437
See gene structure
CDS Length1035
Paired RNAseq reads  30
Single RNAseq reads  87
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001351 (6e-77)
Best Drosophila hit  CG12766 (3e-50)
Best Human hit3-oxo-5-beta-steroid 4-dehydrogenase isoform 1 (4e-51)
Best NR hit (blastp)  aldo-keto reductase [Heliothis virescens] (2e-78)
Best NR hit (blastx)  similar to CG10638-PA [Papilio xuthus] (8e-76)
GeneOntology terms








  
GO:0005737 cytoplasm
GO:0006629 lipid metabolic process
GO:0030573 bile acid catabolic process
GO:0055114 oxidation reduction
GO:0008202 steroid metabolic process
GO:0005575 cellular_component
GO:0016491 oxidoreductase activity
GO:0006699 bile acid biosynthetic process
GO:0047568 3-oxo-5-beta-steroid 4-dehydrogenase activity
GO:0047787 delta4-3-oxosteroid 5beta-reductase activity
InterPro families


  
IPR001395 Aldo/keto reductase
IPR023210 NADP-dependent oxidoreductase domain
IPR020471 Aldo/keto reductase subgroup
IPR018170 Aldo/keto reductase, conserved site
Orthology groupND

Nucleotide sequence:

ATGGACGCGCCTCTTATTATTGCTATAATTCTATTTTTTTCTGGTTTGGCGAACGCGTCT
CTGTCATATAATCTTAACGATGGTAATGTGATACCAGCAATAGCACTTGGTACAAGTTTG
GGACACTTGGCTGATGGTACCAGAGTGTTGTCAGTGAACCACTCGCTTGCTCGAGCCGTG
CAAGGAGCACTGACGGCGGGTTACAGACACATTGACACGGCCTCATTGTACCGAGTTGAG
GACGAAGTTGGGCTTGGAATACGTTGGTATTTAAATGACACCACTAAAAGACAGAATATT
TACGTCACCACCAAGTTATGGAACGATGCCCACGCTCGGGACGAGGTGGTACCAGCCATC
AGACGATCTCTACAGGATTTGCAGCTGGAATATGTCGACCTGTATCTAATGCATTTCCCT
ATGGCGTACACGAAAGACGGAAAGATAAGCGACACCGACTACTTGGAAACGTGGAAAGGA
TTAGAGGACGCCAAAAAATTAAATCTAACCCGGTCAATTGGCGTGTCCAACTTCAACTTA
ACACAGATGAAACGATTGTGGAATGACTCAGAAATCAAGCCAGCTGTGCTACAAATTGAA
GTCAATCCAACAATAACCCAAGATGAAATAATAGACTGGTGTGATGAACACGCTGTCATC
GTTATGGCATACAGTCCTTTCGGCGCCATTTTGGGTCGCAAGAAAAACTCTCCATTACGT
GCAGATGACCCTTTATTAATAAGCTTAGCCCAAAAATACAACAAAACTGTTCCACAAATC
TTATTACGATATTTGTTAGATAGACATCTAGTAGTCATCCCTCGATCAACAAACTACAGC
CGAATCAAAGAGAACTTTAATATAACAGACTTCTCACTTGCGCCAGAAGAGGTGAAACTA
TTGTCGAGTTTCAATAGAGAGTACAGGTTAAGAACGCAGGTCAAATGGTATCCCCACCCG
CACTTCCCCTTCCAGAAGAAAAATCTCACGGAATCTGAAATACAGTACATAGTTGAACAC
AGTAAAGAAGATTAG

Protein sequence:

MDAPLIIAIILFFSGLANASLSYNLNDGNVIPAIALGTSLGHLADGTRVLSVNHSLARAV
QGALTAGYRHIDTASLYRVEDEVGLGIRWYLNDTTKRQNIYVTTKLWNDAHARDEVVPAI
RRSLQDLQLEYVDLYLMHFPMAYTKDGKISDTDYLETWKGLEDAKKLNLTRSIGVSNFNL
TQMKRLWNDSEIKPAVLQIEVNPTITQDEIIDWCDEHAVIVMAYSPFGAILGRKKNSPLR
ADDPLLISLAQKYNKTVPQILLRYLLDRHLVVIPRSTNYSRIKENFNITDFSLAPEEVKL
LSSFNREYRLRTQVKWYPHPHFPFQKKNLTESEIQYIVEHSKED