New model in OGS2.0 | DPOGS203596  |
---|---|
Genomic Position | scaffold2625:- 17550-19437 |
See gene structure | |
CDS Length | 1035 |
Paired RNAseq reads   | 30 |
Single RNAseq reads   | 87 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001351 (6e-77) |
Best Drosophila hit   | CG12766 (3e-50) |
Best Human hit | 3-oxo-5-beta-steroid 4-dehydrogenase isoform 1 (4e-51) |
Best NR hit (blastp)   | aldo-keto reductase [Heliothis virescens] (2e-78) |
Best NR hit (blastx)   | similar to CG10638-PA [Papilio xuthus] (8e-76) |
GeneOntology terms    | GO:0005737 cytoplasm GO:0006629 lipid metabolic process GO:0030573 bile acid catabolic process GO:0055114 oxidation reduction GO:0008202 steroid metabolic process GO:0005575 cellular_component GO:0016491 oxidoreductase activity GO:0006699 bile acid biosynthetic process GO:0047568 3-oxo-5-beta-steroid 4-dehydrogenase activity GO:0047787 delta4-3-oxosteroid 5beta-reductase activity |
InterPro families    | IPR001395 Aldo/keto reductase IPR023210 NADP-dependent oxidoreductase domain IPR020471 Aldo/keto reductase subgroup IPR018170 Aldo/keto reductase, conserved site |
Orthology group | ND |
Nucleotide sequence:
ATGGACGCGCCTCTTATTATTGCTATAATTCTATTTTTTTCTGGTTTGGCGAACGCGTCT
CTGTCATATAATCTTAACGATGGTAATGTGATACCAGCAATAGCACTTGGTACAAGTTTG
GGACACTTGGCTGATGGTACCAGAGTGTTGTCAGTGAACCACTCGCTTGCTCGAGCCGTG
CAAGGAGCACTGACGGCGGGTTACAGACACATTGACACGGCCTCATTGTACCGAGTTGAG
GACGAAGTTGGGCTTGGAATACGTTGGTATTTAAATGACACCACTAAAAGACAGAATATT
TACGTCACCACCAAGTTATGGAACGATGCCCACGCTCGGGACGAGGTGGTACCAGCCATC
AGACGATCTCTACAGGATTTGCAGCTGGAATATGTCGACCTGTATCTAATGCATTTCCCT
ATGGCGTACACGAAAGACGGAAAGATAAGCGACACCGACTACTTGGAAACGTGGAAAGGA
TTAGAGGACGCCAAAAAATTAAATCTAACCCGGTCAATTGGCGTGTCCAACTTCAACTTA
ACACAGATGAAACGATTGTGGAATGACTCAGAAATCAAGCCAGCTGTGCTACAAATTGAA
GTCAATCCAACAATAACCCAAGATGAAATAATAGACTGGTGTGATGAACACGCTGTCATC
GTTATGGCATACAGTCCTTTCGGCGCCATTTTGGGTCGCAAGAAAAACTCTCCATTACGT
GCAGATGACCCTTTATTAATAAGCTTAGCCCAAAAATACAACAAAACTGTTCCACAAATC
TTATTACGATATTTGTTAGATAGACATCTAGTAGTCATCCCTCGATCAACAAACTACAGC
CGAATCAAAGAGAACTTTAATATAACAGACTTCTCACTTGCGCCAGAAGAGGTGAAACTA
TTGTCGAGTTTCAATAGAGAGTACAGGTTAAGAACGCAGGTCAAATGGTATCCCCACCCG
CACTTCCCCTTCCAGAAGAAAAATCTCACGGAATCTGAAATACAGTACATAGTTGAACAC
AGTAAAGAAGATTAG
Protein sequence:
MDAPLIIAIILFFSGLANASLSYNLNDGNVIPAIALGTSLGHLADGTRVLSVNHSLARAV
QGALTAGYRHIDTASLYRVEDEVGLGIRWYLNDTTKRQNIYVTTKLWNDAHARDEVVPAI
RRSLQDLQLEYVDLYLMHFPMAYTKDGKISDTDYLETWKGLEDAKKLNLTRSIGVSNFNL
TQMKRLWNDSEIKPAVLQIEVNPTITQDEIIDWCDEHAVIVMAYSPFGAILGRKKNSPLR
ADDPLLISLAQKYNKTVPQILLRYLLDRHLVVIPRSTNYSRIKENFNITDFSLAPEEVKL
LSSFNREYRLRTQVKWYPHPHFPFQKKNLTESEIQYIVEHSKED