DPGLEAN22094 in OGS1.0

New model in OGS2.0DPOGS208887 
Genomic Positionscaffold1599:+ 85488-95064
See gene structure
CDS Length1452
Paired RNAseq reads  6246
Single RNAseq reads  15907
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002457 (2e-164)
Best Drosophila hit  aldehyde dehydrogenase (1e-119)
Best Human hitaldehyde dehydrogenase, mitochondrial precursor (5e-121)
Best NR hit (blastp)  aldehyde dehydrogenase [Culex quinquefasciatus] (1e-162)
Best NR hit (blastx)  mitochondrial aldehyde dehydrogenase [Bombyx mori] (4e-145)
GeneOntology terms









  
GO:0004028 3-chloroallyl aldehyde dehydrogenase activity
GO:0004029 aldehyde dehydrogenase (NAD) activity
GO:0005737 cytoplasm
GO:0005829 cytosol
GO:0016491 oxidoreductase activity
GO:0018479 benzaldehyde dehydrogenase (NAD+) activity
GO:0035106 operant conditioning
GO:0042802 identical protein binding
GO:0051289 protein homotetramerization
GO:0055114 oxidation reduction
GO:0042573 retinoic acid metabolic process
InterPro families



  
IPR016160 Aldehyde dehydrogenase, conserved site
IPR016161 Aldehyde/histidinol dehydrogenase
IPR015590 Aldehyde dehydrogenase domain
IPR016162 Aldehyde dehydrogenase, N-terminal
IPR016163 Aldehyde dehydrogenase, C-terminal
Orthology groupMCL31187

Nucleotide sequence:

ATGGCTCCGCAAATTAAATATACGAAAATTTTTATCAACAATTCCTGGGTAGACTCGGTC
AGTGGAAAGACATTCCAAACTATAAATCCTCACGATGGATCAGTCAATGCCGAGGTCGCT
GAAGATGTGGATGCAGCTGTCGGAGCAGCTAAAAGTGCATTCCACCGCAACTCTGAATGG
CGTCTGATGGACCCGTCGGAAAGAGTGAAGCTTTTGAACAAATGGGCTGATCTCGTAAAT
CGGGATATAGATTACCTTATAAAATTGGAAACATTAGATAACGGTATCGTGGTACAAACC
AATCAAAGATTTATGTCAGTGGCTGTTAATGCTATACGTTACAACGCCAGTTGGGCTGAT
AAGATTCAAGGAACTACGATACCCGTGGACGGTGAAGCGTTTTCCTACACACTGAAGCAA
CCAGTTGGTGTATGCGCTATAATCATACCATGGAATGCGCCGGTCTTGTTTTTCTGCAGT
AAAGTATCAGCGGCTTTAGCTGCAGGCTGCACCGTAGTAGTGAAGCCGGCAGAACAGACT
CCTTTAACAGCGCTGGCGCTGGCTTCTCTGGTCGCGGAGGCTGGGATTCCACCAGGTGTT
GTGAATGTGGTGCCTGGGTATGGGGAGACAGCAGGAGCGGCTCTAACACATCACCCTGAT
GTCGCACATATATCGTTCACGGGATCTTTACAGGTGGGTAAGATAATCCAACAGGCGGCA
GGCGCCAACAATCTCAAGCGTGTCCAACTTGAGCTAGGCGGGAAAAGTCCTCTCGTTGTT
ATGAACGATGCAGACTTGGATGCTGCGGTGCAGTTTGCTGCTCTCGGGGTTTTTACCAAT
CAAGGACAAATGTGTATAGCTGCTTCCCGTCTTTTTGTGCAATCAGGAATTTACGACGAA
TTTGTTAAAAGAGCTTCCGAATTTGCAAAGAGTCTTGTTGTTGGTAAACCACTAGACCTC
AAAACACAGCACGGTCCTCAGATTGATGAAAACTTAATGAATAGGGTGTTAGGTTACATC
GAAAAAGGAGTATCCGAAGGTGCAAAGCTTTTGACTGGCGGAAAAAGAATTGGAAAAACT
GGTTATTATGTTGAGCCTACCGTCTTTTCTGATGTCACGGATGATATGACCATCGCTGTA
GAAGAAATTTTCGGTCCGGTCCAAAACATCTTAAAGTTCGAAACATTTGAAGAAGTTATT
GAACGTGCTAACGCTACCAACTATGGTTTGGCGGCTGGGATATTTACAAGCTCTGTCGAA
ACTGCTCTACAGTTTAGCAAACATATTGAAGCAGGAATTGTTTGGGTGAATACTTATTTA
CATTTTGGAAGTCAGCTACCATTCGGTGGTTTCAAGGACTCCGGGATTGGCAGAGAAAAT
GGACCCAACGGAGTGGAAGCTTACTTGGAACTCAAAACAGTAATAATGAAACTTTCGAAG
AAGTTGCAATAA

Protein sequence:

MAPQIKYTKIFINNSWVDSVSGKTFQTINPHDGSVNAEVAEDVDAAVGAAKSAFHRNSEW
RLMDPSERVKLLNKWADLVNRDIDYLIKLETLDNGIVVQTNQRFMSVAVNAIRYNASWAD
KIQGTTIPVDGEAFSYTLKQPVGVCAIIIPWNAPVLFFCSKVSAALAAGCTVVVKPAEQT
PLTALALASLVAEAGIPPGVVNVVPGYGETAGAALTHHPDVAHISFTGSLQVGKIIQQAA
GANNLKRVQLELGGKSPLVVMNDADLDAAVQFAALGVFTNQGQMCIAASRLFVQSGIYDE
FVKRASEFAKSLVVGKPLDLKTQHGPQIDENLMNRVLGYIEKGVSEGAKLLTGGKRIGKT
GYYVEPTVFSDVTDDMTIAVEEIFGPVQNILKFETFEEVIERANATNYGLAAGIFTSSVE
TALQFSKHIEAGIVWVNTYLHFGSQLPFGGFKDSGIGRENGPNGVEAYLELKTVIMKLSK
KLQ