New model in OGS2.0 | DPOGS208887  |
---|---|
Genomic Position | scaffold1599:+ 85488-95064 |
See gene structure | |
CDS Length | 1452 |
Paired RNAseq reads   | 6246 |
Single RNAseq reads   | 15907 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002457 (2e-164) |
Best Drosophila hit   | aldehyde dehydrogenase (1e-119) |
Best Human hit | aldehyde dehydrogenase, mitochondrial precursor (5e-121) |
Best NR hit (blastp)   | aldehyde dehydrogenase [Culex quinquefasciatus] (1e-162) |
Best NR hit (blastx)   | mitochondrial aldehyde dehydrogenase [Bombyx mori] (4e-145) |
GeneOntology terms    | GO:0004028 3-chloroallyl aldehyde dehydrogenase activity GO:0004029 aldehyde dehydrogenase (NAD) activity GO:0005737 cytoplasm GO:0005829 cytosol GO:0016491 oxidoreductase activity GO:0018479 benzaldehyde dehydrogenase (NAD+) activity GO:0035106 operant conditioning GO:0042802 identical protein binding GO:0051289 protein homotetramerization GO:0055114 oxidation reduction GO:0042573 retinoic acid metabolic process |
InterPro families    | IPR016160 Aldehyde dehydrogenase, conserved site IPR016161 Aldehyde/histidinol dehydrogenase IPR015590 Aldehyde dehydrogenase domain IPR016162 Aldehyde dehydrogenase, N-terminal IPR016163 Aldehyde dehydrogenase, C-terminal |
Orthology group | MCL31187 |
Nucleotide sequence:
ATGGCTCCGCAAATTAAATATACGAAAATTTTTATCAACAATTCCTGGGTAGACTCGGTC
AGTGGAAAGACATTCCAAACTATAAATCCTCACGATGGATCAGTCAATGCCGAGGTCGCT
GAAGATGTGGATGCAGCTGTCGGAGCAGCTAAAAGTGCATTCCACCGCAACTCTGAATGG
CGTCTGATGGACCCGTCGGAAAGAGTGAAGCTTTTGAACAAATGGGCTGATCTCGTAAAT
CGGGATATAGATTACCTTATAAAATTGGAAACATTAGATAACGGTATCGTGGTACAAACC
AATCAAAGATTTATGTCAGTGGCTGTTAATGCTATACGTTACAACGCCAGTTGGGCTGAT
AAGATTCAAGGAACTACGATACCCGTGGACGGTGAAGCGTTTTCCTACACACTGAAGCAA
CCAGTTGGTGTATGCGCTATAATCATACCATGGAATGCGCCGGTCTTGTTTTTCTGCAGT
AAAGTATCAGCGGCTTTAGCTGCAGGCTGCACCGTAGTAGTGAAGCCGGCAGAACAGACT
CCTTTAACAGCGCTGGCGCTGGCTTCTCTGGTCGCGGAGGCTGGGATTCCACCAGGTGTT
GTGAATGTGGTGCCTGGGTATGGGGAGACAGCAGGAGCGGCTCTAACACATCACCCTGAT
GTCGCACATATATCGTTCACGGGATCTTTACAGGTGGGTAAGATAATCCAACAGGCGGCA
GGCGCCAACAATCTCAAGCGTGTCCAACTTGAGCTAGGCGGGAAAAGTCCTCTCGTTGTT
ATGAACGATGCAGACTTGGATGCTGCGGTGCAGTTTGCTGCTCTCGGGGTTTTTACCAAT
CAAGGACAAATGTGTATAGCTGCTTCCCGTCTTTTTGTGCAATCAGGAATTTACGACGAA
TTTGTTAAAAGAGCTTCCGAATTTGCAAAGAGTCTTGTTGTTGGTAAACCACTAGACCTC
AAAACACAGCACGGTCCTCAGATTGATGAAAACTTAATGAATAGGGTGTTAGGTTACATC
GAAAAAGGAGTATCCGAAGGTGCAAAGCTTTTGACTGGCGGAAAAAGAATTGGAAAAACT
GGTTATTATGTTGAGCCTACCGTCTTTTCTGATGTCACGGATGATATGACCATCGCTGTA
GAAGAAATTTTCGGTCCGGTCCAAAACATCTTAAAGTTCGAAACATTTGAAGAAGTTATT
GAACGTGCTAACGCTACCAACTATGGTTTGGCGGCTGGGATATTTACAAGCTCTGTCGAA
ACTGCTCTACAGTTTAGCAAACATATTGAAGCAGGAATTGTTTGGGTGAATACTTATTTA
CATTTTGGAAGTCAGCTACCATTCGGTGGTTTCAAGGACTCCGGGATTGGCAGAGAAAAT
GGACCCAACGGAGTGGAAGCTTACTTGGAACTCAAAACAGTAATAATGAAACTTTCGAAG
AAGTTGCAATAA
Protein sequence:
MAPQIKYTKIFINNSWVDSVSGKTFQTINPHDGSVNAEVAEDVDAAVGAAKSAFHRNSEW
RLMDPSERVKLLNKWADLVNRDIDYLIKLETLDNGIVVQTNQRFMSVAVNAIRYNASWAD
KIQGTTIPVDGEAFSYTLKQPVGVCAIIIPWNAPVLFFCSKVSAALAAGCTVVVKPAEQT
PLTALALASLVAEAGIPPGVVNVVPGYGETAGAALTHHPDVAHISFTGSLQVGKIIQQAA
GANNLKRVQLELGGKSPLVVMNDADLDAAVQFAALGVFTNQGQMCIAASRLFVQSGIYDE
FVKRASEFAKSLVVGKPLDLKTQHGPQIDENLMNRVLGYIEKGVSEGAKLLTGGKRIGKT
GYYVEPTVFSDVTDDMTIAVEEIFGPVQNILKFETFEEVIERANATNYGLAAGIFTSSVE
TALQFSKHIEAGIVWVNTYLHFGSQLPFGGFKDSGIGRENGPNGVEAYLELKTVIMKLSK
KLQ