DPGLEAN12841 in OGS1.0

New model in OGS2.0DPOGS215931 
Genomic Positionscaffold2568:- 15694-30994
See gene structure
CDS Length1737
Paired RNAseq reads  3011
Single RNAseq reads  8898
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001966 (0.0)
Best Drosophila hit  aldehyde dehydrogenase type III, isoform N (6e-141)
Best Human hitaldehyde dehydrogenase, dimeric NADP-preferring (6e-101)
Best NR hit (blastp)  aldehyde dehydrogenase isoform 2 [Bombyx mori] (0.0)
Best NR hit (blastx)  aldehyde dehydrogenase isoform 2 [Bombyx mori] (0.0)
GeneOntology terms



  
GO:0016620 oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
GO:0055114 oxidation reduction
GO:0006081 cellular aldehyde metabolic process
GO:0004030 aldehyde dehydrogenase [NAD(P)+] activity
GO:0005811 lipid particle
InterPro families




  
IPR016161 Aldehyde/histidinol dehydrogenase
IPR016162 Aldehyde dehydrogenase, N-terminal
IPR016163 Aldehyde dehydrogenase, C-terminal
IPR016160 Aldehyde dehydrogenase, conserved site
IPR015590 Aldehyde dehydrogenase domain
IPR012394 Aldehyde dehydrogenase NAD(P)-dependent
Orthology groupMCL10482

Nucleotide sequence:

ATGACTGTCGGAACACTATCAAGCAATAAACCCAAAGCCGTCAACATGTCTGAGGTCGTA
AATAAAGCAAGGGATACGTTCGACAGCGGCGTCACCAAGCCCATTGAATGGAGAAGGAAG
CAACTGAAAAATCTACTGAGAATGTACGAGGAAAACAGAAACGCGATGGTAGAGGCTCTC
GTTAAAGACTTAAGAAGAAGCAAAATGGAAGCTATTCTCCTAGAAGTCGACTATCTCATT
AATGACATAAGAAATACTATTTACAACTTGGATAACTGGGTCGCTCCTGTGAAGCCCCCA
AAGGGTTTAGTGAATATGCTGGATGATGTAGTCATCTACAACGACCCCTACGGCGTTGTT
CTCATCATCGGTGCCTGGAACTATCCTCTCCAACTGCTACTGCTGCCACTAGCTGGTGCT
ATAGCTGCGGGAAATGCTGTCATCCTCAAGCCCAGTGAGCTGGCGGAGGCCAGCGCTAAG
TTCATGGTGGAAACCTTGCCTAAATATGTGGATAGTGACGCAATAATTTTAGTGGAAGGA
GGTCCGGAGGAAACCTCCGAATTATTGAAACAAAGATTCGACTACATCTTCTACACAGGC
GGGACTAACGTTGGCAGAATAGTTTATGCAGCAGCTACCAAAAACTTGACTCCTGTCACA
TTGGAACTGGGAGGGAAGAGCCCTGTGTACATAGATAACACAGTGGATATAGAAGTAACA
GCGAAGCGTATCCTCTGGGGTAAGTTCATCAACGCCGGTCAGACCTGTATAGCCCCGGAC
TACATCCTGTGCTCGAGGACCGTTCAGGACAAGTTTGTGGATGCAGCCAAGAATGTTCTG
CGGGAGTTTTATGGGGAAGATCCTCAGAAATCACCGGATCTCTGCAGAATCATTAATAAC
AGACACTTCAGTCGTCTGCAAGCATTGATTGATGCTAGCAAGGACAAAGTCGCTATTGGC
GGCCGATACGACTCGCAGGACAAATACATTGCTCCAACGTTACTAGCGAATGTCACTGCC
AGTGACGTCATCATGAAGGACGAGATATTTGGACCTATCCTGCCCATTGTGCCTGTGGAG
AACGCCTATGAAGCCATAAAATTCATTAACGAAAGGGAACATCCGTTAGTTCTATACGTG
TTCAGTGTCCAGAGCAACATCCAACAGCTGTTCACACAGCAGACGCGTTCAGGCAGTCTG
TGTATCAATGACACTATAATGTTTTATGGCGTACAGGTGATGGTATTTGTAAATAGTTAT
GTATATAACGTTATGTTGTATGTAAACGATAATGTGGTGGTGGACGTCTGTAGGGAAAAG
CCGTTGGTGTTGTACGCGTTCACTACAGACGAGGAACTCGCTAAACGGATAGCGGAGAAC
ACGAGCAGCGGCGGCATGTGCATTAATGATACTGTCATGCAAATGGGAGTTGATACATTG
CCATTTGGCGGTGTCGGTAGTAGTGGCATGGGAGCCTACCACGGTAAGGCCTCATTTGAC
ACCTTCACACATAAAAAGAGCTGCTTAATAAGGAACTTCGCTGCTATTGGTGAAAGACTT
GGATCAGGCCGCTACCCTCCCTACACGGACGGTAAGCTGAGCTTCATTACAACCCTGATG
AGAAAACGCAACGGACCCTCCCTCAAATACCTCCCACACCTGATTGCCTTTGCTCTTGGA
GCCGGTGTGGCATACGGAATAGCCACTTGGCAGAAGATGTCGTCGGAGCACCTATAG

Protein sequence:

MTVGTLSSNKPKAVNMSEVVNKARDTFDSGVTKPIEWRRKQLKNLLRMYEENRNAMVEAL
VKDLRRSKMEAILLEVDYLINDIRNTIYNLDNWVAPVKPPKGLVNMLDDVVIYNDPYGVV
LIIGAWNYPLQLLLLPLAGAIAAGNAVILKPSELAEASAKFMVETLPKYVDSDAIILVEG
GPEETSELLKQRFDYIFYTGGTNVGRIVYAAATKNLTPVTLELGGKSPVYIDNTVDIEVT
AKRILWGKFINAGQTCIAPDYILCSRTVQDKFVDAAKNVLREFYGEDPQKSPDLCRIINN
RHFSRLQALIDASKDKVAIGGRYDSQDKYIAPTLLANVTASDVIMKDEIFGPILPIVPVE
NAYEAIKFINEREHPLVLYVFSVQSNIQQLFTQQTRSGSLCINDTIMFYGVQVMVFVNSY
VYNVMLYVNDNVVVDVCREKPLVLYAFTTDEELAKRIAENTSSGGMCINDTVMQMGVDTL
PFGGVGSSGMGAYHGKASFDTFTHKKSCLIRNFAAIGERLGSGRYPPYTDGKLSFITTLM
RKRNGPSLKYLPHLIAFALGAGVAYGIATWQKMSSEHL