DPGLEAN06375 in OGS1.0

New model in OGS2.0DPOGS202918 
Genomic Positionscaffold273:+ 201384-206634
See gene structure
CDS Length1992
Paired RNAseq reads  158
Single RNAseq reads  334
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004155 (2e-20)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  Aldehyde Dehydrogenase [Truepera radiovictrix DSM 17093] (4e-09)
Best NR hit (blastx)  TPA: endonuclease-reverse transcriptase [Schistosoma mansoni] (1e-12)
GeneOntology terms  ND
InterPro families
  
IPR016161 Aldehyde/histidinol dehydrogenase
IPR016163 Aldehyde dehydrogenase, C-terminal
Orthology groupMCL22683

Nucleotide sequence:

ATGTTACGGGTCCCTTCGGAGCATGTGGCCGATCCACTTCCACTTGCGACGTTTGATTTG
CAGGTCAATCTGTATCTCACCGCAGCGTTCCCACAGATTCACGTTGGAGATTTTCTCGGG
CCAGTAAATACCGAGAAAACGGCGAAGACACCATTTGACGAAGACCTGAGGCCGATGCGA
GATGTCTTCAGTAACTTCCACGTCTCGCATCCCTATAACAGCACGGTATTGACATTGGAC
TGGAATATTTTGAGTTTGACTCTCCTGCGGATTGATAATGTGTTACGACAGAAAACAGAC
CGTGTTGCTGGGACTTTATCGGCGGTGATTGAGAGAAACATGGAGGGTATTAAGTTCGAC
GAAGCAAATGTTAGCACAAAACTAGCGGAGGATTATTTGAAGAAAAATCCAATATTTAAA
TCAAAAGGATTCGCCGACAGTGACAGATTCACGTTAAACGATCACAAAAACACAATATAC
TATACTAATTATACGAGAAACCTCGGGGCTGTGGTGTTGACTAAAGATGTAAGTGACCTT
CCATTAAAATTGATGAACCTGGCCAAGAAAATTGAGGAGAATGTCGAACTTTTCTGCCAA
TTGGAACTGTTACTGCGAAAGATCCCCTTAGAAGATACAGAGCGCATAGATTTGAAACTC
ATGACACAAAACTTTATGTATCACAGTTCGTTAAAAAGCTCCAAGGAAGCGACGAGCACA
GCTCGCCAGATTACGTTCGAGTCGGCCACGCCGCTGGTGGCGCTGGGTTTCATCAGCCAG
GTGCTGATGGCTCACATGCCGAAGGTCGTCATTGAATGCACCAAGGAAACAGCTCCCGTC
ACCACCCTCTTCATTGAACTTTGTCACCAAGTCGGATTGGAGGGAAAGGTACTTCTAGCG
ATTTTACCTGAAGTTATAGAAGGCGGTTCCTCCAAAGTGACGATGGATTTTACGAAAATG
GGTGAATTTCTGAACGGCTGCGTGGGAGTTGTGAGCGGGAAAAGTGACATCGACTCCGCC
GTTGAGTACTTCTTGGACGCGTCCAGTCGATATCCCTGGGCGTTGAGGAAGATTTTTGTT
CAAGAAAATGCTGTGGAAAGATTCACCACCACCATGACCTGGAAGGAGGAGAAGCAGATG
GAAGTAACATCAGCCGGGAAAAGCAAAAGCGTTACGGACCGCGAATTATCATACTTCTTT
GGGGAAAAAACGTTTCTGATGAAGCCGGGTCGAGATAGCACTGAACATGACAATAATGTT
GTCATCCTTGAGGCTTTTAGGACGGTGAAGGAACTGATAGGTCTGCTTGCGAACGAGAAG
CCGTTCGCGCTCTCCATCTGGTGCAGTGATATTTCAGAGACGAATGAGCTCGCCCACAAC
GTAGACGCCAGCATCGTGTGGGTCAACGACTTCGCCAACTTCGAAGGACCACCTCGCTCT
TCGCAAGCCTTCTTTTCCCTCATCGATATTTACTTCAGTTCTCAGCATATCGAACAATTT
CCTGAAATGGCTGAATTGACGAAACTCAAAGAATCGTGGTTGGAACTCAGCGTTGAGCAA
AGAAGAGCTGTCGTAAGAGATGCATTGGCAAAAATAGATAGGAACCAGTCCAAGAAAATG
TTAGATGTCCTTGATGATATGACGACGGAAATGGAAAGTTTCGTTTACCTCACCAAGAAT
ATGATAGCGACTGGCATTGAACTACAACCGCAAGCGATGATGCCGAGCGCGATGTACGAC
CACGGACTGGAATCGTCGATAATGTCTTACATTATAAAGGGTGGTGCCATCCTGCTACAC
ATACCTCCGCGTGATCTGCCAGTAAAAAGCAAAGATATTTCTTACATGTTCTATGATAAC
TTACGGAACATGGCCGGTCCCGTACTATTTCTAGAGAAAGAGTATAAAATCGGTGACGTG
TCTGTTATAAGGCATCCAATTAAAAGATATAAAGTCATTTGGACCAATTTCGGAACGATA
TTCGCTAATTAA

Protein sequence:

MLRVPSEHVADPLPLATFDLQVNLYLTAAFPQIHVGDFLGPVNTEKTAKTPFDEDLRPMR
DVFSNFHVSHPYNSTVLTLDWNILSLTLLRIDNVLRQKTDRVAGTLSAVIERNMEGIKFD
EANVSTKLAEDYLKKNPIFKSKGFADSDRFTLNDHKNTIYYTNYTRNLGAVVLTKDVSDL
PLKLMNLAKKIEENVELFCQLELLLRKIPLEDTERIDLKLMTQNFMYHSSLKSSKEATST
ARQITFESATPLVALGFISQVLMAHMPKVVIECTKETAPVTTLFIELCHQVGLEGKVLLA
ILPEVIEGGSSKVTMDFTKMGEFLNGCVGVVSGKSDIDSAVEYFLDASSRYPWALRKIFV
QENAVERFTTTMTWKEEKQMEVTSAGKSKSVTDRELSYFFGEKTFLMKPGRDSTEHDNNV
VILEAFRTVKELIGLLANEKPFALSIWCSDISETNELAHNVDASIVWVNDFANFEGPPRS
SQAFFSLIDIYFSSQHIEQFPEMAELTKLKESWLELSVEQRRAVVRDALAKIDRNQSKKM
LDVLDDMTTEMESFVYLTKNMIATGIELQPQAMMPSAMYDHGLESSIMSYIIKGGAILLH
IPPRDLPVKSKDISYMFYDNLRNMAGPVLFLEKEYKIGDVSVIRHPIKRYKVIWTNFGTI
FAN