DPGLEAN19908 in OGS1.0

New model in OGS2.0DPOGS209764 
Genomic Positionscaffold92:+ 47760-51928
See gene structure
CDS Length1401
Paired RNAseq reads  48
Single RNAseq reads  140
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005403 (4e-127)
Best Drosophila hit  CG7998 (9e-39)
Best Human hitmalate dehydrogenase, mitochondrial precursor (1e-31)
Best NR hit (blastp)  malate dehydrogenase, putative [Pediculus humanus corporis] (7e-55)
Best NR hit (blastx)  malate dehydrogenase, putative [Pediculus humanus corporis] (2e-45)
GeneOntology terms






  
GO:0030060 L-malate dehydrogenase activity
GO:0005759 mitochondrial matrix
GO:0006099 tricarboxylic acid cycle
GO:0055114 oxidation reduction
GO:0005488 binding
GO:0006108 malate metabolic process
GO:0006096 glycolysis
GO:0005811 lipid particle
InterPro families



  
IPR010097 Malate dehydrogenase, type 1
IPR022383 Lactate/malate dehydrogenase, C-terminal
IPR001236 Lactate/malate dehydrogenase, N-terminal
IPR015955 Lactate dehydrogenase/glycoside hydrolase, family 4, C-terminal
IPR016040 NAD(P)-binding domain
Orthology groupMCL39954

Nucleotide sequence:

ATGACTCTCCCAGATCAGAGTAATTTAGTAAGATGGGGTAAAGATACCGGAAAGACTTGC
TATATCTGTGGGAAGGCAGTTGAAACTGCTAAGCATTTGTTAGTGGGATGTAAGGTACTC
CTGGATAGCGGTCAATACTCGCATCGTCACGATAGGGTTCTGGAAATCATACGTAAAGGT
CAGCCACTCGCTCTTCTTCTAAAACAATGTCCATTATTGGATGAGATAGCACTTTATGAT
ATTTGTGCAACTTGTGGCTACGGCATGGAGTTGAGTCACGTTGATACAAAATGTAAGGTC
TCTTCTTTCTCTGGCAGACATATGCTTTGCGATGCTCTCAAGGGCTCAAGAGTTGTTGTA
ATAGTCGCTCGAAACGAATGCGATTCATTTGAAAATAGTGCCCCAATTGTCACAGAGATT
GCATTACAAATTTGTAACACATGTCCCCAGGCATTTACAATCGTAGCCACGGAACCAGTG
GAGAGTATGGTTCCGTTAGTCAGCGAGATACAAAGACTACGTTCGCAATACAATCCAAGA
TTTCTACTTGGATGTGTAGAGCTGAACTGTGTGCGAGCTAATACGGTCTTGGCAGATTTT
CTTAGAGTACCGCCAGAGTCAGTTAGAGTTCCGGTGGTAGGAGGCGCTACTCCAGAGACC
ATGGTTCCCGTACTCTCCGCAGCTGTACATCCTTGCACACTGTCGCAGGAACAGACGGAA
TGTGCTACCTCATGTATAATGAGCGGCAACGAAGCTGTATGTGCTGCTAAAGGTTGCGCG
ACAGCAACTGCATGTCTTTCGGGAGCCTTTGCTGTGGCTCGTACTACGATCAATGTGGTG
AAAGGTTTACAGGGTAGGAAGAACGTTGTGCAGTGTGCTTATGTAGACAGTCTCGGAACA
TGTGCTCCGGGATGTCAGTTTTTTGCTAGTGAGGTTATCCTCGGACCAGCTGGTGTAGAA
AAGAATTTAGGTATACCAGAGCTTTCTAAATTTGAAAACTGTCTCTTATGCCACTGTCTA
CCGTATGTCCGTAATGAAATCGCTCGTGCAATTTGGCTCGTGTACACGATGTGCCAGCAG
TGCTGCTGTTATGGATGCACTGTTCATCCCAGCACATGCTACACTCCGCCCATAGTTCCC
TGCGTACCACCAACCAACTGGACCTGTGACTGTCCCGACGCTTGCCGAGATGAATACCTC
GCCTCCATCTGTCGTGAGATGACCTGCATGTGCGGTAGTACAGCGCTGTGCTGGAGGCCG
CGGGAAGCCGACTATGATGCCAAACGAGCCTCGAACCTTACACATCAAATGCCGTTGAGG
AGCGCCGCCTGTAGTGTTTGTAACGTGCCACGAAGCGTGCGAATTCAACAGGCCTTACGA
GAAAAGAAGGGAGATTTTTAA

Protein sequence:

MTLPDQSNLVRWGKDTGKTCYICGKAVETAKHLLVGCKVLLDSGQYSHRHDRVLEIIRKG
QPLALLLKQCPLLDEIALYDICATCGYGMELSHVDTKCKVSSFSGRHMLCDALKGSRVVV
IVARNECDSFENSAPIVTEIALQICNTCPQAFTIVATEPVESMVPLVSEIQRLRSQYNPR
FLLGCVELNCVRANTVLADFLRVPPESVRVPVVGGATPETMVPVLSAAVHPCTLSQEQTE
CATSCIMSGNEAVCAAKGCATATACLSGAFAVARTTINVVKGLQGRKNVVQCAYVDSLGT
CAPGCQFFASEVILGPAGVEKNLGIPELSKFENCLLCHCLPYVRNEIARAIWLVYTMCQQ
CCCYGCTVHPSTCYTPPIVPCVPPTNWTCDCPDACRDEYLASICREMTCMCGSTALCWRP
READYDAKRASNLTHQMPLRSAACSVCNVPRSVRIQQALREKKGDF