DPGLEAN19146 in OGS1.0

New model in OGS2.0DPOGS206635 
Genomic Positionscaffold174:- 309712-316655
See gene structure
CDS Length1569
Paired RNAseq reads  2695
Single RNAseq reads  7280
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011482 (7e-39)
Best Drosophila hit  CG17896, isoform B (0.0)
Best Human hitmethylmalonate-semialdehyde dehydrogenase [acylating], mitochondrial precursor (5e-170)
Best NR hit (blastp)  AGAP002499-PA [Anopheles gambiae str. PEST] (0.0)
Best NR hit (blastx)  AGAP002499-PA [Anopheles gambiae str. PEST] (0.0)
GeneOntology terms


  
GO:0004491 methylmalonate-semialdehyde dehydrogenase (acylating) activity
GO:0006573 valine metabolic process
GO:0018478 malonate-semialdehyde dehydrogenase (acetylating) activity
GO:0019859 thymine metabolic process
InterPro families




  
IPR016160 Aldehyde dehydrogenase, conserved site
IPR015590 Aldehyde dehydrogenase domain
IPR010061 Methylmalonate-semialdehyde dehydrogenase
IPR016161 Aldehyde/histidinol dehydrogenase
IPR016162 Aldehyde dehydrogenase, N-terminal
IPR016163 Aldehyde dehydrogenase, C-terminal
Orthology groupMCL16115

Nucleotide sequence:

ATGGCGCTGAATCTCTTGAAACTTATTAAATCAGAATCTCATATATTGTTACGAAGACAC
TATAGCAGTTCAGCACCGTCAACTAAGTTATACATAGATGGACAATATGTGGAATCAAAA
ACTAGCAACTGGATTGAACTCACCAATCCCGCAACTAATGAAGTTATCGGCAGGGTACCA
GAGGCGACTCAAGAGGAATTGAATTTGGCACTGGAAGCTGCTAAGAAGGCATACAAATCA
TGGAGTCAGAGCACTGTATTAACTCGTCAACAACTCATGTTGAAGTTTGCTCGTCTTCTA
AGAGAAAATCAAAGTAAATTAGCAGCTAAAATAACAGAGGAGCAAGGAAAAACTATAGCT
GATGCTGAGGGAGATGTACTTAGAGGAATTCAATCTGTGGAGCACTGTTGCAGTATAACA
AGTTTGCAGCTCGGTGATTGTATACAGAACATAGCTAAAGATATGGACACACATAGCTAT
AAAGTACCACTTGGAGTCACCGCTGGAGTAGCAGCATTCAACTTCCCAGTAATGATACCT
TTATGGATGTTCCCACCCGCATTAGTGACTGGTAACACTTGTATCATCAAACCATCGGAG
CAGGACCCCGGTGCCACCCTTATGATGATGGAACTGCTGCAGGAGGCCGGAGCTCCCGCT
GGAGTGGTTAATGTTGTTCACGGAACTCATGACCCTGTGAACTTCATATGTGATCACCCT
GACATCAAAGCTGTGTCATTCGTAGGGGGTGATGCAGCTGGGAAACATATCTACAGCAGG
GCTTCGGCTGCCGGCAAGCGTGTTCAGAGCAATATGGGTGCCAAAAACCATGGGGTCATA
ATGCCGGATGCTAACAAAGAGCACACATTGAACCAATTGGCTGGAGCTGCGTTCGGAGCG
GCCGGACAAAGGTGTATGGCGCTCAGCACGGCCGTGTTTGTGGGTGAGGCCAAAGAATGG
ATACCAGATTTGGTGAAACGAGCTGAAGCTCTCAAAGTTAATGCCGGTCATGTACCTGGC
ACTGATGTTGGTCCGGTCATCTCTGTTAGAGCAAAAGAGAGGATTCATAGGCTTGTTGAA
TCTGGAGCAAAAGAGGGCGCTAAAATCGTGCTTGACGGTAGAGGGGTCAAGGTTCAAGGC
TTCGAGAAAGGAAACTTCGTCGGTCCGACCATTCTCACTCACGTACAACCAAACATGGAA
TGCTACAGAGAAGAAATCTTCGGTCCTGTATTAATTTGTCTCTTTGTTGACACCTTGGAC
GAAGCTATTGAAATGATCAATTCAAATCCCTATGGTAACGGAACAGCCATCTTCACAACC
AACGGGGCGACCGCAAGGAAATTTTCTTCACAAATCGATGTTGGCCAAGTCGGAATAAAC
GTTCCCATACCAGTGCCATTGTCTATGTTCTCATTCAGCGGTAGCAGAGGTAGCTTTTTG
GGTACAAATCATTTCTGTGGCAAACAAGGTATCGACTTTTACACCGAATTAAAAACCGTT
GTATCATTCTGGAGACAGAGTGACGTATCTCACGCCAAGGCCGCCGTCTCTATGCCAACT
CAGCAATAA

Protein sequence:

MALNLLKLIKSESHILLRRHYSSSAPSTKLYIDGQYVESKTSNWIELTNPATNEVIGRVP
EATQEELNLALEAAKKAYKSWSQSTVLTRQQLMLKFARLLRENQSKLAAKITEEQGKTIA
DAEGDVLRGIQSVEHCCSITSLQLGDCIQNIAKDMDTHSYKVPLGVTAGVAAFNFPVMIP
LWMFPPALVTGNTCIIKPSEQDPGATLMMMELLQEAGAPAGVVNVVHGTHDPVNFICDHP
DIKAVSFVGGDAAGKHIYSRASAAGKRVQSNMGAKNHGVIMPDANKEHTLNQLAGAAFGA
AGQRCMALSTAVFVGEAKEWIPDLVKRAEALKVNAGHVPGTDVGPVISVRAKERIHRLVE
SGAKEGAKIVLDGRGVKVQGFEKGNFVGPTILTHVQPNMECYREEIFGPVLICLFVDTLD
EAIEMINSNPYGNGTAIFTTNGATARKFSSQIDVGQVGINVPIPVPLSMFSFSGSRGSFL
GTNHFCGKQGIDFYTELKTVVSFWRQSDVSHAKAAVSMPTQQ