New model in OGS2.0 | DPOGS206635  |
---|---|
Genomic Position | scaffold174:- 309712-316655 |
See gene structure | |
CDS Length | 1569 |
Paired RNAseq reads   | 2695 |
Single RNAseq reads   | 7280 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011482 (7e-39) |
Best Drosophila hit   | CG17896, isoform B (0.0) |
Best Human hit | methylmalonate-semialdehyde dehydrogenase [acylating], mitochondrial precursor (5e-170) |
Best NR hit (blastp)   | AGAP002499-PA [Anopheles gambiae str. PEST] (0.0) |
Best NR hit (blastx)   | AGAP002499-PA [Anopheles gambiae str. PEST] (0.0) |
GeneOntology terms    | GO:0004491 methylmalonate-semialdehyde dehydrogenase (acylating) activity GO:0006573 valine metabolic process GO:0018478 malonate-semialdehyde dehydrogenase (acetylating) activity GO:0019859 thymine metabolic process |
InterPro families    | IPR016160 Aldehyde dehydrogenase, conserved site IPR015590 Aldehyde dehydrogenase domain IPR010061 Methylmalonate-semialdehyde dehydrogenase IPR016161 Aldehyde/histidinol dehydrogenase IPR016162 Aldehyde dehydrogenase, N-terminal IPR016163 Aldehyde dehydrogenase, C-terminal |
Orthology group | MCL16115 |
Nucleotide sequence:
ATGGCGCTGAATCTCTTGAAACTTATTAAATCAGAATCTCATATATTGTTACGAAGACAC
TATAGCAGTTCAGCACCGTCAACTAAGTTATACATAGATGGACAATATGTGGAATCAAAA
ACTAGCAACTGGATTGAACTCACCAATCCCGCAACTAATGAAGTTATCGGCAGGGTACCA
GAGGCGACTCAAGAGGAATTGAATTTGGCACTGGAAGCTGCTAAGAAGGCATACAAATCA
TGGAGTCAGAGCACTGTATTAACTCGTCAACAACTCATGTTGAAGTTTGCTCGTCTTCTA
AGAGAAAATCAAAGTAAATTAGCAGCTAAAATAACAGAGGAGCAAGGAAAAACTATAGCT
GATGCTGAGGGAGATGTACTTAGAGGAATTCAATCTGTGGAGCACTGTTGCAGTATAACA
AGTTTGCAGCTCGGTGATTGTATACAGAACATAGCTAAAGATATGGACACACATAGCTAT
AAAGTACCACTTGGAGTCACCGCTGGAGTAGCAGCATTCAACTTCCCAGTAATGATACCT
TTATGGATGTTCCCACCCGCATTAGTGACTGGTAACACTTGTATCATCAAACCATCGGAG
CAGGACCCCGGTGCCACCCTTATGATGATGGAACTGCTGCAGGAGGCCGGAGCTCCCGCT
GGAGTGGTTAATGTTGTTCACGGAACTCATGACCCTGTGAACTTCATATGTGATCACCCT
GACATCAAAGCTGTGTCATTCGTAGGGGGTGATGCAGCTGGGAAACATATCTACAGCAGG
GCTTCGGCTGCCGGCAAGCGTGTTCAGAGCAATATGGGTGCCAAAAACCATGGGGTCATA
ATGCCGGATGCTAACAAAGAGCACACATTGAACCAATTGGCTGGAGCTGCGTTCGGAGCG
GCCGGACAAAGGTGTATGGCGCTCAGCACGGCCGTGTTTGTGGGTGAGGCCAAAGAATGG
ATACCAGATTTGGTGAAACGAGCTGAAGCTCTCAAAGTTAATGCCGGTCATGTACCTGGC
ACTGATGTTGGTCCGGTCATCTCTGTTAGAGCAAAAGAGAGGATTCATAGGCTTGTTGAA
TCTGGAGCAAAAGAGGGCGCTAAAATCGTGCTTGACGGTAGAGGGGTCAAGGTTCAAGGC
TTCGAGAAAGGAAACTTCGTCGGTCCGACCATTCTCACTCACGTACAACCAAACATGGAA
TGCTACAGAGAAGAAATCTTCGGTCCTGTATTAATTTGTCTCTTTGTTGACACCTTGGAC
GAAGCTATTGAAATGATCAATTCAAATCCCTATGGTAACGGAACAGCCATCTTCACAACC
AACGGGGCGACCGCAAGGAAATTTTCTTCACAAATCGATGTTGGCCAAGTCGGAATAAAC
GTTCCCATACCAGTGCCATTGTCTATGTTCTCATTCAGCGGTAGCAGAGGTAGCTTTTTG
GGTACAAATCATTTCTGTGGCAAACAAGGTATCGACTTTTACACCGAATTAAAAACCGTT
GTATCATTCTGGAGACAGAGTGACGTATCTCACGCCAAGGCCGCCGTCTCTATGCCAACT
CAGCAATAA
Protein sequence:
MALNLLKLIKSESHILLRRHYSSSAPSTKLYIDGQYVESKTSNWIELTNPATNEVIGRVP
EATQEELNLALEAAKKAYKSWSQSTVLTRQQLMLKFARLLRENQSKLAAKITEEQGKTIA
DAEGDVLRGIQSVEHCCSITSLQLGDCIQNIAKDMDTHSYKVPLGVTAGVAAFNFPVMIP
LWMFPPALVTGNTCIIKPSEQDPGATLMMMELLQEAGAPAGVVNVVHGTHDPVNFICDHP
DIKAVSFVGGDAAGKHIYSRASAAGKRVQSNMGAKNHGVIMPDANKEHTLNQLAGAAFGA
AGQRCMALSTAVFVGEAKEWIPDLVKRAEALKVNAGHVPGTDVGPVISVRAKERIHRLVE
SGAKEGAKIVLDGRGVKVQGFEKGNFVGPTILTHVQPNMECYREEIFGPVLICLFVDTLD
EAIEMINSNPYGNGTAIFTTNGATARKFSSQIDVGQVGINVPIPVPLSMFSFSGSRGSFL
GTNHFCGKQGIDFYTELKTVVSFWRQSDVSHAKAAVSMPTQQ