DPGLEAN00018 in OGS1.0

New model in OGS2.0DPOGS211220 
Genomic Positionscaffold93:- 164323-166992
See gene structure
CDS Length1431
Paired RNAseq reads  742
Single RNAseq reads  2260
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003199 (0.0)
Best Drosophila hit  dopa decarboxylase, isoform C (0.0)
Best Human hitaromatic-L-amino-acid decarboxylase (5e-168)
Best NR hit (blastp)  dopa-decarboxylase [Antheraea pernyi] (0.0)
Best NR hit (blastx)  dopa-decarboxylase [Antheraea pernyi] (0.0)
GeneOntology terms








  
GO:0006585 dopamine biosynthetic process from tyrosine
GO:0006587 serotonin biosynthetic process from tryptophan
GO:0004058 aromatic-L-amino-acid decarboxylase activity
GO:0007611 learning or memory
GO:0006584 catecholamine metabolic process
GO:0007619 courtship behavior
GO:0008062 eclosion rhythm
GO:0048066 developmental pigmentation
GO:0030170 pyridoxal phosphate binding
GO:0040007 growth
InterPro families




  
IPR021115 Pyridoxal-phosphate binding site
IPR015424 Pyridoxal phosphate-dependent transferase, major domain
IPR010977 Aromatic-L-amino-acid decarboxylase
IPR002129 Pyridoxal phosphate-dependent decarboxylase
IPR015421 Pyridoxal phosphate-dependent transferase, major region, subdomain 1
IPR015422 Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL13834

Nucleotide sequence:

ATGGAAGCTAAGGAGTTCAAGGATTTCGCGAAGGCAATGGCTGATTACATCGCCGAGTAC
TTGGAAAATATACGCGACAGGCAAGTGGTGCCCTCAGTGAAGCCAGGGTATTTAAGACCC
TTGGTACCGGAGCAGGCTCCAGAGAAGCCTGAACCCTGGACGGCCGTAATGGCTGATATT
GAGAGAGTGGTTATGTCAGGAGTTACCCACTGGCATTCCCCACGTTTCCATGCCTATTTC
CCCACTGCCAACTCCTATCCATCGATCGTTGCAGATATGTTAAGCGGAGCCATCGCCTGC
ATTGGTTTCACCTGGATCGCAAGTCCAGCTTGCACCGAGCTTGAGGTGGTGATGCTGGAT
TGGCTAGGCCAGATGCTGGGTCTGCCAGAAGAATTCTTGGCTCGCTCTGGTGGCGAGGGC
GGTGGCGTCATTCAGGGTACAGCCAGTGAAGCCACATTGGTCGCTCTGTTAGGAGCTAAG
GCCCGAGCGATGCAGAGGACTAAGGAACAGCATCCAGACTGGACGGAAGTTGAAATCCTG
TCCAAACTTGTTGGATATTGTAATAAACAAGCTCATTCGTCTGTCGAGCGAGCTGGTCTC
CTCGGTGGAGTAAAGCTCCGCAGCCTGAAGCACGATGACAAGAGACGCCTGCGCGGAGAT
ACCTTGAAAGAGGCTATCGACGAAGATATCAAGAATGGATTGATACCGTTTTATGTCGTC
GCAACCTTAGGCACAACATCATCGTGCGCTTTCGACGCTCTGGACGAGATAGGCGACGTC
TGCAAGTCGCATGATGTGTGGCTTCATGTGGACGCAGCCTACGCCGGCTCCGCGTTCATC
TGTCCAGAGTACCGCCACCTCATGAAGGGAGTCGAAAAGGCTGATTCGTTCAACTTCAAC
CCTCACAAGTGGATGCTGGTTAACTTCGACTGTTCCGCCATGTGGCTGAAACAGCCGCGT
TGGATCGTTGACGCCTTCAACGTCGATCCTTTATACTTGAAACACGACATGCAAGGATCA
GCGCCGGACTACCGTCACTGGCAGATACCTCTCGGAAGACGCTTCCGATCCCTTAAACTA
TGGTTCGTGTTGAGACTGTATGGAGTTGAGAACATTCAGAACTTCATCCGTAAACATATT
GGACTGGCTCACCTTTTCGAAAAACTCTGTCTTGATGACGAAAGATTCGAACTTTTCGAA
GAGGTCACTATGGGCTTAGTTTGCTTCAGACTCAAAGGTGATAATGAAACTAATGAGGCT
CTCTTGAGACGTATTAATGGACGCGGGAAGATTCATCTTGTACCTTCAAAAGTGGATGAC
GTTTATTTCCTAAGATTTGCTGTTTGCTCGCGTTTCACTGAAGAAAGTGATATTCAAAGC
TCGTGGGAAGAAATAAAGACATCGGCTGATGAAGTCCTAGCAGAAAAATAG

Protein sequence:

MEAKEFKDFAKAMADYIAEYLENIRDRQVVPSVKPGYLRPLVPEQAPEKPEPWTAVMADI
ERVVMSGVTHWHSPRFHAYFPTANSYPSIVADMLSGAIACIGFTWIASPACTELEVVMLD
WLGQMLGLPEEFLARSGGEGGGVIQGTASEATLVALLGAKARAMQRTKEQHPDWTEVEIL
SKLVGYCNKQAHSSVERAGLLGGVKLRSLKHDDKRRLRGDTLKEAIDEDIKNGLIPFYVV
ATLGTTSSCAFDALDEIGDVCKSHDVWLHVDAAYAGSAFICPEYRHLMKGVEKADSFNFN
PHKWMLVNFDCSAMWLKQPRWIVDAFNVDPLYLKHDMQGSAPDYRHWQIPLGRRFRSLKL
WFVLRLYGVENIQNFIRKHIGLAHLFEKLCLDDERFELFEEVTMGLVCFRLKGDNETNEA
LLRRINGRGKIHLVPSKVDDVYFLRFAVCSRFTEESDIQSSWEEIKTSADEVLAEK