DPGLEAN12003 in OGS1.0

New model in OGS2.0DPOGS205057 
Genomic Positionscaffold2402:+ 48981-55971
See gene structure
CDS Length1851
Paired RNAseq reads  181
Single RNAseq reads  458
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006804 (0.0)
Best Drosophila hit  tyrosine decarboxylase 2 (0.0)
Best Human hitaromatic-L-amino-acid decarboxylase (1e-144)
Best NR hit (blastp)  aromatic amino acid decarboxylase [Culex quinquefasciatus] (0.0)
Best NR hit (blastx)  aromatic amino acid decarboxylase [Culex quinquefasciatus] (0.0)
GeneOntology terms






  
GO:0004058 aromatic-L-amino-acid decarboxylase activity
GO:0006519 cellular amino acid and derivative metabolic process
GO:0030170 pyridoxal phosphate binding
GO:0019752 carboxylic acid metabolic process
GO:0018991 oviposition
GO:0004837 tyrosine decarboxylase activity
GO:0007626 locomotory behavior
GO:0048148 behavioral response to cocaine
InterPro families



  
IPR002129 Pyridoxal phosphate-dependent decarboxylase
IPR015421 Pyridoxal phosphate-dependent transferase, major region, subdomain 1
IPR015422 Pyridoxal phosphate-dependent transferase, major region, subdomain 2
IPR015424 Pyridoxal phosphate-dependent transferase, major domain
IPR010977 Aromatic-L-amino-acid decarboxylase
Orthology groupMCL10577

Nucleotide sequence:

ATGGACGTCGAAGAGTTCCGTGTTCGGGGTAAAGAAATGGTTGACTACATCTGTACCTAT
ATGACGACCCTGTCGAAGCGGAGGGTGACTCCATCGGTTGAGCCCGGTTACCTCCGCACG
GAACTGCCGACGGAGGCTCCTTTCCTTCCTGAAAATTGGAATGATGTGATGGAAGATGTG
GAAAATAAGATTATGCCGGGCGTCACACATTGGCAGCATCCTCGGTTTCATGCATATTTC
CCATCGGGCAATGGCTACCCCTCAATACTTGGTGACATGCTCTCCGCAGGCATCGGCTGT
ATCGGATTTTCATGGGCTGCGAGTCCAGCTTGCACGGAATTAGAAATTATAATGTTGGAT
TGGATGGGAAAAGCTATAGGGTTGCCTCCCGCTTTTCTGCAACTTGAGGAAGGAAGCAAG
GGCGGTGGCGTTATTCAGGGATCAGCCAGCGAGTGTGTACTTGTGTGCATGTTAGCCGCA
AGAGCTGCAGGGATCAAGCGATTGAAGCATCAATTCCCGACCGTCGATGAGGGGCTGTTA
CTTTCAAAGTTAATTGCTTATTGTTCCAAAGAAGCACACTCTTGTGTTGAGAAAGCTGCT
ATGATAAGTTTCGTTAAACTGCGTATTCTACAGCCGGACGAACACGGTTCACTTAGAGGG
GATACATTAAAAGAAGCAATGGAAGAAGATGAAGAAGCGGGACTAGTCCCATTTTTCGTT
TCAGCAACGCTAGGGACAACAGGGACGTGTGCATTTGATAATTTGTCCGAAATTGGACCG
GTAGTTCGGAAATTTCCTAGCGTTTGGCTGCATGTAGACGCTGCGTATGCTGGCAGCTCA
TTTATCTGCCCTGAACATAAATATCATCTGGCAGGAATTGAATATGCTGACTCATTTAAT
ACTAATTCAAATAAAATGATGCTCACCAACTTTGATTGTTCTTTAATGTGGGTCACAAAC
AGATATCTATTGACATCTGCTTTAGTCGTCGATCCGTTGTATTTACAACATTGTTATGAC
GGTACCGCAATCGATTACCGCCACTGGGGAATACCGCTCAGCCGTCGCTTCAGATCACTG
AAGTTGTGGTTCATGTTGAGGAGTTATGGAATCAGTGGCCTGCAGAAATATATACGAAGA
CATTGCGAACTCGCTAAGTATTTCGAACAACTTGTTAAAAAGGACAAGAGATTCGAAGTA
TGCAACCAAGTTAAGTTGGGATTAGTATGCTTTCGATTGGTAGGGAGTCGCGACGAAAAT
GAGGAACAAGTTGATGAGTTGAATAAGAAACTGCTTACTAACATCAATGCTTCTGGAAAG
CTCCACATGGTGCCCACTTCTTTTCGTGATCGATACGTGATTCGTTTCTGTGTTGTGCAC
CAACACGCTAGCCGTGAAGATATTGAATATGCTTGGGATACCATAACTGACTTCGCAGAA
GAATTATACGAAGGTCCCGATAAGGAAAGGGATTTGAATGAGGAAAGGGCACGTAAGCAT
CTGCAAGCTCTCGCTCATAAGCGTTCGTTCTTCGTGCGCATGGTGAGCGACCCGAAGATC
TACAATCCTGCCATTAACAAGACCCCGCCGCCAATCCCAACTAGCCCAACTACCCCACCA
GCACCCGCCGCCCCCGATACACCATCGGAAACCGATCCGATGACACCGAAACAATCGTCG
TGGATAAGTTGGCCACTTGCTTTCTTCTTCCAAAGCGTTGACCATGATTCTAATGATTTG
CCTTTGAGGTTCCGGCATTTAGATACGATGGTACGTCTAAAGAGCCCACAGATACGCAGA
GGTTCATCGCCAGGCGTGTCCCCCGAGCGTCGGCCCTCCCCTGCAAACTGA

Protein sequence:

MDVEEFRVRGKEMVDYICTYMTTLSKRRVTPSVEPGYLRTELPTEAPFLPENWNDVMEDV
ENKIMPGVTHWQHPRFHAYFPSGNGYPSILGDMLSAGIGCIGFSWAASPACTELEIIMLD
WMGKAIGLPPAFLQLEEGSKGGGVIQGSASECVLVCMLAARAAGIKRLKHQFPTVDEGLL
LSKLIAYCSKEAHSCVEKAAMISFVKLRILQPDEHGSLRGDTLKEAMEEDEEAGLVPFFV
SATLGTTGTCAFDNLSEIGPVVRKFPSVWLHVDAAYAGSSFICPEHKYHLAGIEYADSFN
TNSNKMMLTNFDCSLMWVTNRYLLTSALVVDPLYLQHCYDGTAIDYRHWGIPLSRRFRSL
KLWFMLRSYGISGLQKYIRRHCELAKYFEQLVKKDKRFEVCNQVKLGLVCFRLVGSRDEN
EEQVDELNKKLLTNINASGKLHMVPTSFRDRYVIRFCVVHQHASREDIEYAWDTITDFAE
ELYEGPDKERDLNEERARKHLQALAHKRSFFVRMVSDPKIYNPAINKTPPPIPTSPTTPP
APAAPDTPSETDPMTPKQSSWISWPLAFFFQSVDHDSNDLPLRFRHLDTMVRLKSPQIRR
GSSPGVSPERRPSPAN