New model in OGS2.0 | DPOGS205057  |
---|---|
Genomic Position | scaffold2402:+ 48981-55971 |
See gene structure | |
CDS Length | 1851 |
Paired RNAseq reads   | 181 |
Single RNAseq reads   | 458 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006804 (0.0) |
Best Drosophila hit   | tyrosine decarboxylase 2 (0.0) |
Best Human hit | aromatic-L-amino-acid decarboxylase (1e-144) |
Best NR hit (blastp)   | aromatic amino acid decarboxylase [Culex quinquefasciatus] (0.0) |
Best NR hit (blastx)   | aromatic amino acid decarboxylase [Culex quinquefasciatus] (0.0) |
GeneOntology terms    | GO:0004058 aromatic-L-amino-acid decarboxylase activity GO:0006519 cellular amino acid and derivative metabolic process GO:0030170 pyridoxal phosphate binding GO:0019752 carboxylic acid metabolic process GO:0018991 oviposition GO:0004837 tyrosine decarboxylase activity GO:0007626 locomotory behavior GO:0048148 behavioral response to cocaine |
InterPro families    | IPR002129 Pyridoxal phosphate-dependent decarboxylase IPR015421 Pyridoxal phosphate-dependent transferase, major region, subdomain 1 IPR015422 Pyridoxal phosphate-dependent transferase, major region, subdomain 2 IPR015424 Pyridoxal phosphate-dependent transferase, major domain IPR010977 Aromatic-L-amino-acid decarboxylase |
Orthology group | MCL10577 |
Nucleotide sequence:
ATGGACGTCGAAGAGTTCCGTGTTCGGGGTAAAGAAATGGTTGACTACATCTGTACCTAT
ATGACGACCCTGTCGAAGCGGAGGGTGACTCCATCGGTTGAGCCCGGTTACCTCCGCACG
GAACTGCCGACGGAGGCTCCTTTCCTTCCTGAAAATTGGAATGATGTGATGGAAGATGTG
GAAAATAAGATTATGCCGGGCGTCACACATTGGCAGCATCCTCGGTTTCATGCATATTTC
CCATCGGGCAATGGCTACCCCTCAATACTTGGTGACATGCTCTCCGCAGGCATCGGCTGT
ATCGGATTTTCATGGGCTGCGAGTCCAGCTTGCACGGAATTAGAAATTATAATGTTGGAT
TGGATGGGAAAAGCTATAGGGTTGCCTCCCGCTTTTCTGCAACTTGAGGAAGGAAGCAAG
GGCGGTGGCGTTATTCAGGGATCAGCCAGCGAGTGTGTACTTGTGTGCATGTTAGCCGCA
AGAGCTGCAGGGATCAAGCGATTGAAGCATCAATTCCCGACCGTCGATGAGGGGCTGTTA
CTTTCAAAGTTAATTGCTTATTGTTCCAAAGAAGCACACTCTTGTGTTGAGAAAGCTGCT
ATGATAAGTTTCGTTAAACTGCGTATTCTACAGCCGGACGAACACGGTTCACTTAGAGGG
GATACATTAAAAGAAGCAATGGAAGAAGATGAAGAAGCGGGACTAGTCCCATTTTTCGTT
TCAGCAACGCTAGGGACAACAGGGACGTGTGCATTTGATAATTTGTCCGAAATTGGACCG
GTAGTTCGGAAATTTCCTAGCGTTTGGCTGCATGTAGACGCTGCGTATGCTGGCAGCTCA
TTTATCTGCCCTGAACATAAATATCATCTGGCAGGAATTGAATATGCTGACTCATTTAAT
ACTAATTCAAATAAAATGATGCTCACCAACTTTGATTGTTCTTTAATGTGGGTCACAAAC
AGATATCTATTGACATCTGCTTTAGTCGTCGATCCGTTGTATTTACAACATTGTTATGAC
GGTACCGCAATCGATTACCGCCACTGGGGAATACCGCTCAGCCGTCGCTTCAGATCACTG
AAGTTGTGGTTCATGTTGAGGAGTTATGGAATCAGTGGCCTGCAGAAATATATACGAAGA
CATTGCGAACTCGCTAAGTATTTCGAACAACTTGTTAAAAAGGACAAGAGATTCGAAGTA
TGCAACCAAGTTAAGTTGGGATTAGTATGCTTTCGATTGGTAGGGAGTCGCGACGAAAAT
GAGGAACAAGTTGATGAGTTGAATAAGAAACTGCTTACTAACATCAATGCTTCTGGAAAG
CTCCACATGGTGCCCACTTCTTTTCGTGATCGATACGTGATTCGTTTCTGTGTTGTGCAC
CAACACGCTAGCCGTGAAGATATTGAATATGCTTGGGATACCATAACTGACTTCGCAGAA
GAATTATACGAAGGTCCCGATAAGGAAAGGGATTTGAATGAGGAAAGGGCACGTAAGCAT
CTGCAAGCTCTCGCTCATAAGCGTTCGTTCTTCGTGCGCATGGTGAGCGACCCGAAGATC
TACAATCCTGCCATTAACAAGACCCCGCCGCCAATCCCAACTAGCCCAACTACCCCACCA
GCACCCGCCGCCCCCGATACACCATCGGAAACCGATCCGATGACACCGAAACAATCGTCG
TGGATAAGTTGGCCACTTGCTTTCTTCTTCCAAAGCGTTGACCATGATTCTAATGATTTG
CCTTTGAGGTTCCGGCATTTAGATACGATGGTACGTCTAAAGAGCCCACAGATACGCAGA
GGTTCATCGCCAGGCGTGTCCCCCGAGCGTCGGCCCTCCCCTGCAAACTGA
Protein sequence:
MDVEEFRVRGKEMVDYICTYMTTLSKRRVTPSVEPGYLRTELPTEAPFLPENWNDVMEDV
ENKIMPGVTHWQHPRFHAYFPSGNGYPSILGDMLSAGIGCIGFSWAASPACTELEIIMLD
WMGKAIGLPPAFLQLEEGSKGGGVIQGSASECVLVCMLAARAAGIKRLKHQFPTVDEGLL
LSKLIAYCSKEAHSCVEKAAMISFVKLRILQPDEHGSLRGDTLKEAMEEDEEAGLVPFFV
SATLGTTGTCAFDNLSEIGPVVRKFPSVWLHVDAAYAGSSFICPEHKYHLAGIEYADSFN
TNSNKMMLTNFDCSLMWVTNRYLLTSALVVDPLYLQHCYDGTAIDYRHWGIPLSRRFRSL
KLWFMLRSYGISGLQKYIRRHCELAKYFEQLVKKDKRFEVCNQVKLGLVCFRLVGSRDEN
EEQVDELNKKLLTNINASGKLHMVPTSFRDRYVIRFCVVHQHASREDIEYAWDTITDFAE
ELYEGPDKERDLNEERARKHLQALAHKRSFFVRMVSDPKIYNPAINKTPPPIPTSPTTPP
APAAPDTPSETDPMTPKQSSWISWPLAFFFQSVDHDSNDLPLRFRHLDTMVRLKSPQIRR
GSSPGVSPERRPSPAN