DPGLEAN17985 in OGS1.0

New model in OGS2.0DPOGS206235 
Genomic Positionscaffold1155:- 16126-23837
See gene structure
CDS Length2262
Paired RNAseq reads  118
Single RNAseq reads  431
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009691 (0.0)
Best Drosophila hit  histidine decarboxylase (0.0)
Best Human hithistidine decarboxylase (3e-148)
Best NR hit (blastp)  PREDICTED: similar to ENSANGP00000017218 [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to ENSANGP00000017218 [Nasonia vitripennis] (0.0)
GeneOntology terms




  
GO:0004398 histidine decarboxylase activity
GO:0042051 compound eye photoreceptor development
GO:0006519 cellular amino acid and derivative metabolic process
GO:0019752 carboxylic acid metabolic process
GO:0030170 pyridoxal phosphate binding
GO:0043052 thermotaxis
InterPro families




  
IPR002129 Pyridoxal phosphate-dependent decarboxylase
IPR021115 Pyridoxal-phosphate binding site
IPR010977 Aromatic-L-amino-acid decarboxylase
IPR015421 Pyridoxal phosphate-dependent transferase, major region, subdomain 1
IPR015422 Pyridoxal phosphate-dependent transferase, major region, subdomain 2
IPR015424 Pyridoxal phosphate-dependent transferase, major domain
Orthology groupMCL15375

Nucleotide sequence:

ATGGCGGATTACCTGGAGAACATCAGAGACCACAAGGTGTACCCTGGCGTACAACCAGGG
TACCTCCACAAACGACTCCCGGACCACGCGCCGGAGATGCCGGAGAAATGGGACGACATC
TTCAAGGACGTCGAAGATCACATCATGCCCGGGATCGTCCACTGGCAGAGTCCCCACATG
CACGCCTACTTCCCAGCGTTGACCTCTTACCCATCAATCATGGGAGAAATGCTTTCCAGT
GCTATGAACGTTCTCTGCTTCACCTGGGCCTCGTCGCCAGCTGGTACAGAATTGGAGACG
ATCGCGATGAACTGGCTCGGGAAGCTCCTCGGTCTGCCGGATTGCTTCCTCAACGAAAAG
AACGACAGCCAAGGAGGAGGTGTGATACAGACTACAGCAAGCGAGGCAACCCTGGTGAGT
CTGCTGGCTGCTCGCACCAGGGCTCTGATGGAACTATCGGCTCTCAACCCTGACATGCAG
TCTTCTGAACTGCTCGGACATTTGATAGCGTACTGCTCGGACCAAGCACATTCCTCAGTG
GAGAAGGCTGGACTCATTGGTCTGGTGCGGATGCGTTACATAGAGTCGGATGAGCACCAG
TGCATGAGAGGTGACAAGTTAGAGGAAGCCATCATCAACGACAAAGCCAAGGGACTGGTC
CCGTTTTGGGTTTGCGCCACTCTCGGTACGACAGGGTCCGTAGCCTTCGACGATCTCCGG
GAGATAGGCCCGGTGTGTGACCAGCACTCCATCTGGCTGCACGTAGATGCTGCATATGCT
GGGAGCGCTTTCATATGCCCCGAATACAGACACTGGCTGGATGGGATCGAGCTGGTGGAT
TCCTTCGCCTTCAATCCATCCAAGTGGCTGATGGTCAACTTCGACTGCACCGGCATGTGG
GTCAGGGACAGTAACGCTCTGCACAGAACCTTCAACGTGAACCCTATTTATCTAAGACAC
GAAAATTCAGGTACGTCACTAAATCTTGAGACTTACGGGTGTCGGTCCTCGCGGTGTCGG
AGAGCTTTAAAGCTGTGGTTCGTGTTGAGGAACTATGGAGTGAGCGGCCTCCAGAAACAC
ATAAGAGAGAGCGTCCGTCTAGCTCAGAAGTTTGAGGCCTTGGTGCTAGCTGATCAACGT
TTTGAAATACCACAACCACGGAATCTGGGCATGGTTGCTTTCCGTCTCAAGGGAGACAAC
ACCCTCACCGAGTACCTTCTGAAGCGCCTGAACGCCCGCGGCTACCTACACGCCGTGCCG
GCATGTTTCAAGGGCGTCTACGTCATCAGGTTCACCGTCACCTCCCAAAGAACCACCAAC
CAGGACATACTCGACGACTGGACAGAGATTAAGACGGTGGCGTCCGAAATACTGAAGGAA
ATGTTTGGATCAGAAAACGGAAACATCGTGGTGTCCAAGAAACCGAGGATTTCTTTGAAA
GAAACTCGCGAACTGAACGCTACGTTTGGGACGAGCCTGTTGCTAGCCAACAGTCCGATG
AGTCCAAAGATCGTAAACGGCACTCACGCAGCGATATGCGATTATGAGTCACTGCTCTCA
TCCTGCGCTCAGACCTTCGCTGAGCTGAAAATGGAACCCAAAGATAGTCCTGAAATGCGG
CGTCGCGTCCGCGGTATGAAGGCGTGCGGGAAGAAGTTCTCGCTGGACTCCTACATGGAC
ATGCTCCAGGAGCTGGTGGTGGCGTCCCTGCCGCAGTGCTCGGAGGAGAAGGAGGAGACT
CCCAACGGATCCAGCCCCGCCGACCGTTCCATCTCATCACCTGTGGTATCCAGCACTACC
GTAAAACCAGTCGCCTGCACGGACACCAACCAATTACTAGTACCAATGACTCCATCCAGA
CAGTTCAGGTCAAAATCGGTCGACGAAACCGATTTGAAGCTAGACGACGCGGTCATCTCC
GTAGACATCAAAAACAACGAGATCACGCTCACGCCGACCGATTCTAAAAGCATTCTCGAC
GCTCGCGACGTATCCGAGCTCAAAATCGGCGATCGGATCTCGAGGGCATTTGACCTCATG
GACACTAACAATATGGAATGTAAGGAAGCGGGCGAAGCCAAGTTGACCATCAAAGGACCC
GGGAGCTACATTAAGCAGATCATACAGCAGTTCAGCGAGGGACCCTTCGACGCGGAGGAC
TGCAAACCTGACCCAGGACGAGCTGTCGCCACACAGAGCCTCAAACAGCGGGCTGACGCG
TTTTGCAAGAAATGCCTTCACTACAAAGGCGTCAACAAGTAA

Protein sequence:

MADYLENIRDHKVYPGVQPGYLHKRLPDHAPEMPEKWDDIFKDVEDHIMPGIVHWQSPHM
HAYFPALTSYPSIMGEMLSSAMNVLCFTWASSPAGTELETIAMNWLGKLLGLPDCFLNEK
NDSQGGGVIQTTASEATLVSLLAARTRALMELSALNPDMQSSELLGHLIAYCSDQAHSSV
EKAGLIGLVRMRYIESDEHQCMRGDKLEEAIINDKAKGLVPFWVCATLGTTGSVAFDDLR
EIGPVCDQHSIWLHVDAAYAGSAFICPEYRHWLDGIELVDSFAFNPSKWLMVNFDCTGMW
VRDSNALHRTFNVNPIYLRHENSGTSLNLETYGCRSSRCRRALKLWFVLRNYGVSGLQKH
IRESVRLAQKFEALVLADQRFEIPQPRNLGMVAFRLKGDNTLTEYLLKRLNARGYLHAVP
ACFKGVYVIRFTVTSQRTTNQDILDDWTEIKTVASEILKEMFGSENGNIVVSKKPRISLK
ETRELNATFGTSLLLANSPMSPKIVNGTHAAICDYESLLSSCAQTFAELKMEPKDSPEMR
RRVRGMKACGKKFSLDSYMDMLQELVVASLPQCSEEKEETPNGSSPADRSISSPVVSSTT
VKPVACTDTNQLLVPMTPSRQFRSKSVDETDLKLDDAVISVDIKNNEITLTPTDSKSILD
ARDVSELKIGDRISRAFDLMDTNNMECKEAGEAKLTIKGPGSYIKQIIQQFSEGPFDAED
CKPDPGRAVATQSLKQRADAFCKKCLHYKGVNK