DPGLEAN00284 in OGS1.0

New model in OGS2.0DPOGS208187 
Genomic Positionscaffold896:+ 41108-47490
See gene structure
CDS Length1452
Paired RNAseq reads  1314
Single RNAseq reads  2661
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010122 (0.0)
Best Drosophila hit  CG5618, isoform A (1e-117)
Best Human hitcysteine sulfinic acid decarboxylase (2e-107)
Best NR hit (blastp)  Cysteine sulfinic acid decarboxylase, putative [Pediculus humanus corporis] (3e-139)
Best NR hit (blastx)  PREDICTED: similar to AGAP002425-PA [Tribolium castaneum] (9e-141)
GeneOntology terms




  
GO:0004782 sulfinoalanine decarboxylase activity
GO:0006508 proteolysis
GO:0008239 dipeptidyl-peptidase activity
GO:0005737 cytoplasm
GO:0019752 carboxylic acid metabolic process
GO:0030170 pyridoxal phosphate binding
InterPro families



  
IPR002129 Pyridoxal phosphate-dependent decarboxylase
IPR015424 Pyridoxal phosphate-dependent transferase, major domain
IPR021115 Pyridoxal-phosphate binding site
IPR015421 Pyridoxal phosphate-dependent transferase, major region, subdomain 1
IPR015422 Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL15472

Nucleotide sequence:

ATGGATACTAAGGGTCATTCATTTCTGGACAGAGTTTTTAATATTTTGAAGGATGAAACC
GCTGACGAAGTTCCTCTCATTCAGTTTAAGCACCCAAAGGAGTTAGAAGAAATACTCAGC
AATGACCTCGACATATCTATAAACGTGGATGACAAAGAGTTGGAAAACTCCGTTAGAAAA
ATTCTGCAGTACAGTGTTAAAACAAACAAAGTGACCTTTAGGAACCAGTTGTATGGTCCC
ATGGATCCCTATGGACTGGCCGGGACGTGGATAGCCGAGGCTTTCAACACCAGCCAGTAC
ACATTTGAAGTCGCCCCAGTATTTACATTGATAGAACTCAAAATGATAGAGCACATATTA
AAATTGTTTGGTATTCCTGGAGGGGATGGTATCTTCAGTCCAGGAGGGAGCGTGTCCATG
TTGTATGCCTTAGTAGCTGCGAGATTTAAAAAATTCCCTGAAGTCAAGAGCAAGGGCATG
CAAAAACTGCCAGAGATGACCATATTCACTTCCGAAGATAGTCACTATTCCATTATTAAA
GCTGCTCATTGGCTTGGTTTTGGCACAGAAAGTGTTATATCTATTAAAACTAATAGTTCT
GGTCAGATGATCGTAAACGAACTAAATAAAGCGATAGAACGTCAGTTGGGTCTAGGAAAA
TATCCTGTGTTCGTAAACGCTACTGCGGGTACAACGGTTCTTGGAGCCATTGACGACTTG
GAAGCAATTGCATCTGTTTGTAAAAAATACGATATTTGGATGCATGTTGACGCCTGTTGG
GGTGGAGGCTTGATGCTTTCTGCCACATTAAGAAAAAGATTGCAAGGAATTCAATTTGCA
GATTCCATTTCCTGGAATCCACATAAGATGATCGGCGCTCCGTTACAATGTTCGATATTT
CTGCTGAAAGAAAAGGGTTTGCTGCACGAAGCCAACGCTGCTGCAGCACAGTACCTGTTC
CAGCAAGACAAGTTTTATGACGTTCGATATGACACAGGGGATAAAAGTGTCCAGTGCGGG
AGAAAGATAGATTCTTTTAAATTGTGGATGATGTGGAAGGCGCGAGGAGATATCGGGCTC
TGTAAGGTGATGGATCATGTAATGAGTATATCAGAGTTCTGTTTGCGATCTATTGCTGAA
AGAGAAGGTTTCAGACTAGTTTCGGATACCTTGCAATGTCCTAACATTTGCTTCTGGTAC
ATTCCAGTTTTTATGAGAAAACGGGAGGAAAATGATGAGTGGTGGGCACTCATTCATAAG
ATTACACCAAAACTGAAAGAGCTGTTGACCTTAAGTTCTCGTTTAATGATAGCTTACACC
CCTTTACGTCATCACAAGAATTTTTTCCGTCTAGCGTTCACCTTCCACCCTGTTCTTGAA
GAAAATCACGTCCTGGAGATATTAAAATCTATAGAGGAATGTGGGGAAATGGTAAACACG
GACATGGTCTAG

Protein sequence:

MDTKGHSFLDRVFNILKDETADEVPLIQFKHPKELEEILSNDLDISINVDDKELENSVRK
ILQYSVKTNKVTFRNQLYGPMDPYGLAGTWIAEAFNTSQYTFEVAPVFTLIELKMIEHIL
KLFGIPGGDGIFSPGGSVSMLYALVAARFKKFPEVKSKGMQKLPEMTIFTSEDSHYSIIK
AAHWLGFGTESVISIKTNSSGQMIVNELNKAIERQLGLGKYPVFVNATAGTTVLGAIDDL
EAIASVCKKYDIWMHVDACWGGGLMLSATLRKRLQGIQFADSISWNPHKMIGAPLQCSIF
LLKEKGLLHEANAAAAQYLFQQDKFYDVRYDTGDKSVQCGRKIDSFKLWMMWKARGDIGL
CKVMDHVMSISEFCLRSIAEREGFRLVSDTLQCPNICFWYIPVFMRKREENDEWWALIHK
ITPKLKELLTLSSRLMIAYTPLRHHKNFFRLAFTFHPVLEENHVLEILKSIEECGEMVNT
DMV