DPGLEAN19365 in OGS1.0

New model in OGS2.0DPOGS210836 
Genomic Positionscaffold77:- 100005-109451
See gene structure
CDS Length2136
Paired RNAseq reads  1783
Single RNAseq reads  4102
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003910 (0.0)
Best Drosophila hit  CG1486, isoform B (2e-76)
Best Human hitpyridoxal-dependent decarboxylase domain-containing protein 1 (3e-51)
Best NR hit (blastp)  PREDICTED: similar to CG1486 CG1486-PA [Tribolium castaneum] (4e-154)
Best NR hit (blastx)  PREDICTED: similar to CG1486 CG1486-PA [Tribolium castaneum] (3e-123)
GeneOntology terms

  
GO:0019752 carboxylic acid metabolic process
GO:0016831 carboxy-lyase activity
GO:0030170 pyridoxal phosphate binding
InterPro families

  
IPR015424 Pyridoxal phosphate-dependent transferase, major domain
IPR015421 Pyridoxal phosphate-dependent transferase, major region, subdomain 1
IPR002129 Pyridoxal phosphate-dependent decarboxylase
Orthology groupMCL13871

Nucleotide sequence:

ATGGGAGATGCCCCGACTTCCGAAATGGACTCAAATAAAGTTAGCCTTGGAAGCGAAAAT
CCATCCGATGTGGACCGGCGACCGTTTGGAGGACTTGAATTCCAAGTTTCCGAAGTAGTA
GAGAGGTTGGAAGCTGGTGTGAATGCTCAAGATGCCATGGAAGAAGAAAAAAAGCCAGAA
GAACGAAAAATAAGCACGGGATTCTTCGAGCCAGAAAAAATGGATATGGATGAAATTTTG
AAGGTCTTAGAACAACTAGTACTTCAAACTGATCCAAGTTGTGAAAGTTTGGAACCACCC
TTATTGCCAACGGATTCTGTGACGCGAGCCGCAATACTTTCTCATAGTATTTCGGCATTA
TTCTCGAGGTTGGAGAGGAGCCACGCTGCCCGGCTAGGAACGCACATAGCTACTGAAACC
ACGCGATGGATGGCGCATTTATTTAGGTTGTCCGATTACGACGCGTTTTATCACCAAGAG
CAGCTCGAGGGTCTGGTCAGAGTCACTCGGATGCTGTTACACCACAAGTACCCGAGATAT
CTCGAAGATGGAGCTCTAGCTTTCTCGAACCGTCTCCCCTCCATCTACAGCTGTGTGGCG
AGTCCTCTGGGCGTGGTCCAACACCTGTGCCGGCAGCTGGGTCTGCCGCTGGCCTGCGTC
AGACCGGTGCCAGTAGATTCATCTGGTAAGGGTATGGATCTGAATGCTCTGGATCGTCTG
TGCGAGGAGGACTCGGCTGGTCGTACTCCGCTGCTGGTGTTAGGCGAGGCGGGCGAGCCT
CCCCTCGGCGGGGGATCCCCGCTGAAAGCGCTGGCTGAACTATGTGGACGTAGAGGGGTC
CACTTACATGTGAGGGGACACGCCCTCGCCCTCCCCGCCGCCGGGGGATTTGAACAGACG
TACAGTATAGCGGACTCGCTGACACTACAACCGGGTCCGTGGTTCGGAATACCGGGGCTG
CCGACTGTTACGTTTTACAAAATACCGGAACCGCTGACGGCGAACGATCACTCCAAGGTT
GTAAATTCGGCGAGTAGTCGCGAGGGTGCTCTGGCCGCACTGGGCGGTCTGACCGCTGGC
GCGGCGCGGCTGGCAGCTCTGCCGCTGTGGACGGCGACGAGGGCGGCCGGCGCTAAGAGG
CTCGCAAGACGGATAGACGCCGCCTTCCGCTCCGCCCGTACAGCGCGGGCCTTAATAGCC
AGCACTGAGCTGAGATTGCTGAGCGATAGACCCGGCGGTGATGAACCTCCTAACATGGAT
ATAGTCGATGCCATAAGTGAATCCTCAGCGTGCGTGTCCTTCCAATTCGCGCCAGCAGGG
TGCGCTGACCGGCCACCCCCCTACTACGATAAACTCAACTCGTGGTTGGGGCAAGTGTTG
CAACGAGAGGCTGATATGATCAATATAGAAATCTGCGAGACGGAGAGTTACGGCGTGGTG
CTCCGCTACTGTCCGCTCGAGGGTATCTTTCTGGAGGAGGACCGTCTGTCGGAGTGGGCG
GCCGTGTTAGACGCTCAGCTGCACGTGCTCACCGCTACGGTCGCGCTACGAGAACCCTTC
CAGAAGACGCTACAGACACATCCCTGTCTACGACTTGTACATGTACCGGGATGGGCTGGT
CTGGGAGGAGTTCGTTACGTGCCACCCGGTTGGGAGAACGCTCCTCTTGAGGAATTGAAC
TCCTTGAATAGACAGCTAGTGGAGACATTGAGGGCTACCGACGGAGCCTTCTCGTGTGGG
GACGGAGAAGACGGTATGGCATGTGTCAGGTTCGGTATGGTCACCGCTGACACAGACGTG
GATGAATTGTTGGATCTGGTGTTGTCAGCGGGCAAGGACGTGGAGGAGAACTCCAAGGCT
CTCACTGATATGACCGAGGTGTTGAAAAAAGGTATATCAGCGGCTCAAGAAGAACTGAAT
CGTTCTGCGTGGCAGGAGGGGCTGCTGCGTCGTGTGCCGGTAGTGGGTCGGGTCGTGTCG
TGGTGGGCGCCGCCTCAGCCCTGCCCCGGCCGCCGGCTACTGTTGACCCACGGCACCCTG
CAGGCGACTGATGATATCTACCGATTCGTTCAGAAGAAAGACAAAGAGGAACCAGCCCGC
GCTCACTCCCCAACGAGACAGAACACGGTTCCATAA

Protein sequence:

MGDAPTSEMDSNKVSLGSENPSDVDRRPFGGLEFQVSEVVERLEAGVNAQDAMEEEKKPE
ERKISTGFFEPEKMDMDEILKVLEQLVLQTDPSCESLEPPLLPTDSVTRAAILSHSISAL
FSRLERSHAARLGTHIATETTRWMAHLFRLSDYDAFYHQEQLEGLVRVTRMLLHHKYPRY
LEDGALAFSNRLPSIYSCVASPLGVVQHLCRQLGLPLACVRPVPVDSSGKGMDLNALDRL
CEEDSAGRTPLLVLGEAGEPPLGGGSPLKALAELCGRRGVHLHVRGHALALPAAGGFEQT
YSIADSLTLQPGPWFGIPGLPTVTFYKIPEPLTANDHSKVVNSASSREGALAALGGLTAG
AARLAALPLWTATRAAGAKRLARRIDAAFRSARTARALIASTELRLLSDRPGGDEPPNMD
IVDAISESSACVSFQFAPAGCADRPPPYYDKLNSWLGQVLQREADMINIEICETESYGVV
LRYCPLEGIFLEEDRLSEWAAVLDAQLHVLTATVALREPFQKTLQTHPCLRLVHVPGWAG
LGGVRYVPPGWENAPLEELNSLNRQLVETLRATDGAFSCGDGEDGMACVRFGMVTADTDV
DELLDLVLSAGKDVEENSKALTDMTEVLKKGISAAQEELNRSAWQEGLLRRVPVVGRVVS
WWAPPQPCPGRRLLLTHGTLQATDDIYRFVQKKDKEEPARAHSPTRQNTVP