DPGLEAN02199 in OGS1.0

New model in OGS2.0DPOGS204431 
Genomic Positionscaffold1355:- 3285-8763
See gene structure
CDS Length1302
Paired RNAseq reads  2033
Single RNAseq reads  5421
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007728 (0.0)
Best Drosophila hit  CG17337 (3e-152)
Best Human hitcytosolic non-specific dipeptidase isoform 1 (1e-142)
Best NR hit (blastp)  glutamate carboxypeptidase [Aedes aegypti] (4e-170)
Best NR hit (blastx)  glutamate carboxypeptidase [Aedes aegypti] (7e-160)
GeneOntology terms

  
GO:0005515 protein binding
GO:0006508 proteolysis
GO:0008237 metallopeptidase activity
InterPro families

  
IPR001261 ArgE/DapE/ACY1/CPG2/YscS, conserved site
IPR002933 Peptidase M20
IPR011650 Peptidase M20, dimerisation
Orthology groupMCL14127

Nucleotide sequence:

ATGGTATACTGGATGAGGGACAAGTTAAAAGAAGTCGGTGCATCGACTGAAATAAGAGAT
GTAGGTTATCAGAATTTCGATGGCAAGGAAGTAAAACTACCACCGGTTCTGGTTGGCGTT
CTTGAAAATGATCCCAAAAAAAATACAATCTGCATTTATGGCCATTTAGATGTCCAACCT
GCTCTTAAATCTGATGGCTGGGAATCTGAACCATTTGATTTAGTGGAGCGTGATGGAAAG
TTGTTTGGTAGAGGTGCTACAGATGATAAAGGACCAGTACTCGGTTGGCTTCATGCTATC
AATGCATACAAAGCTACTGGCGAGGAGCTGCCAGTGAATCTCAAATTCGTATTTGAATGT
ATGGAAGAATCTGGTTCAGAGGGTCTTGATGAGTTGCTAATGCAGAAATTGAAGCCGGAA
GGTTTCTTTGATTCCGTGGACTTTGTCTGTATTTCTGACAACTATTGGCTGGGAACCACT
AAACCTTGCATCACTTACGGTCTGAGAGGCATTAGCTATTATTTCTTGGAGGTTGAATGC
GCTAAAATGGATCTCCACAGTGGTGTATATGGAGGAACTGTACATGAAGCCATGTCCGAT
CTCATATACCTTATGAACACTCTGGTTGATAAAGATGGTAAGATCTTAATCACCGACATA
TACAAGTCGGTAGCACCGCTCACAGATAATGAACAGAAACTGTACAATACAATCGACTTC
AACCCAGAGGCCTACAGACAATCAATAAGCGCCCATAAACTGGCCCACAATGGTGTAAAG
GAACAACTACTGATGCACCGATGGAGGTATCCAAGCCTGTCACTCCATGGAATTGAAGGC
GCTGCCTTCCAGCCTGGTGCGAAGACTGTCATCCCCGGGAAGGTCATTGGCAAATTCTCA
ATTCGTATCGTCCCTAACCAGGAGCCGGAGGAAGTCGAGAAACTTGTGTTTGACTATGTT
CACAAGAAGTGGGAAGAACGCGGGTCTCCCAACAAGATGCGTATAACTGCTCAGTCCGGA
CGCGCTTGGACCGAGAACCCTGAACATCCACACTACCAGGCCGCTGCTAGAGCCACACGA
CTCATATACAAGACTGAGCCGGACATGTCTCGTGAGGGTGGATCCATACCAGTGACGATC
ACGCTCCAAGAGGCCAGCGCCAAGAACGTGCTGCTGCTGCCCATGGGCGCGGGAGACGAT
ATGGCGCACTCACAGAACGAGAAGATCAACGTCCGGAACTATATAGAGGGGATCAAACTC
TTCGCTGCATACTTATATGAAGTCGGTAAACTACCTAAATAG

Protein sequence:

MVYWMRDKLKEVGASTEIRDVGYQNFDGKEVKLPPVLVGVLENDPKKNTICIYGHLDVQP
ALKSDGWESEPFDLVERDGKLFGRGATDDKGPVLGWLHAINAYKATGEELPVNLKFVFEC
MEESGSEGLDELLMQKLKPEGFFDSVDFVCISDNYWLGTTKPCITYGLRGISYYFLEVEC
AKMDLHSGVYGGTVHEAMSDLIYLMNTLVDKDGKILITDIYKSVAPLTDNEQKLYNTIDF
NPEAYRQSISAHKLAHNGVKEQLLMHRWRYPSLSLHGIEGAAFQPGAKTVIPGKVIGKFS
IRIVPNQEPEEVEKLVFDYVHKKWEERGSPNKMRITAQSGRAWTENPEHPHYQAAARATR
LIYKTEPDMSREGGSIPVTITLQEASAKNVLLLPMGAGDDMAHSQNEKINVRNYIEGIKL
FAAYLYEVGKLPK