DPGLEAN08423 in OGS1.0

New model in OGS2.0DPOGS215632 
Genomic Positionscaffold1007:+ 15475-59025
See gene structure
CDS Length3684
Paired RNAseq reads  399
Single RNAseq reads  1205
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003532 (2e-19)
Best Drosophila hit  CG32627, isoform C (6e-82)
Best Human hitcytosolic carboxypeptidase 1 (6e-132)
Best NR hit (blastp)  PREDICTED: similar to ATP/GTP binding protein 1 [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to ATP/GTP binding protein 1 [Apis mellifera] (0.0)
GeneOntology terms







  
GO:0008270 zinc ion binding
GO:0008233 peptidase activity
GO:0008237 metallopeptidase activity
GO:0005634 nucleus
GO:0006508 proteolysis
GO:0004181 metallocarboxypeptidase activity
GO:0046872 metal ion binding
GO:0005488 binding
GO:0005737 cytoplasm
InterPro families  IPR000834 Peptidase M14, carboxypeptidase A
Orthology groupMCL16762

Nucleotide sequence:

ATGAATGAGAAATGTCAGGATTTGGTTATCGGAGCCGACTGCTCGGCGCAAACACAGGAA
ATCAAAAAACCAAAGAAAAGAGCGTATCGCATGGCGGACGACGCGGGCGACTGCCTGTTC
GAGCGGCTCCGACTGCACCAGCAACGCGCCCCGGACGCGACTGAGGTCGCGCGTGCTATC
ACAGCTAGAATCAACTCGCGTCTCACATCACACGACAAACACATCCGACAGAGCACTCTT
GACAAACTATGGAACAAGCAAACTGGTGCGATACAAATGCTTCTATCTATATTAGAGAAT
TCAAGAGATACAGCGACATCAACCTATATAACATCAATATTTAGAGAAGCTCTCTGCCTT
AAACAAGGAAAAGGAAAAAAATGTTCAGTAGCGAATGAAGCCTTGGGGTCGAAAAAGAAG
GAAAGCAAAAAAGGCAAAGAAAATAAAACACCACTGAATAAAAAGGCAAACAACGTAGCG
CGACAGCAGTGCTCACAACAATTCATCGCCCTAAACGGCACGCAAGCGGTGATACGTACA
CTACTTGCCACACATAGCAAACGCGACAGCAATGTTAGCACGGAACTAATGCTACAAGAC
CTCGTCTGGATACTTGCAGCACTTGCTCCAAGAGCAAACAACATACAACGGCAGCAATGT
GCGCAGGATTTTGTGGAAATGAAAGGAACACAGACCATTATTAGACGAGTGATATCAACG
GTCCATGATAAGAGGGAGCATCATGCTGGGGTGGAATTAGTATTAAATGATCTGGTGTGG
ATTCTCGCTCCGTTGGGGTCAAAAGATCCAAAATTTGCAATGAAGGTGAGAATGTTAGGA
TGCGTTAGGACGATGCATCTAATTTTGAAAGGACATTTCACTGACAATAAGTTGATATTT
CCACTATTAGTCATCATGAAACAGCTAGCCAAAAATTCCGTAACAACTTCCATTTTAATA
CGCGACGGAGTCATCGCGACTTACGATCGAGTGTTAATAAGTCTAGGCTTCATTCCTACA
GCGAGACTGAGACTCTGTTTGGACGCTATAGACTACTTTAGCAAGAACAAGGTGTGCTGC
ATGCAGATTGTGAAGACAGGACTGTGTGGAGTGCTAATAAGAGTGTTCGACCGCTGGGAC
CGCTACGAGGGGCGCATGAGGCTCAAGATATGCGCACACATCCTCCAGACATTACAGCAC
CTCTGCAACATCAAGGCTGGACGTCGTGCTCTTTGCACCAAGAAGCACGTGCAGACCCTC
CACAGGTTCTGTTCCCAGTGTCCTGATGAAATCGAGTTCGACGGACTGTTGGCTAGAGTC
TGTTCCGTCATAACATTGTGTCTCAAGCATCAGGCATTACCAGTGCCATCATCTAGTCCA
GCTACCTTCAACCTGAACCCTATACTTAAAGGAACAAATACCACATGGCCATGTCACGAA
GATGATGATGACGGCGGAAACTCAGACTCGAAGACAATCAATTCGGATTTGGAAGACGAT
GCTCCTGACAGTGATAACGAGGTCATTGATGACTTCCCAGATATTGATTTCGAAGAAAAT
GACTTAAAGAATAATGAAGATTTGGAGAAATCACATACAAAAAGTGGGGAGAGCATACAG
AGTGCACTGTGGATCAATCCCAATGAGAGAGATATCGAAGACTTAAAGAGATACTATATT
TTCTTCAAGGAGTTCGGTTCCTATAACAAGCAAATAAGGTTGGTAAAAAGCCGGTCCAAT
TCCCGGGGGTCCATACTAGATGACATTTTTATATCTCAAAATAGCGCTAACCGAAGCCAG
CCGTCGCCGACAAATCTTTCTTTAACTGCAGTACTTGGGAATACCGACTACGACAGCGCT
TTAGGATCATCTCAAACGCTTTCATTCCTGCAAGGATATCATAAGATACACGAAAGCACT
TCGACAACATCTTGTTCTTCCTTAAAAATCCATAAAGATATATCAAAATACAGTCCGCTC
GAGTCAGTTTATTCAATAATATCATCGAGAGTTAAAAGCATCATTCCATTTGTAAAAGTT
GCCTACCCAGATATGACAGGCGGGCAGGGTGCAACACAACCAGAGCCATTAAATAAAATG
GAGAGAACAGCTTGCAGAAATAAATTACTCGCTTGCGTCGAGAGAGCAATTAATCCGGAA
GCGTATATGAATGAAGTTGTGTATGATCTGGATGCTTTGAACAGTTCGAGCTCAAACGCA
GACACGACTTCGCAGAAAAGCAACGAGAGTTTATTTTTAATTAACACCGACGAACAAGAA
ATAACAAAAGTCAATAGTTTCTCATCGAGACTAAATTTTGAATCGAGATTTGAGTCGGGA
AATTTAAGAAAAGCCATACAGGTAGGTCCAAGAGAATATGAATTAATTTTAATGCCAGAC
GTAAATTCTCCAAAACGGCATCAGTGGTTTTACTTCGAAGTGCGTAATATGCAACAGGGA
CGGCCCTATATATTTAATATTGTGAATTGTGAGAAATCAGATAGCCAATTCAACTTCGGC
ATGAAGCCTGTTATGTATTCTGTGAAGGAAGCCGTCCTTGGAAGACCCGGGTGGGTGAGA
GCCGGTTCGGACATTTGCTATTACAGGAACAGCTACCACTATTCCAATCAAAGAAACAAC
AAGTGCTACCTAACAGTTACGTTCAACATCGACTTTCCCCACACAAACGACGTCTGCTAC
CTCGCTTACCACTTCCCATTCACTTACTCCATGATGATGACTAGAATTTTCCAATGGAGT
TCTCAATTGCCTCCTGGCGCTTATCTACGAGCTGAGCCCTTATGTTATACACTTAACAAC
AACGAAGTTCCTCTGTTGACTATATCAGCTGATGATACTCCGTCCAATCCCATAGTTGAC
AGGGAGATAGTATTCCTTACGGCTCGAGTCCACCCTGGTGAAAGCAACGCGTCCTGGGTA
ATGGATGGAACGCTGCGTTTCCTGCTCACAGACACTTCATCCGCAGCGGCCCTCCGTAAC
AAGTACGTGTTCAAAATCGTGCCGATGCTCAACGTCGAAGGTGTCGTTAATGGCTGCCAT
CGATGCGGCTTAACTAATGAAGATTTAAATCGACGCTGGTGCAAGCCGAGCCCCGTTTTG
CATCCTTCTATTTACCATACCAAGGGCTTAATAGAATATTTGGTGCGTGTTTGGAAGAAA
CCTCCGGTAGTTTATTGCGACTACCACGGTCATTCGCGCAAGAAGAACGTGTTCTTTTAC
GGTTGCGCCGGCGCAGAGAGCTGGTGCAGCAACGACCGGCTTGTCCCGGACGAGCCTGTT
AAATATCTCATGCTTCCAGCTTTAATGCACCGGCTATCACCGGCGTTCGCTCTTGGTTCG
TGTTCCTTTCGTGTTGAACGTGAGCGTGAGAGCACAGCGCGAGTCACTGTGTGGCGCCAC
CTAGGAGTCACACGGTCCTACACTATGGAAGCATCATTTTGTGGATTTGATAGGGGACCG
TTTAAAGGATTTCATCTCAACACCCAGCATCTGCAGAGCGTGGGCAGTGACTTTTGCGAA
GCTCTCAACGGTCTCGGAGATACAGCCAACAATGTTGACATACAACTCACTAAAGATCTC
AATGGCGAAATAGCAATAGACAGTGAAGCTGGCTCGGGGTCGGACAGCGTGTTGAAAACA
GATTCGGATGAAGATTTCGATTAG

Protein sequence:

MNEKCQDLVIGADCSAQTQEIKKPKKRAYRMADDAGDCLFERLRLHQQRAPDATEVARAI
TARINSRLTSHDKHIRQSTLDKLWNKQTGAIQMLLSILENSRDTATSTYITSIFREALCL
KQGKGKKCSVANEALGSKKKESKKGKENKTPLNKKANNVARQQCSQQFIALNGTQAVIRT
LLATHSKRDSNVSTELMLQDLVWILAALAPRANNIQRQQCAQDFVEMKGTQTIIRRVIST
VHDKREHHAGVELVLNDLVWILAPLGSKDPKFAMKVRMLGCVRTMHLILKGHFTDNKLIF
PLLVIMKQLAKNSVTTSILIRDGVIATYDRVLISLGFIPTARLRLCLDAIDYFSKNKVCC
MQIVKTGLCGVLIRVFDRWDRYEGRMRLKICAHILQTLQHLCNIKAGRRALCTKKHVQTL
HRFCSQCPDEIEFDGLLARVCSVITLCLKHQALPVPSSSPATFNLNPILKGTNTTWPCHE
DDDDGGNSDSKTINSDLEDDAPDSDNEVIDDFPDIDFEENDLKNNEDLEKSHTKSGESIQ
SALWINPNERDIEDLKRYYIFFKEFGSYNKQIRLVKSRSNSRGSILDDIFISQNSANRSQ
PSPTNLSLTAVLGNTDYDSALGSSQTLSFLQGYHKIHESTSTTSCSSLKIHKDISKYSPL
ESVYSIISSRVKSIIPFVKVAYPDMTGGQGATQPEPLNKMERTACRNKLLACVERAINPE
AYMNEVVYDLDALNSSSSNADTTSQKSNESLFLINTDEQEITKVNSFSSRLNFESRFESG
NLRKAIQVGPREYELILMPDVNSPKRHQWFYFEVRNMQQGRPYIFNIVNCEKSDSQFNFG
MKPVMYSVKEAVLGRPGWVRAGSDICYYRNSYHYSNQRNNKCYLTVTFNIDFPHTNDVCY
LAYHFPFTYSMMMTRIFQWSSQLPPGAYLRAEPLCYTLNNNEVPLLTISADDTPSNPIVD
REIVFLTARVHPGESNASWVMDGTLRFLLTDTSSAAALRNKYVFKIVPMLNVEGVVNGCH
RCGLTNEDLNRRWCKPSPVLHPSIYHTKGLIEYLVRVWKKPPVVYCDYHGHSRKKNVFFY
GCAGAESWCSNDRLVPDEPVKYLMLPALMHRLSPAFALGSCSFRVERERESTARVTVWRH
LGVTRSYTMEASFCGFDRGPFKGFHLNTQHLQSVGSDFCEALNGLGDTANNVDIQLTKDL
NGEIAIDSEAGSGSDSVLKTDSDEDFD