DPGLEAN06994 in OGS1.0

New model in OGS2.0DPOGS202963 
Genomic Positionscaffold222:- 8105-18557
See gene structure
CDS Length3759
Paired RNAseq reads  3978
Single RNAseq reads  10102
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005752 (0.0)
Best Drosophila hit  tripeptidyl-peptidase II, isoform D (0.0)
Best Human hittripeptidyl-peptidase 2 (0.0)
Best NR hit (blastp)  PREDICTED: similar to tripeptidylpeptidase II [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to tripeptidylpeptidase II [Nasonia vitripennis] (0.0)
GeneOntology terms





  
GO:0004175 endopeptidase activity
GO:0008240 tripeptidyl-peptidase activity
GO:0004252 serine-type endopeptidase activity
GO:0005737 cytoplasm
GO:0004177 aminopeptidase activity
GO:0008233 peptidase activity
GO:0006508 proteolysis
InterPro families



  
IPR015500 Peptidase S8, subtilisin-related
IPR000209 Peptidase S8/S53, subtilisin/kexin/sedolisin
IPR022229 Peptidase S8A, tripeptidyl peptidase II
IPR022232 Peptidase S8A, tripeptidyl peptidase II, arthropoda
IPR022398 Peptidase S8/S53, subtilisin, active site
Orthology groupMCL11107

Nucleotide sequence:

ATGGCAGACGTACCGATCGATTGTGAATTCCCTGTTTGGGGATTAATGCCTAAAAGAGAA
ACAGGGGTCGTCTCATTTTTAAATAAATATCCTGAATATGATGGCAGGAACACAGTTATT
GCGATATTAGACTCAGGTGTGGATCCCGCCGCAGAGGGTCTTAAGGTTACAAGTACAGGA
GAGACTAAAGTTATCGAGAGGTATGACTGCAGCGGCTGTGGTGATGTGGATACTAGCACA
GTGGTGAAAAAGGTTGTGGATGGATACATAACCGGCATTACTGGACGTAAACTAAAGATT
CCAGAAACATGGGACAATCCTAAGGGTGAATGGCGTATAGGTGTTGTTTATCCTTTTAGT
TTATATCCAACTAAGGTGAAGGAAAGGATCCAAGAGCATCGTAAGGAACACGTGTGGGAT
GTTGGCCAGAAGCCAGCTATGGCTAAGGCCACCAAAGATTTGCAGGATTTCGAAAATGAA
GTTTCTTCAAAAACCACCTTGAGTCAAGAGGAGAAGCAAGCGAAGGAGGAGCTGGAAGCC
AGAGTTGAGGTGTTGAAGGAACTCGACAAGAAATACACAGACGTCGGACCCACGTACGAC
TGTGTGCTGTGGCACGACGGAACGGTTTGGAGAGCGTGCATCGACACGTCCGAGGAGGGA
GACTTGTCTTCGGGCGTCCTGCTGGGCGAGTACAGCGCGACGCAGGAACACGCTCACCTC
ACGCCGCTGGACGAGATGACGGTCAGTGTGAACGTTCACAACGACGGAGACACGCTCGAG
GTAGTCGGCATGTGTTCGACTCACGGCACACACGTGGCGGCCATAGCGGCGGGGTACTTC
CCCGACGACCCCGACCGGAACGGGGTGGCGCCGGGTGCCAAGATTATATCGCTCACGATA
GGAGACAGTCGCCTGGGGTCCATGGAGACGGGCACGGCGCTGGTGCGGGCCTGCGTCAAG
GTCATGGAGCTGGCGGCGAGGACGAAGGTGGACGTCATCAACATGAGCTACGGGGAACAC
GCGCACTGGTCCAACGCGGGTCGTGTGGGGGAGATCATCAGTATGGTGGTGAACAAGTAC
GGCGTGTCGTGGGTGGTGTCCGGCGGCAACCACGGCCCCGCGCTCGCCACGGTCGGCGCG
CCGCCTGACATCGCGCAACCCATACTCATAGGCGTGGGCGCGTACGTGTCGTCCGAGATG
ATGTTGGCGGCGTACTCCATGCGGGCGCGCGGCTGCGGGCCGCGGAAGTCGACCTCGTCG
GCGGGGCCCTGCAGCGACGGCGCGCTCGGCATCTCCGTCTGCGCACCCGGGGCTGCGCTC
GCCTCCGTCGCCAGGTTCACTCTGAGGAACTCCCAGCTCATGAACGGCACGTCCATGGCG
GCGCCGCACGTGGCCGGGGCTGTCGCGGCCCTGATCTCGGGTCTGTCGTGCCGCGGCCTG
CCTCACTCGCCCTACTCCATGAAGCGAGCGCTGGAGAACACGGCCACGTACTTAGAACAC
GTGGAGCCCTGGGCGCAGGGGGCCGGCTTGTTGAATATAGAGAAGGCGTTCGAGCACTTG
GTGGAGCATCACGCGGCGGTGGAGCGTGACGTCACCTTCAATATAAAGTGCGGCGCCAAC
AACGCCAAGGGTATCTTCCTCCGTCCGCGGGCCGACGACCCGCCCCGGGACATCAGCATC
ACCGTGGAGCCGCAGTTCCTGGAGGACTTCCGAGACCAGAACAAGCGGGCGGTGATGGAG
CGCCAGTTGTCGTTCGAGGTCCGCCTGGCGCTGACGGCGGCTCCGGCCTGGCTGCACGGG
CCCAAGCACCTGCACCTGGCCGCGGCGCCCCGGGCCTTCGCCCTCAGGGTGCACACCGCG
GACTTACCTCCGGGACCTCACTTCGCCAGTCTGAACGCGTACGACGTGTCGTGCGTGTCC
AAGGGGCCGGTGTTCCGCGTGTCGGTGACGGTGCTGCAGCCTGAGCCGCTGGCAGGTCTG
CCACACGAGCCCCACATACGACTGACGGACGTACTGTTCCGGCCCTCCGCCATCAAGAGA
CACATCATAGTAGTCCCGCCCGAGGCGTCGTGGGGCGTGGTCCGCTTGGTCCGTCGCGGC
GGAGAGAGTTCGTCTCGGTTCCTGGTGCACGTGATGCAGCTCTCGCCGCGCCGCTCCTGC
AGGGACCACGAGACGCACCGCATCATGACGCTCGGACCGCACGCTCCCGCGCAGGCGCCC
TTCAGACTACTGGGCGGCGTGACGGTAGAGGTGGCGATCGCCAAGTACTGGGCGAACGCC
GGAGACGTGCAAGTAGATTATACTATAGAGTTACACGGACTGAGGCCGGACTGCGGGCAC
CGGCTGACGCTGACCAGTGCAGCGCTGGGCAGCGTGCGGCTCACAGCGCTGAGGCCGCTT
GATGTGCAGCCGACGGCGGTCCTCAAACACATCGAGCCCGTGTACAGGCCGTCCGAGTCC
AAGCTGTGTTCCCTGACCGCGCGTGACGTCATCCCCCCCTCCAGGCAGATCTACCAGCTG
CTGAACACGTATACCTTCAATATACCTAAAGCTACCGAAGTGTCGCCCATGGTGCCGATG
TTGTGTGACATGTTGTACGAGTCGGAGTTCGAGTCCCAGATGTGGATGCTGTACAACAGC
TGCAAACAACTCGTGGCTGTAGGGGACGCCTACCCCTCGAAGTACTCAGCCAAGGTGGAT
AAGGGCGAGTACACACTCCGCCTGTCTATTCGTCACGAGAACCGCGCGCTGCTGGAGAGG
CTCACCGAGCTGCCGGTCGTGGTGCAGCAGAGACTCGCGCAACCCATCACGCTGGACGTG
TACAGCGACCAGCCACAGGCGTTGACGGGCGGGAAGAAGTTCACGTCGGCGTCTCTGGCC
AGCGGCGATGTGCTGCCGCTGTACTTCGCGCCGCTCCCCGCTGATAAGATAAGTCGTTCG
AACCTGTCCATCGGCGTGTCCCTGACGGGGACGGTGTCGTTCGTGAAGGACGAGCTGGGT
CACAAGCACCTGCACATGGGCGAGTGTCAGACCCTACTGGACGGACCCCGTAGGACGATC
AAGGACAACAGGAGACACGAGGACTACCACGACGCGCTTAGAGAGTTCACCGTGGGCTGG
ATGACCAAGATGGAGGGAGAAAAGTTGGACCAGGTGTACGAAGAAATATTAGAAAAATTC
CCGAACTTCATTGGAGCTCACGTCGCTTACATGAACAGTCTGGACTCCCCGACAGACCCC
AAGAGGTTACCGAATACAGAAGACGGCACGAACGGACTGAAACCGGCTCAGGACGAACAG
ATCATAAGCATCGCTGACAAGGTCATCAAGAGTATAGACCAGGATAAGTTACTGGCGCAC
CTGGGGACGAAGAACGACATGCGAGCTGACTCCAACAAGATAAAACAAGAGTTCGACCGT
CAGCGCGGCTACCTGATCGAAGCGCTGTGCCGTCGCGGCTCCGCCATGTGTCGCCTGGGG
CGGTCGATCTCCGCCCTGCACGAGAACGCGAACACCTTACTGAAGTTCACGGAGCTGAGC
GAGCCGCGCGCGCTCCAGTACGGCCTTTGGCACTGGACCGCCTTGGAGCAGTGGGGGCGC
GCCATGAGGCTGTGGCTGAGGGTGCACGACGAGCGACCCTCGCGGGAGGTGGACCAGCGA
GCCGCGCGCGCCGCCCGGGCGCTGGGCTGGGGACACGTGGCCGCACACCTCGCCGCCGCC
GCGCCGCACAAGCACGCCGCACACTACCGCCCCTTCTGA

Protein sequence:

MADVPIDCEFPVWGLMPKRETGVVSFLNKYPEYDGRNTVIAILDSGVDPAAEGLKVTSTG
ETKVIERYDCSGCGDVDTSTVVKKVVDGYITGITGRKLKIPETWDNPKGEWRIGVVYPFS
LYPTKVKERIQEHRKEHVWDVGQKPAMAKATKDLQDFENEVSSKTTLSQEEKQAKEELEA
RVEVLKELDKKYTDVGPTYDCVLWHDGTVWRACIDTSEEGDLSSGVLLGEYSATQEHAHL
TPLDEMTVSVNVHNDGDTLEVVGMCSTHGTHVAAIAAGYFPDDPDRNGVAPGAKIISLTI
GDSRLGSMETGTALVRACVKVMELAARTKVDVINMSYGEHAHWSNAGRVGEIISMVVNKY
GVSWVVSGGNHGPALATVGAPPDIAQPILIGVGAYVSSEMMLAAYSMRARGCGPRKSTSS
AGPCSDGALGISVCAPGAALASVARFTLRNSQLMNGTSMAAPHVAGAVAALISGLSCRGL
PHSPYSMKRALENTATYLEHVEPWAQGAGLLNIEKAFEHLVEHHAAVERDVTFNIKCGAN
NAKGIFLRPRADDPPRDISITVEPQFLEDFRDQNKRAVMERQLSFEVRLALTAAPAWLHG
PKHLHLAAAPRAFALRVHTADLPPGPHFASLNAYDVSCVSKGPVFRVSVTVLQPEPLAGL
PHEPHIRLTDVLFRPSAIKRHIIVVPPEASWGVVRLVRRGGESSSRFLVHVMQLSPRRSC
RDHETHRIMTLGPHAPAQAPFRLLGGVTVEVAIAKYWANAGDVQVDYTIELHGLRPDCGH
RLTLTSAALGSVRLTALRPLDVQPTAVLKHIEPVYRPSESKLCSLTARDVIPPSRQIYQL
LNTYTFNIPKATEVSPMVPMLCDMLYESEFESQMWMLYNSCKQLVAVGDAYPSKYSAKVD
KGEYTLRLSIRHENRALLERLTELPVVVQQRLAQPITLDVYSDQPQALTGGKKFTSASLA
SGDVLPLYFAPLPADKISRSNLSIGVSLTGTVSFVKDELGHKHLHMGECQTLLDGPRRTI
KDNRRHEDYHDALREFTVGWMTKMEGEKLDQVYEEILEKFPNFIGAHVAYMNSLDSPTDP
KRLPNTEDGTNGLKPAQDEQIISIADKVIKSIDQDKLLAHLGTKNDMRADSNKIKQEFDR
QRGYLIEALCRRGSAMCRLGRSISALHENANTLLKFTELSEPRALQYGLWHWTALEQWGR
AMRLWLRVHDERPSREVDQRAARAARALGWGHVAAHLAAAAPHKHAAHYRPF