DPGLEAN04694 in OGS1.0

New model in OGS2.0DPOGS204146 
Genomic Positionscaffold2055:- 18499-23560
See gene structure
CDS Length1218
Paired RNAseq reads  359
Single RNAseq reads  939
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005172 (4e-71)
Best Drosophila hit  melanization protease 1, isoform A (3e-54)
Best Human hitchymotrypsin-like protease CTRL-1 precursor (2e-27)
Best NR hit (blastp)  hemolymph proteinase 5 [Manduca sexta] (4e-105)
Best NR hit (blastx)  hemolymph proteinase 5 [Manduca sexta] (5e-107)
GeneOntology terms



  
GO:0006952 defense response
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
GO:0008236 serine-type peptidase activity
GO:0035006 melanization defense response
InterPro families




  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
IPR006604 Disulphide knot CLIP
IPR022700 Proteinase, regulatory CLIP domain
Orthology groupMCL10695

Nucleotide sequence:

ATGTTGTCTGTTTGGTTTGTGTGTGTAGTGATCTTGATCTTTGTGCCCGCCGACTGTTTA
TATTCTGGTGAAAGCTGCACGGTCAACGGTCGACCTGGGATTTGTAGGTTGCTCTCACAA
TGTTCTCATTTAGTTAATGAGATCCAAGATGCTGGAACACCGATGCCGCCGTATCTGAGA
AGAAAACTACAAAACCTCTCGTGTGGCTTCGACGATGACGAGCCTATGGTTTGCTGTATT
TCTAACCCTGGAGATACGGGAGATCCAAATCATAATGGGTTGCTTAAACCTTATAATAAT
GACGATATTGATACAGGAAAAAATAGAGTTGATGATAAAAGTGGTGCAAATACAATAGAC
TCCATACCGGATATTCGCTATCACCCAAAACTAAATCTGCTACCAACGAATTGCGGTGTC
ATTGAAAATGACAGGATTTTCGGAGGAAATAGGACAAGGCTATTTGAAATGCCGTGGATG
GTGCTACTGTCATACGACTCTCCTCGCGGTACAAAATTAAGTTGCGGTGGTACTATTATA
ACCAGACGGTACATCTTGACAGCAGCACACTGTGTGTCATTCCTGGGATCAAGACTTACA
TTACGTGACGTCATCCTTGGAGAGTACGACATTAGGTCTGACCCAGATTGTGAGAGGGTT
GAAGGAGAAGTGTTTTGTGCACCAAGAGTTCGGAATGTATCTATAGATGAGACTATACCT
CATCCGGGGTATTCTCCCACGAGGCTAAGAGATGATATTGCTTTAATAAGGCTCTCAGAA
CCAGTAGATTTCACCTTGGACAGCATGAAACCAATCTGCTTGCCGACGACACCAACATTG
TTATCAGAGCAGCTGGAAGGTTTGCAGGGTGTAGTGGCAGGCTGGGGCACCACCGAGGAT
GGACTTCAGTCACCTGTGCTGCTCAGTGTTGATCTACCAATACTCACCAATTCGCAGTGC
CAGTCGGTTTATCACGGATCGCTTCAAATTTACGATACTCAACTGTGCGCAGGAGGAGTT
GTGGATAAAGACTCCTGTGGTGGTGATTCTGGAGGACCATTGATGTACCCTGGAAGAACA
CAATCTGTTGGAGTCAGATACGTTCAACGGGGCATAGTGTCTTACGGCTCCAAGCGTTGT
GGGATTGGAGGATTACCTGGAGTATACACTAGAGTATCCTATTACATGAAATGGATTTTA
GATAATATAAGAGACTAG

Protein sequence:

MLSVWFVCVVILIFVPADCLYSGESCTVNGRPGICRLLSQCSHLVNEIQDAGTPMPPYLR
RKLQNLSCGFDDDEPMVCCISNPGDTGDPNHNGLLKPYNNDDIDTGKNRVDDKSGANTID
SIPDIRYHPKLNLLPTNCGVIENDRIFGGNRTRLFEMPWMVLLSYDSPRGTKLSCGGTII
TRRYILTAAHCVSFLGSRLTLRDVILGEYDIRSDPDCERVEGEVFCAPRVRNVSIDETIP
HPGYSPTRLRDDIALIRLSEPVDFTLDSMKPICLPTTPTLLSEQLEGLQGVVAGWGTTED
GLQSPVLLSVDLPILTNSQCQSVYHGSLQIYDTQLCAGGVVDKDSCGGDSGGPLMYPGRT
QSVGVRYVQRGIVSYGSKRCGIGGLPGVYTRVSYYMKWILDNIRD