DPGLEAN04442 in OGS1.0

New model in OGS2.0DPOGS215183 
Genomic Positionscaffold895:- 45147-52195
See gene structure
CDS Length1389
Paired RNAseq reads  281
Single RNAseq reads  1282
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008668 (8e-94)
Best Drosophila hit  CG1299 (2e-52)
Best Human hitplasma kallikrein precursor (5e-30)
Best NR hit (blastp)  clip domain serine protease 4 [Bombyx mori] (4e-104)
Best NR hit (blastx)  coagulation factor-like protein 3 [Hyphantria cunea] (1e-103)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families




  
IPR006604 Disulphide knot CLIP
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR022700 Proteinase, regulatory CLIP domain
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
Orthology groupMCL22674

Nucleotide sequence:

ATGTCACCTCCTCACGTAATGAGTAGTGCTCTGTTGACAGTGCTGTGTGTCTGTTTGTGT
CCACAAGCTGTGTTTTGTCAATACCAGGAAACTTGCACCACATTAGAAGGTGGCATCGGA
ACATGTACGCCAACTATCTTATGCGCTCCGTACTTCAATCTGCTGGGAGTAGCCAAAAAC
CTCACTATTTCTTTTCAACTTAGAGATGCTCAATGTGGGTCACATGGTATCAACATCATG
GTCTGCTGCCCAAATCAAAATGAGCCACCTCCAGAAACAAGCAAATTATTATTGACCTTA
AATACCTCCGATGACTCTAATAAGGATCAAGGGATCACTCAACTTGACGAAACGTGCACC
ACAATTGAAGGTGGTGTTGGCAAGTGTGAGTCGTTAGCAGCCTGCGAGCCGTACCTTCAT
CTGACGAGACAAGCCAAAAACATTCCCTTGGCTATTCAACTTAGAGATGCTCAATGCGGT
TCAGACGGAAACGACCAAAAGGTCTGCTGTCCTACTTCAGGTACCTCCACTTCATCGCCA
ACTGGCGAACCTTCCTTCAGGTCACTATCAGAGTCAGACTACATAACTGCCTTCCCTGAA
CCACCAGATTGTGGATTCAGTTTAGCACACTTTAACAGAGTTGTGGGAGGTGTGAACGCT
AAACTCGGTGACTTCCCCTGGATGGCACTTCTTGGTACCAAACAAGGAAACTGGGACGCA
GCACGTTGGATATGTGGGGGAACTCTGATCTCTCACCGCCACGTCCTGACCGCTGCTCAC
TGTATAAAGAATGAATTGAACGTGGTCCGACTTGGAGAACTGGACTTCGAAAGAGACGAC
GATGGCGCTTCTCCCATAGACTTTTCCATTAAAAGAAAAATCAAACATGAAAACTTCGAC
TACGCTTCCTTCACTAATGACATCGGCCTTTTGATATTGGGAAAGGATGTGGAGTTCACT
CGTCTGATGCGGCCGATCTGTCTGCCGACTCGTGAAGACCTACGTTCAAAATCTTTTGTT
GGCTACCATCCTTTCATCGCCGGTTGGGGAAACGTCGACAACCGTGGTGCTGCTAAATCT
CACATGCAAGTTGCGCAGCTGCCTGTCCTGGAAAACTCCAAATGCAGGAGGGTTTACGAA
TTGCGGGTCATCGACGAAAGGGTCATGTGTGCTGGCGTCACAGGCAAAGACTCCTGCAAT
GGTGACAGTGGCGGACCGCTCATGCAACCGAATACGAACCGGACAACGGGTAAAATATAT
TTCTATCAGACCGGCGTGGTGTCGTATGGTCACACTAGATGTGGTGAAGCGAGTTTCCCA
GGCGTGTACAGCTCAGTGCAGCACTTCCTGCCCTGGATACAGAAACACGTGCTGGGATCG
GACGAATGA

Protein sequence:

MSPPHVMSSALLTVLCVCLCPQAVFCQYQETCTTLEGGIGTCTPTILCAPYFNLLGVAKN
LTISFQLRDAQCGSHGINIMVCCPNQNEPPPETSKLLLTLNTSDDSNKDQGITQLDETCT
TIEGGVGKCESLAACEPYLHLTRQAKNIPLAIQLRDAQCGSDGNDQKVCCPTSGTSTSSP
TGEPSFRSLSESDYITAFPEPPDCGFSLAHFNRVVGGVNAKLGDFPWMALLGTKQGNWDA
ARWICGGTLISHRHVLTAAHCIKNELNVVRLGELDFERDDDGASPIDFSIKRKIKHENFD
YASFTNDIGLLILGKDVEFTRLMRPICLPTREDLRSKSFVGYHPFIAGWGNVDNRGAAKS
HMQVAQLPVLENSKCRRVYELRVIDERVMCAGVTGKDSCNGDSGGPLMQPNTNRTTGKIY
FYQTGVVSYGHTRCGEASFPGVYSSVQHFLPWIQKHVLGSDE