DPGLEAN08842 in OGS1.0

New model in OGS2.0DPOGS215182 
Genomic Positionscaffold4551:- 3848-9421
See gene structure
CDS Length1485
Paired RNAseq reads  583
Single RNAseq reads  2820
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008668 (7e-77)
Best Drosophila hit  CG32260 (5e-37)
Best Human hittryptase beta-2 precursor (8e-21)
Best NR hit (blastp)  coagulation factor-like protein 3 [Hyphantria cunea] (2e-87)
Best NR hit (blastx)  clip domain serine protease 4 [Bombyx mori] (3e-80)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families




  
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR006604 Disulphide knot CLIP
IPR022700 Proteinase, regulatory CLIP domain
Orthology groupMCL22674

Nucleotide sequence:

GTATTCGTTCCGATATACACGAGACGAGATGAAACCTCTCAGCATTCCTTGGATTCAACG
TACGGATGCCGGGTCCATCGCGTTGCAGGGAGAGGAGCTTCAACAGTGCCTGGAACTTGC
AACCCAGGTTTAGGTTTCAGGGGCCCTTATCTAACGCGCGTTAAACGCTCACCAACTATT
GTCCAAGCTCCTCTCTCTCTACCTAACCAAATTTTAAACGAATCTACGAGGGCTGCCATG
AATGAACTGATGTTAGTTCTTGACCGGCCCAAGTCTTTTAGCAAGGACACTTGCACCACA
TTGGAGGGTGGCATCGGAACGTGTACGCCAGCTGTCTCATGCGCTGCGTACTTCGATCTG
CAGCGACAAGCCAAAAGCTTAACTGCTTCTTTCCAGCTTAGAGACTCGCAATGTAAACCA
AATGGATATAATGATATGATCTGCTGCCCAGTTGCAAATGAGATACCAAGAGAAACGACG
CTGCCATTTCGGGACTTAGACAAAGAATATAACTGTGGTGAAGACCAAAGGAATGCTCAA
CTTGACGAAACGTGCACCACAATTGAAGGTGGTGTTGGCAAGTGTGAGTCGTTAGCAGCC
TGCGAGCCGTACCTTCATCTGACGAGACAAGCCAAAAACATTCCCTTGGCTATTCAACTT
AGAGATGCTCAATGCGGTTCAGACGGAAACGACCAAAAGGTCTGCTGTCCTACTTCAGGT
ACCTCCACTTCATCGCCAACTGGCGAACCTTCCTTCAGGTCACTATCAGAGTCAGACTAC
ATAACTGCCTTCCCTGAACCACCAGATTGTGGATTCAGTTTAGCACACTTTAACAGAGTT
GTGGGAGGTGTGAACGCTAAACTCGGAGGCTTCCCATGGATGGCACTTCTTGGTACCAAA
CAAGAAAACTGGGACACAGCACGTTGGATATGTGGGGGAAGTCTGATCTCTCACCGCCAC
GTCCTGACCGCTGCTCACTGTATAAAGAATGAATTGAACGTGGTCCGACTTGGAGAACTG
GACTTCGAAAGAGACAACGATGGCGCTTCTCCCATAGACTTATCCATTAAAAGAAAAATC
AAACATGAAAACTTCGACTACGCTTCCTTCACTAATGACATCGGCCTCTTGATATTGGGA
AAGGATGTGGAGTTCTCAAGCGGGGCTTCTTCATCTCACCTGCTATATGTGGAGCTGCCT
GTCGTGAACAACTCGGTATGCGAGACAGCTTATGAGTCGCGGGTCATCGATGAGAGAGTT
ATGTGTGTTGGCAGCATCTTTAAAGACTCCTGCTCCGGGGACAGCGGTGGACCGCTCATG
GACAATATAATAAACAAAACAACTTTTCGTACTCATTTCTATCAGACCGGCGTTGTGTCG
TATGGTCACACTAAATGCGGTGAAGCAAATTTTCCAGGCGTCTACAGTTCACTGGCGTAC
TTCTTGCCCTGGATACGGGAAAATGTGCTGGGATTTGTAGAATAG

Protein sequence:

VFVPIYTRRDETSQHSLDSTYGCRVHRVAGRGASTVPGTCNPGLGFRGPYLTRVKRSPTI
VQAPLSLPNQILNESTRAAMNELMLVLDRPKSFSKDTCTTLEGGIGTCTPAVSCAAYFDL
QRQAKSLTASFQLRDSQCKPNGYNDMICCPVANEIPRETTLPFRDLDKEYNCGEDQRNAQ
LDETCTTIEGGVGKCESLAACEPYLHLTRQAKNIPLAIQLRDAQCGSDGNDQKVCCPTSG
TSTSSPTGEPSFRSLSESDYITAFPEPPDCGFSLAHFNRVVGGVNAKLGGFPWMALLGTK
QENWDTARWICGGSLISHRHVLTAAHCIKNELNVVRLGELDFERDNDGASPIDLSIKRKI
KHENFDYASFTNDIGLLILGKDVEFSSGASSSHLLYVELPVVNNSVCETAYESRVIDERV
MCVGSIFKDSCSGDSGGPLMDNIINKTTFRTHFYQTGVVSYGHTKCGEANFPGVYSSLAY
FLPWIRENVLGFVE