DPGLEAN03148 in OGS1.0

New model in OGS2.0DPOGS206563 
Genomic Positionscaffold210:- 63916-66925
See gene structure
CDS Length1221
Paired RNAseq reads  552
Single RNAseq reads  1636
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013746 (7e-51)
Best Drosophila hit  CG9737 (4e-41)
Best Human hittransmembrane protease serine 4 isoform 4 (4e-28)
Best NR hit (blastp)  pxProphenoloxidase-activating proteinase 3 [Plutella xylostella] (2e-73)
Best NR hit (blastx)  pxProphenoloxidase-activating proteinase 3 [Plutella xylostella] (4e-72)
GeneOntology terms

  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
GO:0006911 phagocytosis, engulfment
InterPro families




  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR022700 Proteinase, regulatory CLIP domain
IPR006604 Disulphide knot CLIP
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
Orthology groupND

Nucleotide sequence:

ATGGATTTCGCTATACTACTATGTGCTGCCTTGTACATCGTTCATTTAACGGAATCTGTT
GAAGCTGAAAAAAAATGTAAAGTTCCGAACGGTGACGCCGGTAACTGTGTCAGTATAAAT
ATATGCCCACCACTGAAAAGCCTCTATAAGAAGAAACATAAAACAGTCAATGAAAACAGA
TTTATAAGACAATCTATATGTGGTTCCCGAGACTCCGATCCAATAAAGATCTGCTGTCCA
CCACAGTCTTCGTGGGTTGGGTTCGTCCCCACGCCGGTGGTAGTTCCACCATCACCAGTG
TATAGACGACCTAAACCAGAACCGACCTTGTTACAACCCAATACACTCATGATACCGTCA
TTCAACAGCACCGACAGACAAACAGACAAACCGATGAGACCAAAACCTGACCTGCAAGAT
TACAACAGTGTGTGCGGCATAGACTCATCCAGTGGGAATAGAATTACAAATGGCAACGAA
ACCGCTGTGGACCAGTATCCTTGGTTGGCTTTATTGGAATATTCAAACGGCTTCCTCGGC
TGCGGCGGGAGCTTGATCAGCTCCAGATACGTTCTTACAGCTGCACACTGTCTTAAAAGC
TTACAAAACGGAGAGCCATTATATGTCCGTCTGGGGGAGTACAACATCACTTCCTTCCCC
ACGGACATCGTTGAAATAGACGGCGGTGGTTTTGAAGTTGTCACAGTAACAGTCATAGCC
ATTAGGGCTATGTATACACATCCGTTGTATTATAGAGACCTGAGACTACACGATATAGGT
CTTATTGAAATGGAAGAAGCAGCAAATTTCAGCGATTTCATCAAAGTAATATGTCTTCCC
CAAATGGATTACATGCCAATCTTCAACAGCTCAACCATATTCTACGTAGCTGGCTGGGGC
AGTGACAATTTTAGTTCTGGCACTGAGGTCAAGATGGAGACCAGCGTCCCATACAAACTC
CACAGTCAGTGCCCGTTGGTGATGGAACCGTATCCGATTCACCAGATATGTGCTGGCGGG
GAAGGTGGAAGGGACACCTGCAGCGGGGACTCAGGTGGTCCATTGATGTATGAGACTCCG
AGCCACAGATACGAAGCTGTAGGGATCGTGAGTTACGGCTCCAGGGATTGTGGTAAAGAA
GGGGAACCGGCTGTTTACACTTATGTATACAATTACCTGCCCTGGATAAGGAATATTCTC
AGCGGTAATGTGGACCAATGA

Protein sequence:

MDFAILLCAALYIVHLTESVEAEKKCKVPNGDAGNCVSINICPPLKSLYKKKHKTVNENR
FIRQSICGSRDSDPIKICCPPQSSWVGFVPTPVVVPPSPVYRRPKPEPTLLQPNTLMIPS
FNSTDRQTDKPMRPKPDLQDYNSVCGIDSSSGNRITNGNETAVDQYPWLALLEYSNGFLG
CGGSLISSRYVLTAAHCLKSLQNGEPLYVRLGEYNITSFPTDIVEIDGGGFEVVTVTVIA
IRAMYTHPLYYRDLRLHDIGLIEMEEAANFSDFIKVICLPQMDYMPIFNSSTIFYVAGWG
SDNFSSGTEVKMETSVPYKLHSQCPLVMEPYPIHQICAGGEGGRDTCSGDSGGPLMYETP
SHRYEAVGIVSYGSRDCGKEGEPAVYTYVYNYLPWIRNILSGNVDQ