DPGLEAN19282 in OGS1.0

New model in OGS2.0DPOGS206217 
Genomic Positionscaffold3442:- 5534-8834
See gene structure
CDS Length1290
Paired RNAseq reads  27
Single RNAseq reads  79
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009694 (5e-128)
Best Drosophila hit  CG13744 (7e-107)
Best Human hittransmembrane protease serine 9 (1e-35)
Best NR hit (blastp)  GE22661 [Drosophila yakuba] (3e-110)
Best NR hit (blastx)  GD10114 [Drosophila simulans] (4e-110)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families


  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001314 Peptidase S1A, chymotrypsin-type
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
Orthology groupMCL17661

Nucleotide sequence:

ATGTTCACGGTGAAGAGGCAAGCGAAGACATGGATACCGATACAGGACAGGAAAATAAAA
AACTGTTTTATCACCAAAAAACTATCGAACAGTCCCAAGAATGTATTTAAAAGGCGAGCC
TTTAAATGGCGGAAACCTAAATGCCAGACGGTTAACATAATCATACCTCTCATGATTTTG
AACTTTGCTGGACACACCAGCTCGGAGAGTCTTAGTAACAGAGTCCTGGCCTCACTCCTG
GGATACCCGACCACGTGTACTGTTGGTTCTCAAGTGCGAGCCTGTTCTCTGTCGCTGACT
TGTTGGCTCCGCGGTGGTATCAGGGTGAAGGGTTGCGGAGGAAGCTGGTTGTTCTCATGC
TGTTACATAGCCCGGGACAGCTATGACTATGATAACTCAATCCCCTCTTCCGACTGGAAA
TACAAAATACCGCCGAAGTTACGTCAAGTACCTCAGAGGAATGTGGTGCCAACTAACGTG
TTCCGACGGAGAGTCGACGACGACATTAGTCAGATGGAGTGCGGCCTCTCCTCAAGTCGC
ATGCTCCAGAAGCGTATCATCGGCGGTCGGGAGGCCAGGGTCGCGGAGTTCCCCTGGCAG
GCTCACGTCAGGATCTCAGAGTTCCAGTGCGGCGGAGTCTTAATATCTCGTTGGTACGTG
GCGACGGCAGCTCACTGCGTGTCCCGAGCTCGTCCTAGGGATGTGGCCGTGTGGCTCGGA
GCACTTGACACCACCTCTGGGGATAAAAGCGCGAGAAAAATTGGGGTCGTCCAGAAAATC
CTCCACCCCCTCTTCCAGTTTCGCATGACCCAACCTGACCGGTACGACATAGCGTTGCTA
AAACTCTCCCGACCTGTGACCTACACTAGTCACATCCTCCCGATCTGTCTGCCCGACGGA
GATTTCGAACTCCGCGGCAAGTCAGGGGTCATCGCCGGCTGGGGCAAGACCGATACCAGC
AACGGCCACACTGGCACTAACTTACTACGGTCCGCTACTGTACCGATTTTGAGCACCGAA
CAATGTATCAACTGGCACCAGAGTAAGCAGATCTCTGTTGAAATACATTCGGAGATGATC
TGCGCCGGACATTCAGACGGACACCAAGATGCGTGTCTAGGTGACTCTGGAGGTCCCCTA
ATTGTGTTGGACAGGGGTCGTTACTACCTGGCCGGTATCACCTCGGCCGGGTTCGGCTGC
GGCGTCGACCACCAGCCAGGGATCTATCACAACGTGCGGGTCACCGCTGGCTGGATCAGA
GACGTCATCACCAGATATGGTGACCTCTAG

Protein sequence:

MFTVKRQAKTWIPIQDRKIKNCFITKKLSNSPKNVFKRRAFKWRKPKCQTVNIIIPLMIL
NFAGHTSSESLSNRVLASLLGYPTTCTVGSQVRACSLSLTCWLRGGIRVKGCGGSWLFSC
CYIARDSYDYDNSIPSSDWKYKIPPKLRQVPQRNVVPTNVFRRRVDDDISQMECGLSSSR
MLQKRIIGGREARVAEFPWQAHVRISEFQCGGVLISRWYVATAAHCVSRARPRDVAVWLG
ALDTTSGDKSARKIGVVQKILHPLFQFRMTQPDRYDIALLKLSRPVTYTSHILPICLPDG
DFELRGKSGVIAGWGKTDTSNGHTGTNLLRSATVPILSTEQCINWHQSKQISVEIHSEMI
CAGHSDGHQDACLGDSGGPLIVLDRGRYYLAGITSAGFGCGVDHQPGIYHNVRVTAGWIR
DVITRYGDL