DPGLEAN14083 in OGS1.0

New model in OGS2.0DPOGS212233 
Genomic Positionscaffold959:- 37576-44652
See gene structure
CDS Length1251
Paired RNAseq reads  189
Single RNAseq reads  478
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004445 (7e-79)
Best Drosophila hit  CG14760 (3e-47)
Best Human hitsuppressor of tumorigenicity 14 protein (7e-33)
Best NR hit (blastp)  trypsin-like proteinase T2b precursor [Ostrinia nubilalis] (4e-136)
Best NR hit (blastx)  trypsin-like proteinase T2a precursor [Ostrinia nubilalis] (4e-130)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families



  
IPR000859 CUB
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
IPR009003 Peptidase cysteine/serine, trypsin-like
Orthology groupMCL10255

Nucleotide sequence:

ATGACGGTGATTTATCTGATAAGGCCAACTATCGGCCAATCTCTCTGGCTGGTCATCTGG
AAGATGTTCGGATCAGGAGCTATTTTTTTGTTGGTTGTGGGCTATGCTCAGGCGCAAGAT
GCGAATTGCGATTTCTTTTTAAATGTTGCTGCCGGGAGAAGTTATCCTATCTCCAGTCCG
AACTATCCATACAGCTACAGGCCAGGAGTAACTTGTCGCTGGATTGCACAATGCCCAAAT
GGATATAATTGCAGATTGGATTGCAGCGAAATAAATTTACCGCAGACGCAAAACTGCTAC
ATGGATAGGTTACTAGTATCCAAGACAGGGGATAGTCAATTGGGGTCATCGGAATACCAT
TGTGGTTATGGCACATTGACAGCTGTCTCCGTAGCAAACAGGATAAGTGTTGGTCTCGTT
ACGTCGAGGTCCAGTCGCGGCGGCAGATTTACCTGCACTGTTACTGCCCAAGCGTCGTCA
ACCTGCAGCTGTGGTTACAGAAATGTCCAGAGTTTAAAAGAGAGTTATATCGTCGGCGGT
GAGGAGACACGTCCTAATGAATATCCCATGATGGCTGGCATCGTGTATGTGGGAGAGAAC
ACCATCAAATGCGGTGCAGTCATCATTGATAACGGATACGTATTGACAGCTGCTCACTGC
GTCGTCGGCAAAAATCTCGGTGAACTCGCTGTGGTCGTTGGCGAACATGACGTCAGCACC
GGAGCGGATTCGCCGTCCTTGCAAGTTTTCAGAGTTGCTTCGGTTATAATTCATCCTCAA
TTTAACTCGGATACATATGACAACGACATCGCCATCATACAGATATATGGCAGTATAGTG
TACAGTCAGAAAGTAGGACCTGTCTGTCTGCCATTTAAGTTCATAAACGACGACTTCACC
GGATCCAAAGTTACCATTTTGGGTTGGGGGACGACATTCCCCGGAGGTCCAACATCGAAC
GTGCTCCGGAAGGTGGACGTGAATGTCGTCAGTCAGGCTTCATGCAGCAGAAGTTATCCA
AGTCTCAGTAACAATCAGATGTGTACATTTGCTCAAGGGAAAGATGCTTGTCAGGACGAT
TCTGGCGGTCCTCTGCTCTACCAGGACCCTTCGAACGGCCGTTTATACAGTGCTGGTATC
GTTAGCTTCGGCCGATTCTGTGCCTCCAGTTATCCCGGGGTGAACACAAGAGTCACCTCA
TACCTCTATTGGATCCTTAACAACGCCCCGGCCAACTACTGCAATATTTAA

Protein sequence:

MTVIYLIRPTIGQSLWLVIWKMFGSGAIFLLVVGYAQAQDANCDFFLNVAAGRSYPISSP
NYPYSYRPGVTCRWIAQCPNGYNCRLDCSEINLPQTQNCYMDRLLVSKTGDSQLGSSEYH
CGYGTLTAVSVANRISVGLVTSRSSRGGRFTCTVTAQASSTCSCGYRNVQSLKESYIVGG
EETRPNEYPMMAGIVYVGENTIKCGAVIIDNGYVLTAAHCVVGKNLGELAVVVGEHDVST
GADSPSLQVFRVASVIIHPQFNSDTYDNDIAIIQIYGSIVYSQKVGPVCLPFKFINDDFT
GSKVTILGWGTTFPGGPTSNVLRKVDVNVVSQASCSRSYPSLSNNQMCTFAQGKDACQDD
SGGPLLYQDPSNGRLYSAGIVSFGRFCASSYPGVNTRVTSYLYWILNNAPANYCNI