DPGLEAN10937 in OGS1.0

New model in OGS2.0DPOGS210981 
Genomic Positionscaffold262:- 94988-104765
See gene structure
CDS Length1929
Paired RNAseq reads  1356
Single RNAseq reads  3126
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006406 (1e-22)
Best Drosophila hit  CG4914 (2e-85)
Best Human hittransmembrane protease serine 9 (3e-58)
Best NR hit (blastp)  serine protease-like protein [Bombyx mori] (1e-117)
Best NR hit (blastx)  serine protease-like protein [Bombyx mori] (6e-119)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families


  
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001314 Peptidase S1A, chymotrypsin-type
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL10168

Nucleotide sequence:

ATGTTTTTGTATGTACTCTGTCTTATGCTATGCGTGTCTCTATGTTCTACGGATACATTG
ACTAAGAATTTACTGAGAGTAAGCGATGAATACTACGCTCATGGTCGAAACAACGATTTA
CCGCCGTGCCGTGATTGTAGTTGTGGAGAACGTAATGAAGAACCTAGAATCGTGGGTGGT
TCTTCCACCGACGTGAACGCATATCCTTGGACGGCTCGTCTTATCTATTATAAGTCGTTC
GGATGTGGTGCCTCGGTCATCAATGACAGATACGTTATAACGGCAGCCCATTGTGTGAAG
GGATTCATGTGGTTCTTATTCAAAGTGAAATTCGGTGAGCATGATCGTTGCGATACTGGC
CATGTGCCTGAAACTCGTACAGTAGTTAAGATGTATGTACACAACTTTACTTTGACGGAA
TTAACTAATGACATATCACTACTACAGCTCAATAGACCTTTGGAGTATACACATGCTATC
CGACCCGTTTGTCTGCCTAAAACAGCGGATAATTTGTACGTTGGCAAAATAGCTACTGTC
GCTGGCTGGGGCGCCGTCCAAGAAACTGGTAAATGGTCGTGTACGTTACTCGAGGCTCAG
TTACCGATACTGAGCAACGAGAATTGTACCAAGACGAAATATGATGTAACAAAAATTAAG
GAAGTTATGATGTGTGCTGGATATCCAGAAACCGCTCATAAAGACGCTTGCACTGGAGAT
AGCGGTGGACCGTTGTTTATGGAAAATAGTGAACACGCTTACGAATTAATTGGTATAGTA
TCTTGGGGCTACGGATGTGCTAGAAAAGGCTACCCCGGTGTTTACACGAGAGTAACCAAA
TATTTGGACTGGATACGTGATAATACACAAGACGCATATAGTTGTCTTTACAAGAGCTGG
TCCAGCCTAATAATAGAACAGCGCATAATAATGTGGTTGAGGGAGCAACGTCCTCATTCC
CGTTGCATCCTCTCTCTGCTACAGCCTGGGGCTAGTGCTGCCTTAGGTGAAAAGGCCTTT
AATGAAACAAAAGAAACAACCACTGCGGCAAGTGGTAATATTGAGAGTTCCAGTAATACT
AATAGTAGTACTACTTCTACTACTACTCCTGCTACTACATTCGATCAGGAGATGTTAGAC
GAACTATATCAAGATTCGCAAAACAGGTGTAACTGTCGTTGCGGTGAAAGAAACGAGGAA
TCTCGTATTGTGGGTGGAGTGGAAACATCAGTGAACGAGTTCCCTTGGGTCGCTCGTCTG
ACTTACTTTAACAAGTTCTACTGCGGGGGCATGCTGATAAATGATAGATATATCCTAACT
GCGGCCCATTGTGTTAAAGGATTAATGTGGTTCATGATAAAGGTAACTTTGGGAGAGCAC
AACCGTTGTAACGACTCTCGTCCTGTAACACGTTATGTAGTACAAGTTGTTGCCCACAAC
TTTACCTATCTTACATTCAGGGATGATGTTGCCGTTTTGAGATTGAACGAGCCGATCGAA
ATATCAGATACAATTAAACCAGTATGTCTGCCCCAAATTACCGATAATGATTACGTGGGG
GTAAAAGCAATTGCCGTTGGTTGGGGATCGATTGGTGAGCAGAAAAATCATTCGTGCACT
CTATTAAACGTGGAATTGCCAGTGCTTAGTAATGACGTTTGTAGAAACACTATGTATGAG
ACGAGTATGATAGCGGATGGAATGCTCTGCGCCGGTTACCCAGACGAAGGACAAAGGGAC
ACTTGCCAGGGTGACAGTGGTGGACCTCTGACTGCAGAGAGAAAGGATAAACGTTACGAA
CTGCTGGGTATAGTCTCTTGGGGTATTGGGTGTGGAAGACGTGGATATCCAGGGGTTTAC
ACGAGGGTTACAAAATACCTGAATTGGATCAGAGACAACTCCCGCCACGGATGTTTCTGT
TCAGACTAA

Protein sequence:

MFLYVLCLMLCVSLCSTDTLTKNLLRVSDEYYAHGRNNDLPPCRDCSCGERNEEPRIVGG
SSTDVNAYPWTARLIYYKSFGCGASVINDRYVITAAHCVKGFMWFLFKVKFGEHDRCDTG
HVPETRTVVKMYVHNFTLTELTNDISLLQLNRPLEYTHAIRPVCLPKTADNLYVGKIATV
AGWGAVQETGKWSCTLLEAQLPILSNENCTKTKYDVTKIKEVMMCAGYPETAHKDACTGD
SGGPLFMENSEHAYELIGIVSWGYGCARKGYPGVYTRVTKYLDWIRDNTQDAYSCLYKSW
SSLIIEQRIIMWLREQRPHSRCILSLLQPGASAALGEKAFNETKETTTAASGNIESSSNT
NSSTTSTTTPATTFDQEMLDELYQDSQNRCNCRCGERNEESRIVGGVETSVNEFPWVARL
TYFNKFYCGGMLINDRYILTAAHCVKGLMWFMIKVTLGEHNRCNDSRPVTRYVVQVVAHN
FTYLTFRDDVAVLRLNEPIEISDTIKPVCLPQITDNDYVGVKAIAVGWGSIGEQKNHSCT
LLNVELPVLSNDVCRNTMYETSMIADGMLCAGYPDEGQRDTCQGDSGGPLTAERKDKRYE
LLGIVSWGIGCGRRGYPGVYTRVTKYLNWIRDNSRHGCFCSD