DPGLEAN10938 in OGS1.0

New model in OGS2.0DPOGS210982 
Genomic Positionscaffold262:- 83507-88685
See gene structure
CDS Length1356
Paired RNAseq reads  272
Single RNAseq reads  788
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006407 (2e-38)
Best Drosophila hit  CG3355, isoform A (3e-40)
Best Human hitsuppressor of tumorigenicity 14 protein (2e-36)
Best NR hit (blastp)  serine protease like protein [Cephonodes hylas] (1e-108)
Best NR hit (blastx)  serine protease like protein [Cephonodes hylas] (3e-108)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families


  
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001314 Peptidase S1A, chymotrypsin-type
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL20484

Nucleotide sequence:

ATGTTGGGATCGAAACTACATTGTGGAGGTGCAATTATTACCGACCAACACGTTTTAAGT
GCTGGGCACTGTATCACTTTTGGTGTTAATTTTAAAGATCTAACCGTCTACATAGGAATG
CATGATCGTTTGGGGAGCACTCATACCGTCTCTAGACTGAAGAATGGTGTTAAGCATCCC
AGCTTCACTTCAAATGCCGTTCGAGACATCAATGACATTGCGATTTTAACACTCGACAAA
AAGCTTCAATTTTCAGATAAAGTTCGTCCAATATGCTTACCAAGTGAAGGAATGGATTTT
AAAAATGTACCACTAACTGTAGCCGGATGGGGAAAAACTAGACAAGGAGCTCTAACATCA
TCGAGATATTTATTAGAAACTAAGGTTAAAATTGTTCCTAGTAATACGTGTTCCAAGTCG
TCTATATACAAAGATAATCTTGTCACCGATTCCATGATGTGTGCTTATAGTCTTGGAAAA
GACGCTTGTCAGGGTGATAGCGGGGGACCAATTTTCGCTACACATGCACGAACACATAAC
AAGAAATGGTACCAAGTTGGTATCGTCTCTTGGGGTATAGATTGCGCTATGCCGGACTAT
CCTGAATGCGGAACACCATCAGACAAAATAATATCAATGAGAATAGTGGGTGGTAGAAGA
GCTGAGCCTCACTCGTTTCCTTGGACTGTGGCTATCGTGAAGAATGATCGAATGCATTGC
GGTGGTGCCATAATAACAGACCGGCATGTCCTCAGCGCTGGTCATTGTTTTAAATGGGAT
GATAGAAAGCAAATGAAAGTTTATATAGGTCTCGACGATTTGGAAGACATGAATAATGTT
GAAGTTAGGAACATCTCAAATGTGGTCATTCACGAACAGTTCACATCGACCGCTGTTCGA
GACGAAAATGATATAGCAATCGCTACTTTAAACAAACCAGTTACGTTCAGTGACACAATC
GTACCAATATGTTTGCCTTCTCCGGGACAAAAATTTGATGGTAGATCAGGTACTATAGTA
GGATGGGGTCGTCTTGGAACTGATAAAACATCTTCGAAGGTTCTAATGAAAGCCAGTCTT
CGAATTCTCAGTGACGAGGAATGTTTTAAATCCAAATTGGCCAGCCATATAAAGCCAATG
ATGATGTGTGCTTTCACTAAAGGAAAAGACGGTTGTCAGGGCGACAGTGGTGGACCACTT
TTGACGTTTGAATCCGACGGAAGATACGTTCAAGCAGGAATTGTGTCGTGGGGTATTGGA
TGTGCAAACCCAAATTACCCAGGTGTGTACACTAAAGTGAGCAACTACAATGACTGGATC
GAAAAGAATACAGCAAATGGAAAAACATGTGATTAA

Protein sequence:

MLGSKLHCGGAIITDQHVLSAGHCITFGVNFKDLTVYIGMHDRLGSTHTVSRLKNGVKHP
SFTSNAVRDINDIAILTLDKKLQFSDKVRPICLPSEGMDFKNVPLTVAGWGKTRQGALTS
SRYLLETKVKIVPSNTCSKSSIYKDNLVTDSMMCAYSLGKDACQGDSGGPIFATHARTHN
KKWYQVGIVSWGIDCAMPDYPECGTPSDKIISMRIVGGRRAEPHSFPWTVAIVKNDRMHC
GGAIITDRHVLSAGHCFKWDDRKQMKVYIGLDDLEDMNNVEVRNISNVVIHEQFTSTAVR
DENDIAIATLNKPVTFSDTIVPICLPSPGQKFDGRSGTIVGWGRLGTDKTSSKVLMKASL
RILSDEECFKSKLASHIKPMMMCAFTKGKDGCQGDSGGPLLTFESDGRYVQAGIVSWGIG
CANPNYPGVYTKVSNYNDWIEKNTANGKTCD