DPGLEAN04446 in OGS1.0

New model in OGS2.0DPOGS215188 
Genomic Positionscaffold895:- 1007-5346
See gene structure
CDS Length1152
Paired RNAseq reads  428
Single RNAseq reads  988
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008668 (7e-76)
Best Drosophila hit  CG32260 (7e-44)
Best Human hitcoagulation factor X preproprotein (1e-26)
Best NR hit (blastp)  hemolymph proteinase 17 [Manduca sexta] (9e-99)
Best NR hit (blastx)  hemolymph proteinase 17 short form [Manduca sexta] (2e-92)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families




  
IPR006604 Disulphide knot CLIP
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR022700 Proteinase, regulatory CLIP domain
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
Orthology groupMCL10588

Nucleotide sequence:

ATGGTTCTCCGAGTAGTATTTCTGTGTTTGTGTTTCCAGACAGCGCTCTCTCGTATCGAA
TATAGAAGAGAGGAGTCATGCAAGGAAGTGAATGGTGTTACTGGAAGATGTGTCTCCATA
GAATCCTGTCCCCCGTTTGTCCTGATGATGCAGACGGAACTTATTACTCAATACAAAACA
CTTCTAAAGCAATCACACTGTGGGTTTGAGGGAAACGTCCCCATGGTTTGCTGTCCCGAT
ACTTCTCCTGAATCTCCATCCTCTCTCGGTCCCTCATCTCCTGTCGACGTTCCATCTAAG
GGGTCGGCTCGAATGATGAACCTTCAATCACCATTTCTGTCACCACCAACCTGTGGAGTG
TCGAACGCTTCATCTGGCCGGGTTGTGGGAGGAGTTGACGCTAAGCTCGGAGACTTGCCC
TGGATGTGCCTTTTGGGGTACTGGGAGGGTGGCTATGATAAAGGCGGATCAAACGGGGAC
ACCAAGTGGAGATGCGGGGGATCGCTGGTGTCCGCACAACACGTGCTCACAGCCGCTCAC
TGTATTCATCACAGAGAGAAAGAACTATACGTGGTCCGTCTCGGAGAGTTGGATCTCGAT
CGTGATGATGAAGCGGCTCCAATCGACGTCCTCATTAGAAGAGCAATAAAACATGAAGCA
TATAACAGGGACACGTACACTAACGACATAGGACTCCTCGTACTCGAAAGAGGTGTCGAG
TTCACAAACCTGATACGGCCTATTTGTCTTCCGATCCTTCCTGAATTACTGTCTAACACG
TTTGTCAACTACAGTCCGTTCGTTGCTGGCTGGGGCAGAACGTCAGATCGAGGTCCCGGT
TCGAGCCATCTCAAACTGACTCAATTGCAAGTAGTCGATAACCAAAAGTGTAAGAAAACG
TACCTGGAGTACCCCGCCGTGATTGATGATAAGGTCTTGTGTGCTGAAGCGGGAGGACGC
GACGCCTGCGAAGGGGACAGCGGGGGACCCCTTATACAACCATTTTATAATCAGGATAAG
AAAGTGTATTACTTCTACCAGACAGGTGTTGTAGCGTACGGAAGACGTTGTGCTGAAGCC
GGTTACCCCGGGGTATATTCCAGGGTAACTCACTACATACTCTGGATACAGAAGCACATC
ATGGAGAACTAA

Protein sequence:

MVLRVVFLCLCFQTALSRIEYRREESCKEVNGVTGRCVSIESCPPFVLMMQTELITQYKT
LLKQSHCGFEGNVPMVCCPDTSPESPSSLGPSSPVDVPSKGSARMMNLQSPFLSPPTCGV
SNASSGRVVGGVDAKLGDLPWMCLLGYWEGGYDKGGSNGDTKWRCGGSLVSAQHVLTAAH
CIHHREKELYVVRLGELDLDRDDEAAPIDVLIRRAIKHEAYNRDTYTNDIGLLVLERGVE
FTNLIRPICLPILPELLSNTFVNYSPFVAGWGRTSDRGPGSSHLKLTQLQVVDNQKCKKT
YLEYPAVIDDKVLCAEAGGRDACEGDSGGPLIQPFYNQDKKVYYFYQTGVVAYGRRCAEA
GYPGVYSRVTHYILWIQKHIMEN