DPGLEAN14344 in OGS1.0

New model in OGS2.0DPOGS211237 
Genomic Positionscaffold1538:+ 15007-17950
See gene structure
CDS Length1185
Paired RNAseq reads  588
Single RNAseq reads  2389
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005173 (7e-46)
Best Drosophila hit  Spatzle-Processing enzyme (3e-47)
Best Human hittransmembrane protease serine 3 isoform 1 (3e-29)
Best NR hit (blastp)  CLIP-domain serine protease subfamily B (AGAP003252-PA) [Anopheles gambiae str. PEST] (1e-55)
Best NR hit (blastx)  PREDICTED: similar to proclotting enzyme [Tribolium castaneum] (3e-57)
GeneOntology terms









  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
GO:0050830 defense response to Gram-positive bacterium
GO:0045087 innate immune response
GO:0050829 defense response to Gram-negative bacterium
GO:0045752 positive regulation of Toll signaling pathway
GO:0050832 defense response to fungus
GO:0008063 Toll signaling pathway
GO:0031638 zymogen activation
GO:0006964 positive regulation of biosynthetic process of antibacterial peptides active against Gram-negative bacteria
GO:0006952 defense response
InterPro families




  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR006604 Disulphide knot CLIP
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
IPR022700 Proteinase, regulatory CLIP domain
Orthology groupND

Nucleotide sequence:

ATGGATAAGATAATTTTAGCCATCCCACTGACTATCGTTATAATTGCCACAGTTTATTCC
GATATAATTGAAGCTCCAAACCAATACTGTAAGACAAAATTTACAAATGGTACATGTGTG
AAAGTAACCGATTGCCCCTATGCGTTGACATTGATATATAAACATGATTATGATACACTA
TCAAATCTCACTTGTGGATTCAATAAGCACCAACCTCAGGTATGTTGTCCCCAACAAGAC
TTTCCTATATTGTACGATACAAAAGAGGAACCGGCTACAAATAGACCGAAACCAATGAAT
TTAAAACCGGTAGCCACAACAACTTTTAGCCCCCACATAGAAACGAAGAATGATAGCTCT
AATGTGTTACCAAACAAAACAATTTGTGGCAAAGTAAAAAATAAGGGCGTCAGTGATAGA
ATCGTTGGTGGATCTGTTGTTGAAGTTGATGAACATCCTTGGTTAGCTCGTATACAACAT
AAATTCGATGACAATACTATTTTCGGATGTTCAGCTGCACTTATCACTAATTTATATCTT
CTTACGGCAGCACATTGCGTGCAAAATCACAAAATTATTCCGTTCAGTGTTCGTTTGGGA
GAGTGGAACACCAAGACAGACATTGACTGTCGCAACAACATTTGTAATAACAGTACAGTT
GACATAAACATTAATAAAATAATTGTCCATCCAAAATATGATGGAAAATTAGGTCATAAC
AGCGACATCGCCTTGATTCGTTTAAGAGATCCCGTGAATTTTACAGATTTCATACAGCCC
ATATGTTTACCCGCTTCTAAATACATTGCCATGCAAGACTCTGTCATCAATAACGCTTAT
TGGACAGCTGGCTGGGGAGAAACAGAATATGAAGAGGAATCTGTTATAAAACGCCAAGTA
CAACTGAATTCTGTACCAATAGAAATTTGTCGAGCTCATTTCAAAGTGGCACCTGAAACT
GAGCCAAACATAATTTGCGCTGGAGGTATAAAAGGAAAAGATACATGCAATGGAGATTCA
GGAGGACCATTAGTAAAAATAGAATCAGAAAATTATGAAGAAAATTGGTACATGTTTGGA
ATAACCAGTTCGGGCTCCAAGACATGTGGCCGGGAAGGTGTACCCGGAATCTATACAAGA
GTCACCTCTTACATTGATTGGATTCTTGAAAATGTTAAAGAATGA

Protein sequence:

MDKIILAIPLTIVIIATVYSDIIEAPNQYCKTKFTNGTCVKVTDCPYALTLIYKHDYDTL
SNLTCGFNKHQPQVCCPQQDFPILYDTKEEPATNRPKPMNLKPVATTTFSPHIETKNDSS
NVLPNKTICGKVKNKGVSDRIVGGSVVEVDEHPWLARIQHKFDDNTIFGCSAALITNLYL
LTAAHCVQNHKIIPFSVRLGEWNTKTDIDCRNNICNNSTVDININKIIVHPKYDGKLGHN
SDIALIRLRDPVNFTDFIQPICLPASKYIAMQDSVINNAYWTAGWGETEYEEESVIKRQV
QLNSVPIEICRAHFKVAPETEPNIICAGGIKGKDTCNGDSGGPLVKIESENYEENWYMFG
ITSSGSKTCGREGVPGIYTRVTSYIDWILENVKE