DPGLEAN14131 in OGS1.0

New model in OGS2.0DPOGS200027 
Genomic Positionscaffold9430:- 25-8310
See gene structure
CDS Length1656
Paired RNAseq reads  81
Single RNAseq reads  241
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012427 (1e-26)
Best Drosophila hit  snake, isoform A (2e-40)
Best Human hittransmembrane protease serine 11D (2e-32)
Best NR hit (blastp)  hemolymph proteinase 9 [Manduca sexta] (5e-103)
Best NR hit (blastx)  hemolymph proteinase 9 [Manduca sexta] (2e-103)
GeneOntology terms







  
GO:0009950 dorsal/ventral axis specification
GO:0008063 Toll signaling pathway
GO:0005576 extracellular region
GO:0016485 protein processing
GO:0007311 maternal specification of dorsal/ventral axis, oocyte, germ-line encoded
GO:0004252 serine-type endopeptidase activity
GO:0008236 serine-type peptidase activity
GO:0008233 peptidase activity
GO:0006508 proteolysis
InterPro families


  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
Orthology groupMCL15123

Nucleotide sequence:

ATGTTAAAGCTTATATATTTGTTTCTCGCTCTGTTTTTCGTCGTGTCCTCACGGCTGAGG
GACGTGATCAGGGTGAATCGTTTTTGCCACAGCTCTCCAGGCTGGTTGATTCTGAGCCAG
GTGGATATTGGTCAGCATCTTCTAACCTGTGAACTCCGCAATACATCGGTCCATTACATG
GGCATATGTCCTCTACTTCTTTTCTCATCCATGCCGCCAGTTAGTATTAGTTTCTCTAGA
CTCTCCGATCCACTTCTGGCCACATGCCAAAAGAAAGTGAGAGTCCATTGTGCACAGATA
GAGAACATATATTTTGAAGATCCCTGTGACTTCGTCGATCCTGTTCTGCCAGACTTCAGG
TATCCTGGGAAAAGAATTTGCGAAGTTAAATGTGATGAATACGTTTGGCAGAGAATAAAT
CGGGACAAAGCTGCAAAAAGGTTTCAGGAATGTTTGGAATATAATAGAAATCAAGGTGTC
CCCGGACCTTATGGTGCTATTGGTGGAAGAGATTCATTGTCAGGAGAATTTCCACATATG
GGAGCTATAGGGTGGAGAGCTGTTCAGGGTACTTGGGTATTTAAATGCGGCAGCACACTC
ATCAGCCCGAAGTTCACTCTTACAGCAGCCCACTGTACCAGAGCGCCGCCAGACCCAAGA
ACTGTTTCTGAAGTACCACAAATTATAAGGTTCGGAGAAAAAAACATTATTGACGTGTTA
GATCCAGTCGATGCGAACATAGTAAGATTTTTTGTCCACCCACAATATAAATCTCCTTCA
AAATACAACGATATCGCATTGATTGAAATTGATACGGAGCTAAAGTTCTCTAAAAACATT
CAGCCAGCTTGTCTATGGAGCTACATTGACACTAGTGTGCTCGGTTCAAGCGCTACTTTA
ACAGGGTGGGGAGTTATTGACACAGCTACAGGAAAGACGTCTCCAATTCTGCAAGCAGCA
GGCGTAAATGTAATCGACGACGAGCTCTGTAACAGATTGTTGAAACGATCGTGCAGTAGA
CGATGGTGCGGAGTCAAAGATCAGATCTGTGCAGGAAAACTAGAAGGAGGAGTAGATGCT
TGTCAAGGTGATTCCGGTGGACCCTTACAAATAAAAATACCTCTACCGCCATCTGATGAG
GGTTCAATGCACTATGTAATAGGTGTGACGTCATTTGGGATAGGTTGCGCTAGACCAAAT
CTGCCTGGTATATACACTAGAGTTTCCACATTTGTTGACTGGATAGAAAATATTAATACA
GATAAAATAGAGTCGGATCCTTGCGTGCCATACAATGCCACTCTGCCGAACTTCAAGAAA
TATGGACGAAGAATTAGTGAAGTCAGTCTCGCGAAACGGAGAGAAAAGGAAGGACCTGAC
GGGCTTTTCGTTTCGACTGAAGCAATTGGAGGCCGAGATGCGCTGCCAGGGGAATTCCCA
CACATGGGGGCATTAGGTTGGAAAGCTGTAGAGGGTACTTGGATATTCAAATGCGGTAGT
ACTCTTATCAGTCCAAAGTTCACTCTTACGGCAGCTCACTGCTCTAAGACACCTCCAGAC
CCCAAAACGAGTTCCCAAATTCCTCAAATTGTGAGATTTGGAGATAAAAACATAATAGAT
GTGGTAAGGACAATTACGAATATTCAGAACTTTTAA

Protein sequence:

MLKLIYLFLALFFVVSSRLRDVIRVNRFCHSSPGWLILSQVDIGQHLLTCELRNTSVHYM
GICPLLLFSSMPPVSISFSRLSDPLLATCQKKVRVHCAQIENIYFEDPCDFVDPVLPDFR
YPGKRICEVKCDEYVWQRINRDKAAKRFQECLEYNRNQGVPGPYGAIGGRDSLSGEFPHM
GAIGWRAVQGTWVFKCGSTLISPKFTLTAAHCTRAPPDPRTVSEVPQIIRFGEKNIIDVL
DPVDANIVRFFVHPQYKSPSKYNDIALIEIDTELKFSKNIQPACLWSYIDTSVLGSSATL
TGWGVIDTATGKTSPILQAAGVNVIDDELCNRLLKRSCSRRWCGVKDQICAGKLEGGVDA
CQGDSGGPLQIKIPLPPSDEGSMHYVIGVTSFGIGCARPNLPGIYTRVSTFVDWIENINT
DKIESDPCVPYNATLPNFKKYGRRISEVSLAKRREKEGPDGLFVSTEAIGGRDALPGEFP
HMGALGWKAVEGTWIFKCGSTLISPKFTLTAAHCSKTPPDPKTSSQIPQIVRFGDKNIID
VVRTITNIQNF