DPGLEAN21182 in OGS1.0

New model in OGS2.0DPOGS208138 
Genomic Positionscaffold1489:- 31114-40523
See gene structure
CDS Length4812
Paired RNAseq reads  85
Single RNAseq reads  191
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014089 (0.0)
Best Drosophila hit  nudel (2e-94)
Best Human hitprostasin preproprotein (6e-41)
Best NR hit (blastp)  ovarian serine protease [Bombyx mori] (0.0)
Best NR hit (blastx)  ovarian serine protease [Bombyx mori] (0.0)
GeneOntology terms










  
GO:0009950 dorsal/ventral axis specification
GO:0016485 protein processing
GO:0008063 Toll signaling pathway
GO:0007313 maternal specification of dorsal/ventral axis, oocyte, soma encoded
GO:0007306 eggshell chorion assembly
GO:0008236 serine-type peptidase activity
GO:0007310 oocyte dorsal/ventral axis specification
GO:0004252 serine-type endopeptidase activity
GO:0005576 extracellular region
GO:0006508 proteolysis
GO:0008233 peptidase activity
GO:0007343 egg activation
InterPro families





  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR002172 Low-density lipoprotein (LDL) receptor class A repeat
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR015420 Peptidase S1A, nudel
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR023415 Low-density lipoprotein (LDL) receptor class A, conserved site
IPR001190 Speract/scavenger receptor
Orthology groupMCL16943

Nucleotide sequence:

ATGGGATGTAATGGTTGTAATGACAACACATTCAGTTGTGAGGATCTAAATGTTAACTCA
AAGAACACATGTTTTTCCAAGGAGCAACGTTGTGACAACTTTGCGGATTGTCCAAATCAG
AAAGACGAAATTGACTGTAGCTTGTTAGCACCGAGCTTACACAAAAAACCCTTGTTTGCC
ATTTCAAATACGGAAGGCTTTCTGCACAGAAATTTTAAAGGAAACTGGTATGCTGTCTGT
AGTAATCCTTACATGTGGGCACATGATGTGTGTCGTCGGGAAACAGGGCTTATAATAAGG
CCTCCTTATATTCAAGTTGTGCAAATAGATCCCTTAATAAAGGTTAAGTATATAAACACG
GCGCTGGGAGGATCAATACACACTACAAATACTTGCGGGAACAATTCCGCAGTGTACGTC
ACGTGTCCTGATTTATTGTGTGGAACCCGAGTGCTATCTTCCTCAGAATTTTTAAGACAA
AACGCTAATATGGAAGACAACCTGTTTGGCCGGAATAAAAGATTCCTTTTCCAAGAATCA
TACCCTGGGATATATTACGGTGACCGAAAAAAGAGATATACAATAAACAACGCCTGGCAA
TCCCAACCATTCCATTATTTGAGAAAAGATTTGGTTAATGATGTAAGGAACAAAAGATCA
GACAGTAGAGTAGTGGGAGGAAAACCTAGTCAACCAGCTGCCTGGCCGTGGGTAGTAGCA
CTTTATAGAGATGGAATGTTTCATTGTGGAGGCGTTATTGTTAACCAAAATTGGATAATG
TCTGCGGCACACTGCGTTAACAAATTTTGGGAACATTACTATGAAGTACAAGTCGGTATG
CTCCGTCGGTTTTCATTCTCACCTCAAGAGCAGAACCACCGTGTTACTCACGTAATAGTG
AATCAAAATTACAATCAGGAAGATATGAAGAATGACTTATCTTTATTGAGAGTTAAACCT
GGCATTCAGTTTAGTCGCTGGGTACGACCTATTTGCTTACCTGGACCTGAAGTGGCTGGT
GCTGACTGGATGTGGGGACCTCCTGCTGGTACGACTTGTACAGCTGTAGGCTGGGGAGCA
ACTGTAGAACGTGGCCCTGATCCGGACCATATGCGTGAGGTAGAAGTTCCTGTATGGGAG
CACTGCAAACATGAGGAAGATCAAAGTGGCAGTGAAATGTGCGCAGGTCTTGCTGAGGGT
GGTAGAGATTCTTGTCAAGGAGATAGCGGAGGACCACTTCTATGCACTAATCCTGCCAAT
CCGCAGCAATGGTATGTAGCAGGTATTGTGAGTCATGGCGATGGTTGTGCACGAAAAGGT
GAACCAGGGGTTTATACAAGAGTCAGCGTTTTTGTTTCTTGGATACGATACCACATTGCA
TCAAAAGCGTTACCGATAATTCAGCCTAAACAAGAATGTCCGGGATTTAGATGTGATTCT
GGGATTTCAAAGTGCTTGCCAAAAAAGAGGATGTGTGATAAAATAATAGATTGTTTAGAT
GGCGAAGATGAACTAAATTGTGAAATAGTGAGATCAGCAGATATTTTTCCAAATAATTTA
TTTCTCAATCCATTGGCTAAAGTCGCAAATATAACAAACAACCAAGAAACTATCAACATA
AGTGATAATGAAAAAAACAATAATCTTCCAAGTAATAATATTATCTTAGTCAATGATACC
AATATGAATACGAAATTAAACTCATCTAATTTAAATGAATTTGAAACTACAACATTTTTA
TCGAATAAAGATATAATAACTGATACAAAAGTATATGACCAATCTACTTTGTTTGTAACC
ACGGAGAAATCTATCATCAAAAATGATGATAAACTTACAAATATTATACCACTCCCAACT
GAAGTAAGTTTAGAACAAAGTTCCCTTTACGATGATATAAAACACGCTTCAACAATAGAA
ATGCCATCTCCTGATTATTCAGGAGAATCCATAGATGATCTGGATCCAAATGAATCAATT
ACATTAGATTCAATTATTACAACGACTACAAGTAAATATTTTGATGATATAAATATAAAC
CTAAACTCTAGATACGCAATAGAGTCTATGTCCTCGATTTTAGATAACCCTCAAAAAGAA
GAGAAAATATTCACAAATATTAAGGTGACAGAGAAAACTTTGGTAACATCACCACTTGGC
ATTTCTAGATTAGATTCGGATATTGATTCTACAACAGTAACTATAAAATCAAGTGATTTT
AATACATCCGACGAAAATAATTTAAATAACAATATATTCAATAATACTATTGATCAAAAG
GATAACTTAAACTTAGCAGATAATAAATATAATAAGACTATTCCAACGACAAATGATTAT
GTGCCAACTTATAATGGTAGCAATAATTTTAATGAAAGTACTACTGAACCAAATATAGAC
GACCAAATAAATTCGGAAACGAATCATGAACAAAACTATTTAACACCCATTAGGCCGATG
GATACTACTTACACTAATAGCGATAAAAATGAAGACACGTTTCTGACTGAACTCCAATCA
GCAAAAAAGAAAAAATATATACCAACGCCTACTGAGTTTCAATGCAGGCGCATTTATCAA
ATCGTCCCCCACACGACTCGTTGTGATCATAAAGCAGACTGTGAAGATGGTTCAGATGAA
CAGGATTGTACGTGTGTTGACTACCTAACAACTTTTGATAATAGACTGTTATGTGATGGA
CACTTCGATTGTGCCGATGGACAGGACGAAGTGAATTGTTATACATGTGAAGAGGATAAG
TTTCTATGTAAACTAAGTGAAATGTGTCTCGATTCAAAGTACGTTTGTGATGGTATACCA
CAATGTCCCTCAGGCGAAGACGAAATGGACTGCTTTGCTCTTACAAACGGCAATCATATT
GAACGTGATATACACGGCAGACCAGAGGCAAAATTGGAGGGTTACTTGACTAAAAAGTAT
CAAAACAGCTGGCATGTTGTGTGTGAAGACAACATGTCGGTTTCAGAACAAGAAGAAGCT
GCTACACATATATGCCGCTATTTGGGATTTAGCTCAGCAAATAAATATGTTATCAAATAT
ATCAATGTGAAACAAAAACTTCATCATATGAAAGATAAAAGGTCGATACGAAATATCGAT
TTAAGGATGCCTGTTCACTTCAGCTATAGAACAGCTAGTGACAACAATGATTCCACGCAT
GTAGTCATAAATGAACCTCAAATAATTAAAGAGGAATGTGTTCCTAATATAACGAAAACC
TGCATGTCGCTTTATGTTTTTTGCGATCATTCCTTGTACACTCATTTTGATAGCATTGAT
GAAGTGAACATCAAGAACGAAATAAAGAAGATGTCTGATCAAATGTGGCCATGGATTGCG
AAATTATATGTGGACGGAAAATATAAATGCACTGGAGTTTTAGTTGATTTGTCTTGGGTT
CTAATAAATCACGTATGCCTACCGAGTTCTGATTATCACTATGTAACAGTTATACTTGGT
TCTCACAAAACTCTTAAATCAACTGTTGGACCTTATGAGCAAGTGTATCGAGTTGATGCA
AAGAAACATTTATATCAAAGGAAAGTTATGCTTCTGCATCTCAACGAACCCGCTGTATAC
ACATCTATGGTGAAGCCGATGGTAGTGACGTCTCTATATTCCGATGATGCTGATAATACG
ATATGCGTAGCAGTTGGCCAGGATAGGAATAATAAAATGTCAAGCGTTTTTCTAAAAGAA
ACTGATAAATGCAATTCCCACAATCGATGTTTCGATCTTTTAGTCAATTCTAGCTATTGT
AACTTTGAAGATGCAAAATGGGCCGGTATAATAAGTTGTCACAACAAACGTGGATGGTAT
CCCGCAGCGTCGTTTGTTAAAGACATGGGAATATGTAAAAATACTGATGGCATAAATGGA
ACAGACATTGGAAATTTAAAAATTGATATAAAATATTTCGAAGATAAACCATTACCTCTT
TCCGATGGGCATTTGTTTACAAATTGCGAAGGAGTCCGGTGTCAAAGAGGGCATTGTGTG
GGGTTACAAGATGTATGTAATGGGGTCACGAATTGTGAAGATTCTTCGGATGAATCTAAA
GAATCATGTCGGAAAAAACATGATGTTTGTACACAAAATCCATTTTATCGTGGATGTGAA
TGTCCGGTTGGTCAGTTAAAATGTCATAATGGTCAATGTATACCCAAAGAATTGTTCAAG
GACGGCCGCAATGATTGCGGAGATGGCACCGACGAGCCTGGTCAAACTTTATGTTCAGAT
TACTTGAGGAGGGTTATGCCTTCAAGACTTTGTGACGGAATTCTTCACTGTCACGACAGG
AGTGACGAAGATCCCACATTCTGTAAATGTTTCGCAAAAAAGGCGTACAAGTGCACAGGA
ATGTCGATTGACGAGGACTACTGCGTAGCAACTGACATGGTTTGTGACGGTGTACTTGAT
TGTCCAAATGGAGATGATGAGCGAACCTGTATAGGTTTGAGCTCGGCTCAGGGAACACCG
CACGGCATTGGCGAAGTAATAATACGCTCCCACGGCGTGTGGTATTCGAAATGCTATACC
AAACAAAACCATACGAAATCAGAACTAGAAGCTATTTGTAGAGAGTTAGGTTTCATTGGC
GGACACGCAAAACAACTGCCAGATCCTAAAGGAATACCAAATCCCTACAACAATATTGTT
ATCGACATGTTTTCTGATGTAATGCTAAATAATAACACAATAATAAAATTGAGAAACACA
CCGAATCCTATCGCCCGCGCTGTGACTCAAGATATAAAAGAGTGTTATCCAGTTTTCATA
GAATGTCTCTAG

Protein sequence:

MGCNGCNDNTFSCEDLNVNSKNTCFSKEQRCDNFADCPNQKDEIDCSLLAPSLHKKPLFA
ISNTEGFLHRNFKGNWYAVCSNPYMWAHDVCRRETGLIIRPPYIQVVQIDPLIKVKYINT
ALGGSIHTTNTCGNNSAVYVTCPDLLCGTRVLSSSEFLRQNANMEDNLFGRNKRFLFQES
YPGIYYGDRKKRYTINNAWQSQPFHYLRKDLVNDVRNKRSDSRVVGGKPSQPAAWPWVVA
LYRDGMFHCGGVIVNQNWIMSAAHCVNKFWEHYYEVQVGMLRRFSFSPQEQNHRVTHVIV
NQNYNQEDMKNDLSLLRVKPGIQFSRWVRPICLPGPEVAGADWMWGPPAGTTCTAVGWGA
TVERGPDPDHMREVEVPVWEHCKHEEDQSGSEMCAGLAEGGRDSCQGDSGGPLLCTNPAN
PQQWYVAGIVSHGDGCARKGEPGVYTRVSVFVSWIRYHIASKALPIIQPKQECPGFRCDS
GISKCLPKKRMCDKIIDCLDGEDELNCEIVRSADIFPNNLFLNPLAKVANITNNQETINI
SDNEKNNNLPSNNIILVNDTNMNTKLNSSNLNEFETTTFLSNKDIITDTKVYDQSTLFVT
TEKSIIKNDDKLTNIIPLPTEVSLEQSSLYDDIKHASTIEMPSPDYSGESIDDLDPNESI
TLDSIITTTTSKYFDDININLNSRYAIESMSSILDNPQKEEKIFTNIKVTEKTLVTSPLG
ISRLDSDIDSTTVTIKSSDFNTSDENNLNNNIFNNTIDQKDNLNLADNKYNKTIPTTNDY
VPTYNGSNNFNESTTEPNIDDQINSETNHEQNYLTPIRPMDTTYTNSDKNEDTFLTELQS
AKKKKYIPTPTEFQCRRIYQIVPHTTRCDHKADCEDGSDEQDCTCVDYLTTFDNRLLCDG
HFDCADGQDEVNCYTCEEDKFLCKLSEMCLDSKYVCDGIPQCPSGEDEMDCFALTNGNHI
ERDIHGRPEAKLEGYLTKKYQNSWHVVCEDNMSVSEQEEAATHICRYLGFSSANKYVIKY
INVKQKLHHMKDKRSIRNIDLRMPVHFSYRTASDNNDSTHVVINEPQIIKEECVPNITKT
CMSLYVFCDHSLYTHFDSIDEVNIKNEIKKMSDQMWPWIAKLYVDGKYKCTGVLVDLSWV
LINHVCLPSSDYHYVTVILGSHKTLKSTVGPYEQVYRVDAKKHLYQRKVMLLHLNEPAVY
TSMVKPMVVTSLYSDDADNTICVAVGQDRNNKMSSVFLKETDKCNSHNRCFDLLVNSSYC
NFEDAKWAGIISCHNKRGWYPAASFVKDMGICKNTDGINGTDIGNLKIDIKYFEDKPLPL
SDGHLFTNCEGVRCQRGHCVGLQDVCNGVTNCEDSSDESKESCRKKHDVCTQNPFYRGCE
CPVGQLKCHNGQCIPKELFKDGRNDCGDGTDEPGQTLCSDYLRRVMPSRLCDGILHCHDR
SDEDPTFCKCFAKKAYKCTGMSIDEDYCVATDMVCDGVLDCPNGDDERTCIGLSSAQGTP
HGIGEVIIRSHGVWYSKCYTKQNHTKSELEAICRELGFIGGHAKQLPDPKGIPNPYNNIV
IDMFSDVMLNNNTIIKLRNTPNPIARAVTQDIKECYPVFIECL