New model in OGS2.0 | DPOGS208138  |
---|---|
Genomic Position | scaffold1489:- 31114-40523 |
See gene structure | |
CDS Length | 4812 |
Paired RNAseq reads   | 85 |
Single RNAseq reads   | 191 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014089 (0.0) |
Best Drosophila hit   | nudel (2e-94) |
Best Human hit | prostasin preproprotein (6e-41) |
Best NR hit (blastp)   | ovarian serine protease [Bombyx mori] (0.0) |
Best NR hit (blastx)   | ovarian serine protease [Bombyx mori] (0.0) |
GeneOntology terms    | GO:0009950 dorsal/ventral axis specification GO:0016485 protein processing GO:0008063 Toll signaling pathway GO:0007313 maternal specification of dorsal/ventral axis, oocyte, soma encoded GO:0007306 eggshell chorion assembly GO:0008236 serine-type peptidase activity GO:0007310 oocyte dorsal/ventral axis specification GO:0004252 serine-type endopeptidase activity GO:0005576 extracellular region GO:0006508 proteolysis GO:0008233 peptidase activity GO:0007343 egg activation |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR002172 Low-density lipoprotein (LDL) receptor class A repeat IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR015420 Peptidase S1A, nudel IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR023415 Low-density lipoprotein (LDL) receptor class A, conserved site IPR001190 Speract/scavenger receptor |
Orthology group | MCL16943 |
Nucleotide sequence:
ATGGGATGTAATGGTTGTAATGACAACACATTCAGTTGTGAGGATCTAAATGTTAACTCA
AAGAACACATGTTTTTCCAAGGAGCAACGTTGTGACAACTTTGCGGATTGTCCAAATCAG
AAAGACGAAATTGACTGTAGCTTGTTAGCACCGAGCTTACACAAAAAACCCTTGTTTGCC
ATTTCAAATACGGAAGGCTTTCTGCACAGAAATTTTAAAGGAAACTGGTATGCTGTCTGT
AGTAATCCTTACATGTGGGCACATGATGTGTGTCGTCGGGAAACAGGGCTTATAATAAGG
CCTCCTTATATTCAAGTTGTGCAAATAGATCCCTTAATAAAGGTTAAGTATATAAACACG
GCGCTGGGAGGATCAATACACACTACAAATACTTGCGGGAACAATTCCGCAGTGTACGTC
ACGTGTCCTGATTTATTGTGTGGAACCCGAGTGCTATCTTCCTCAGAATTTTTAAGACAA
AACGCTAATATGGAAGACAACCTGTTTGGCCGGAATAAAAGATTCCTTTTCCAAGAATCA
TACCCTGGGATATATTACGGTGACCGAAAAAAGAGATATACAATAAACAACGCCTGGCAA
TCCCAACCATTCCATTATTTGAGAAAAGATTTGGTTAATGATGTAAGGAACAAAAGATCA
GACAGTAGAGTAGTGGGAGGAAAACCTAGTCAACCAGCTGCCTGGCCGTGGGTAGTAGCA
CTTTATAGAGATGGAATGTTTCATTGTGGAGGCGTTATTGTTAACCAAAATTGGATAATG
TCTGCGGCACACTGCGTTAACAAATTTTGGGAACATTACTATGAAGTACAAGTCGGTATG
CTCCGTCGGTTTTCATTCTCACCTCAAGAGCAGAACCACCGTGTTACTCACGTAATAGTG
AATCAAAATTACAATCAGGAAGATATGAAGAATGACTTATCTTTATTGAGAGTTAAACCT
GGCATTCAGTTTAGTCGCTGGGTACGACCTATTTGCTTACCTGGACCTGAAGTGGCTGGT
GCTGACTGGATGTGGGGACCTCCTGCTGGTACGACTTGTACAGCTGTAGGCTGGGGAGCA
ACTGTAGAACGTGGCCCTGATCCGGACCATATGCGTGAGGTAGAAGTTCCTGTATGGGAG
CACTGCAAACATGAGGAAGATCAAAGTGGCAGTGAAATGTGCGCAGGTCTTGCTGAGGGT
GGTAGAGATTCTTGTCAAGGAGATAGCGGAGGACCACTTCTATGCACTAATCCTGCCAAT
CCGCAGCAATGGTATGTAGCAGGTATTGTGAGTCATGGCGATGGTTGTGCACGAAAAGGT
GAACCAGGGGTTTATACAAGAGTCAGCGTTTTTGTTTCTTGGATACGATACCACATTGCA
TCAAAAGCGTTACCGATAATTCAGCCTAAACAAGAATGTCCGGGATTTAGATGTGATTCT
GGGATTTCAAAGTGCTTGCCAAAAAAGAGGATGTGTGATAAAATAATAGATTGTTTAGAT
GGCGAAGATGAACTAAATTGTGAAATAGTGAGATCAGCAGATATTTTTCCAAATAATTTA
TTTCTCAATCCATTGGCTAAAGTCGCAAATATAACAAACAACCAAGAAACTATCAACATA
AGTGATAATGAAAAAAACAATAATCTTCCAAGTAATAATATTATCTTAGTCAATGATACC
AATATGAATACGAAATTAAACTCATCTAATTTAAATGAATTTGAAACTACAACATTTTTA
TCGAATAAAGATATAATAACTGATACAAAAGTATATGACCAATCTACTTTGTTTGTAACC
ACGGAGAAATCTATCATCAAAAATGATGATAAACTTACAAATATTATACCACTCCCAACT
GAAGTAAGTTTAGAACAAAGTTCCCTTTACGATGATATAAAACACGCTTCAACAATAGAA
ATGCCATCTCCTGATTATTCAGGAGAATCCATAGATGATCTGGATCCAAATGAATCAATT
ACATTAGATTCAATTATTACAACGACTACAAGTAAATATTTTGATGATATAAATATAAAC
CTAAACTCTAGATACGCAATAGAGTCTATGTCCTCGATTTTAGATAACCCTCAAAAAGAA
GAGAAAATATTCACAAATATTAAGGTGACAGAGAAAACTTTGGTAACATCACCACTTGGC
ATTTCTAGATTAGATTCGGATATTGATTCTACAACAGTAACTATAAAATCAAGTGATTTT
AATACATCCGACGAAAATAATTTAAATAACAATATATTCAATAATACTATTGATCAAAAG
GATAACTTAAACTTAGCAGATAATAAATATAATAAGACTATTCCAACGACAAATGATTAT
GTGCCAACTTATAATGGTAGCAATAATTTTAATGAAAGTACTACTGAACCAAATATAGAC
GACCAAATAAATTCGGAAACGAATCATGAACAAAACTATTTAACACCCATTAGGCCGATG
GATACTACTTACACTAATAGCGATAAAAATGAAGACACGTTTCTGACTGAACTCCAATCA
GCAAAAAAGAAAAAATATATACCAACGCCTACTGAGTTTCAATGCAGGCGCATTTATCAA
ATCGTCCCCCACACGACTCGTTGTGATCATAAAGCAGACTGTGAAGATGGTTCAGATGAA
CAGGATTGTACGTGTGTTGACTACCTAACAACTTTTGATAATAGACTGTTATGTGATGGA
CACTTCGATTGTGCCGATGGACAGGACGAAGTGAATTGTTATACATGTGAAGAGGATAAG
TTTCTATGTAAACTAAGTGAAATGTGTCTCGATTCAAAGTACGTTTGTGATGGTATACCA
CAATGTCCCTCAGGCGAAGACGAAATGGACTGCTTTGCTCTTACAAACGGCAATCATATT
GAACGTGATATACACGGCAGACCAGAGGCAAAATTGGAGGGTTACTTGACTAAAAAGTAT
CAAAACAGCTGGCATGTTGTGTGTGAAGACAACATGTCGGTTTCAGAACAAGAAGAAGCT
GCTACACATATATGCCGCTATTTGGGATTTAGCTCAGCAAATAAATATGTTATCAAATAT
ATCAATGTGAAACAAAAACTTCATCATATGAAAGATAAAAGGTCGATACGAAATATCGAT
TTAAGGATGCCTGTTCACTTCAGCTATAGAACAGCTAGTGACAACAATGATTCCACGCAT
GTAGTCATAAATGAACCTCAAATAATTAAAGAGGAATGTGTTCCTAATATAACGAAAACC
TGCATGTCGCTTTATGTTTTTTGCGATCATTCCTTGTACACTCATTTTGATAGCATTGAT
GAAGTGAACATCAAGAACGAAATAAAGAAGATGTCTGATCAAATGTGGCCATGGATTGCG
AAATTATATGTGGACGGAAAATATAAATGCACTGGAGTTTTAGTTGATTTGTCTTGGGTT
CTAATAAATCACGTATGCCTACCGAGTTCTGATTATCACTATGTAACAGTTATACTTGGT
TCTCACAAAACTCTTAAATCAACTGTTGGACCTTATGAGCAAGTGTATCGAGTTGATGCA
AAGAAACATTTATATCAAAGGAAAGTTATGCTTCTGCATCTCAACGAACCCGCTGTATAC
ACATCTATGGTGAAGCCGATGGTAGTGACGTCTCTATATTCCGATGATGCTGATAATACG
ATATGCGTAGCAGTTGGCCAGGATAGGAATAATAAAATGTCAAGCGTTTTTCTAAAAGAA
ACTGATAAATGCAATTCCCACAATCGATGTTTCGATCTTTTAGTCAATTCTAGCTATTGT
AACTTTGAAGATGCAAAATGGGCCGGTATAATAAGTTGTCACAACAAACGTGGATGGTAT
CCCGCAGCGTCGTTTGTTAAAGACATGGGAATATGTAAAAATACTGATGGCATAAATGGA
ACAGACATTGGAAATTTAAAAATTGATATAAAATATTTCGAAGATAAACCATTACCTCTT
TCCGATGGGCATTTGTTTACAAATTGCGAAGGAGTCCGGTGTCAAAGAGGGCATTGTGTG
GGGTTACAAGATGTATGTAATGGGGTCACGAATTGTGAAGATTCTTCGGATGAATCTAAA
GAATCATGTCGGAAAAAACATGATGTTTGTACACAAAATCCATTTTATCGTGGATGTGAA
TGTCCGGTTGGTCAGTTAAAATGTCATAATGGTCAATGTATACCCAAAGAATTGTTCAAG
GACGGCCGCAATGATTGCGGAGATGGCACCGACGAGCCTGGTCAAACTTTATGTTCAGAT
TACTTGAGGAGGGTTATGCCTTCAAGACTTTGTGACGGAATTCTTCACTGTCACGACAGG
AGTGACGAAGATCCCACATTCTGTAAATGTTTCGCAAAAAAGGCGTACAAGTGCACAGGA
ATGTCGATTGACGAGGACTACTGCGTAGCAACTGACATGGTTTGTGACGGTGTACTTGAT
TGTCCAAATGGAGATGATGAGCGAACCTGTATAGGTTTGAGCTCGGCTCAGGGAACACCG
CACGGCATTGGCGAAGTAATAATACGCTCCCACGGCGTGTGGTATTCGAAATGCTATACC
AAACAAAACCATACGAAATCAGAACTAGAAGCTATTTGTAGAGAGTTAGGTTTCATTGGC
GGACACGCAAAACAACTGCCAGATCCTAAAGGAATACCAAATCCCTACAACAATATTGTT
ATCGACATGTTTTCTGATGTAATGCTAAATAATAACACAATAATAAAATTGAGAAACACA
CCGAATCCTATCGCCCGCGCTGTGACTCAAGATATAAAAGAGTGTTATCCAGTTTTCATA
GAATGTCTCTAG
Protein sequence:
MGCNGCNDNTFSCEDLNVNSKNTCFSKEQRCDNFADCPNQKDEIDCSLLAPSLHKKPLFA
ISNTEGFLHRNFKGNWYAVCSNPYMWAHDVCRRETGLIIRPPYIQVVQIDPLIKVKYINT
ALGGSIHTTNTCGNNSAVYVTCPDLLCGTRVLSSSEFLRQNANMEDNLFGRNKRFLFQES
YPGIYYGDRKKRYTINNAWQSQPFHYLRKDLVNDVRNKRSDSRVVGGKPSQPAAWPWVVA
LYRDGMFHCGGVIVNQNWIMSAAHCVNKFWEHYYEVQVGMLRRFSFSPQEQNHRVTHVIV
NQNYNQEDMKNDLSLLRVKPGIQFSRWVRPICLPGPEVAGADWMWGPPAGTTCTAVGWGA
TVERGPDPDHMREVEVPVWEHCKHEEDQSGSEMCAGLAEGGRDSCQGDSGGPLLCTNPAN
PQQWYVAGIVSHGDGCARKGEPGVYTRVSVFVSWIRYHIASKALPIIQPKQECPGFRCDS
GISKCLPKKRMCDKIIDCLDGEDELNCEIVRSADIFPNNLFLNPLAKVANITNNQETINI
SDNEKNNNLPSNNIILVNDTNMNTKLNSSNLNEFETTTFLSNKDIITDTKVYDQSTLFVT
TEKSIIKNDDKLTNIIPLPTEVSLEQSSLYDDIKHASTIEMPSPDYSGESIDDLDPNESI
TLDSIITTTTSKYFDDININLNSRYAIESMSSILDNPQKEEKIFTNIKVTEKTLVTSPLG
ISRLDSDIDSTTVTIKSSDFNTSDENNLNNNIFNNTIDQKDNLNLADNKYNKTIPTTNDY
VPTYNGSNNFNESTTEPNIDDQINSETNHEQNYLTPIRPMDTTYTNSDKNEDTFLTELQS
AKKKKYIPTPTEFQCRRIYQIVPHTTRCDHKADCEDGSDEQDCTCVDYLTTFDNRLLCDG
HFDCADGQDEVNCYTCEEDKFLCKLSEMCLDSKYVCDGIPQCPSGEDEMDCFALTNGNHI
ERDIHGRPEAKLEGYLTKKYQNSWHVVCEDNMSVSEQEEAATHICRYLGFSSANKYVIKY
INVKQKLHHMKDKRSIRNIDLRMPVHFSYRTASDNNDSTHVVINEPQIIKEECVPNITKT
CMSLYVFCDHSLYTHFDSIDEVNIKNEIKKMSDQMWPWIAKLYVDGKYKCTGVLVDLSWV
LINHVCLPSSDYHYVTVILGSHKTLKSTVGPYEQVYRVDAKKHLYQRKVMLLHLNEPAVY
TSMVKPMVVTSLYSDDADNTICVAVGQDRNNKMSSVFLKETDKCNSHNRCFDLLVNSSYC
NFEDAKWAGIISCHNKRGWYPAASFVKDMGICKNTDGINGTDIGNLKIDIKYFEDKPLPL
SDGHLFTNCEGVRCQRGHCVGLQDVCNGVTNCEDSSDESKESCRKKHDVCTQNPFYRGCE
CPVGQLKCHNGQCIPKELFKDGRNDCGDGTDEPGQTLCSDYLRRVMPSRLCDGILHCHDR
SDEDPTFCKCFAKKAYKCTGMSIDEDYCVATDMVCDGVLDCPNGDDERTCIGLSSAQGTP
HGIGEVIIRSHGVWYSKCYTKQNHTKSELEAICRELGFIGGHAKQLPDPKGIPNPYNNIV
IDMFSDVMLNNNTIIKLRNTPNPIARAVTQDIKECYPVFIECL