DPGLEAN07950 in OGS1.0

New model in OGS2.0DPOGS214599 
Genomic Positionscaffold34:- 97965-113038
See gene structure
CDS Length6867
Paired RNAseq reads  16590
Single RNAseq reads  38864
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005129 (8e-19)
Best Drosophila hit  papilin, isoform E (0.0)
Best Human hittissue factor pathway inhibitor isoform a precursor (5e-22)
Best NR hit (blastp)  lacunin [Manduca sexta] (0.0)
Best NR hit (blastx)  lacunin [Manduca sexta] (0.0)
GeneOntology terms





  
GO:0005604 basement membrane
GO:0005578 proteinaceous extracellular matrix
GO:0030198 extracellular matrix organization
GO:0005201 extracellular matrix structural constituent
GO:0008270 zinc ion binding
GO:0004222 metalloendopeptidase activity
GO:0004867 serine-type endopeptidase inhibitor activity
InterPro families







  
IPR002223 Proteinase inhibitor I2, Kunitz metazoa
IPR008197 Whey acidic protein, 4-disulphide core
IPR003599 Immunoglobulin subtype
IPR003598 Immunoglobulin subtype 2
IPR007110 Immunoglobulin-like
IPR010909 PLAC
IPR020901 Proteinase inhibitor I2, Kunitz, conserved site
IPR013098 Immunoglobulin I-set
IPR013783 Immunoglobulin-like fold
Orthology groupMCL10728

Nucleotide sequence:

ATGTCATCGGACGACACCGGTTCTGGTTTCGAGACAACTGGGACTTACGAAACAACAGAT
ATAACTGGTTATACTGAAACAACCGAGACTGATTTAATAGAAGGGTCAGCAAGTGGTTCG
ACTGAAACAACAGATATAAGTGTAGTAACAGAAGTTAGTACTGAAAACTCAGATTCAACT
GTAACTGATGAAACAGAAGCGAGCACAACCGAAGTTTCTGATGTTACTTCGTCAGGAAGT
AGTGATTCCAGTGAAACAGACAGTACAGATTCAGTTTCAACAGAAGAAACAGAATCTACT
TCAGAATATTCCCAATCTAGTACAACAGAAAGTAGTTTCAGTAGTACAGATAATGGAATG
AGCACAACAGAAGATGAATTAAAAACAAATGAAAGTGAATTCAGTACAACCGAAAGTGGT
ATTAGTACAACGGAGAGTTCAACTGAAAGAGAATCTAGCACACAAACAACAGATTTCGTT
AGTACAATTGGAAGTTCTGAATCTAGTGAATCCAGCACAACAGAAAGTGATTCATCGAGT
ACGATGGAATCTTCATCCCTCAGCACAGTAATAAACGAATTTGAAACAAGTACAATTTTG
TCTAGCACGACCGAGGATGAATTGAGTACACTTGAGAGTGAACCTAGTACTACAGAAAAT
TATTCGAGTTCATCAGAAAGTGGGTCCAGTGTGACAGAAAGTGAGTCTAGTACAACTGAG
GGAACGGAATCTAGCACTATATCAAGTTCTGAATCTGAAAGTACAGAAGGTAGTACATTA
GAATACGATGAATCTAGCACACCAATAACTGATTTATCTAGTACAACGGAAACTGATGAA
TCAAGTACAACTGATAATGTTTTAACAAGCACGACTGAACTCTTAGAATCTAGTTCAACA
GAGAGATCAGAATCTACCTCAACAGAAAGTGTTGAAGTATCCACAACAGAGAGTGTTCAA
GTAAGTACAACAGAAAGTGATAAAATAAGCACAACAGACAGTATCGAGTCAAGTTCAACG
GAAAGTGTTGAGTCGAGCACAACAGATAATATTGAACTAAGCAGTACAGAAAGTAGCCAG
TCATCTACAACAGAAACTGATATTTCAAGTACATCGGAAAATATATTATCTAGTGTGACC
GACGTTAGCGAATCTAGTACTACGGAAAGTGAAGTGTCTAGTTCAACAGCTACTGATGAA
ACAAGTACAATGACTACTTCCGAATCCAGTACAACAGACACATCCCTATACAGTACATCA
GAGGAATTTGAAACTACCGAGACTAGTGAATTTTCATCTACGGAAGGTGTTAGTGATAGT
ACCACAGTCGTTAGCAGTACAGAATCTACAGAATCAAATCCAACAGAATCTTCTGATATA
ACCGGAACTAGTGAAAGCTCCACGGCAACAGGTAGCACTGAGTCTAGTTCAATTACTGAA
GAAACAGTTACCGAAGAATCTGTTGCACCAAAATCGACACCTTGGGATTGGGCGTCAACG
ATTGAAGTTTTCACAAAGAAACCTTGTAGGCCCAGAAAGAGAACCGCAAAATGTGTCAAA
AGTAAATTTGGATGCTGTCCGGACAAAAGAACACCTGCTGCTGGGCCATTTGACGAAGGT
TGTCCAAATCCCAAAACATGCAAAGAGTCGAAATTCGGTTGCTGCCCCGATGGTGTTTCA
CCAGCACCAGAACCGAAAGGCAAAGGCTGCCCCGTTACTCCTTGTAACGAAACACTTTAT
GGATGCTGTAAGTCTGACAACATCACTGCTGCTGAGGGTAACAACCAAGAGGGTTGCCCA
CCTCCGCCACCTGTTTGCAAGTCATCTGAATTTGGTTGCTGTGAAGATAATGAAACAATT
GCTAAAGGTCCTAATAAAGAAGGATGCTCAGAAACAGTAACTGAAAAGGTAACGCCAGTC
GTTGCAGTGGGATGTGCTTCCTCAGAGTTCGGTTGTTGCTACGACAACGAGACTGATGCC
TCTGGTCCAAATGGTGAAGGATGTCCTTGCAGTATTAGTGAATTCGGCTGCTGTCCTGAT
GGTCTCACTACAGCTGGTGGCGCAAATATGGAGGGCTGTTTAATGTCATGTAACACGTCG
GCTTACGGCTGTTGTCCTGATGGAGAAACTCCAGCACATGGACCGGATTCTGAAGGCTGC
TGTGTTCAGACTTCGTTTGGTTGCTGTCCTGACAATTACAAACCTGCTGAGGGACCGCAT
CTTGAAGGTTGTGGTTGTCAATACGCTCATTACGGTTGTTGCCCTGATAACGTCACAGTC
GCCCGGGGACCCAATATGGATGGATGTGGCTGCGCGCACTCTCAATATGGATGCTGTCCA
GATAGACATACACAAGCTCAGGGTCCTGAATTTGAAGGATGTGGTTGTCACACGTATCAA
TTTGGTTGTTGTTTGGACGGTGTTACTATCGCTTCGGGTCCTGAAATGCAGGGATGTCGC
TGTGTGGACTCCAAGTATGGATGCTGCGGGGATGAAAAAACTCACGCTAAGGGACCCAAT
AACGAGGGATGCGATTGTTCAAATAGCAAATATGGCTGTTGTCCTGATGGCATAACTGAA
GCTCAGGGTGAAAAGTTCCTTAATTGTACCGACGCTCCTATAAATAGACAAGCGGCCTGT
GCACTTGCTAATGATGGAGGGCCATGCCGCAATTACTCAGTTTATTGGTTCTACGATATG
ACCTATGGCGGATGTTCCCGATTCTGGTACGGTGGTTGTGAAGGCAACGGTAACCGATTT
CTTAGTGAGGAGGAATGCAAAGACGTGTGCGTTCAGCCATCACCAAAAGACGCTTGTAAT
CTTCCGAAAGTCAAAGGAGCATGCCAGGGTTACCACGTTCGTTGGTATTACGATTCACAA
CGTGAACAGTGCTCTCAGTTTGTATTCGGCGGATGTCTCGGCAATGCAAACAATTTCGAC
TCTAAAGAGCTCTGTCAGGAACGCTGTGAACCGGAGAAAACTGAAGATACATGTAACTTA
CCTATCGAGCGTGGTCCCTGCGCCGGTAACTTTGCCCGTTGGGGCTTCAATCCGGAAAAA
CGGCGATGCGAACAGTTCGTCTGGGGAGGCTGCGAAGGGAACGCCAATCGATTTAATTCA
GAAGCTGCCTGCCTGTTACAGTGTGATCCACCCGGAACTCCGAAACAGGCATGCTCGCAG
TTACAAGATGTAGGAAACTGCACTGAGAAGCATGCGGTGTGGTCGTTTAGTCAGACGGAG
AACCGCTGCATCCCCTTCTATTACACTGGTTGTGGTGGCAATGATAACCGCTTTGAAAGT
GAAAGCTCATGCGCCAAAAGCTGTCCCAGTGTTTATGAGCAAGAAATTTGTACTCTCCCT
GCTTTGACCGGTGAGTGCGCGGACTATACACAAAGGTGGTTCTTCGACACTACAAAACAG
AGATGTCGGCCATTCTACTACGGAGGGTGTGGTGGAAATGAAAATAACTTCTACTCGGAA
ATGGAATGTGAGACGCGGTGTTCGGAACAACCGGTCACGACCACCGTGCAGCCGCTCACG
ATGCCGCCGACACAAACACAACCGAACGTCCCTACGCCGGAAAGATCTGAATTTTGTTAC
CTGGAAATCGATAGCGGTCCATGCACGCAGCCTCAGACACGTTATGCGTTCGATGCTTCC
CGCGGTACGTGTGTGCAGTTCCAATACGGCGGCTGTGGTGGCAACCGGAACCACTTCCCC
AGCCTCGAATACTGCCAGTACTATTGCGGAGTCAGTCAAGATGTATGCCAGCTGCCGTTC
GCGGAGGGTCCTTGCGACCAGTCCATCATGCAATGGTTCTATGATGCTGCCTCCGACTCC
TGCAGCCAGTTCACGTACGGAGGCTGTGAGGGCAACGGGAACAGGTTTAATACACTTGAG
GAATGCGAAAGCCGATGTCGTCAAAGTCTTCCAGCTACTACGACAACCTCGTCTACTACT
ACTATAACACCGGTGTACGTTTCGAGCGAATGTCAAGTATCTCCGGCGTTAGAGGAGTGC
CGGGAGAGCGGCGAGGTGTGGTATCTGGACCAGGAGCTTCGCACTTGCGTATCTTTCGTT
AACGAGGCCGAGGGCTCAGGTTGCAGACACACGGGGGCCTTCCACTCTCAAGAAGCCTGC
GAGCGAGCTTGCGGAGCCTTCCGAGGACTCGATGTTTGCCGCTACTCCCTCGATCCGGGT
CCTTGTCGCGAGATGGTTCCCAAGTTCTATTACAACGAAGCCACTGGCCGCTGCGAATCA
TTTACATACGGAGGTTGTCACGGAGGTCCCAACCGTTTCTCTTCTCTTGAAGAATGTGAA
CAAATATGCAGACCAAATACTGATCCTTGCATCCAATCTCCTGAATCGGGCAACTGCTTG
GCTTACTTCGTTATGTGGTACTACGATAGTTCCCGAGATGAATGCGGTCAGTTTGTGTAT
GGAGGCTGCAATGGCAACGATAATAGATTTGAAACACAAGCAGAATGTGAAGGTCGGTGC
AAGAAGGGTATTATAACTACGACAGCGTTTACCCTCGCCTCGGTTCCGTCCACGACCTCG
ACTTCAACCTCGACAACAACTACGACGACAACGACGACGACCACTACACAGGCGCCATCA
CCAACACCACAGTTTATAGTAGAAGCGGAATGCTCAACCCCAGAGTCATTAGCGGTGTGC
GGCAAAAACATTACGGTGTATTACTTCGATACTAGGACTCAAGCTTGCTTAGCCGGAGAT
TTTGGCGGCTGTAGGTATGCCAACAGTTATCGCACTGAAGAAGAGTGTCAGAGACGATGT
GGCGCCTTCAGAGGACTGGACGTTTGCGGTTCTCGTCTTGACCCTGGTCCTTGTTTGAAC
ACCATTCCTAAATTCTACTGGGATCCTCTCTCTGGCCGATGTCTGAGCTTTGCTTATGGA
GGATGTCACGGTGGACCGAATCGCTTCTCTACTGTGGAGGAATGTGAAGAAATCTGCGGA
GCGACTGGACCAGAAGCGCGTTGTCTGGTGCCGGTGTCGTCGGGTACTCCGGGCTGTGGG
GTTCCCTCGCGTCGCTGGTACTACAGCGTCAGCTTCGGGGACTGCCTGGCTTTCGTCTAC
TCTGGCTGTGGCGGCAACGAAAATAACTTCCACACGTATGAAGAGTGCGCTGCCTGCAAG
AGTGATTACTTGATCCCCGATAAAGAAACAGGCAACGAAGTTTTACCCGACTGTGATGAT
TTTAATGCGGAGTGCGCAGCCCTGGAGTGCAAATATGGAGTGCAGAGGATACGCGTGGGA
GGAGGATGCGAGCGATGTTCATGTATCGCGCCGGAGGTTGACTGTGAACCGTTAGCCAAG
GAATGCAAGAACCTCAAGTGTACTTATGGACTTCAAAAGACTACTGACGATGATGGCTGC
GAGAGGTGTAATTGCATCGATCATCCTTGCGCCAACAAGGAGTGCGAAATTGGAGAGCGT
TGTGTTGCAACGCCCTACAGAGATGCAATCTCGCAAGAAATTCTTTACTCCTCTGACTGT
AGAATTGCAAACAAATCCGGATCGTGTCCATCGGAAGCTGTGTCATTAACGGCGACGGAG
AGTCAGTGTAGGCGCCAGTGTAACGATGACGCAGACTGTCCTGGTGTAGGAAAGTGCTGC
GAACGTGGCTGTAGTCATCTCTGTCTGGAACCAGTATCACCCACCAGCCCCACGGCACGA
CCTGTGCCTATCTATGTACCTGAACTGCCCCAAGTGCCATACGCGAATGAGGCGACGGAG
CCGGAAGTCCACGCGACTCTGGGTGGCAAGGTTACTCTACGCTGTTTGTTCCACGGCAAC
CCCCCGCCCAAGATCACCTGGCAGCGAGGACAGATCACGATCGAAGGAGACGTTGGTCGG
TACCGACTAATGTCTGATGGCTCGTTGGAGATTGTTTCCCTTTATCGCAACGACTCCGGG
GTCTACATCTGTGTAGCGGACAATGGACTTGGAATAGCACGCCAAGAAATCAACCTACAG
GTTGAAGATGGAGTGGACGGACCGGCGGGCATAGCTGGCCTCACTGATACCGTGGTGGTC
GGGGAGCTGGGCCAGCCGCTCAGCGTCAGGTGCATGGCTTACGGATATCCGACACCATCG
ATCTACTGGTACCACGGCCGGAACGGACCCATGGTGCCCTTCAGCAGCCCGCAGTATGAA
GCCAGAGATAACATTCTACAAATAAGGAAGTTGTCCATTGACACACTCGGGGAATACATC
TGCCAAGCTTATAATGGAATCGGCAAACCGGTAGACTGGTCGTTAATCGTGCAGGCGTAT
AGATCCGACGACTCTGTTGACTCGCCGTACTTAGTGTCACGACAGCATGAAGTATTGATA
ACGCCTAGGGAACCTCAAACTGAAGCTACCACGACGATCGCACCGGAAATTGAGATACCC
GTCTACACCGTTCCCGTTACAACTCGTATTGTGTCTGAACGCACGCGGCTGGCCGCGGGA
TCGGAGCTTAATCTGTTGTGTGAAGTCGATGGCTATCCCGTGCCGGAAGTCTACTGGACC
AAGGACTCAGTCAGAATATCATCGGATGAGAAGGCGCGATTGACGGTGATGAGAACGAAC
ACGAACGACTCCGGCGTGTACAGCTGCCACGCATTCAACGCCTACAACTCTCATTACTCC
AGTGTGGAGATTAGCGTCGAAGGTCTGTACATTCCACCCACTTGCAAGGATAATCCGTAC
TTCGCCAACTGTCACCTCATAGTACGCAGCAAGTTCTGTCACCACAAATATTACTCTGGA
TTCTGCTGCAAGTCTTGCGTGGAGGCTGGACAGCTGGACCCTCGAGAGTTGGAGCTGCAG
GCGGACAGTCCCCTGTACCGGAAGTAG

Protein sequence:

MSSDDTGSGFETTGTYETTDITGYTETTETDLIEGSASGSTETTDISVVTEVSTENSDST
VTDETEASTTEVSDVTSSGSSDSSETDSTDSVSTEETESTSEYSQSSTTESSFSSTDNGM
STTEDELKTNESEFSTTESGISTTESSTERESSTQTTDFVSTIGSSESSESSTTESDSSS
TMESSSLSTVINEFETSTILSSTTEDELSTLESEPSTTENYSSSSESGSSVTESESSTTE
GTESSTISSSESESTEGSTLEYDESSTPITDLSSTTETDESSTTDNVLTSTTELLESSST
ERSESTSTESVEVSTTESVQVSTTESDKISTTDSIESSSTESVESSTTDNIELSSTESSQ
SSTTETDISSTSENILSSVTDVSESSTTESEVSSSTATDETSTMTTSESSTTDTSLYSTS
EEFETTETSEFSSTEGVSDSTTVVSSTESTESNPTESSDITGTSESSTATGSTESSSITE
ETVTEESVAPKSTPWDWASTIEVFTKKPCRPRKRTAKCVKSKFGCCPDKRTPAAGPFDEG
CPNPKTCKESKFGCCPDGVSPAPEPKGKGCPVTPCNETLYGCCKSDNITAAEGNNQEGCP
PPPPVCKSSEFGCCEDNETIAKGPNKEGCSETVTEKVTPVVAVGCASSEFGCCYDNETDA
SGPNGEGCPCSISEFGCCPDGLTTAGGANMEGCLMSCNTSAYGCCPDGETPAHGPDSEGC
CVQTSFGCCPDNYKPAEGPHLEGCGCQYAHYGCCPDNVTVARGPNMDGCGCAHSQYGCCP
DRHTQAQGPEFEGCGCHTYQFGCCLDGVTIASGPEMQGCRCVDSKYGCCGDEKTHAKGPN
NEGCDCSNSKYGCCPDGITEAQGEKFLNCTDAPINRQAACALANDGGPCRNYSVYWFYDM
TYGGCSRFWYGGCEGNGNRFLSEEECKDVCVQPSPKDACNLPKVKGACQGYHVRWYYDSQ
REQCSQFVFGGCLGNANNFDSKELCQERCEPEKTEDTCNLPIERGPCAGNFARWGFNPEK
RRCEQFVWGGCEGNANRFNSEAACLLQCDPPGTPKQACSQLQDVGNCTEKHAVWSFSQTE
NRCIPFYYTGCGGNDNRFESESSCAKSCPSVYEQEICTLPALTGECADYTQRWFFDTTKQ
RCRPFYYGGCGGNENNFYSEMECETRCSEQPVTTTVQPLTMPPTQTQPNVPTPERSEFCY
LEIDSGPCTQPQTRYAFDASRGTCVQFQYGGCGGNRNHFPSLEYCQYYCGVSQDVCQLPF
AEGPCDQSIMQWFYDAASDSCSQFTYGGCEGNGNRFNTLEECESRCRQSLPATTTTSSTT
TITPVYVSSECQVSPALEECRESGEVWYLDQELRTCVSFVNEAEGSGCRHTGAFHSQEAC
ERACGAFRGLDVCRYSLDPGPCREMVPKFYYNEATGRCESFTYGGCHGGPNRFSSLEECE
QICRPNTDPCIQSPESGNCLAYFVMWYYDSSRDECGQFVYGGCNGNDNRFETQAECEGRC
KKGIITTTAFTLASVPSTTSTSTSTTTTTTTTTTTTQAPSPTPQFIVEAECSTPESLAVC
GKNITVYYFDTRTQACLAGDFGGCRYANSYRTEEECQRRCGAFRGLDVCGSRLDPGPCLN
TIPKFYWDPLSGRCLSFAYGGCHGGPNRFSTVEECEEICGATGPEARCLVPVSSGTPGCG
VPSRRWYYSVSFGDCLAFVYSGCGGNENNFHTYEECAACKSDYLIPDKETGNEVLPDCDD
FNAECAALECKYGVQRIRVGGGCERCSCIAPEVDCEPLAKECKNLKCTYGLQKTTDDDGC
ERCNCIDHPCANKECEIGERCVATPYRDAISQEILYSSDCRIANKSGSCPSEAVSLTATE
SQCRRQCNDDADCPGVGKCCERGCSHLCLEPVSPTSPTARPVPIYVPELPQVPYANEATE
PEVHATLGGKVTLRCLFHGNPPPKITWQRGQITIEGDVGRYRLMSDGSLEIVSLYRNDSG
VYICVADNGLGIARQEINLQVEDGVDGPAGIAGLTDTVVVGELGQPLSVRCMAYGYPTPS
IYWYHGRNGPMVPFSSPQYEARDNILQIRKLSIDTLGEYICQAYNGIGKPVDWSLIVQAY
RSDDSVDSPYLVSRQHEVLITPREPQTEATTTIAPEIEIPVYTVPVTTRIVSERTRLAAG
SELNLLCEVDGYPVPEVYWTKDSVRISSDEKARLTVMRTNTNDSGVYSCHAFNAYNSHYS
SVEISVEGLYIPPTCKDNPYFANCHLIVRSKFCHHKYYSGFCCKSCVEAGQLDPRELELQ
ADSPLYRK