New model in OGS2.0 | DPOGS214599  |
---|---|
Genomic Position | scaffold34:- 97965-113038 |
See gene structure | |
CDS Length | 6867 |
Paired RNAseq reads   | 16590 |
Single RNAseq reads   | 38864 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005129 (8e-19) |
Best Drosophila hit   | papilin, isoform E (0.0) |
Best Human hit | tissue factor pathway inhibitor isoform a precursor (5e-22) |
Best NR hit (blastp)   | lacunin [Manduca sexta] (0.0) |
Best NR hit (blastx)   | lacunin [Manduca sexta] (0.0) |
GeneOntology terms    | GO:0005604 basement membrane GO:0005578 proteinaceous extracellular matrix GO:0030198 extracellular matrix organization GO:0005201 extracellular matrix structural constituent GO:0008270 zinc ion binding GO:0004222 metalloendopeptidase activity GO:0004867 serine-type endopeptidase inhibitor activity |
InterPro families    | IPR002223 Proteinase inhibitor I2, Kunitz metazoa IPR008197 Whey acidic protein, 4-disulphide core IPR003599 Immunoglobulin subtype IPR003598 Immunoglobulin subtype 2 IPR007110 Immunoglobulin-like IPR010909 PLAC IPR020901 Proteinase inhibitor I2, Kunitz, conserved site IPR013098 Immunoglobulin I-set IPR013783 Immunoglobulin-like fold |
Orthology group | MCL10728 |
Nucleotide sequence:
ATGTCATCGGACGACACCGGTTCTGGTTTCGAGACAACTGGGACTTACGAAACAACAGAT
ATAACTGGTTATACTGAAACAACCGAGACTGATTTAATAGAAGGGTCAGCAAGTGGTTCG
ACTGAAACAACAGATATAAGTGTAGTAACAGAAGTTAGTACTGAAAACTCAGATTCAACT
GTAACTGATGAAACAGAAGCGAGCACAACCGAAGTTTCTGATGTTACTTCGTCAGGAAGT
AGTGATTCCAGTGAAACAGACAGTACAGATTCAGTTTCAACAGAAGAAACAGAATCTACT
TCAGAATATTCCCAATCTAGTACAACAGAAAGTAGTTTCAGTAGTACAGATAATGGAATG
AGCACAACAGAAGATGAATTAAAAACAAATGAAAGTGAATTCAGTACAACCGAAAGTGGT
ATTAGTACAACGGAGAGTTCAACTGAAAGAGAATCTAGCACACAAACAACAGATTTCGTT
AGTACAATTGGAAGTTCTGAATCTAGTGAATCCAGCACAACAGAAAGTGATTCATCGAGT
ACGATGGAATCTTCATCCCTCAGCACAGTAATAAACGAATTTGAAACAAGTACAATTTTG
TCTAGCACGACCGAGGATGAATTGAGTACACTTGAGAGTGAACCTAGTACTACAGAAAAT
TATTCGAGTTCATCAGAAAGTGGGTCCAGTGTGACAGAAAGTGAGTCTAGTACAACTGAG
GGAACGGAATCTAGCACTATATCAAGTTCTGAATCTGAAAGTACAGAAGGTAGTACATTA
GAATACGATGAATCTAGCACACCAATAACTGATTTATCTAGTACAACGGAAACTGATGAA
TCAAGTACAACTGATAATGTTTTAACAAGCACGACTGAACTCTTAGAATCTAGTTCAACA
GAGAGATCAGAATCTACCTCAACAGAAAGTGTTGAAGTATCCACAACAGAGAGTGTTCAA
GTAAGTACAACAGAAAGTGATAAAATAAGCACAACAGACAGTATCGAGTCAAGTTCAACG
GAAAGTGTTGAGTCGAGCACAACAGATAATATTGAACTAAGCAGTACAGAAAGTAGCCAG
TCATCTACAACAGAAACTGATATTTCAAGTACATCGGAAAATATATTATCTAGTGTGACC
GACGTTAGCGAATCTAGTACTACGGAAAGTGAAGTGTCTAGTTCAACAGCTACTGATGAA
ACAAGTACAATGACTACTTCCGAATCCAGTACAACAGACACATCCCTATACAGTACATCA
GAGGAATTTGAAACTACCGAGACTAGTGAATTTTCATCTACGGAAGGTGTTAGTGATAGT
ACCACAGTCGTTAGCAGTACAGAATCTACAGAATCAAATCCAACAGAATCTTCTGATATA
ACCGGAACTAGTGAAAGCTCCACGGCAACAGGTAGCACTGAGTCTAGTTCAATTACTGAA
GAAACAGTTACCGAAGAATCTGTTGCACCAAAATCGACACCTTGGGATTGGGCGTCAACG
ATTGAAGTTTTCACAAAGAAACCTTGTAGGCCCAGAAAGAGAACCGCAAAATGTGTCAAA
AGTAAATTTGGATGCTGTCCGGACAAAAGAACACCTGCTGCTGGGCCATTTGACGAAGGT
TGTCCAAATCCCAAAACATGCAAAGAGTCGAAATTCGGTTGCTGCCCCGATGGTGTTTCA
CCAGCACCAGAACCGAAAGGCAAAGGCTGCCCCGTTACTCCTTGTAACGAAACACTTTAT
GGATGCTGTAAGTCTGACAACATCACTGCTGCTGAGGGTAACAACCAAGAGGGTTGCCCA
CCTCCGCCACCTGTTTGCAAGTCATCTGAATTTGGTTGCTGTGAAGATAATGAAACAATT
GCTAAAGGTCCTAATAAAGAAGGATGCTCAGAAACAGTAACTGAAAAGGTAACGCCAGTC
GTTGCAGTGGGATGTGCTTCCTCAGAGTTCGGTTGTTGCTACGACAACGAGACTGATGCC
TCTGGTCCAAATGGTGAAGGATGTCCTTGCAGTATTAGTGAATTCGGCTGCTGTCCTGAT
GGTCTCACTACAGCTGGTGGCGCAAATATGGAGGGCTGTTTAATGTCATGTAACACGTCG
GCTTACGGCTGTTGTCCTGATGGAGAAACTCCAGCACATGGACCGGATTCTGAAGGCTGC
TGTGTTCAGACTTCGTTTGGTTGCTGTCCTGACAATTACAAACCTGCTGAGGGACCGCAT
CTTGAAGGTTGTGGTTGTCAATACGCTCATTACGGTTGTTGCCCTGATAACGTCACAGTC
GCCCGGGGACCCAATATGGATGGATGTGGCTGCGCGCACTCTCAATATGGATGCTGTCCA
GATAGACATACACAAGCTCAGGGTCCTGAATTTGAAGGATGTGGTTGTCACACGTATCAA
TTTGGTTGTTGTTTGGACGGTGTTACTATCGCTTCGGGTCCTGAAATGCAGGGATGTCGC
TGTGTGGACTCCAAGTATGGATGCTGCGGGGATGAAAAAACTCACGCTAAGGGACCCAAT
AACGAGGGATGCGATTGTTCAAATAGCAAATATGGCTGTTGTCCTGATGGCATAACTGAA
GCTCAGGGTGAAAAGTTCCTTAATTGTACCGACGCTCCTATAAATAGACAAGCGGCCTGT
GCACTTGCTAATGATGGAGGGCCATGCCGCAATTACTCAGTTTATTGGTTCTACGATATG
ACCTATGGCGGATGTTCCCGATTCTGGTACGGTGGTTGTGAAGGCAACGGTAACCGATTT
CTTAGTGAGGAGGAATGCAAAGACGTGTGCGTTCAGCCATCACCAAAAGACGCTTGTAAT
CTTCCGAAAGTCAAAGGAGCATGCCAGGGTTACCACGTTCGTTGGTATTACGATTCACAA
CGTGAACAGTGCTCTCAGTTTGTATTCGGCGGATGTCTCGGCAATGCAAACAATTTCGAC
TCTAAAGAGCTCTGTCAGGAACGCTGTGAACCGGAGAAAACTGAAGATACATGTAACTTA
CCTATCGAGCGTGGTCCCTGCGCCGGTAACTTTGCCCGTTGGGGCTTCAATCCGGAAAAA
CGGCGATGCGAACAGTTCGTCTGGGGAGGCTGCGAAGGGAACGCCAATCGATTTAATTCA
GAAGCTGCCTGCCTGTTACAGTGTGATCCACCCGGAACTCCGAAACAGGCATGCTCGCAG
TTACAAGATGTAGGAAACTGCACTGAGAAGCATGCGGTGTGGTCGTTTAGTCAGACGGAG
AACCGCTGCATCCCCTTCTATTACACTGGTTGTGGTGGCAATGATAACCGCTTTGAAAGT
GAAAGCTCATGCGCCAAAAGCTGTCCCAGTGTTTATGAGCAAGAAATTTGTACTCTCCCT
GCTTTGACCGGTGAGTGCGCGGACTATACACAAAGGTGGTTCTTCGACACTACAAAACAG
AGATGTCGGCCATTCTACTACGGAGGGTGTGGTGGAAATGAAAATAACTTCTACTCGGAA
ATGGAATGTGAGACGCGGTGTTCGGAACAACCGGTCACGACCACCGTGCAGCCGCTCACG
ATGCCGCCGACACAAACACAACCGAACGTCCCTACGCCGGAAAGATCTGAATTTTGTTAC
CTGGAAATCGATAGCGGTCCATGCACGCAGCCTCAGACACGTTATGCGTTCGATGCTTCC
CGCGGTACGTGTGTGCAGTTCCAATACGGCGGCTGTGGTGGCAACCGGAACCACTTCCCC
AGCCTCGAATACTGCCAGTACTATTGCGGAGTCAGTCAAGATGTATGCCAGCTGCCGTTC
GCGGAGGGTCCTTGCGACCAGTCCATCATGCAATGGTTCTATGATGCTGCCTCCGACTCC
TGCAGCCAGTTCACGTACGGAGGCTGTGAGGGCAACGGGAACAGGTTTAATACACTTGAG
GAATGCGAAAGCCGATGTCGTCAAAGTCTTCCAGCTACTACGACAACCTCGTCTACTACT
ACTATAACACCGGTGTACGTTTCGAGCGAATGTCAAGTATCTCCGGCGTTAGAGGAGTGC
CGGGAGAGCGGCGAGGTGTGGTATCTGGACCAGGAGCTTCGCACTTGCGTATCTTTCGTT
AACGAGGCCGAGGGCTCAGGTTGCAGACACACGGGGGCCTTCCACTCTCAAGAAGCCTGC
GAGCGAGCTTGCGGAGCCTTCCGAGGACTCGATGTTTGCCGCTACTCCCTCGATCCGGGT
CCTTGTCGCGAGATGGTTCCCAAGTTCTATTACAACGAAGCCACTGGCCGCTGCGAATCA
TTTACATACGGAGGTTGTCACGGAGGTCCCAACCGTTTCTCTTCTCTTGAAGAATGTGAA
CAAATATGCAGACCAAATACTGATCCTTGCATCCAATCTCCTGAATCGGGCAACTGCTTG
GCTTACTTCGTTATGTGGTACTACGATAGTTCCCGAGATGAATGCGGTCAGTTTGTGTAT
GGAGGCTGCAATGGCAACGATAATAGATTTGAAACACAAGCAGAATGTGAAGGTCGGTGC
AAGAAGGGTATTATAACTACGACAGCGTTTACCCTCGCCTCGGTTCCGTCCACGACCTCG
ACTTCAACCTCGACAACAACTACGACGACAACGACGACGACCACTACACAGGCGCCATCA
CCAACACCACAGTTTATAGTAGAAGCGGAATGCTCAACCCCAGAGTCATTAGCGGTGTGC
GGCAAAAACATTACGGTGTATTACTTCGATACTAGGACTCAAGCTTGCTTAGCCGGAGAT
TTTGGCGGCTGTAGGTATGCCAACAGTTATCGCACTGAAGAAGAGTGTCAGAGACGATGT
GGCGCCTTCAGAGGACTGGACGTTTGCGGTTCTCGTCTTGACCCTGGTCCTTGTTTGAAC
ACCATTCCTAAATTCTACTGGGATCCTCTCTCTGGCCGATGTCTGAGCTTTGCTTATGGA
GGATGTCACGGTGGACCGAATCGCTTCTCTACTGTGGAGGAATGTGAAGAAATCTGCGGA
GCGACTGGACCAGAAGCGCGTTGTCTGGTGCCGGTGTCGTCGGGTACTCCGGGCTGTGGG
GTTCCCTCGCGTCGCTGGTACTACAGCGTCAGCTTCGGGGACTGCCTGGCTTTCGTCTAC
TCTGGCTGTGGCGGCAACGAAAATAACTTCCACACGTATGAAGAGTGCGCTGCCTGCAAG
AGTGATTACTTGATCCCCGATAAAGAAACAGGCAACGAAGTTTTACCCGACTGTGATGAT
TTTAATGCGGAGTGCGCAGCCCTGGAGTGCAAATATGGAGTGCAGAGGATACGCGTGGGA
GGAGGATGCGAGCGATGTTCATGTATCGCGCCGGAGGTTGACTGTGAACCGTTAGCCAAG
GAATGCAAGAACCTCAAGTGTACTTATGGACTTCAAAAGACTACTGACGATGATGGCTGC
GAGAGGTGTAATTGCATCGATCATCCTTGCGCCAACAAGGAGTGCGAAATTGGAGAGCGT
TGTGTTGCAACGCCCTACAGAGATGCAATCTCGCAAGAAATTCTTTACTCCTCTGACTGT
AGAATTGCAAACAAATCCGGATCGTGTCCATCGGAAGCTGTGTCATTAACGGCGACGGAG
AGTCAGTGTAGGCGCCAGTGTAACGATGACGCAGACTGTCCTGGTGTAGGAAAGTGCTGC
GAACGTGGCTGTAGTCATCTCTGTCTGGAACCAGTATCACCCACCAGCCCCACGGCACGA
CCTGTGCCTATCTATGTACCTGAACTGCCCCAAGTGCCATACGCGAATGAGGCGACGGAG
CCGGAAGTCCACGCGACTCTGGGTGGCAAGGTTACTCTACGCTGTTTGTTCCACGGCAAC
CCCCCGCCCAAGATCACCTGGCAGCGAGGACAGATCACGATCGAAGGAGACGTTGGTCGG
TACCGACTAATGTCTGATGGCTCGTTGGAGATTGTTTCCCTTTATCGCAACGACTCCGGG
GTCTACATCTGTGTAGCGGACAATGGACTTGGAATAGCACGCCAAGAAATCAACCTACAG
GTTGAAGATGGAGTGGACGGACCGGCGGGCATAGCTGGCCTCACTGATACCGTGGTGGTC
GGGGAGCTGGGCCAGCCGCTCAGCGTCAGGTGCATGGCTTACGGATATCCGACACCATCG
ATCTACTGGTACCACGGCCGGAACGGACCCATGGTGCCCTTCAGCAGCCCGCAGTATGAA
GCCAGAGATAACATTCTACAAATAAGGAAGTTGTCCATTGACACACTCGGGGAATACATC
TGCCAAGCTTATAATGGAATCGGCAAACCGGTAGACTGGTCGTTAATCGTGCAGGCGTAT
AGATCCGACGACTCTGTTGACTCGCCGTACTTAGTGTCACGACAGCATGAAGTATTGATA
ACGCCTAGGGAACCTCAAACTGAAGCTACCACGACGATCGCACCGGAAATTGAGATACCC
GTCTACACCGTTCCCGTTACAACTCGTATTGTGTCTGAACGCACGCGGCTGGCCGCGGGA
TCGGAGCTTAATCTGTTGTGTGAAGTCGATGGCTATCCCGTGCCGGAAGTCTACTGGACC
AAGGACTCAGTCAGAATATCATCGGATGAGAAGGCGCGATTGACGGTGATGAGAACGAAC
ACGAACGACTCCGGCGTGTACAGCTGCCACGCATTCAACGCCTACAACTCTCATTACTCC
AGTGTGGAGATTAGCGTCGAAGGTCTGTACATTCCACCCACTTGCAAGGATAATCCGTAC
TTCGCCAACTGTCACCTCATAGTACGCAGCAAGTTCTGTCACCACAAATATTACTCTGGA
TTCTGCTGCAAGTCTTGCGTGGAGGCTGGACAGCTGGACCCTCGAGAGTTGGAGCTGCAG
GCGGACAGTCCCCTGTACCGGAAGTAG
Protein sequence:
MSSDDTGSGFETTGTYETTDITGYTETTETDLIEGSASGSTETTDISVVTEVSTENSDST
VTDETEASTTEVSDVTSSGSSDSSETDSTDSVSTEETESTSEYSQSSTTESSFSSTDNGM
STTEDELKTNESEFSTTESGISTTESSTERESSTQTTDFVSTIGSSESSESSTTESDSSS
TMESSSLSTVINEFETSTILSSTTEDELSTLESEPSTTENYSSSSESGSSVTESESSTTE
GTESSTISSSESESTEGSTLEYDESSTPITDLSSTTETDESSTTDNVLTSTTELLESSST
ERSESTSTESVEVSTTESVQVSTTESDKISTTDSIESSSTESVESSTTDNIELSSTESSQ
SSTTETDISSTSENILSSVTDVSESSTTESEVSSSTATDETSTMTTSESSTTDTSLYSTS
EEFETTETSEFSSTEGVSDSTTVVSSTESTESNPTESSDITGTSESSTATGSTESSSITE
ETVTEESVAPKSTPWDWASTIEVFTKKPCRPRKRTAKCVKSKFGCCPDKRTPAAGPFDEG
CPNPKTCKESKFGCCPDGVSPAPEPKGKGCPVTPCNETLYGCCKSDNITAAEGNNQEGCP
PPPPVCKSSEFGCCEDNETIAKGPNKEGCSETVTEKVTPVVAVGCASSEFGCCYDNETDA
SGPNGEGCPCSISEFGCCPDGLTTAGGANMEGCLMSCNTSAYGCCPDGETPAHGPDSEGC
CVQTSFGCCPDNYKPAEGPHLEGCGCQYAHYGCCPDNVTVARGPNMDGCGCAHSQYGCCP
DRHTQAQGPEFEGCGCHTYQFGCCLDGVTIASGPEMQGCRCVDSKYGCCGDEKTHAKGPN
NEGCDCSNSKYGCCPDGITEAQGEKFLNCTDAPINRQAACALANDGGPCRNYSVYWFYDM
TYGGCSRFWYGGCEGNGNRFLSEEECKDVCVQPSPKDACNLPKVKGACQGYHVRWYYDSQ
REQCSQFVFGGCLGNANNFDSKELCQERCEPEKTEDTCNLPIERGPCAGNFARWGFNPEK
RRCEQFVWGGCEGNANRFNSEAACLLQCDPPGTPKQACSQLQDVGNCTEKHAVWSFSQTE
NRCIPFYYTGCGGNDNRFESESSCAKSCPSVYEQEICTLPALTGECADYTQRWFFDTTKQ
RCRPFYYGGCGGNENNFYSEMECETRCSEQPVTTTVQPLTMPPTQTQPNVPTPERSEFCY
LEIDSGPCTQPQTRYAFDASRGTCVQFQYGGCGGNRNHFPSLEYCQYYCGVSQDVCQLPF
AEGPCDQSIMQWFYDAASDSCSQFTYGGCEGNGNRFNTLEECESRCRQSLPATTTTSSTT
TITPVYVSSECQVSPALEECRESGEVWYLDQELRTCVSFVNEAEGSGCRHTGAFHSQEAC
ERACGAFRGLDVCRYSLDPGPCREMVPKFYYNEATGRCESFTYGGCHGGPNRFSSLEECE
QICRPNTDPCIQSPESGNCLAYFVMWYYDSSRDECGQFVYGGCNGNDNRFETQAECEGRC
KKGIITTTAFTLASVPSTTSTSTSTTTTTTTTTTTTQAPSPTPQFIVEAECSTPESLAVC
GKNITVYYFDTRTQACLAGDFGGCRYANSYRTEEECQRRCGAFRGLDVCGSRLDPGPCLN
TIPKFYWDPLSGRCLSFAYGGCHGGPNRFSTVEECEEICGATGPEARCLVPVSSGTPGCG
VPSRRWYYSVSFGDCLAFVYSGCGGNENNFHTYEECAACKSDYLIPDKETGNEVLPDCDD
FNAECAALECKYGVQRIRVGGGCERCSCIAPEVDCEPLAKECKNLKCTYGLQKTTDDDGC
ERCNCIDHPCANKECEIGERCVATPYRDAISQEILYSSDCRIANKSGSCPSEAVSLTATE
SQCRRQCNDDADCPGVGKCCERGCSHLCLEPVSPTSPTARPVPIYVPELPQVPYANEATE
PEVHATLGGKVTLRCLFHGNPPPKITWQRGQITIEGDVGRYRLMSDGSLEIVSLYRNDSG
VYICVADNGLGIARQEINLQVEDGVDGPAGIAGLTDTVVVGELGQPLSVRCMAYGYPTPS
IYWYHGRNGPMVPFSSPQYEARDNILQIRKLSIDTLGEYICQAYNGIGKPVDWSLIVQAY
RSDDSVDSPYLVSRQHEVLITPREPQTEATTTIAPEIEIPVYTVPVTTRIVSERTRLAAG
SELNLLCEVDGYPVPEVYWTKDSVRISSDEKARLTVMRTNTNDSGVYSCHAFNAYNSHYS
SVEISVEGLYIPPTCKDNPYFANCHLIVRSKFCHHKYYSGFCCKSCVEAGQLDPRELELQ
ADSPLYRK