DPGLEAN06552 in OGS1.0

New model in OGS2.0DPOGS214724 
Genomic Positionscaffold6029:- 3247-6275
See gene structure
CDS Length2718
Paired RNAseq reads  4394
Single RNAseq reads  11173
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005131 (2e-07)
Best Drosophila hit  CG12163, isoform B (2e-33)
Best Human hitcathepsin F precursor (6e-30)
Best NR hit (blastp)  cysteine proteinase inhibitor precursor [Manduca sexta] (0.0)
Best NR hit (blastx)  cysteine proteinase inhibitor precursor [Manduca sexta] (0.0)
GeneOntology terms  GO:0005515 protein binding
InterPro families



  
IPR000010 Proteinase inhibitor I25, cystatin
IPR013201 Proteinase inhibitor I29, cathepsin propeptide
IPR000668 Peptidase C1A, papain C-terminal
IPR000169 Peptidase, cysteine peptidase active site
IPR013128 Peptidase C1A, papain
Orthology groupMCL12947

Nucleotide sequence:

GTGACTTGTGAAGAGTCACTTAAAAATAGACAGTCGCGGAAAAAGAGGTCAATAAGCATT
TCTAAGCAAAAACGTTCCTTCAAAGGACGCCCCATTAAACAGCAACCCAATAAGGCCGAA
TATAAAGTTTTAGCTGAAAAATCACTGCGAACATATTTACAAGTCCAAAAAATAACAAAT
AAACACAAAGTGATATCTGTTGAGCGAGTGACGTCACAGGTCGTATCAGGAACTATCTAT
GATATTGACTTTATTGCGTCACCTATTTGCTCTAAAGTAAACCAAAACAAAAAAAAGTGT
GATATAAATGATGTCAGCAAACTCTATTGTAACGCGAAAATTTGGAATCAACCTTGGAGG
AGCCAGGAAAAAATCGATGTTGATTGTAATAATGGTATTAGTGAAGAAGATGAAAAATAT
TCCAGAAGAAAGCGCAATATCCTAGGAGCTCCCGCAAACTATGTTGATGACGAAAATATC
AAATCTTTGGTTGAAGAAGCTGTTACAAAATACCAGAAATTATCAAACACCAAGTATGTG
CATAAGATTGTAAAAATCCACAATGTGAGTGAACAAATTGTATCTGGTATTATTACTAAA
TTGGATTTTTCAATTTCGCCTACAAACTGTTTGTTAGAAGATAATGCTATGCATATCGAT
GGTTGTCAGACACAATCATCCGATAAAATATTACATTGTAATGCGCAAGTGTGGGTTCAA
CCATGGATTCAATCATCTAAAGAAATTGAAATAAAGTGTGAAAAAAATAGTGAGGAAAGT
AAAGGGGACTTAGAGAATGATAAGTTTTCATCTGATCGAATAAAGAGGCAAATTACGCAT
GATGAAGATGATATTGATGAAGACACAAAGTATTATTACGCTGATCGAGCTGTACATTAC
ATAAATGAAAAAGAATCGACAAACAATTTGAATAAACTCATTACTATCCACGCGTTTGAA
AGCAGTACTAATATGGGAGTCAACATGATAAAAATGTACATCGAAATAGGTTTAACTTAT
TGCTTAAGACATGAAGATGAAGCGGAACTACAGAACTGTGAGGAATTGTCTGGTATCTAT
CACAAACTTTGTTATGTTCGGTTGTGGCCATCACCCGATGATGAACTAGTAGTTCAAAGC
TTGGCTGTGGTCTGCGACGACGAACGGGACTTCAAGAGCGTCACAGGCCTATCGATAACG
AATCTTATCAAAGAAGCTGTTAAGGAATTAGAATCTTCGCCTAAAATTAAAAACAAGTTG
GTTCACCTCGGTGAACCACACGTAGTACCTAGCCTGGATTCCCGTAAGCCCACGCAATTA
AGTTTTATAGTGCGAGCTACAAATTGTTCCAAGTACGTAGATATTGAAAAAGACCGTTTC
CAATGTTACATTGATAATTCTAGACTTCCTAAACCTTGCACATCAAGTATCTGGATGGCA
GCCAACACAAAAAAAATAAGAAAAGTCACAACACGCTGCAGTAGATCATTACCGAATCGT
AGCAGAAGATCGCTTTCGTTTGATACGACAAACACAACTTCCGACGAAAAACTTATCCAA
GGTATGGTAAGGGAATCATTGGACAAGCTAGAAATGTCGTCGCTGTTAAACTATAAACAG
AAGTTGCTACAGATTAACAGTTTTATGACTAATATAACCAGAGGAAGACTAACAACTATA
GACTTTGATGTAGCTTATACGACTTGCTTAAAATACGAATGGGTCGATAATATGACTGCT
TGTGAGATAATAGAGCACCTGCCCAGAAGACATTGCATATCACAGGTGAGGGAGCGGCTG
TGGATACAAAATGGCAGAGAAATAACAGTGAACTGCGACGACGACGAAACGCCGCTAGAA
TCTCATATAGAGTATGAGACCGCTGATAACGGAATGGCTTTGGCTAACGAGGCTTTGAAG
CACATCGAAGCTAAATATCCTCATCCAAATAAACAGAAAATTGTAAGAGTGTTTTCGTTG
GAAAAACAGCAGGTTGCTGGGTTGCATTTTAGATTGAAATTAGAGGTAGGCATTACAGAC
TGTTTAGCTTTGAGTGCCAAGAAGGACTGTAAGCTAACAAAAAACATGTCAACAAATAAG
TTCTGTCGAGTAAATATTTGGTTGCGTCCATGGTCTGAACATCCACCGCTTTATAGGGTG
ATATGTGACTATCAGGATGAAGCGTCACACGAGTTTTTCTTCGAAGTTCAAGCTGAACGT
CTCTTTTCCGACTTCCTAACTACCTACATGCCGGATTACATCGATAATAAATCAGAAATG
GTCAAAAGATACAACATATTTAAGGACAACGTTAAAAGAATACACGAATTAAATATCCAC
GAGCGTGGAACAGCAACTTACGGAGTTACTAGGTATTCGGATCTGACTTATGACGAGTTC
GTATCAAAATATATGGGCCTTAAGACACATATGAGAAATGAGAATCTGATTCCGATGAGA
CAAGCGGACATCCCAGAGGTGGCTCTTCCTGAAAACTTTGATTGGCGCGAATATAATGCT
GTCACTGAGGTCAAAGATCAAGGTTCCTGTGGAAGCTGCTGGGCGTTCAGTGTTACCGGT
AATATAGAAGGTCAATATAAGATCCAGAACGACGAGCTGGTCTCTCTGTCGGAGCAAGAA
TTGGTAGACTGTGACAAACTGGACGACGGCTGCAACGGAGGCCTCCCAGACAACGCCTAC
AGGTACTATTTTATCTAA

Protein sequence:

VTCEESLKNRQSRKKRSISISKQKRSFKGRPIKQQPNKAEYKVLAEKSLRTYLQVQKITN
KHKVISVERVTSQVVSGTIYDIDFIASPICSKVNQNKKKCDINDVSKLYCNAKIWNQPWR
SQEKIDVDCNNGISEEDEKYSRRKRNILGAPANYVDDENIKSLVEEAVTKYQKLSNTKYV
HKIVKIHNVSEQIVSGIITKLDFSISPTNCLLEDNAMHIDGCQTQSSDKILHCNAQVWVQ
PWIQSSKEIEIKCEKNSEESKGDLENDKFSSDRIKRQITHDEDDIDEDTKYYYADRAVHY
INEKESTNNLNKLITIHAFESSTNMGVNMIKMYIEIGLTYCLRHEDEAELQNCEELSGIY
HKLCYVRLWPSPDDELVVQSLAVVCDDERDFKSVTGLSITNLIKEAVKELESSPKIKNKL
VHLGEPHVVPSLDSRKPTQLSFIVRATNCSKYVDIEKDRFQCYIDNSRLPKPCTSSIWMA
ANTKKIRKVTTRCSRSLPNRSRRSLSFDTTNTTSDEKLIQGMVRESLDKLEMSSLLNYKQ
KLLQINSFMTNITRGRLTTIDFDVAYTTCLKYEWVDNMTACEIIEHLPRRHCISQVRERL
WIQNGREITVNCDDDETPLESHIEYETADNGMALANEALKHIEAKYPHPNKQKIVRVFSL
EKQQVAGLHFRLKLEVGITDCLALSAKKDCKLTKNMSTNKFCRVNIWLRPWSEHPPLYRV
ICDYQDEASHEFFFEVQAERLFSDFLTTYMPDYIDNKSEMVKRYNIFKDNVKRIHELNIH
ERGTATYGVTRYSDLTYDEFVSKYMGLKTHMRNENLIPMRQADIPEVALPENFDWREYNA
VTEVKDQGSCGSCWAFSVTGNIEGQYKIQNDELVSLSEQELVDCDKLDDGCNGGLPDNAY
RYYFI