DPGLEAN11983 in OGS1.0

New model in OGS2.0DPOGS213340 
Genomic Positionscaffold163:- 8323-14378
See gene structure
CDS Length1590
Paired RNAseq reads  7065
Single RNAseq reads  15920
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009139 (9e-69)
Best Drosophila hit  26-29kD-proteinase (3e-73)
Best Human hitcathepsin L2 preproprotein (4e-50)
Best NR hit (blastp)  putative C1A cysteine protease precursor [Spodoptera frugiperda] (2e-141)
Best NR hit (blastx)  putative C1A cysteine protease precursor [Spodoptera frugiperda] (2e-143)
GeneOntology terms


  
GO:0004197 cysteine-type endopeptidase activity
GO:0006508 proteolysis
GO:0005811 lipid particle
GO:0005875 microtubule associated complex
InterPro families


  
IPR013128 Peptidase C1A, papain
IPR013201 Proteinase inhibitor I29, cathepsin propeptide
IPR000668 Peptidase C1A, papain C-terminal
IPR000169 Peptidase, cysteine peptidase active site
Orthology groupND

Nucleotide sequence:

ATGTTTTCAATATATTTTATTTTAGCGTTTCTTTTCATTCATACAGGCTGTGCTGCTGTC
CTGGGCGAGGATGGAGTCGTGTGGCCCAAGGAGTATCATTTGAAAGGCGAAATAACTTTC
ATGAGCGTAGGACTCCAGGAGCCGTTTGAAATCTGGTATAGTGCTGCAGAAAATAAGTCC
AGGATTGATTTTTACGATGGCACCGTGAAGAGGTACGTCATCGGGGAGGAGGAGGACGAC
GGTGAAGAATATAAGGTATTTCCGGTGTTCATTGACAAGGAGATGACCGTCATGTGTGTG
AGAGAACCAACAGACGGGGAGTCTATGGACTTTTTAATAGATCCTAGTAATTTTACATAT
TTCGACACAACGTCATATAATGGAAAAACGGTTCAGGTGTGGAAGAGTATTGAAGTGGAA
TTAAACGAACAGAAAGTCGAAAAAGTGTTGTTTGTGTATAAACAAGATGGGTTTCATGTG
CCGATAAGAGTTGAAGAAATCAAACATAATCTATGGACGGGTGCTTTAGAGGGACATAAA
ATTACCACCTTCTACGATTACAGGAAACCGACAAAGGACGACTTCAATGTCGCCTTAGTG
ACTGAATGTGAAGACGCCACTGACTTTTACAAAGACCTGCGCGTGCTTCATCCGATGATA
CCTTCCGATGTGGATAGGCTCTATCACAGTTATACAAAGCATCACAACAGAAACTACAAA
GCTGAAGAGCATAGTTTGCGTAAATCAATATTAGAACAGAATTGGCAGCGCGTCCTCCTT
CACAACAAAAAAAACTTAGGCTTCAAGTTGACTCTCAACAAATATTCTGATCGCACTAAG
GAAGAACTATCCTTCCTAACTGGCACGAGACCCTCATTAGGGACAGGCACCGTCTCCTTT
CCGCACACTGATGAGGAAGTGGAACAGATGGTGTTGGATCTTCCTGAGAATTATGATATG
AGGCTGGAAGGAACTATTAGTGCAGTCAAGAATCAGGGTCGCTGTGGCTCCTGCTGGACG
TTCTCAACTGCAGCTGCTGTGGAGGGAGCGCTGGCCAGGAAGAACGGGGGACGAGACCTG
GACCTGAGCGAGCAGTCCATCGTGGATTGTGCGTGGGGATATCATAACGCTGGCTGTGAC
GGAGGCATGATAGACACGGCGTTTAAGTACATCCTGGACTACGGCATCCCGACTCAGATA
GAATACGGAGATTACTTAGGAGAAGATGGCTACTGCCACATCGAGAACGTCACTGACGTC
TACAATATCATTGGGTTCGTGCAAGTGCCGTCCAAGAGTGTGAATGCTATGAAAGTAGCC
CTTTACAAGTACGGGCCGGTGTCCGTGGCTATTAATGCGAACAAGCTCTTGGTGGCCTAT
GAAAGTGGCATCTTCTTCGACCCTGAGTGTAACGAGGACCACATCAACCACGCCGTGACC
GTAGTAGGTTATGGTGTCCGCGATGGTGCCACCTACTGGATCGTGAAGAACTCCTGGGGA
GAGGACTGGGGTCAGGACGGCTACCTGCTCATCTCTGCTACCGACAATAACTGTCATATA
CTAGAATACGCCTACTATCCTCTAGTCTGA

Protein sequence:

MFSIYFILAFLFIHTGCAAVLGEDGVVWPKEYHLKGEITFMSVGLQEPFEIWYSAAENKS
RIDFYDGTVKRYVIGEEEDDGEEYKVFPVFIDKEMTVMCVREPTDGESMDFLIDPSNFTY
FDTTSYNGKTVQVWKSIEVELNEQKVEKVLFVYKQDGFHVPIRVEEIKHNLWTGALEGHK
ITTFYDYRKPTKDDFNVALVTECEDATDFYKDLRVLHPMIPSDVDRLYHSYTKHHNRNYK
AEEHSLRKSILEQNWQRVLLHNKKNLGFKLTLNKYSDRTKEELSFLTGTRPSLGTGTVSF
PHTDEEVEQMVLDLPENYDMRLEGTISAVKNQGRCGSCWTFSTAAAVEGALARKNGGRDL
DLSEQSIVDCAWGYHNAGCDGGMIDTAFKYILDYGIPTQIEYGDYLGEDGYCHIENVTDV
YNIIGFVQVPSKSVNAMKVALYKYGPVSVAINANKLLVAYESGIFFDPECNEDHINHAVT
VVGYGVRDGATYWIVKNSWGEDWGQDGYLLISATDNNCHILEYAYYPLV