New model in OGS2.0 | DPOGS213340  |
---|---|
Genomic Position | scaffold163:- 8323-14378 |
See gene structure | |
CDS Length | 1590 |
Paired RNAseq reads   | 7065 |
Single RNAseq reads   | 15920 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009139 (9e-69) |
Best Drosophila hit   | 26-29kD-proteinase (3e-73) |
Best Human hit | cathepsin L2 preproprotein (4e-50) |
Best NR hit (blastp)   | putative C1A cysteine protease precursor [Spodoptera frugiperda] (2e-141) |
Best NR hit (blastx)   | putative C1A cysteine protease precursor [Spodoptera frugiperda] (2e-143) |
GeneOntology terms    | GO:0004197 cysteine-type endopeptidase activity GO:0006508 proteolysis GO:0005811 lipid particle GO:0005875 microtubule associated complex |
InterPro families    | IPR013128 Peptidase C1A, papain IPR013201 Proteinase inhibitor I29, cathepsin propeptide IPR000668 Peptidase C1A, papain C-terminal IPR000169 Peptidase, cysteine peptidase active site |
Orthology group | ND |
Nucleotide sequence:
ATGTTTTCAATATATTTTATTTTAGCGTTTCTTTTCATTCATACAGGCTGTGCTGCTGTC
CTGGGCGAGGATGGAGTCGTGTGGCCCAAGGAGTATCATTTGAAAGGCGAAATAACTTTC
ATGAGCGTAGGACTCCAGGAGCCGTTTGAAATCTGGTATAGTGCTGCAGAAAATAAGTCC
AGGATTGATTTTTACGATGGCACCGTGAAGAGGTACGTCATCGGGGAGGAGGAGGACGAC
GGTGAAGAATATAAGGTATTTCCGGTGTTCATTGACAAGGAGATGACCGTCATGTGTGTG
AGAGAACCAACAGACGGGGAGTCTATGGACTTTTTAATAGATCCTAGTAATTTTACATAT
TTCGACACAACGTCATATAATGGAAAAACGGTTCAGGTGTGGAAGAGTATTGAAGTGGAA
TTAAACGAACAGAAAGTCGAAAAAGTGTTGTTTGTGTATAAACAAGATGGGTTTCATGTG
CCGATAAGAGTTGAAGAAATCAAACATAATCTATGGACGGGTGCTTTAGAGGGACATAAA
ATTACCACCTTCTACGATTACAGGAAACCGACAAAGGACGACTTCAATGTCGCCTTAGTG
ACTGAATGTGAAGACGCCACTGACTTTTACAAAGACCTGCGCGTGCTTCATCCGATGATA
CCTTCCGATGTGGATAGGCTCTATCACAGTTATACAAAGCATCACAACAGAAACTACAAA
GCTGAAGAGCATAGTTTGCGTAAATCAATATTAGAACAGAATTGGCAGCGCGTCCTCCTT
CACAACAAAAAAAACTTAGGCTTCAAGTTGACTCTCAACAAATATTCTGATCGCACTAAG
GAAGAACTATCCTTCCTAACTGGCACGAGACCCTCATTAGGGACAGGCACCGTCTCCTTT
CCGCACACTGATGAGGAAGTGGAACAGATGGTGTTGGATCTTCCTGAGAATTATGATATG
AGGCTGGAAGGAACTATTAGTGCAGTCAAGAATCAGGGTCGCTGTGGCTCCTGCTGGACG
TTCTCAACTGCAGCTGCTGTGGAGGGAGCGCTGGCCAGGAAGAACGGGGGACGAGACCTG
GACCTGAGCGAGCAGTCCATCGTGGATTGTGCGTGGGGATATCATAACGCTGGCTGTGAC
GGAGGCATGATAGACACGGCGTTTAAGTACATCCTGGACTACGGCATCCCGACTCAGATA
GAATACGGAGATTACTTAGGAGAAGATGGCTACTGCCACATCGAGAACGTCACTGACGTC
TACAATATCATTGGGTTCGTGCAAGTGCCGTCCAAGAGTGTGAATGCTATGAAAGTAGCC
CTTTACAAGTACGGGCCGGTGTCCGTGGCTATTAATGCGAACAAGCTCTTGGTGGCCTAT
GAAAGTGGCATCTTCTTCGACCCTGAGTGTAACGAGGACCACATCAACCACGCCGTGACC
GTAGTAGGTTATGGTGTCCGCGATGGTGCCACCTACTGGATCGTGAAGAACTCCTGGGGA
GAGGACTGGGGTCAGGACGGCTACCTGCTCATCTCTGCTACCGACAATAACTGTCATATA
CTAGAATACGCCTACTATCCTCTAGTCTGA
Protein sequence:
MFSIYFILAFLFIHTGCAAVLGEDGVVWPKEYHLKGEITFMSVGLQEPFEIWYSAAENKS
RIDFYDGTVKRYVIGEEEDDGEEYKVFPVFIDKEMTVMCVREPTDGESMDFLIDPSNFTY
FDTTSYNGKTVQVWKSIEVELNEQKVEKVLFVYKQDGFHVPIRVEEIKHNLWTGALEGHK
ITTFYDYRKPTKDDFNVALVTECEDATDFYKDLRVLHPMIPSDVDRLYHSYTKHHNRNYK
AEEHSLRKSILEQNWQRVLLHNKKNLGFKLTLNKYSDRTKEELSFLTGTRPSLGTGTVSF
PHTDEEVEQMVLDLPENYDMRLEGTISAVKNQGRCGSCWTFSTAAAVEGALARKNGGRDL
DLSEQSIVDCAWGYHNAGCDGGMIDTAFKYILDYGIPTQIEYGDYLGEDGYCHIENVTDV
YNIIGFVQVPSKSVNAMKVALYKYGPVSVAINANKLLVAYESGIFFDPECNEDHINHAVT
VVGYGVRDGATYWIVKNSWGEDWGQDGYLLISATDNNCHILEYAYYPLV