DPGLEAN18014 in OGS1.0

New model in OGS2.0DPOGS202493 
Genomic Positionscaffold1426:+ 24835-32620
See gene structure
CDS Length1641
Paired RNAseq reads  986
Single RNAseq reads  4763
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000097 (5e-160)
Best Drosophila hit  Calpain-A, isoform C (7e-86)
Best Human hitcalpain-9 isoform 2 (5e-51)
Best NR hit (blastp)  PREDICTED: similar to calpain B [Acyrthosiphon pisum] (1e-96)
Best NR hit (blastx)  PREDICTED: similar to calpain B [Acyrthosiphon pisum] (8e-101)
GeneOntology terms









  
GO:0004198 calcium-dependent cysteine-type endopeptidase activity
GO:0030036 actin cytoskeleton organization
GO:0005737 cytoplasm
GO:0006508 proteolysis
GO:0005509 calcium ion binding
GO:0005622 intracellular
GO:0016540 protein autoprocessing
GO:0006911 phagocytosis, engulfment
GO:0009953 dorsal/ventral pattern formation
GO:0021919 BMP signaling pathway involved in spinal cord dorsal/ventral patterning
GO:0042335 cuticle development
InterPro families




  
IPR022682 Peptidase C2, calpain, large subunit, domain III
IPR022683 Peptidase C2, calpain, domain III
IPR002048 Calcium-binding EF-hand
IPR022684 Peptidase C2, calpain family
IPR011992 EF-hand-like domain
IPR018249 EF-HAND 2
Orthology groupMCL10179

Nucleotide sequence:

ATGATTCAATCATTCAAGGACTTCACCAGCAACTTTGATAGAGTCGAAATCTGCAACTTG
AACCCCGACTCCCTGGACCCCGAAGAATGTCCTGAGGGCTGCACCAAGAAGTGGGAGATG
TCTGTGTTTGAAGGGGAGTGGGTCAGAGGTGTAACCGCTGGCGGCTGTAGGAATTACCTA
GAATCGTTTTGGAAGAATCCTCAATACACCGTTACACTGAAAGACCCCGACGAAGATGAC
GCGGAGAACAAGTGTACCATAATAGTGGCGTTGATGCAAAAGAACCGTCGCTCTCAGCGT
CACCAGGGGCTCGAGTGCCTCACCATAGGGTTCGCGGTGTACCGCCTGCCCGACTACGGC
CATGTGCCCAAGCCCTTAGATGTCAACTTCTTCAAATACAACGCCAGTGTGGGCAGGTCG
CAGGCCTTCATCAATCTGAGGGAGGTCAGCGCCAGATTCAAATTCGAGCCCGGAAGCTAC
GTCATAGTGCCGTCCACCTTCGAACCTGACGAGGAAGGGGAGTTCCTGTTGCGTGTGTTC
TCCGAGAAAACGAATAATATGACAGAGAACGACGAAGAAGTAGGGATGGGAGACGTGGAT
GACAGAAAAATGTTAGATTATGTCATTACGGCGGTCAAACGGACGCTGGGAAGACACGCA
GGGGACCTGGCATTCCGTGGAATTTATCGCTATAAGAATGGACAAAATGGCCATCATGAC
CAAAACGCGGATGGGATTGTTAGAACTAGAGTGAAGGAAATAACTCCCAACCCGGAGCCC
GCGGATCCTGTTAGAGAGTTCTTCACCCGCCTGGCTGGGAGCGACGGGGAGGTGGACTGG
CAGGAACTGAAGGAGATACTGGACTACGCCATGAGAGAGGACCTAATGCCGCTTTGTAAT
TGTCCGCCAAACGAACCGATGACATCTAAATGGCTATGCGGCATGGCGCTTATGACGGGG
GCGGGGGATCCCCGCCCGATCTGCAAGGAGATAGGAATAGACCTACAGCAGATGCAAACA
CAGCCGGAGCAGGTCCACCTCAACCAGCCGCAGGCGCAAGAACTAAAAGGACAAGGTTTC
TCTAAGGAAGTCTGTAGGAGTATGGTTGCTATGTTGGACAAAGACAACTCTGGCGGACTC
GGCTTCGAAGAGTTCAAATCTCTTTGGATCGATTTGCGCAACTGGAGGGTAGGGGGCGGG
GGATCCCCGCCGAACGAACAGAGCTGTTTGGAACAGCTGCTTTGCGCGCTCTGCACGCCG
ATCTGCAAGGAGATAGGAATAGACCTACAGCAGATGCAAACACAGCCGGAGCAGGTCCAC
CTCAACCAGCCGCAGGCGCAAGAACTAAAAGGACAAGGTTTCTCTAAGGAAGTCTGTAGG
AGTATGGTTGCTATGTTGGACAAAGACAACTCTGGCGGACTCGGCTTCGAAGAGTTCAAA
TCTCTTTGGATCGATTTGCGCAACTGGAGGGTAGGTGACACGCTATACGGGTCAAGCGAT
GGCTACATACAGTTTGATGACTTCATCATGTGTTCGGTGCGGCTGAAGACCATGATCGAC
GCTTTCCAAGGCAGGTCGTCGGGCGGCGACTACGCCACGTTTTCCCTGGACGAATGGCTG
AATCGCACAGTCTACTCCTAA

Protein sequence:

MIQSFKDFTSNFDRVEICNLNPDSLDPEECPEGCTKKWEMSVFEGEWVRGVTAGGCRNYL
ESFWKNPQYTVTLKDPDEDDAENKCTIIVALMQKNRRSQRHQGLECLTIGFAVYRLPDYG
HVPKPLDVNFFKYNASVGRSQAFINLREVSARFKFEPGSYVIVPSTFEPDEEGEFLLRVF
SEKTNNMTENDEEVGMGDVDDRKMLDYVITAVKRTLGRHAGDLAFRGIYRYKNGQNGHHD
QNADGIVRTRVKEITPNPEPADPVREFFTRLAGSDGEVDWQELKEILDYAMREDLMPLCN
CPPNEPMTSKWLCGMALMTGAGDPRPICKEIGIDLQQMQTQPEQVHLNQPQAQELKGQGF
SKEVCRSMVAMLDKDNSGGLGFEEFKSLWIDLRNWRVGGGGSPPNEQSCLEQLLCALCTP
ICKEIGIDLQQMQTQPEQVHLNQPQAQELKGQGFSKEVCRSMVAMLDKDNSGGLGFEEFK
SLWIDLRNWRVGDTLYGSSDGYIQFDDFIMCSVRLKTMIDAFQGRSSGGDYATFSLDEWL
NRTVYS