New model in OGS2.0 | DPOGS203981  |
---|---|
Genomic Position | scaffold2:+ 937292-942884 |
See gene structure | |
CDS Length | 1896 |
Paired RNAseq reads   | 318 |
Single RNAseq reads   | 850 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002120 (4e-175) |
Best Drosophila hit   | calpain C (5e-149) |
Best Human hit | calpain-9 isoform 1 (2e-86) |
Best NR hit (blastp)   | PREDICTED: similar to calpain-c [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to calpain-c [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0004198 calcium-dependent cysteine-type endopeptidase activity GO:0005737 cytoplasm GO:0006508 proteolysis GO:0005622 intracellular GO:0005509 calcium ion binding |
InterPro families    | IPR001300 Peptidase C2, calpain, catalytic domain IPR022683 Peptidase C2, calpain, domain III IPR018249 EF-HAND 2 IPR022682 Peptidase C2, calpain, large subunit, domain III IPR022684 Peptidase C2, calpain family IPR011992 EF-hand-like domain |
Orthology group | MCL16375 |
Nucleotide sequence:
ATGACGGATTACGAGCGCATCAAGGCTAGCTGCCTCCAGCGCGGCCAGCTATGGGAGGAC
CCTGACTTCCCCGCTATCCAGCCATCCGTGTTCTACCATCAGGTGCCACCCTTCAAATTT
GAGTGGAAGCGCGCCAAAGAACTATACCAGTATCCAAAGTTCATATTGGACAACAGTGAC
AGCTTCGACATAGTTACAGGCAGACTTGGAGATAAATGGCTTTTATCGTGTATAGGAGTG
CTATACCTGTGTAAAGGGCTATTCTACAGAGTGGTTCCAGCCGATCAACAAATCGATTCT
AACTACGCAGGTATTTTCAGGTTTCGTCTTTGGTGGTGTGGACAATGGGTTGAAGTTCTT
GTTGACGATAGATTACCTACGGTTAACGGTAAATTAGCTTTCATGCATTGCAGTCACTCT
GAGCAATTGTGGCCGGCTCTACTTGAAAAGGCATATGCAAAAATGCATGGTTCGTACGAA
GCTCTAAAATACGGTAACTTACTGGATGGACTGGCAGATCTTACTGGAGGAATCACAGAA
TCCCTGAATATTTCTGACCTTGCCGATGCCACAGCTCTACACAACTTAATGAAGACTACG
AGCGTTGTTACAGCTTACCGTTTACCTAATGCCGCCACACATTCTGTGAAAAGCATTGAA
TCTGGAATGAATTACAGACTTTACAACGTGGAAAGGGTAGATACTTCTGATGGTCCAGTG
TATTTAGTGCGATTGGGACGACCATTAACACCTGGTGATACGCACATTACTCATTTTGTT
TTAGACCAAGCCACATGGACCCATTCTATTCCTCTACACGAACGTCAACGTTTGACATCT
ATAACAAAAGGTTTCTGGATGCTTTATAATGACTTCACTTCTATGTTTTCGCGTGTGGAA
ATAGTCCATCTTGATCTAGAAACGAGCAAAGCGGAAGCATCTCTCTCAGATAAAAACAAA
TGGCTAGTAAAAAGTCACCAAGGAAGGTGGAGAAAGGGCGTCACTGCTGGTGGTTGTAGA
AATCACGTTAATCTGTTTCATATGAACCCACAAATACAAATTGTATTAAATGATCCTGAC
ACGGTGATTATATCACTCAATCAGCATAGTATTATGGAACCTAAAGTTATAGGATTCAGC
ATTTACAAAATACCTAAGAGCTTAACAGAAACAGCATCATCACTTTTTTTTAAGAAGACT
AAAAGTTCGATTAATTCCCAATACACAAACAGTAGGCAAGTTAGTGAAAGATGTCACTTA
GAACCAGGTGCATACTTAGTAATACCCACGACTTTTGAACCTAGACAGGAAGCAAATTTT
TCATTAAGAGTGTACTCTGTAAAGCAACTCAAACTGAAAGTATTAGATTGTGCCCCACAG
ATGTTAAAAGCAGCTATTTTAAAAGCACCTCCTGGGTTCGAAACTAGTAGTTTCACACAA
TATGAGTCACAATTTCTACAGCTGGCTGATGAACACAAAACTATAAATGCCTTCGAACTA
CAAGAATTATTAGAAAAGTGCTTGCCAAATGATTACATAAAGAGCTGTGCAACAATCGAA
ACATGTAGACAAATCGTCTTATCATTGGAAAAAGATGGCTCTGGTCGTATAACATTATCT
GATTTCAAAGATCTCATATGCAGCCTGAAGCACTGGCAGATTGTATTCCGAGCTCACGCT
CCAGAGAAAATGAGCGTCCTCAAGATTGAAAGGTTTCGAGATGCACTTCGCGATGTCGGC
TTTGTAATTCCAGAACGGGCATTGTCATTACTTGTATTGAAGTACATGAGAAAAGATGGC
ATGCTGAGATTTGGGGACTTTGTATCTGCAGTAGTTCTTCTCCATAGAGCGTTTCGTAAG
TTCATAAGTATTATTTCTGTCAATCATAGGAAGTAA
Protein sequence:
MTDYERIKASCLQRGQLWEDPDFPAIQPSVFYHQVPPFKFEWKRAKELYQYPKFILDNSD
SFDIVTGRLGDKWLLSCIGVLYLCKGLFYRVVPADQQIDSNYAGIFRFRLWWCGQWVEVL
VDDRLPTVNGKLAFMHCSHSEQLWPALLEKAYAKMHGSYEALKYGNLLDGLADLTGGITE
SLNISDLADATALHNLMKTTSVVTAYRLPNAATHSVKSIESGMNYRLYNVERVDTSDGPV
YLVRLGRPLTPGDTHITHFVLDQATWTHSIPLHERQRLTSITKGFWMLYNDFTSMFSRVE
IVHLDLETSKAEASLSDKNKWLVKSHQGRWRKGVTAGGCRNHVNLFHMNPQIQIVLNDPD
TVIISLNQHSIMEPKVIGFSIYKIPKSLTETASSLFFKKTKSSINSQYTNSRQVSERCHL
EPGAYLVIPTTFEPRQEANFSLRVYSVKQLKLKVLDCAPQMLKAAILKAPPGFETSSFTQ
YESQFLQLADEHKTINAFELQELLEKCLPNDYIKSCATIETCRQIVLSLEKDGSGRITLS
DFKDLICSLKHWQIVFRAHAPEKMSVLKIERFRDALRDVGFVIPERALSLLVLKYMRKDG
MLRFGDFVSAVVLLHRAFRKFISIISVNHRK