New model in OGS2.0 | DPOGS213187  |
---|---|
Genomic Position | scaffold188:+ 189256-201336 |
See gene structure | |
CDS Length | 2238 |
Paired RNAseq reads   | 1036 |
Single RNAseq reads   | 2555 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007376 (3e-21) |
Best Drosophila hit   | Calpain-A, isoform C (1e-30) |
Best Human hit | calpain-11 (4e-34) |
Best NR hit (blastp)   | PREDICTED: similar to calpain [Tribolium castaneum] (4e-155) |
Best NR hit (blastx)   | PREDICTED: similar to calpain [Tribolium castaneum] (2e-113) |
GeneOntology terms    | GO:0031410 cytoplasmic vesicle GO:0008233 peptidase activity GO:0001669 acrosomal vesicle GO:0006508 proteolysis GO:0005622 intracellular GO:0004198 calcium-dependent cysteine-type endopeptidase activity GO:0005509 calcium ion binding |
InterPro families    | IPR001300 Peptidase C2, calpain, catalytic domain IPR022682 Peptidase C2, calpain, large subunit, domain III IPR022684 Peptidase C2, calpain family IPR022683 Peptidase C2, calpain, domain III |
Orthology group | MCL19603 |
Nucleotide sequence:
ATGAGGAAGTTCATGCGAACGCGTCCCGGACACCCTGGTCACCCCGGACACACCACACTA
CCCGGGCATGTCGGACATCCCGCGCTCATCCCGCACGCAGCGATCGGGCACAAGACCAAC
GGTGTTACCAATGGCGTCGGAGGTGGCCCCAGGAAGGAGGAGCTCTCCACTCTTCCGAGA
TACTGGAGCTCGTCGGAGGACGAGGGGGTGGAGTGTGCGGATGGAGTGGAGACGTTGTTC
GTGGACCGAGACTTTCCAGCCGCCGCTTCCAGTCTGGGCGACAGACGCCTCGCCTCGCGC
ACTTCGTGGATAAGACCTCACGAGATCTGTCCTCGGCCGCGGTTCCTGGGAGACGCAGCG
TTGGAAGCCGGGCTGTCGAACGACCAGCTCGAGGACGAACCCGCAGAGCTTCAGGCTGCT
CGCTGGGACGTGCGGCCCGGGGCGGCCGGGGACGCATTGTCCCTGGCGACCGCCGCACTC
TCGCACACGCCGCGTCTGCTTGCACGCGTCGCGCCACCACACTCCTTCCGCACCCGATAC
ACTGGGAAATTCAGGTTCCGGTTTTGGGTATTCGGATCGTGGCGTGAGGTGGTGGTCGAT
GACCTGTTGCCGACACGCGGTGGCGTGTTGCTCACTGCGCGTGGTGGTCTGGTAGACGAC
TTCACATTGCCGCTTCTCGAGAAGGCTTACGCTAAACTACAAGGGTCGCTGGCATCACTG
CGCGGGTGCGGCGCGGCTCAAGTACTACAAGACCTCACCGGTGCAGTAGTGCAGAGCTTC
TCACCGCCGCGACAGCCCCGCTCGCTTTTGTTACAGGTGCTGCACTCGGCGGTGCCACGA
TCTACTCTGTTAGTGGCGTCTACGGAGCGCGGGACTTCTGGCCTTGTTCCTGGCCGCGCC
TATCTGGTGACCGGTTTAGCCCGAGTACGTGAGACGGGAGGTGAGGGGGCTCTAGTGAGG
CTGCTGGCTGCGGGTGGGCCTGCCGCGTGGTCGGGTGCATGGTCGCGAGGCTCTCCCGAG
TGGCGAGCGTTGCCCCCTGCTGATCTGGACCTGCTGGCCGGCCGCCTCACACACCCTGGA
CACTTTTGGATGTCGTTCCAGGAGTTCGCGCGGTTATTTTCGCGATTAGAGCTGGTGCAT
ATAGGGCCGGATGACTGGCTCCTGGAGCCAGCGCTGCACGCTAGACGGCCGTGGCGCGCG
GTGTTGGCTCGTCGTCGCTGGAGACGCGGTTACAACGCGGGTGGGCCGCCGGCTTGCACT
CGTACTGCTCATGCCAATCCCCAGTTTCACGTACATGTGCCACGATCTGAATCCGGCAAG
TGCCACGTAGTGGTGTCTGTCACCCAGCAGTACTCTCCCGCTGGTTCGCCTGACCGCCTT
CACGGCATCGGATTCGCGGTATATGAACTGCCCCCGGGTGCTCCTCCCCCTCGCGCCCCC
CGAGCCCCCGCTGCTTTGGCTGACCTCCGAGCACTGGACGTGACACATTGGTCTCGAGCA
CGCGAGGTGGCCACGTTCTTCACGTTGCCGGCAGGACAGTACTTGGTGGTGCCTCACACG
CACCGACCACACCTCGAGACTACCTTCCTGCTGCGCATCCTGACTGATGAGCACACGGAC
GTATGGGAGGTCAACGATGACAATGTGATCGTCCGTGACGTCGCGACCGAGTTCTTAGAC
GAAGGATGCCCTTTGGAGCCCGAGGTTCAGGCCGCGATCGCGAAAACGATCGGAAAAAGA
GGCGTCGAAGAGGTGAGGCTCGACGTGAGAGGGATGAGGGGGGGACTGCTGAGAGACCGG
GGTATGACGAGGACGGTGTGGTGTTCGCAGATGGACGCGCGCGCGCTCAGGAACTTGCTG
CGGCGCGTATGGCGGCGCGTGCTGCCGGCGCGGCCGTCGCGGGCGCTGTGCGGTGCGCTG
GTGGCGCTGGGCGACCCTGCCGCGGCGGGGAGGCTGGAACGGGGTGCGGTGCTGGGCGCC
GCCCCAGGCTGCCGGGCCGCCGTCAGCGCCTACTGCTTGCGCGCGCTGCTGTGGGCGTGC
GGCGTGCGAGCCTCCAACAAGGTGCTGGAGTGCCTCGTGCTGAGGTTCGCACGCGGGACT
CGCCTCTCGCCCGACGCCTGCGTGTTGGCGCTGGCCCGTCTGCATCTCGCCCACGAGAGA
TTCCGAAGCCTCGACAACAAACTCAAATCTAATCCCATTTCGCTGGAGGAGATGCTCCTC
ATGACCATCTACTCCTGA
Protein sequence:
MRKFMRTRPGHPGHPGHTTLPGHVGHPALIPHAAIGHKTNGVTNGVGGGPRKEELSTLPR
YWSSSEDEGVECADGVETLFVDRDFPAAASSLGDRRLASRTSWIRPHEICPRPRFLGDAA
LEAGLSNDQLEDEPAELQAARWDVRPGAAGDALSLATAALSHTPRLLARVAPPHSFRTRY
TGKFRFRFWVFGSWREVVVDDLLPTRGGVLLTARGGLVDDFTLPLLEKAYAKLQGSLASL
RGCGAAQVLQDLTGAVVQSFSPPRQPRSLLLQVLHSAVPRSTLLVASTERGTSGLVPGRA
YLVTGLARVRETGGEGALVRLLAAGGPAAWSGAWSRGSPEWRALPPADLDLLAGRLTHPG
HFWMSFQEFARLFSRLELVHIGPDDWLLEPALHARRPWRAVLARRRWRRGYNAGGPPACT
RTAHANPQFHVHVPRSESGKCHVVVSVTQQYSPAGSPDRLHGIGFAVYELPPGAPPPRAP
RAPAALADLRALDVTHWSRAREVATFFTLPAGQYLVVPHTHRPHLETTFLLRILTDEHTD
VWEVNDDNVIVRDVATEFLDEGCPLEPEVQAAIAKTIGKRGVEEVRLDVRGMRGGLLRDR
GMTRTVWCSQMDARALRNLLRRVWRRVLPARPSRALCGALVALGDPAAAGRLERGAVLGA
APGCRAAVSAYCLRALLWACGVRASNKVLECLVLRFARGTRLSPDACVLALARLHLAHER
FRSLDNKLKSNPISLEEMLLMTIYS