DPGLEAN18013 in OGS1.0

New model in OGS2.0DPOGS202493 
Genomic Positionscaffold1426:+ 15506-23983
See gene structure
CDS Length1566
Paired RNAseq reads  1072
Single RNAseq reads  3902
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000097 (4e-134)
Best Drosophila hit  Calpain-B (4e-89)
Best Human hitcalpain-9 isoform 2 (6e-73)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC002116 [Tribolium castaneum] (4e-102)
Best NR hit (blastx)  calpain-B [Apis mellifera] (1e-100)
GeneOntology terms




  
GO:0005509 calcium ion binding
GO:0006508 proteolysis
GO:0004198 calcium-dependent cysteine-type endopeptidase activity
GO:0016020 membrane
GO:0005737 cytoplasm
GO:0016540 protein autoprocessing
InterPro families

  
IPR001300 Peptidase C2, calpain, catalytic domain
IPR000169 Peptidase, cysteine peptidase active site
IPR022684 Peptidase C2, calpain family
Orthology groupMCL10179

Nucleotide sequence:

ATGGGGCAAAATGGCGGCGCTGATTTCGGCGCGCTATTGTCCGGGGCCGGTCGACAAATT
TTGAACCAAGGCGGTCAAGCATTGTTGAATTATGGAGCTCAGGCGCTTGGGAACATAATT
AACGAAGTGTTCCAAAAGAAAGAGGCCGAACATAAGAGAGTTTTACCGAGCATCAAAAAT
TATAAAGTGAATGTGATTGTACGAACGGCAGTTGAGGCTTTCAATAAACCTGAATCCATA
CCTAAAACAGATCGAGATGTTAAAAACGCTGGGTTTTTTGCGAAATTACTAAGGAAATCG
AATGTGTCGGATATCGATAAGACAAAAGGAAAACCTCTTTGGGCCAACAAAGTCGATTCA
GATGAAAGAATTGATATCGAGAAGCGAAAAATAGTATCGACTACAGACAGATCAAACATC
TCGAATAGATTTCATTTCGCCAAACCGGAAGTTACGTCGGAAGTTACATCGGAAGTTAAT
AACGATGAAATCAAAAATAAATCACCACGTAAAAAATCGATCGTCGTACCAAGTAGGGTA
AATCGATTCAAAACGCCAGCAAATTACAACGGCAGTCACATGTTCATGCCTACAGGCGAA
CGGTTGTTCTGGCTCGGTGAGACCCGTCCATCATCATTCGGTCCAGCCACGTACCAGGAT
TTCAAGGAGATCAGATCTCGCTGTCTCTCCGAAGGCAGGCTGTTCGAGGATCCGGAATTC
CCGGCCACCGATCGCAGTTTGTACTACAAGGAACGTCTGGATAGACCCTTAACATGGCTA
AGACCTGGGGAAATCAGCGAAGATCCGCAGCTATTCGTGGAGGGCTACAGTCGCTTCGAC
GTGCAACAGGGCGAGTTAGGAGACTGTTGGTTGCTGGCTGCCGTCGCCAATCTGACGCTC
CATAGAAAACTCTTCTTCCAAGTAGTGCCGGACGACCAGAGCTTCGATGAAGAATACGCT
GGTGTCTTCCACTTCCGGTTCTGGCAGTATGGTCGCTGGGTGGACGTTGTCGTCGACGAC
CGCCTGCCGACCTACCGCGGAAAACTGGTCTTTCTTCACTCATCAGAGAGAAATGAGTTT
TGGAGTGCCTTATTGGAGAAGGCCTATGCTAAACTCCACGGTTCCTATGAAGCCTTAAAG
GGAGGTTCTACCTGTGAAGCCATGGAAGATTTCACGGGCGGTGTGACCGAAATGTACGAA
ATGACGGAACTACCGCCCAACTTCTATACTATACTACTGAAAGCATACGAACGTAACTCA
CTCATGGGATGCAGTATTGAGGTAGGAAAAGGTTTTTTAAATAATTATTTTTCGTCGCAA
GAGTCTTTGCAGACTCGTCGTGGTCATCACGTATCAGTGATCCGTATACATGATGACTCA
TGGCAAGATAGTCGCAGTGGTTTCCCCTTGTCTTCTGCACCAGCACTTTTTTGGGGCTAT
TTAGACCCCAAGGCTTGCTGTAGCCTGTTCCAGCCAGTGGTATCTATATACTCTCAGAAA
GTACACATGACTCGGAAAAGATCACATTGGTACTTGCCAGGTTCGAACCCGCGCCCTCAC
GCATGA

Protein sequence:

MGQNGGADFGALLSGAGRQILNQGGQALLNYGAQALGNIINEVFQKKEAEHKRVLPSIKN
YKVNVIVRTAVEAFNKPESIPKTDRDVKNAGFFAKLLRKSNVSDIDKTKGKPLWANKVDS
DERIDIEKRKIVSTTDRSNISNRFHFAKPEVTSEVTSEVNNDEIKNKSPRKKSIVVPSRV
NRFKTPANYNGSHMFMPTGERLFWLGETRPSSFGPATYQDFKEIRSRCLSEGRLFEDPEF
PATDRSLYYKERLDRPLTWLRPGEISEDPQLFVEGYSRFDVQQGELGDCWLLAAVANLTL
HRKLFFQVVPDDQSFDEEYAGVFHFRFWQYGRWVDVVVDDRLPTYRGKLVFLHSSERNEF
WSALLEKAYAKLHGSYEALKGGSTCEAMEDFTGGVTEMYEMTELPPNFYTILLKAYERNS
LMGCSIEVGKGFLNNYFSSQESLQTRRGHHVSVIRIHDDSWQDSRSGFPLSSAPALFWGY
LDPKACCSLFQPVVSIYSQKVHMTRKRSHWYLPGSNPRPHA