DPGLEAN20567 in OGS1.0

New model in OGS2.0DPOGS206355 
Genomic Positionscaffold2239:- 16871-19819
See gene structure
CDS Length1257
Paired RNAseq reads  949
Single RNAseq reads  2385
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014111 (4e-180)
Best Drosophila hit  small optic lobes, isoform D (4e-11)
Best Human hitcalpain-7 (4e-115)
Best NR hit (blastp)  calpain [Aedes aegypti] (1e-157)
Best NR hit (blastx)  PREDICTED: similar to calpain [Tribolium castaneum] (6e-159)
GeneOntology terms




  
GO:0008234 cysteine-type peptidase activity
GO:0008233 peptidase activity
GO:0005634 nucleus
GO:0006508 proteolysis
GO:0005622 intracellular
GO:0004198 calcium-dependent cysteine-type endopeptidase activity
InterPro families

  
IPR001300 Peptidase C2, calpain, catalytic domain
IPR022682 Peptidase C2, calpain, large subunit, domain III
IPR022683 Peptidase C2, calpain, domain III
Orthology groupMCL16651

Nucleotide sequence:

AACATCGACCTCCACGCGCTAACCGGTTGGATCCCGGAGCGGTGCGCGATCCGGTCCGAG
GCGGACTTCAACGCGGATGGTTTGTATGAGATAGTGCGGGCGAGGCTGGAGGCGGGACAC
GTGCTGGCCAGCGTCGCCACCGGGGACCTGTCCGACGACGACGCCGAGCGGACCGGCCTC
GTCGCGTCGCACGCTTACGCCGTGCTCGACGTGCGGCTGGTCAATGGCGTAAAGCTTCTG
AAGCTCAAGAACCCGTGGTCTCACCTCCGCTGGCGAGGCAACTACAGCGAGCTGGACACT
GTCCACTGGTCGCCGAACCTCTGCTCCGCACTGGACTACGACCCGGACAGCGCCGCTCAG
TACGACAACGGAGTGTTCTGGATAGATTACGCGAGTATACTGAAGTTCTTTGATGTTTTT
TATCTCAATTGGAACCCAGAGCTGTTTAAGTTCACTTATTGCATACATCAGAAATGGGAA
GCCGGTAACGGTCCTATTAAAGACGCGTATACGATATCAGAGAATCCTCAGTACTCTTTG
AAGGTGAACGGCACTGGCGCCGTCTGGTTGTTACTGACGAGACACATCACTAAGATAGAA
GACTTCAGGAACAACCAGGAGTACATAACACTACTCGTTTATAAGAACGGGAAGCGAGTA
TACTATCCACACGACCCACCTCCTTATATAGACGGAATACGTATCAACAGTCCCCACTAC
CTCGTGAAGATAATAGTGGGGGAGAACAGTTCGGACAAATACACACTGGTCGTGTCCCAG
TATGAGAAGACTCGCACCATATACTACACGCTGAGGGCCTACGCCACGTGTCCGTTCGCA
TTGGCAAAGCTGGACCCATATCCCTATACCAAAACTATCAGAGGTGAATGGTCGGGCAGA
ACAGCCGGCGGTTGTGAAAATCACAGACAAACTTATCAGAATAACCCAAAATATATAATA
ACGGTCCCAGAAAGCAGGAACCCGTGCCACGTCACCATAGAACTGAAAGGTCCCAAAGAA
TACCAGATAGGAGTAGACGCGAGGGTTGAATCCTTGGACGATCCAAATATAACCGCGCCG
TTCTTGAGGGAATCCTCAGGAGCGTACAGATCTGGTTTCGTTGTGCTGGAGTTAAATAAT
TTACCAGGCGGACGGTATCTGCTCACACCATCTACCTTCTATCCGGGACAGGAAGGGCCA
TTTTTCCTTGAACTGAGATCTACTTGCAGCATCACAGCCGAGAGGAAGAATGAATGA

Protein sequence:

NIDLHALTGWIPERCAIRSEADFNADGLYEIVRARLEAGHVLASVATGDLSDDDAERTGL
VASHAYAVLDVRLVNGVKLLKLKNPWSHLRWRGNYSELDTVHWSPNLCSALDYDPDSAAQ
YDNGVFWIDYASILKFFDVFYLNWNPELFKFTYCIHQKWEAGNGPIKDAYTISENPQYSL
KVNGTGAVWLLLTRHITKIEDFRNNQEYITLLVYKNGKRVYYPHDPPPYIDGIRINSPHY
LVKIIVGENSSDKYTLVVSQYEKTRTIYYTLRAYATCPFALAKLDPYPYTKTIRGEWSGR
TAGGCENHRQTYQNNPKYIITVPESRNPCHVTIELKGPKEYQIGVDARVESLDDPNITAP
FLRESSGAYRSGFVVLELNNLPGGRYLLTPSTFYPGQEGPFFLELRSTCSITAERKNE