DPGLEAN04352 in OGS1.0

New model in OGS2.0DPOGS200937 
Genomic Positionscaffold267:+ 61367-68457
See gene structure
CDS Length1419
Paired RNAseq reads  2783
Single RNAseq reads  8498
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000307 (0.0)
Best Drosophila hit  silver, isoform E (1e-65)
Best Human hitcarboxypeptidase E preproprotein (1e-100)
Best NR hit (blastp)  PREDICTED: similar to Zinc carboxypeptidase family protein [Tribolium castaneum] (2e-148)
Best NR hit (blastx)  PREDICTED: similar to Zinc carboxypeptidase family protein [Tribolium castaneum] (8e-147)
GeneOntology terms



  
GO:0004181 metallocarboxypeptidase activity
GO:0006508 proteolysis
GO:0004180 carboxypeptidase activity
GO:0008270 zinc ion binding
GO:0005575 cellular_component
InterPro families

  
IPR014766 Carboxypeptidase, regulatory domain
IPR000834 Peptidase M14, carboxypeptidase A
IPR008969 Carboxypeptidase-like, regulatory domain
Orthology groupMCL19654

Nucleotide sequence:

ATGTATGCGTTTTTCTACTGTGCTCTTTTTCTTGGGGTATCAACTGAATTTGTTTGGAAA
CACCATAATAATGAGGAGTTATCTTCGATACTGGAAGAAGTTCACGAAAACTGCCCTAAT
ATTACTAGAGTTTATGCCTTAACAGAACCATCAGTGAGGAATGTACCATTGTATGTTATT
GAATTTTCTGACACCCCGGGATTTCACCAACCATATAAACCAGAAGTTAAATATGTCGGA
AATATACATGGTAATGAGGTTTTGGGACGTGAATTACTTTTGGGCTTGGCATATTACCTT
TGTGAAGAATATAATAAACACGATCGTCGTATAAGAAATTTGATTCACAACACTCGCATA
CATTTATTGCCTTCCATGAACCCTGATGGCTGGCAGTTATCAACTGACACAGGTGGTCAG
GATTTTTTGTTGGGACGTAACAACAATCATTCAGTGGATTTAAACAGGAACTTCCCAGAT
CTGGATGCAATAACATTTGAATTTGAAAGACAAGGCATCAGTCACAACAATCATTTACTC
AAAGACCTCACACGTCTTGCAGCACCACTGGAGCCGGAAACTCGAGCTGTTATGAGATGG
ATAATGTCCGTTCCATTTGTACTGAGTGCAGCCATGCATGGTGGAGATTTGGTAGCAAAC
TATCCTTATGATGAGAGCAGGAGTGGAGCTCCTGTGTCTGAATATTCAGCCAGTCCGGAT
GATGAGACTTTTAGGGAGTTAGCTATGACATATGCCGAAGCTCATGCAGATATGGCATCT
GCTAATAGACCAGGCTGTCGTTTTGGGGATGAAACTAATGCATACAACTTTGGAAAGCAA
GGAGGTGTTACTAACGGAGCAGCCTGGTATAGTCTGAGAGGAGGCATGCAGGATTTTAAT
TATCTAGCGACGAATGCTTTCGAAGTGACTCTAGAGCTGGGATGCCAGAAGTATCCTTAC
GAGAAAGACCTGGAAAAGGAGTGGTTTCGTAACAAGGACGCGTTGTTAGCTTATATATGG
AAAGCCCATACTGGCATCAAGGGTATTGTGAAAGATGACTCCGGCTTCATACAAAACGCT
GTGATATCCGTCGTCAACATAACTGGATCTGTACCACGGCCGATAAGACACGACATTACC
AGCGGTATATACGGTGATTACTACCGTCTCCTGACCCCTGGTCACTACGAGGTGACAGCG
AGTCACCCCGGGTACTTCCCCGTGTCACGCGTCGTCACCGTCCCCACACACCAGACCTCG
GCCAGGATAGTCAACTTCAAACTGGAGCCTACAACGAGCTGGTTCGATGATTATACTTTC
GGCGTATACCCTCACGGTCTGAGAGACGGCCAGCCGAGGATTTACAAGCGATCGCTCTAC
CACAAAGTCGCCAACGCCATGCTGGATAAGACGCACTGA

Protein sequence:

MYAFFYCALFLGVSTEFVWKHHNNEELSSILEEVHENCPNITRVYALTEPSVRNVPLYVI
EFSDTPGFHQPYKPEVKYVGNIHGNEVLGRELLLGLAYYLCEEYNKHDRRIRNLIHNTRI
HLLPSMNPDGWQLSTDTGGQDFLLGRNNNHSVDLNRNFPDLDAITFEFERQGISHNNHLL
KDLTRLAAPLEPETRAVMRWIMSVPFVLSAAMHGGDLVANYPYDESRSGAPVSEYSASPD
DETFRELAMTYAEAHADMASANRPGCRFGDETNAYNFGKQGGVTNGAAWYSLRGGMQDFN
YLATNAFEVTLELGCQKYPYEKDLEKEWFRNKDALLAYIWKAHTGIKGIVKDDSGFIQNA
VISVVNITGSVPRPIRHDITSGIYGDYYRLLTPGHYEVTASHPGYFPVSRVVTVPTHQTS
ARIVNFKLEPTTSWFDDYTFGVYPHGLRDGQPRIYKRSLYHKVANAMLDKTH