DPGLEAN03991 in OGS1.0

New model in OGS2.0DPOGS201318 
Genomic Positionscaffold6907:+ 3182-7514
See gene structure
CDS Length1566
Paired RNAseq reads  1336
Single RNAseq reads  3708
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003111 (3e-84)
Best Drosophila hit  CG4572, isoform B (7e-65)
Best Human hitprobable serine carboxypeptidase CPVL precursor (2e-70)
Best NR hit (blastp)  venom serine carboxypeptidase [Apis mellifera] (3e-80)
Best NR hit (blastx)  venom serine carboxypeptidase [Apis mellifera] (5e-80)
GeneOntology terms






  
GO:0016787 hydrolase activity
GO:0008233 peptidase activity
GO:0004180 carboxypeptidase activity
GO:0006508 proteolysis
GO:0004185 serine-type carboxypeptidase activity
GO:0003674 molecular_function
GO:0008150 biological_process
GO:0005575 cellular_component
InterPro families
  
IPR001563 Peptidase S10, serine carboxypeptidase
IPR018202 Peptidase S10, serine carboxypeptidase, active site
Orthology groupMCL39726

Nucleotide sequence:

ATGTGTACAGAAAGACTCCTGTTAATTATAACACTCGCGGCTGTAGCAGATGCCGTACAA
ATAGATACACCTCTTTTTCTCACCGCTTTCATTAAAGAGAATAAAACTGCGGAGGCGAGA
AACGCGTCTCTCGTAAATGCGGACGAATTTCTAAACGTCACAAGTTATTCAGGTTTTTTA
ACTGTTGACGATAACTATGATTCTAATTTATTCTTCTGGTACTTTCCCGTTGCTAATAAA
GATGTAAAGAGAACTCCATGGATAATTTGGCTCCAAGGAGGTCCGGGAGCTACAAGCTTA
GCCGGCCTTTTCGACGAAATGGGTCCATTCGAATTGGATAGCAATTTAAATTTAAAAAAA
CGCAAGTACACGTGGACGGATGACTTCTCTATGGTATACATAGATAATCCCGTGGGAGCG
GGTTTCAGTTTCACGAAACATGATGAGGGTTATCCGAACAATATGGATATGTACACCGAA
AGCCTATATAGAGCAGTGAATCAGCTGATCGTATTATATCCAGAGTTAAGTGAGGCGCCT
CTGTATGTAGCTGGTGAGTCCTATGCTGGGCGGTACGTGCCAGCTTTAGCCGAGAGAATC
ATGAAAGATAAGGAGAAAGACGGCCACATTAATTTACAGGGTATCATGCTGGGTAATCCT
TTACTAGACCGCGAGAGTGTAATTGATTATACTCGAGCGTTCTACTCTTGGGGACTCATA
GACGAGCAGGGCGCTCTAGCAGCAGAACCTCTTCAGAAGCAGTTCCAAAAGGAAATCGAT
GAAGGGAATGCCCAAGAGGCATATAAGCTGCGTGACGAGCTTCTCGATAAGCTCCAAGGT
ATAGCGGAGCAGTCGTCTCTATACAACGTCATCACACCTATAGAAGGTTTGGAACACTTC
ATCAATTTCATCACCAGTTCGAAAATCAGGAACTTGATCCACGCCGGGAATGTGACCTTT
CACTTTTCAAACGACAAGGTCCATAAACATCTCGTAGCTGATTTCTTGGCCCCCGTTTCC
AGTAAAGTCCTAACTGTTCTCGAACACTACAGGGTTCTTATATACTGCGGCCAGTTGGAC
CTCACGACTCCCTGTGTTCTGAACAGCGAGGCTCGCAGGAAGAGGTGGATGTGGTCTGGG
AGGGAAGAGTTTCTTAGATCACCGCGGACACCATGGTGGTTCAATAATACCGTGGCTGGC
TTCGTGAAATCAGGCGGAGGCTTCACGGAGGTTCTCGTAAAGGGGGCCGGACATCTAGTA
CCCAAGGAAAAACCAGCTGAAGCCAAGGCACTAATATCATACTTCATCAATGGAACAGGT
CTACCAACACCACCTTCATACAAAATACATCCGGAAGACACTCCATACTACGAGGAGTAC
TTTGACCTAAAAACATCAGGAGCTGTCCCGGCGGTGGGGCTAAGGGCTGGCTTAATCGCC
AGTGTCGTAGTGAACGTTCTGCTGTTAGCTGGTATCGCTTTAGGAGTCTACAAGTTTCTG
AAATGGAAGAGAGAATCCGATTATTTCTATTCGCCCTTAAACGACGGCATTTTAACTATG
TCGTAG

Protein sequence:

MCTERLLLIITLAAVADAVQIDTPLFLTAFIKENKTAEARNASLVNADEFLNVTSYSGFL
TVDDNYDSNLFFWYFPVANKDVKRTPWIIWLQGGPGATSLAGLFDEMGPFELDSNLNLKK
RKYTWTDDFSMVYIDNPVGAGFSFTKHDEGYPNNMDMYTESLYRAVNQLIVLYPELSEAP
LYVAGESYAGRYVPALAERIMKDKEKDGHINLQGIMLGNPLLDRESVIDYTRAFYSWGLI
DEQGALAAEPLQKQFQKEIDEGNAQEAYKLRDELLDKLQGIAEQSSLYNVITPIEGLEHF
INFITSSKIRNLIHAGNVTFHFSNDKVHKHLVADFLAPVSSKVLTVLEHYRVLIYCGQLD
LTTPCVLNSEARRKRWMWSGREEFLRSPRTPWWFNNTVAGFVKSGGGFTEVLVKGAGHLV
PKEKPAEAKALISYFINGTGLPTPPSYKIHPEDTPYYEEYFDLKTSGAVPAVGLRAGLIA
SVVVNVLLLAGIALGVYKFLKWKRESDYFYSPLNDGILTMS