New model in OGS2.0 | DPOGS201318  |
---|---|
Genomic Position | scaffold6907:+ 3182-7514 |
See gene structure | |
CDS Length | 1566 |
Paired RNAseq reads   | 1336 |
Single RNAseq reads   | 3708 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003111 (3e-84) |
Best Drosophila hit   | CG4572, isoform B (7e-65) |
Best Human hit | probable serine carboxypeptidase CPVL precursor (2e-70) |
Best NR hit (blastp)   | venom serine carboxypeptidase [Apis mellifera] (3e-80) |
Best NR hit (blastx)   | venom serine carboxypeptidase [Apis mellifera] (5e-80) |
GeneOntology terms    | GO:0016787 hydrolase activity GO:0008233 peptidase activity GO:0004180 carboxypeptidase activity GO:0006508 proteolysis GO:0004185 serine-type carboxypeptidase activity GO:0003674 molecular_function GO:0008150 biological_process GO:0005575 cellular_component |
InterPro families    | IPR001563 Peptidase S10, serine carboxypeptidase IPR018202 Peptidase S10, serine carboxypeptidase, active site |
Orthology group | MCL39726 |
Nucleotide sequence:
ATGTGTACAGAAAGACTCCTGTTAATTATAACACTCGCGGCTGTAGCAGATGCCGTACAA
ATAGATACACCTCTTTTTCTCACCGCTTTCATTAAAGAGAATAAAACTGCGGAGGCGAGA
AACGCGTCTCTCGTAAATGCGGACGAATTTCTAAACGTCACAAGTTATTCAGGTTTTTTA
ACTGTTGACGATAACTATGATTCTAATTTATTCTTCTGGTACTTTCCCGTTGCTAATAAA
GATGTAAAGAGAACTCCATGGATAATTTGGCTCCAAGGAGGTCCGGGAGCTACAAGCTTA
GCCGGCCTTTTCGACGAAATGGGTCCATTCGAATTGGATAGCAATTTAAATTTAAAAAAA
CGCAAGTACACGTGGACGGATGACTTCTCTATGGTATACATAGATAATCCCGTGGGAGCG
GGTTTCAGTTTCACGAAACATGATGAGGGTTATCCGAACAATATGGATATGTACACCGAA
AGCCTATATAGAGCAGTGAATCAGCTGATCGTATTATATCCAGAGTTAAGTGAGGCGCCT
CTGTATGTAGCTGGTGAGTCCTATGCTGGGCGGTACGTGCCAGCTTTAGCCGAGAGAATC
ATGAAAGATAAGGAGAAAGACGGCCACATTAATTTACAGGGTATCATGCTGGGTAATCCT
TTACTAGACCGCGAGAGTGTAATTGATTATACTCGAGCGTTCTACTCTTGGGGACTCATA
GACGAGCAGGGCGCTCTAGCAGCAGAACCTCTTCAGAAGCAGTTCCAAAAGGAAATCGAT
GAAGGGAATGCCCAAGAGGCATATAAGCTGCGTGACGAGCTTCTCGATAAGCTCCAAGGT
ATAGCGGAGCAGTCGTCTCTATACAACGTCATCACACCTATAGAAGGTTTGGAACACTTC
ATCAATTTCATCACCAGTTCGAAAATCAGGAACTTGATCCACGCCGGGAATGTGACCTTT
CACTTTTCAAACGACAAGGTCCATAAACATCTCGTAGCTGATTTCTTGGCCCCCGTTTCC
AGTAAAGTCCTAACTGTTCTCGAACACTACAGGGTTCTTATATACTGCGGCCAGTTGGAC
CTCACGACTCCCTGTGTTCTGAACAGCGAGGCTCGCAGGAAGAGGTGGATGTGGTCTGGG
AGGGAAGAGTTTCTTAGATCACCGCGGACACCATGGTGGTTCAATAATACCGTGGCTGGC
TTCGTGAAATCAGGCGGAGGCTTCACGGAGGTTCTCGTAAAGGGGGCCGGACATCTAGTA
CCCAAGGAAAAACCAGCTGAAGCCAAGGCACTAATATCATACTTCATCAATGGAACAGGT
CTACCAACACCACCTTCATACAAAATACATCCGGAAGACACTCCATACTACGAGGAGTAC
TTTGACCTAAAAACATCAGGAGCTGTCCCGGCGGTGGGGCTAAGGGCTGGCTTAATCGCC
AGTGTCGTAGTGAACGTTCTGCTGTTAGCTGGTATCGCTTTAGGAGTCTACAAGTTTCTG
AAATGGAAGAGAGAATCCGATTATTTCTATTCGCCCTTAAACGACGGCATTTTAACTATG
TCGTAG
Protein sequence:
MCTERLLLIITLAAVADAVQIDTPLFLTAFIKENKTAEARNASLVNADEFLNVTSYSGFL
TVDDNYDSNLFFWYFPVANKDVKRTPWIIWLQGGPGATSLAGLFDEMGPFELDSNLNLKK
RKYTWTDDFSMVYIDNPVGAGFSFTKHDEGYPNNMDMYTESLYRAVNQLIVLYPELSEAP
LYVAGESYAGRYVPALAERIMKDKEKDGHINLQGIMLGNPLLDRESVIDYTRAFYSWGLI
DEQGALAAEPLQKQFQKEIDEGNAQEAYKLRDELLDKLQGIAEQSSLYNVITPIEGLEHF
INFITSSKIRNLIHAGNVTFHFSNDKVHKHLVADFLAPVSSKVLTVLEHYRVLIYCGQLD
LTTPCVLNSEARRKRWMWSGREEFLRSPRTPWWFNNTVAGFVKSGGGFTEVLVKGAGHLV
PKEKPAEAKALISYFINGTGLPTPPSYKIHPEDTPYYEEYFDLKTSGAVPAVGLRAGLIA
SVVVNVLLLAGIALGVYKFLKWKRESDYFYSPLNDGILTMS