DPGLEAN16475 in OGS1.0

New model in OGS2.0DPOGS207768 
Genomic Positionscaffold2482:+ 9613-21451
See gene structure
CDS Length1722
Paired RNAseq reads  146
Single RNAseq reads  416
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005320 (3e-149)
Best Drosophila hit  CG4678, isoform D (3e-169)
Best Human hitcarboxypeptidase M precursor (8e-99)
Best NR hit (blastp)  GH12395 [Drosophila grimshawi] (0.0)
Best NR hit (blastx)  PREDICTED: similar to GA18350-PA [Acyrthosiphon pisum] (2e-180)
GeneOntology terms


  
GO:0004185 serine-type carboxypeptidase activity
GO:0006508 proteolysis
GO:0008270 zinc ion binding
GO:0004181 metallocarboxypeptidase activity
InterPro families

  
IPR000834 Peptidase M14, carboxypeptidase A
IPR014766 Carboxypeptidase, regulatory domain
IPR008969 Carboxypeptidase-like, regulatory domain
Orthology groupMCL15534

Nucleotide sequence:

ATGTTGGCGTGCTTGGTTCATGGTTGGGTGGCGATGTTCCAGTGTACAAAATCCCTCTCA
AGGGTTTTCCTGTTGCTACTTATATTTAAAACATCTCACGGACAATATTCTCCCGGATCG
ACTTTTACAGAAGATGACGCAACCAACCCTAGCGATGTGGAGTCTCGTGATGTCGGTACT
GAGCTGCAGTACAAATACCATGACCACGAGGAGATGACACGCTACCTGCGGGCCGTCTCC
GCCAGATATCCAGCGCTCACAGCGCTGTACTCCATAGGAAAGTCCGTCCAAGGTCGAGAT
CTCTGGGTGATGGTGGTGTCGGCATCGCCCTATGAGCATATGATTGGCAAGCCAGATGTC
AAATACGTAGCCAATATACACGGCAACGAAGCTGTCGGAAGGGAGATGCTTTTACATCTT
ATACAGTACTTAGTGACTTCCTACGAGACGGATTCCTACATAAAATGGTTATTAGACAAC
ACACGCATACATTTGATGCCATCGATGAACCCTGATGGTTTCCTCATATCTCGTGAAGGA
CAGTGTGACACCATTCATGGCAGGCACAACGCCCGTCGCTACGACTTGAACCGTAACTTC
CCGGATTTCTTCAAACGGAACACGAAGCAGCCTCAACCGGAGACGGAAGCTGTAAAGGAA
TGGATAAGCAAGATACAGTTCGTACTATCAGGATCGCTGCACGGCGGCGCCCTGGTCGCC
TCCTACCCTTACGACAACACGCCCAGCGCTATTTTCCAAAGCTACGCGCACAGTCCGTCG
GTATCTCCCGATGATGACGTCTTCCAGCACTTGGCCCGCGTGTACTCCAGCAACCACGAC
AAGATGTCCCGAGGAGTCTCTTGCAAATCCGGATCACCTAAGTTTGATAACGGAATCACT
AATGGCGCGGCCTGGTATCCACTGACAGGAGGAATGCAAGACTACAACTACCTGTGGCAT
GGATGTATGGAAATTACTCTAGAGATCTCATGCTGCAAATATCCTTTGGCTCATGAGCTA
CCGAAATACTGGCAGGACAACAAACAGGCGCTTATAAAGTATCTAGCTGAAGCCCACCGC
GGCGCCCACGGGTTCGTGATGGACGAACACGGTAACCCGGTGGAAAAGGCTTCGATCAAG
GTCAAAGGACGCGAGGTCACGTTCCATACAACTAAATACGGAGAATTCTGGCGTATACTA
CTCCCTGGAACTTATAGACTAGAGGTCGGAGCCGATGGATATTTACCACAAGAAGTAGAA
TTCTTCGTCATAGACAGCCACCCCACTCTGTTGAACGTAACGCTGCATTCCGCCAAGCGT
ATCGATGGCGGGGGTCCTTACTACCGGCCAGCACCGCGCCCGCCGCCACCGCCCGCGCCG
GGTCTGTTCTCAACATTCACTAATTCCATCAACAAATTCACGGTTCCTTGGGAAGCCAAC
ATCCCCAAAGATCACGCCATCAAGGTCAATAAACATTACCAGCTCACAAACGAACTCACT
AAGAATAGTTTCTTCGTAAATTTGTATGCGGTAGAAGTGGGAGCGAGAGGCATAACAGCT
AGATCTCTCTACAACCTACTAGACTTGGGCCTGTCCAGAACTGACATCGATTCATTCTTA
GTACGTACTTCGAAGACAGCCCTAGTAGGTTCTTTTCAAATTTGGTTAAGTAGAGAGAGG
AGCTTGGACGGTAGTGGCGAGCGTTTAACGCGCGTTAGATAG

Protein sequence:

MLACLVHGWVAMFQCTKSLSRVFLLLLIFKTSHGQYSPGSTFTEDDATNPSDVESRDVGT
ELQYKYHDHEEMTRYLRAVSARYPALTALYSIGKSVQGRDLWVMVVSASPYEHMIGKPDV
KYVANIHGNEAVGREMLLHLIQYLVTSYETDSYIKWLLDNTRIHLMPSMNPDGFLISREG
QCDTIHGRHNARRYDLNRNFPDFFKRNTKQPQPETEAVKEWISKIQFVLSGSLHGGALVA
SYPYDNTPSAIFQSYAHSPSVSPDDDVFQHLARVYSSNHDKMSRGVSCKSGSPKFDNGIT
NGAAWYPLTGGMQDYNYLWHGCMEITLEISCCKYPLAHELPKYWQDNKQALIKYLAEAHR
GAHGFVMDEHGNPVEKASIKVKGREVTFHTTKYGEFWRILLPGTYRLEVGADGYLPQEVE
FFVIDSHPTLLNVTLHSAKRIDGGGPYYRPAPRPPPPPAPGLFSTFTNSINKFTVPWEAN
IPKDHAIKVNKHYQLTNELTKNSFFVNLYAVEVGARGITARSLYNLLDLGLSRTDIDSFL
VRTSKTALVGSFQIWLSRERSLDGSGERLTRVR