DPGLEAN11864 in OGS1.0

New model in OGS2.0DPOGS202239 
Genomic Positionscaffold2891:+ 14958-16959
See gene structure
CDS Length1266
Paired RNAseq reads  193
Single RNAseq reads  574
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004830 (4e-149)
Best Drosophila hit  CG8560 (2e-62)
Best Human hitmast cell carboxypeptidase A precursor (8e-40)
Best NR hit (blastp)  carboxypeptidase precursor [Helicoverpa armigera] (1e-165)
Best NR hit (blastx)  carboxypeptidase precursor [Helicoverpa armigera] (6e-162)
GeneOntology terms


  
GO:0004180 carboxypeptidase activity
GO:0006508 proteolysis
GO:0008270 zinc ion binding
GO:0004181 metallocarboxypeptidase activity
InterPro families
  
IPR000834 Peptidase M14, carboxypeptidase A
IPR009020 Proteinase inhibitor, propeptide
Orthology groupMCL21843

Nucleotide sequence:

ATGGCAAGGTGGATTACCTTCCTGCTATTCATCGCAGTTACCCACGCTCGACATGAGCAA
TACGAGGGGCACTCTCTCTACCGGGTAGCTGGTCCATCTGACCAATTTCAATACCTAGAA
GCCTACATAGATTTCCTCTCTATTACACCAGCAGCCAAATCCACTTCCAGACAGCTAGAG
GTTTTGATGAGACTATCATCAGAGGAAAAAGGCAAATGGCTGAAATATTTCGAAGACCAT
GACATGCCCTATTCTTTAGTATCAAACAACTTGGCACAGGTTCTTCGTGCTGAGGATTCT
TATTTAATGAGGTCTAAAGAAGGCGAAGCTGAAGATACAAACAGTACAATGACATGGGAT
AGTTATTATAACGCGGAAGAGATCAACAAATACATAGACGAGATGGGCGCAAAATACCCT
GACCTCATAACAGTTATCAACGCCGGCAGGAGTTACGAAGGTCGACAGATCAAATATGTC
AGGATTTCCACCACACGCTTTGAAAACCTTCGCAAAAGAGTAATAGTTATAGACGCTGGT
GTACATGCTAGAGAATGGGTTACCACACCAGTAGCTCTGTATTTGATCAAGCAATTGGCC
GAAGGCGCTGATAAATTACTGACTGAAAACCTCGACTGGATCATTATACCTTTAGCAAAC
CCAGATGGTTACGAATACTCCATAAATGAGGATCGTTTATGGCGTAAAACCCGTTCTAAA
TCTCACGCTGGCTCAGACGCATGTCCTGGTGTTGATGGAAACCGAAACTTCGATTTCGAC
TGGGGCTCCAGACCTGACTCTAACATAGCCTGCTCCATTATTTACGAAGGACCATCACCC
TTCTCGGAACCAGAGACACGTATCATAAGAGACGCTGTTTTGTCAAATTTGGCTCGTACT
TCCCTTTACATTTCTCTACACAGCTATGGCAACATGTTCCTTTACGCTTGGGGAACTAAC
GGTACACTTCCTTCAAATGGCCTATCTCTTCACCTTGCCGGTATCATCATGGCAACAGCT
ATTGAGGAAGTCAAATTAGAAAAAGCTGATTCTTACATTGTCGGTAATGCTGCTAACGTT
CTGTACTACACCAGCGGTACCTCAAGAGACTGGACTCGTGGCATGGGAATACCATTCACC
TATACCATGGAACTTCCTGGTTATGAGTACGGCTTCCTCGTCCCACCCACTTACATCAAG
CAAATAGTGACCGAATCTTTCGTAGGAATAGCTGCTGGAGCTCGTTACGTACTCTCACTA
TACTGA

Protein sequence:

MARWITFLLFIAVTHARHEQYEGHSLYRVAGPSDQFQYLEAYIDFLSITPAAKSTSRQLE
VLMRLSSEEKGKWLKYFEDHDMPYSLVSNNLAQVLRAEDSYLMRSKEGEAEDTNSTMTWD
SYYNAEEINKYIDEMGAKYPDLITVINAGRSYEGRQIKYVRISTTRFENLRKRVIVIDAG
VHAREWVTTPVALYLIKQLAEGADKLLTENLDWIIIPLANPDGYEYSINEDRLWRKTRSK
SHAGSDACPGVDGNRNFDFDWGSRPDSNIACSIIYEGPSPFSEPETRIIRDAVLSNLART
SLYISLHSYGNMFLYAWGTNGTLPSNGLSLHLAGIIMATAIEEVKLEKADSYIVGNAANV
LYYTSGTSRDWTRGMGIPFTYTMELPGYEYGFLVPPTYIKQIVTESFVGIAAGARYVLSL
Y