DPGLEAN10171 in OGS1.0

New model in OGS2.0DPOGS212730 
Genomic Positionscaffold10:+ 94653-98269
See gene structure
CDS Length1248
Paired RNAseq reads  84
Single RNAseq reads  224
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013276 (9e-103)
Best Drosophila hit  CG8560 (2e-51)
Best Human hitcarboxypeptidase A2 precursor (2e-36)
Best NR hit (blastp)  carboxypeptidase [Spodoptera frugiperda] (7e-87)
Best NR hit (blastx)  carboxypeptidase [Helicoverpa armigera] (6e-89)
GeneOntology terms


  
GO:0004180 carboxypeptidase activity
GO:0006508 proteolysis
GO:0008270 zinc ion binding
GO:0004181 metallocarboxypeptidase activity
InterPro families

  
IPR000834 Peptidase M14, carboxypeptidase A
IPR003146 Proteinase inhibitor, carboxypeptidase propeptide
IPR009020 Proteinase inhibitor, propeptide
Orthology groupMCL20847

Nucleotide sequence:

ATGGGTTGTTTTGTAATATTCTTATTTGCTTCGCTATTTTATGTTACTTTAGCTGGAAAA
CATGATGTTTATCGCGGTTACATTGTGTATGGTGTTAAGTTAGAAGATCAGGTGGACCAA
GAGGTACTTTATGGGCTACAATCAGAACTGGATTTGGATTTATGGGAGTACGGAGTTCCC
AAAGTTAAAGATGTTCTCGTCATGGTGTCACCGGACAAGAAAGAGAGATTTTTGGACATT
TTAGATAGAAATTATATCAAACACTACCTTCACCTATCTGACGTGGCCCAGGCTTTGGAA
GAGAACGACAACGATTTGTCTAGTTGGGAACGTGAATCAAGCAGAGTTTTTGAAAAGTAT
GCAAGATATGCTGAGATAGATGCATACCTTGAAGAGGTAGCACAAGCTCATCCCCGGATT
GTGACGCTCGTGAATGCTGGACTTAGTTTCGAAGGACGTCCTATCAAATATTTAAAGATA
TCCACTTCAAACTTCACCGACCCCAGCAAGCCTGTCTACTTCATCGACGCCGCGATGCAC
GCTCGCGAGTGGATTACGATTCCACCAGCTCTGTACAGCATTCATCGTCTGGTGGAAGAC
CTTCGAGAACAAGACCGAGACTTACTGGAAGAAATCGACTGGATTGTGATGCCGCTGGAA
CGCTTATGGCGGAAGACGCGTTCCTTCAATGTCACAAGACATCCTGAATGTTACGGAGTG
GATGCGAATCGAAACTTCGACGTAGACTTCTATGGCACCGGCTCCAGTACCAATCCCTGC
GTGAACACATTCCGTGGTCACGAACCATTCTCAGAGCCGGAAACCCGCTGTGTTCGAGAC
GTCATTCTAGAACACATAGATCGCTTGCAAGTGTACCTTAATGTACACAGTCATGGTAAC
CTCATCTTGTATGGTTACGGCAATAAAACCTTACCCTCCAATGTTGTCCAACTACATCAA
GTCGGTGCCATCATGGGAGCAGCTATAGATCATAAAAAACTCCTCGAGGCTCCGTATTAT
CTGGTGGGAAATAGTGCGCTTGTACTGTACACAAGCTCCGGCAGCGCACAGGATTATGGA
CAGGTGGTCGGTGTACCCTTCTCCTATACACTGGAGTTGCCTGGAATGGGTTATGGGTTC
CAGATTCCCGTCAGGTTCGTCAACCAAGTCAATATGGAAACCTGGGAAGGCATTGCTGCA
TCAGCACGAATCGCTAAAATATATTATAGAGCGAGAGATCAAAAGTAA

Protein sequence:

MGCFVIFLFASLFYVTLAGKHDVYRGYIVYGVKLEDQVDQEVLYGLQSELDLDLWEYGVP
KVKDVLVMVSPDKKERFLDILDRNYIKHYLHLSDVAQALEENDNDLSSWERESSRVFEKY
ARYAEIDAYLEEVAQAHPRIVTLVNAGLSFEGRPIKYLKISTSNFTDPSKPVYFIDAAMH
AREWITIPPALYSIHRLVEDLREQDRDLLEEIDWIVMPLERLWRKTRSFNVTRHPECYGV
DANRNFDVDFYGTGSSTNPCVNTFRGHEPFSEPETRCVRDVILEHIDRLQVYLNVHSHGN
LILYGYGNKTLPSNVVQLHQVGAIMGAAIDHKKLLEAPYYLVGNSALVLYTSSGSAQDYG
QVVGVPFSYTLELPGMGYGFQIPVRFVNQVNMETWEGIAASARIAKIYYRARDQK