DPGLEAN21071 in OGS1.0

New model in OGS2.0DPOGS200623 
Genomic Positionscaffold173:- 1157-6782
See gene structure
CDS Length1569
Paired RNAseq reads  1166
Single RNAseq reads  2935
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008910 (8e-173)
Best Drosophila hit  CG3108 (3e-98)
Best Human hitcarboxypeptidase B2 isoform a preproprotein (2e-57)
Best NR hit (blastp)  molting fluid carboxypeptidase A [Bombyx mori] (0.0)
Best NR hit (blastx)  molting fluid carboxypeptidase A [Bombyx mori] (5e-180)
GeneOntology terms

  
GO:0004181 metallocarboxypeptidase activity
GO:0008270 zinc ion binding
GO:0006508 proteolysis
InterPro families

  
IPR000834 Peptidase M14, carboxypeptidase A
IPR003146 Proteinase inhibitor, carboxypeptidase propeptide
IPR009020 Proteinase inhibitor, propeptide
Orthology groupMCL12092

Nucleotide sequence:

ATGGCAAAGCTCTGGTGGACGGTCGTGTGCCTCCTCGCATCCTTGGAGCTCTGCACTCCG
CTTATAAACGAATTGCAACCGGGTAAAGAATGGCCCAAACGTCAATCAGTAAGACAACCT
CAGGACGAGCTAGACAACCCTGATGTCACAACGGTTATAGCTGATACAACGGTAGGAGGT
TCAGTAGATATACCCGAAGATGTACAAGAAGATGTTGAAGAGAACATTCAAACAAAAGCT
ATAGATGTAGAAGACTCCAAAGTAGATTACTCCGGAGCACAACTGTGGAAAGTGGCGACT
GATAAGAACGGAGTAAGAGTACTTTTAGGTCGATTGCGTCGTAGAAATCTCATTTCGACG
TGGTCGGGGAACCAAACGTACATCGATGTCCTTGTGAAACCCGACGCCGTACAGAACGTT
ACACGGATATTCAAGAGAGAGAACATTACTTTTGACGTCATTATCGAGGACCTACAAAGG
AGAATCAATGAAGAAAACCCTCCGCTCGATGAGAACGAAATTGAGCTACAGGACAGACGA
GCAAGATCTTACAATTTTTGGCGTTACAGGGCCACCCGTTTAGGTGTGATAAAAGCCTTC
ATGGGCTACAGCTCTCTTAAACTTCCAATGGAGTTTTGGCCATATGCCAACTGGGGTCAC
CGTATGACATGGAAACAATATCACAGATTGGAAGACATTCACGGCTTTATGGATTATTTA
GCCAAAACGTATCCCAAGATCGTGAGTGTGAACTCAATAGGAAAATCCTATGAAGGAAGA
GACCTTAAAGTTCTCCGTATATCAGATGGCAAGCCTTCAAATAAGGCGGTTTTTATCGAC
GGTGGTATACACGCTAGGGAATGGATCAGCCCGGCTACGGTTACATACTTCATCAACCAA
ATAGCTGAAAACTTCGACGAAGAATCCGATGACATAAGGGATATTGATTGGTATTTCTTG
CCTGTTGTCAATCCTGATGGATACGAATACACGCATATCAAAGATCGTTTGTGGAGAAAA
AATAGAAAGCCGGCAGTTTACGGTGTGAGACAGTGTGTCGGGACTGATTTGAACAGAAAT
TTCGGTTATCGTTGGGGTGGTAAAGGTTCCTCGAGTAATCCCTGCAGTGAAATATATAGA
GGAAATAGAGCTTTTTCTGAACCAGAATCCAGAGCAGTATCGGAATTCATCAAAACAAGT
GCAGCTAATTTCTCAGCATACCTGACATACCACAGTTATGGTCAATATTTATTATACCCT
TGGGGATATGACAACGCAGTCCCACCAGATCATAAAGAATTAGATCTTGTTGGCAAAAAT
ATAGCAGCGGCTATTCAAGCGACTGGAGGCTCTAAATATTCTGTTGGGTCGTCTAGTGGC
CTCCTTTATCCCGCTTCAGGCGGTTCAGATGACTGGGCCAAAGGCCAGGGCATTAAATAT
GCATACACAATTGAACTTAGCGATACTGGCCGCCATGGATTTGTTTTGCCGACAACCTTC
ATTGAGCCAGTAGCAAGGGAATCATTGTCAGGCTTAAGAGTGCTTGCAGCCCAATTAAGA
AAGAACTAA

Protein sequence:

MAKLWWTVVCLLASLELCTPLINELQPGKEWPKRQSVRQPQDELDNPDVTTVIADTTVGG
SVDIPEDVQEDVEENIQTKAIDVEDSKVDYSGAQLWKVATDKNGVRVLLGRLRRRNLIST
WSGNQTYIDVLVKPDAVQNVTRIFKRENITFDVIIEDLQRRINEENPPLDENEIELQDRR
ARSYNFWRYRATRLGVIKAFMGYSSLKLPMEFWPYANWGHRMTWKQYHRLEDIHGFMDYL
AKTYPKIVSVNSIGKSYEGRDLKVLRISDGKPSNKAVFIDGGIHAREWISPATVTYFINQ
IAENFDEESDDIRDIDWYFLPVVNPDGYEYTHIKDRLWRKNRKPAVYGVRQCVGTDLNRN
FGYRWGGKGSSSNPCSEIYRGNRAFSEPESRAVSEFIKTSAANFSAYLTYHSYGQYLLYP
WGYDNAVPPDHKELDLVGKNIAAAIQATGGSKYSVGSSSGLLYPASGGSDDWAKGQGIKY
AYTIELSDTGRHGFVLPTTFIEPVARESLSGLRVLAAQLRKN