DPGLEAN10170 in OGS1.0

New model in OGS2.0DPOGS212730 
Genomic Positionscaffold10:+ 88962-92458
See gene structure
CDS Length1425
Paired RNAseq reads  176
Single RNAseq reads  562
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013276 (9e-96)
Best Drosophila hit  CG8560 (4e-53)
Best Human hitcarboxypeptidase A4 isoform 2 preproprotein (2e-34)
Best NR hit (blastp)  carboxypeptidase B precursor [Helicoverpa zea] (2e-83)
Best NR hit (blastx)  midgut carboxypeptidase A2 [Trichoplusia ni] (2e-83)
GeneOntology terms


  
GO:0004180 carboxypeptidase activity
GO:0006508 proteolysis
GO:0008270 zinc ion binding
GO:0004181 metallocarboxypeptidase activity
InterPro families

  
IPR000834 Peptidase M14, carboxypeptidase A
IPR003146 Proteinase inhibitor, carboxypeptidase propeptide
IPR009020 Proteinase inhibitor, propeptide
Orthology groupMCL20847

Nucleotide sequence:

ATGTGGAGGTGTAACGGACACGACATGAGGGCCTTAGTATGTTTGTTCCTTGGTCTTGTG
GCATCGACCTTAGCTGGGAACCATGACAAATATTCCGGGTATACCGTCTATGGGGTTCAT
ATAGATGATCACTACCACCAGAAGGTACTTTCGGTTTTACAGAGGGATATCGATTTAGAC
GTTTGGCGACATGGGGTTCCGAAAGCCAGAGATGCTCTGGTTATGGTTTCGCCAGAAAAT
AGACTGCAATTCCTAAAGATTTTGGAAGAAAATAACATGCACCATTACATTCATCTCCAT
GATGTTGCGAGATCTCTGAAACAAAGTGATGATGATTTTTTACGGTGGAAACTGAGCAGA
GGAAATGAATTGTCTATTTTTGAGGATTATCCAACATACGGTGATGTAATAACAGATGGT
GATATAGATAAATTAAGATATATATATAGACTTAACCAAGGTTCCGTGCTCGAACCTTGG
ACGAGGCGTAAAGATATATGTAGAGAATGTGTTATGTTGCAGGTCATCGACTACATGGAG
AGAATAGCTCGTAATAATTCAGCCATAGCGACTTTAGTGAACGCCGGCAATAGTTTCGAG
GGACGACCTGTGAAATACTTGAAGATATCAACCACAAACTTTACTGACACCAGTAAACCT
ATCTACTTCCTGGAGGCAACGATGCATGCTCGTGAGTGGGTGACAACCCAAACAGCTCTG
TACACCATACATCGATTGATCGAAGACCTGAAGACTGAGGATAGGGATCTGATAGAAGGG
ATCGATTGGATCATATTTCCCGTGGTCAATCCTGATGGATATGAATTTTCTCATACAACC
GATCGCATGTGGAGGAAAACACGTTCTTTCAATGCAACGATCAGCGCCACGTGCTACGGT
GTCGATCCAAACAGGAACTTTGACATTGAATTTAATACAGTAAGCGTGTCTCCGGATCCC
TGTTCCCAAATATATCCGGGACACGAAGCTTTCTCGGAACCAGAAACCCGTTACGTCAGA
GATATTTTATTGGAATACAATAATCGCATCCAAGTGTTCATGGACGTGCATAGTTACGGC
AACTACATTGTTTACGGCTTCGATAACGGAACGCTGCCACAGAACGCGCTGCACATACAT
CACGTGGGTGCGTTAATGGGTGCGGCTATCGATACTCTTAAATTACAAAAAGCACCCTTC
TACATTGTCGGCAACTCGAGGTACGTCTTCTACGCTGTCTCTGGAAGTGGTCAGGATTAC
GCACAGGCGGTTGGCGTAGGATTCTCGTACACTATAGAGTTGCCAGGATACGAGTATGAC
TTCCGAGTTCCTCCCTCGTACATCAATCAAATCAACACGGAGACCTGGGAAGGTGTCGCA
GCCTCGGCTCGAGCTGCAAGATCGTATTATAGAGCAAGAAATTAA

Protein sequence:

MWRCNGHDMRALVCLFLGLVASTLAGNHDKYSGYTVYGVHIDDHYHQKVLSVLQRDIDLD
VWRHGVPKARDALVMVSPENRLQFLKILEENNMHHYIHLHDVARSLKQSDDDFLRWKLSR
GNELSIFEDYPTYGDVITDGDIDKLRYIYRLNQGSVLEPWTRRKDICRECVMLQVIDYME
RIARNNSAIATLVNAGNSFEGRPVKYLKISTTNFTDTSKPIYFLEATMHAREWVTTQTAL
YTIHRLIEDLKTEDRDLIEGIDWIIFPVVNPDGYEFSHTTDRMWRKTRSFNATISATCYG
VDPNRNFDIEFNTVSVSPDPCSQIYPGHEAFSEPETRYVRDILLEYNNRIQVFMDVHSYG
NYIVYGFDNGTLPQNALHIHHVGALMGAAIDTLKLQKAPFYIVGNSRYVFYAVSGSGQDY
AQAVGVGFSYTIELPGYEYDFRVPPSYINQINTETWEGVAASARAARSYYRARN