New model in OGS2.0 | DPOGS212730  |
---|---|
Genomic Position | scaffold10:+ 88962-92458 |
See gene structure | |
CDS Length | 1425 |
Paired RNAseq reads   | 176 |
Single RNAseq reads   | 562 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013276 (9e-96) |
Best Drosophila hit   | CG8560 (4e-53) |
Best Human hit | carboxypeptidase A4 isoform 2 preproprotein (2e-34) |
Best NR hit (blastp)   | carboxypeptidase B precursor [Helicoverpa zea] (2e-83) |
Best NR hit (blastx)   | midgut carboxypeptidase A2 [Trichoplusia ni] (2e-83) |
GeneOntology terms    | GO:0004180 carboxypeptidase activity GO:0006508 proteolysis GO:0008270 zinc ion binding GO:0004181 metallocarboxypeptidase activity |
InterPro families    | IPR000834 Peptidase M14, carboxypeptidase A IPR003146 Proteinase inhibitor, carboxypeptidase propeptide IPR009020 Proteinase inhibitor, propeptide |
Orthology group | MCL20847 |
Nucleotide sequence:
ATGTGGAGGTGTAACGGACACGACATGAGGGCCTTAGTATGTTTGTTCCTTGGTCTTGTG
GCATCGACCTTAGCTGGGAACCATGACAAATATTCCGGGTATACCGTCTATGGGGTTCAT
ATAGATGATCACTACCACCAGAAGGTACTTTCGGTTTTACAGAGGGATATCGATTTAGAC
GTTTGGCGACATGGGGTTCCGAAAGCCAGAGATGCTCTGGTTATGGTTTCGCCAGAAAAT
AGACTGCAATTCCTAAAGATTTTGGAAGAAAATAACATGCACCATTACATTCATCTCCAT
GATGTTGCGAGATCTCTGAAACAAAGTGATGATGATTTTTTACGGTGGAAACTGAGCAGA
GGAAATGAATTGTCTATTTTTGAGGATTATCCAACATACGGTGATGTAATAACAGATGGT
GATATAGATAAATTAAGATATATATATAGACTTAACCAAGGTTCCGTGCTCGAACCTTGG
ACGAGGCGTAAAGATATATGTAGAGAATGTGTTATGTTGCAGGTCATCGACTACATGGAG
AGAATAGCTCGTAATAATTCAGCCATAGCGACTTTAGTGAACGCCGGCAATAGTTTCGAG
GGACGACCTGTGAAATACTTGAAGATATCAACCACAAACTTTACTGACACCAGTAAACCT
ATCTACTTCCTGGAGGCAACGATGCATGCTCGTGAGTGGGTGACAACCCAAACAGCTCTG
TACACCATACATCGATTGATCGAAGACCTGAAGACTGAGGATAGGGATCTGATAGAAGGG
ATCGATTGGATCATATTTCCCGTGGTCAATCCTGATGGATATGAATTTTCTCATACAACC
GATCGCATGTGGAGGAAAACACGTTCTTTCAATGCAACGATCAGCGCCACGTGCTACGGT
GTCGATCCAAACAGGAACTTTGACATTGAATTTAATACAGTAAGCGTGTCTCCGGATCCC
TGTTCCCAAATATATCCGGGACACGAAGCTTTCTCGGAACCAGAAACCCGTTACGTCAGA
GATATTTTATTGGAATACAATAATCGCATCCAAGTGTTCATGGACGTGCATAGTTACGGC
AACTACATTGTTTACGGCTTCGATAACGGAACGCTGCCACAGAACGCGCTGCACATACAT
CACGTGGGTGCGTTAATGGGTGCGGCTATCGATACTCTTAAATTACAAAAAGCACCCTTC
TACATTGTCGGCAACTCGAGGTACGTCTTCTACGCTGTCTCTGGAAGTGGTCAGGATTAC
GCACAGGCGGTTGGCGTAGGATTCTCGTACACTATAGAGTTGCCAGGATACGAGTATGAC
TTCCGAGTTCCTCCCTCGTACATCAATCAAATCAACACGGAGACCTGGGAAGGTGTCGCA
GCCTCGGCTCGAGCTGCAAGATCGTATTATAGAGCAAGAAATTAA
Protein sequence:
MWRCNGHDMRALVCLFLGLVASTLAGNHDKYSGYTVYGVHIDDHYHQKVLSVLQRDIDLD
VWRHGVPKARDALVMVSPENRLQFLKILEENNMHHYIHLHDVARSLKQSDDDFLRWKLSR
GNELSIFEDYPTYGDVITDGDIDKLRYIYRLNQGSVLEPWTRRKDICRECVMLQVIDYME
RIARNNSAIATLVNAGNSFEGRPVKYLKISTTNFTDTSKPIYFLEATMHAREWVTTQTAL
YTIHRLIEDLKTEDRDLIEGIDWIIFPVVNPDGYEFSHTTDRMWRKTRSFNATISATCYG
VDPNRNFDIEFNTVSVSPDPCSQIYPGHEAFSEPETRYVRDILLEYNNRIQVFMDVHSYG
NYIVYGFDNGTLPQNALHIHHVGALMGAAIDTLKLQKAPFYIVGNSRYVFYAVSGSGQDY
AQAVGVGFSYTIELPGYEYDFRVPPSYINQINTETWEGVAASARAARSYYRARN