New model in OGS2.0 | DPOGS202239  |
---|---|
Genomic Position | scaffold2891:+ 14958-16959 |
See gene structure | |
CDS Length | 1266 |
Paired RNAseq reads   | 193 |
Single RNAseq reads   | 574 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004830 (4e-149) |
Best Drosophila hit   | CG8560 (2e-62) |
Best Human hit | mast cell carboxypeptidase A precursor (8e-40) |
Best NR hit (blastp)   | carboxypeptidase precursor [Helicoverpa armigera] (1e-165) |
Best NR hit (blastx)   | carboxypeptidase precursor [Helicoverpa armigera] (6e-162) |
GeneOntology terms    | GO:0004180 carboxypeptidase activity GO:0006508 proteolysis GO:0008270 zinc ion binding GO:0004181 metallocarboxypeptidase activity |
InterPro families    | IPR000834 Peptidase M14, carboxypeptidase A IPR009020 Proteinase inhibitor, propeptide |
Orthology group | MCL21843 |
Nucleotide sequence:
ATGGCAAGGTGGATTACCTTCCTGCTATTCATCGCAGTTACCCACGCTCGACATGAGCAA
TACGAGGGGCACTCTCTCTACCGGGTAGCTGGTCCATCTGACCAATTTCAATACCTAGAA
GCCTACATAGATTTCCTCTCTATTACACCAGCAGCCAAATCCACTTCCAGACAGCTAGAG
GTTTTGATGAGACTATCATCAGAGGAAAAAGGCAAATGGCTGAAATATTTCGAAGACCAT
GACATGCCCTATTCTTTAGTATCAAACAACTTGGCACAGGTTCTTCGTGCTGAGGATTCT
TATTTAATGAGGTCTAAAGAAGGCGAAGCTGAAGATACAAACAGTACAATGACATGGGAT
AGTTATTATAACGCGGAAGAGATCAACAAATACATAGACGAGATGGGCGCAAAATACCCT
GACCTCATAACAGTTATCAACGCCGGCAGGAGTTACGAAGGTCGACAGATCAAATATGTC
AGGATTTCCACCACACGCTTTGAAAACCTTCGCAAAAGAGTAATAGTTATAGACGCTGGT
GTACATGCTAGAGAATGGGTTACCACACCAGTAGCTCTGTATTTGATCAAGCAATTGGCC
GAAGGCGCTGATAAATTACTGACTGAAAACCTCGACTGGATCATTATACCTTTAGCAAAC
CCAGATGGTTACGAATACTCCATAAATGAGGATCGTTTATGGCGTAAAACCCGTTCTAAA
TCTCACGCTGGCTCAGACGCATGTCCTGGTGTTGATGGAAACCGAAACTTCGATTTCGAC
TGGGGCTCCAGACCTGACTCTAACATAGCCTGCTCCATTATTTACGAAGGACCATCACCC
TTCTCGGAACCAGAGACACGTATCATAAGAGACGCTGTTTTGTCAAATTTGGCTCGTACT
TCCCTTTACATTTCTCTACACAGCTATGGCAACATGTTCCTTTACGCTTGGGGAACTAAC
GGTACACTTCCTTCAAATGGCCTATCTCTTCACCTTGCCGGTATCATCATGGCAACAGCT
ATTGAGGAAGTCAAATTAGAAAAAGCTGATTCTTACATTGTCGGTAATGCTGCTAACGTT
CTGTACTACACCAGCGGTACCTCAAGAGACTGGACTCGTGGCATGGGAATACCATTCACC
TATACCATGGAACTTCCTGGTTATGAGTACGGCTTCCTCGTCCCACCCACTTACATCAAG
CAAATAGTGACCGAATCTTTCGTAGGAATAGCTGCTGGAGCTCGTTACGTACTCTCACTA
TACTGA
Protein sequence:
MARWITFLLFIAVTHARHEQYEGHSLYRVAGPSDQFQYLEAYIDFLSITPAAKSTSRQLE
VLMRLSSEEKGKWLKYFEDHDMPYSLVSNNLAQVLRAEDSYLMRSKEGEAEDTNSTMTWD
SYYNAEEINKYIDEMGAKYPDLITVINAGRSYEGRQIKYVRISTTRFENLRKRVIVIDAG
VHAREWVTTPVALYLIKQLAEGADKLLTENLDWIIIPLANPDGYEYSINEDRLWRKTRSK
SHAGSDACPGVDGNRNFDFDWGSRPDSNIACSIIYEGPSPFSEPETRIIRDAVLSNLART
SLYISLHSYGNMFLYAWGTNGTLPSNGLSLHLAGIIMATAIEEVKLEKADSYIVGNAANV
LYYTSGTSRDWTRGMGIPFTYTMELPGYEYGFLVPPTYIKQIVTESFVGIAAGARYVLSL
Y