New model in OGS2.0 | DPOGS204330 |
---|---|
Genomic Position | scaffold794:+ 9321-14841 |
See gene structure | |
CDS Length | 2316 |
Paired RNAseq reads | 77 |
Single RNAseq reads | 214 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007227 (2e-19) |
Best Drosophila hit | CG4408 (2e-14) |
Best Human hit | mast cell carboxypeptidase A precursor (3e-10) |
Best NR hit (blastp) | Carboxypeptidase B [Lepeophtheirus salmonis] (9e-20) |
Best NR hit (blastx) | GF18768 [Drosophila ananassae] (6e-18) |
GeneOntology terms | GO:0004181 metallocarboxypeptidase activity GO:0006508 proteolysis GO:0008270 zinc ion binding |
InterPro families | IPR000834 Peptidase M14, carboxypeptidase A |
Orthology group | MCL40138 |
Nucleotide sequence:
ATGAATTTTGGTATATTTTTGATAATATTAGGTTTTGCTCATTTAGTTTGTTGTTCTAAT
AGTAATGTCACTAATAATGAGTGCGATGTATGTACATTATCATGGTTCCGTCGTAAACGG
GATCTTAAGGAGTACACGCGATGGAAGTCGAACGTAGAGACGACACAAGAATTTATTGTT
AAAGTTTTAGAAGACGATGTGTCATTACCTCAAATTAAGAAGTTTGCGGCTATAAACATT
GCGCCGTTCACTAAAGAAAGAGAAAAAATTATAATTGGTGGTCCAATAACTCCATCACCT
ACAACGCCTTCAGATTCAGATAAGTTAAAAGATTATACTCGATTTGGTCCTCCCTCGACC
TTGTCTGTTACTGCCATTGATGGAAACATTATCCTTCAGCCCATTAAAATAAACGAAAAT
GCAATTGCACCGATTCGACCCCCAGGTCGTAAATCCGGATTAGAACCTGCAAAAAACCCA
CTTCGAGATGAAAGTCCTCCAGCGCGACCCCAATTCTTACCAAAACAAGAGCCATTAATA
GCTCGGCAATTCGCTCCAATAGCGCCGGTATCTTTTACTAAACGTAATTCTCTGGGCCCT
TTGACTGGTCGTGGAATAGACAAATCGATGTTGTTCAGAGAAACAACATTTCAACATCCC
ACAAGAACTTCCCATGCAGAGCTAGAAGTCCCAAAAAGGACAACAGAATTTACGACTTCT
GTTACATATGACTACAATTATGAATTAAGATCTCAATTTGTTCCAATGCGGACTTCACAA
AAGTCGATAATAGATTTACATAGAACAATAACTGACCAAATAGAAACGGAAAATAGTGAT
TTTATGCCCACGATTGCCAGTAAAAAAATAGCAGCGCAGAAAGGTTCTTCAAAAATAGGA
CACAAAGAAACAAAAATGGCTTACGATCTACTTTACCACGATGAAGATGAAATAAGCGAC
CCGAGAAGAAAAGAACATCAAAGAATCACAGAGAGTTATGACGATCTCCATGTAATTGAC
AACGAGAATGTTACACGAGATGATGCTTTAAATAATGCTGAAGAACGTGAAAGTACCACA
GACGACTCCGATTCAAATAATAACGATAAAGACAATTTGGAAACGACTATAAGTGGTTTT
CAAGCTGAACAAAAGCTATTTGGTGTGGCGACTATAGAATCAGTTTCTAATAAATGGGAT
AGCGAGGAAGATGAAGATAAAAGCAAAACAACGCTGAAAGAAAAAAAAGTATTTTTATGT
AACATATTAAAAAGAAGACAACTTTCTTTCAGCACTCCTTTGACTTTGTACGAGATAGTA
ACGCAACTAAAACAATGGGCCGATGAAAGTCCTGTTGCCAAATGGATGGACCTTACCGAA
GGAAATTACACGATAATGGAAAATCCGATACACATGATGATGGTGGATGATCCCAGTAGT
GGACAAATTATGTCTGCTAAAAAAACCGTTATGATCGTCGCAGGAATCCAAGGAAGAGAT
CATCATGCTGTTACGGCGGCGATGTACGTTCTGTATCAGTTGATTGAACGCAGTGAAGCA
CATTCTGATTTGCTTACAAAGTTCCGCTTTTGGATTGTTCCTGTTTTTAATCCAGACGGA
TATGATTATTCAATGACTTTCCCCCAAAGACGTGAATGGACGAAGAATGTACGACAGTCG
TGGGATTCGTGCCAAGGTAGGATTCTATGTAGGACCTGCGAAGAGTTCGGAGTGGGTTGT
ACGATACGGCCATGTTACGGGGTCAACTTGGATCGCAATTTTGAATATCAATGGATACCC
ACAGAAGAGTTGCGCTCTGAACATCCGTGTGGAATGCTGTATGCTGGACCTCGTCAACTG
AGCGAGGCTGAGACGATAGCGCTCACGCATTTTCTTCACAGTCAGCGCACGCCGATTCAC
ACTTTCATAGCTTTCAAAGAAGGAGATGTTCTAAGACACAGAGCCTCGAGAGCTGCAGCC
GCTGCTAATAGCATTAGCGGTCGGCCATACGTTGCTGGTCAAACCTCCGAGTTTCTACCG
CTTTACGCTGGTGGCGTTGAAGACTGGGTGGATGGTCATCTGGGGATAGACAATACTTAT
ACCGTCATGATGTTCCGACCCTCTGATTCCTATAGTTCAAAACTTATCACGGAGCGCGTC
GTTCACGAGGCGTACGCTGCGGTGGATACTTTGCTTCTAGAGAGTGTCGAGACTCCGAGA
TCGCCGCAAACAGTTTTAACCAGAGCCAAAGCTTCTGCAAACACCTTATTCGCGAATATC
TTTATATTATTACCTATGGTTCTGGGCTTCGGTTAA
Protein sequence:
MNFGIFLIILGFAHLVCCSNSNVTNNECDVCTLSWFRRKRDLKEYTRWKSNVETTQEFIV
KVLEDDVSLPQIKKFAAINIAPFTKEREKIIIGGPITPSPTTPSDSDKLKDYTRFGPPST
LSVTAIDGNIILQPIKINENAIAPIRPPGRKSGLEPAKNPLRDESPPARPQFLPKQEPLI
ARQFAPIAPVSFTKRNSLGPLTGRGIDKSMLFRETTFQHPTRTSHAELEVPKRTTEFTTS
VTYDYNYELRSQFVPMRTSQKSIIDLHRTITDQIETENSDFMPTIASKKIAAQKGSSKIG
HKETKMAYDLLYHDEDEISDPRRKEHQRITESYDDLHVIDNENVTRDDALNNAEERESTT
DDSDSNNNDKDNLETTISGFQAEQKLFGVATIESVSNKWDSEEDEDKSKTTLKEKKVFLC
NILKRRQLSFSTPLTLYEIVTQLKQWADESPVAKWMDLTEGNYTIMENPIHMMMVDDPSS
GQIMSAKKTVMIVAGIQGRDHHAVTAAMYVLYQLIERSEAHSDLLTKFRFWIVPVFNPDG
YDYSMTFPQRREWTKNVRQSWDSCQGRILCRTCEEFGVGCTIRPCYGVNLDRNFEYQWIP
TEELRSEHPCGMLYAGPRQLSEAETIALTHFLHSQRTPIHTFIAFKEGDVLRHRASRAAA
AANSISGRPYVAGQTSEFLPLYAGGVEDWVDGHLGIDNTYTVMMFRPSDSYSSKLITERV
VHEAYAAVDTLLLESVETPRSPQTVLTRAKASANTLFANIFILLPMVLGFG