New model in OGS2.0 | DPOGS204330  |
---|---|
Genomic Position | scaffold794:+ 9321-14841 |
See gene structure | |
CDS Length | 2316 |
Paired RNAseq reads   | 77 |
Single RNAseq reads   | 214 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007227 (2e-19) |
Best Drosophila hit   | CG4408 (2e-14) |
Best Human hit | mast cell carboxypeptidase A precursor (3e-10) |
Best NR hit (blastp)   | Carboxypeptidase B [Lepeophtheirus salmonis] (9e-20) |
Best NR hit (blastx)   | GF18768 [Drosophila ananassae] (6e-18) |
GeneOntology terms    | GO:0004181 metallocarboxypeptidase activity GO:0006508 proteolysis GO:0008270 zinc ion binding |
InterPro families   | IPR000834 Peptidase M14, carboxypeptidase A |
Orthology group | MCL40138 |
Nucleotide sequence:
ATGAATTTTGGTATATTTTTGATAATATTAGGTTTTGCTCATTTAGTTTGTTGTTCTAAT
AGTAATGTCACTAATAATGAGTGCGATGTATGTACATTATCATGGTTCCGTCGTAAACGG
GATCTTAAGGAGTACACGCGATGGAAGTCGAACGTAGAGACGACACAAGAATTTATTGTT
AAAGTTTTAGAAGACGATGTGTCATTACCTCAAATTAAGAAGTTTGCGGCTATAAACATT
GCGCCGTTCACTAAAGAAAGAGAAAAAATTATAATTGGTGGTCCAATAACTCCATCACCT
ACAACGCCTTCAGATTCAGATAAGTTAAAAGATTATACTCGATTTGGTCCTCCCTCGACC
TTGTCTGTTACTGCCATTGATGGAAACATTATCCTTCAGCCCATTAAAATAAACGAAAAT
GCAATTGCACCGATTCGACCCCCAGGTCGTAAATCCGGATTAGAACCTGCAAAAAACCCA
CTTCGAGATGAAAGTCCTCCAGCGCGACCCCAATTCTTACCAAAACAAGAGCCATTAATA
GCTCGGCAATTCGCTCCAATAGCGCCGGTATCTTTTACTAAACGTAATTCTCTGGGCCCT
TTGACTGGTCGTGGAATAGACAAATCGATGTTGTTCAGAGAAACAACATTTCAACATCCC
ACAAGAACTTCCCATGCAGAGCTAGAAGTCCCAAAAAGGACAACAGAATTTACGACTTCT
GTTACATATGACTACAATTATGAATTAAGATCTCAATTTGTTCCAATGCGGACTTCACAA
AAGTCGATAATAGATTTACATAGAACAATAACTGACCAAATAGAAACGGAAAATAGTGAT
TTTATGCCCACGATTGCCAGTAAAAAAATAGCAGCGCAGAAAGGTTCTTCAAAAATAGGA
CACAAAGAAACAAAAATGGCTTACGATCTACTTTACCACGATGAAGATGAAATAAGCGAC
CCGAGAAGAAAAGAACATCAAAGAATCACAGAGAGTTATGACGATCTCCATGTAATTGAC
AACGAGAATGTTACACGAGATGATGCTTTAAATAATGCTGAAGAACGTGAAAGTACCACA
GACGACTCCGATTCAAATAATAACGATAAAGACAATTTGGAAACGACTATAAGTGGTTTT
CAAGCTGAACAAAAGCTATTTGGTGTGGCGACTATAGAATCAGTTTCTAATAAATGGGAT
AGCGAGGAAGATGAAGATAAAAGCAAAACAACGCTGAAAGAAAAAAAAGTATTTTTATGT
AACATATTAAAAAGAAGACAACTTTCTTTCAGCACTCCTTTGACTTTGTACGAGATAGTA
ACGCAACTAAAACAATGGGCCGATGAAAGTCCTGTTGCCAAATGGATGGACCTTACCGAA
GGAAATTACACGATAATGGAAAATCCGATACACATGATGATGGTGGATGATCCCAGTAGT
GGACAAATTATGTCTGCTAAAAAAACCGTTATGATCGTCGCAGGAATCCAAGGAAGAGAT
CATCATGCTGTTACGGCGGCGATGTACGTTCTGTATCAGTTGATTGAACGCAGTGAAGCA
CATTCTGATTTGCTTACAAAGTTCCGCTTTTGGATTGTTCCTGTTTTTAATCCAGACGGA
TATGATTATTCAATGACTTTCCCCCAAAGACGTGAATGGACGAAGAATGTACGACAGTCG
TGGGATTCGTGCCAAGGTAGGATTCTATGTAGGACCTGCGAAGAGTTCGGAGTGGGTTGT
ACGATACGGCCATGTTACGGGGTCAACTTGGATCGCAATTTTGAATATCAATGGATACCC
ACAGAAGAGTTGCGCTCTGAACATCCGTGTGGAATGCTGTATGCTGGACCTCGTCAACTG
AGCGAGGCTGAGACGATAGCGCTCACGCATTTTCTTCACAGTCAGCGCACGCCGATTCAC
ACTTTCATAGCTTTCAAAGAAGGAGATGTTCTAAGACACAGAGCCTCGAGAGCTGCAGCC
GCTGCTAATAGCATTAGCGGTCGGCCATACGTTGCTGGTCAAACCTCCGAGTTTCTACCG
CTTTACGCTGGTGGCGTTGAAGACTGGGTGGATGGTCATCTGGGGATAGACAATACTTAT
ACCGTCATGATGTTCCGACCCTCTGATTCCTATAGTTCAAAACTTATCACGGAGCGCGTC
GTTCACGAGGCGTACGCTGCGGTGGATACTTTGCTTCTAGAGAGTGTCGAGACTCCGAGA
TCGCCGCAAACAGTTTTAACCAGAGCCAAAGCTTCTGCAAACACCTTATTCGCGAATATC
TTTATATTATTACCTATGGTTCTGGGCTTCGGTTAA
Protein sequence:
MNFGIFLIILGFAHLVCCSNSNVTNNECDVCTLSWFRRKRDLKEYTRWKSNVETTQEFIV
KVLEDDVSLPQIKKFAAINIAPFTKEREKIIIGGPITPSPTTPSDSDKLKDYTRFGPPST
LSVTAIDGNIILQPIKINENAIAPIRPPGRKSGLEPAKNPLRDESPPARPQFLPKQEPLI
ARQFAPIAPVSFTKRNSLGPLTGRGIDKSMLFRETTFQHPTRTSHAELEVPKRTTEFTTS
VTYDYNYELRSQFVPMRTSQKSIIDLHRTITDQIETENSDFMPTIASKKIAAQKGSSKIG
HKETKMAYDLLYHDEDEISDPRRKEHQRITESYDDLHVIDNENVTRDDALNNAEERESTT
DDSDSNNNDKDNLETTISGFQAEQKLFGVATIESVSNKWDSEEDEDKSKTTLKEKKVFLC
NILKRRQLSFSTPLTLYEIVTQLKQWADESPVAKWMDLTEGNYTIMENPIHMMMVDDPSS
GQIMSAKKTVMIVAGIQGRDHHAVTAAMYVLYQLIERSEAHSDLLTKFRFWIVPVFNPDG
YDYSMTFPQRREWTKNVRQSWDSCQGRILCRTCEEFGVGCTIRPCYGVNLDRNFEYQWIP
TEELRSEHPCGMLYAGPRQLSEAETIALTHFLHSQRTPIHTFIAFKEGDVLRHRASRAAA
AANSISGRPYVAGQTSEFLPLYAGGVEDWVDGHLGIDNTYTVMMFRPSDSYSSKLITERV
VHEAYAAVDTLLLESVETPRSPQTVLTRAKASANTLFANIFILLPMVLGFG