New model in OGS2.0 | DPOGS202235 |
---|---|
Genomic Position | scaffold747:- 13819-17747 |
See gene structure | |
CDS Length | 1404 |
Paired RNAseq reads | 1821 |
Single RNAseq reads | 5040 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013534 (3e-138) |
Best Drosophila hit | CG2493 (5e-115) |
Best Human hit | lysosomal Pro-X carboxypeptidase isoform 1 preproprotein (3e-113) |
Best NR hit (blastp) | Lysosomal Pro-X carboxypeptidase, putative [Pediculus humanus corporis] (8e-137) |
Best NR hit (blastx) | Lysosomal Pro-X carboxypeptidase, putative [Pediculus humanus corporis] (5e-136) |
GeneOntology terms | GO:0004185 serine-type carboxypeptidase activity GO:0006508 proteolysis |
InterPro families | IPR008758 Peptidase S28 |
Orthology group | MCL13075 |
Nucleotide sequence:
ATGTTCAGAGTAATATCGCTTGTATTGTTCATAAATTATGTGACCTGTGACTACAAGTTC
GAGACGAAATGGTTCAATGTGCCCCTGGATCACTTCGGATTCCAGAGAAACGAAACTTTC
AACATAAAATATCTGATCAACGAGGAGTATTGGGACAAGGGAGGCGGACCGATATTCTTC
TATACAGGAAATGAGGGACAAATTGAGGTATTCGCGAAACACACCGGCTTTATGTGGGAC
ATCGCTGAGGAATTTAAAGCGAAATTGGTATTTGCAGAACATAGATACTATGGTCAATCA
ATGCCCTTCGGTAATAAGTCACTGGACAACGAGCACATTGGCTACTTGACATCCGAACAG
GCGTTAGCTGATTACGCAGACCTCATAAACTATCTACAGGGAAACAAACAGAGACCGACA
TACCCCGTCATTGCTTTTGGAGGATCTTACGGTGGAATGCTCTCCGCCTACATACGCATC
AAGTACCCTCACCTGGTGACGGGCGCCATAGCCGCCTCGGCCCCGATCCACATGTACCCC
GGGATGGTGCCGTGCGAAGTGTTCCACAGGATTGTGACTTCCAGCTTCAAAATAGCGGAT
GAAAAATGCGTTAAAAATATAAGAAGCTCGTGGGGTGTTCTTAGAAAATTTCTCGAAAGT
CAAAACAATACCGATTGGCTTCACAAGAACTGGAACCTGTGCGAGCCCGTGAAGCCTGCG
GATGTGAACACCTTGATGGAGTTCCTCCAGTCGATGTACGAAACCCTCGCGATGGTGAAC
TATCCCTTCCCGTCGGACTTCCTGCTGCCCCTGCCCGCGCAGCCGGTGCGAGTAGTGTGT
CAGTACTTGAACGAGACCCTCAGTGGACAAAAACTCATTGAGGCTATTGGTAAGGTGATC
AAAGTGTACAGCAACTATGATGGCAAAGCCCCCTGTGTCGACTACAAGAAGGGAGACGAC
TTCGGCAATCTTGACGCTAGCGGATGGGACTATCAGGCGTGCACAGAGATGATAATGCCG
ATGTGCACTACCGGAAACCAAGATATGTTCGAGCCCTCCCCTTGGAACTTCACCAAATAC
GCTGAAGACTGTCACAGGAAATACAACGTGTACCCGCGACAGGAGGCGGCTCGGATACAG
TACGGAGGAGACAGGCTTCGAGCGGCGACCAATATTGTGTTCAGTAATGGACTGCTGGAT
CCCTGGGCGGGAGGCGGCATCCTGAATAGTATCAGTAATTCAGTGAAGGCAGTTGTTATC
ATCGACGCGGCCCATCACCTTGACCTGATGCCTTCCAACCCAGCTGATCCCAATTCAGTA
AAACTCGCCAGAAACATACACAAACAGAACATAGACAAATGGATACGAGAGTTCCGCACG
GAACGCTCCGACAGACACCATTAG
Protein sequence:
MFRVISLVLFINYVTCDYKFETKWFNVPLDHFGFQRNETFNIKYLINEEYWDKGGGPIFF
YTGNEGQIEVFAKHTGFMWDIAEEFKAKLVFAEHRYYGQSMPFGNKSLDNEHIGYLTSEQ
ALADYADLINYLQGNKQRPTYPVIAFGGSYGGMLSAYIRIKYPHLVTGAIAASAPIHMYP
GMVPCEVFHRIVTSSFKIADEKCVKNIRSSWGVLRKFLESQNNTDWLHKNWNLCEPVKPA
DVNTLMEFLQSMYETLAMVNYPFPSDFLLPLPAQPVRVVCQYLNETLSGQKLIEAIGKVI
KVYSNYDGKAPCVDYKKGDDFGNLDASGWDYQACTEMIMPMCTTGNQDMFEPSPWNFTKY
AEDCHRKYNVYPRQEAARIQYGGDRLRAATNIVFSNGLLDPWAGGGILNSISNSVKAVVI
IDAAHHLDLMPSNPADPNSVKLARNIHKQNIDKWIREFRTERSDRHH