New model in OGS2.0 | DPOGS202963  |
---|---|
Genomic Position | scaffold222:- 8105-18557 |
See gene structure | |
CDS Length | 3759 |
Paired RNAseq reads   | 3978 |
Single RNAseq reads   | 10102 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005752 (0.0) |
Best Drosophila hit   | tripeptidyl-peptidase II, isoform D (0.0) |
Best Human hit | tripeptidyl-peptidase 2 (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to tripeptidylpeptidase II [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to tripeptidylpeptidase II [Nasonia vitripennis] (0.0) |
GeneOntology terms    | GO:0004175 endopeptidase activity GO:0008240 tripeptidyl-peptidase activity GO:0004252 serine-type endopeptidase activity GO:0005737 cytoplasm GO:0004177 aminopeptidase activity GO:0008233 peptidase activity GO:0006508 proteolysis |
InterPro families    | IPR015500 Peptidase S8, subtilisin-related IPR000209 Peptidase S8/S53, subtilisin/kexin/sedolisin IPR022229 Peptidase S8A, tripeptidyl peptidase II IPR022232 Peptidase S8A, tripeptidyl peptidase II, arthropoda IPR022398 Peptidase S8/S53, subtilisin, active site |
Orthology group | MCL11107 |
Nucleotide sequence:
ATGGCAGACGTACCGATCGATTGTGAATTCCCTGTTTGGGGATTAATGCCTAAAAGAGAA
ACAGGGGTCGTCTCATTTTTAAATAAATATCCTGAATATGATGGCAGGAACACAGTTATT
GCGATATTAGACTCAGGTGTGGATCCCGCCGCAGAGGGTCTTAAGGTTACAAGTACAGGA
GAGACTAAAGTTATCGAGAGGTATGACTGCAGCGGCTGTGGTGATGTGGATACTAGCACA
GTGGTGAAAAAGGTTGTGGATGGATACATAACCGGCATTACTGGACGTAAACTAAAGATT
CCAGAAACATGGGACAATCCTAAGGGTGAATGGCGTATAGGTGTTGTTTATCCTTTTAGT
TTATATCCAACTAAGGTGAAGGAAAGGATCCAAGAGCATCGTAAGGAACACGTGTGGGAT
GTTGGCCAGAAGCCAGCTATGGCTAAGGCCACCAAAGATTTGCAGGATTTCGAAAATGAA
GTTTCTTCAAAAACCACCTTGAGTCAAGAGGAGAAGCAAGCGAAGGAGGAGCTGGAAGCC
AGAGTTGAGGTGTTGAAGGAACTCGACAAGAAATACACAGACGTCGGACCCACGTACGAC
TGTGTGCTGTGGCACGACGGAACGGTTTGGAGAGCGTGCATCGACACGTCCGAGGAGGGA
GACTTGTCTTCGGGCGTCCTGCTGGGCGAGTACAGCGCGACGCAGGAACACGCTCACCTC
ACGCCGCTGGACGAGATGACGGTCAGTGTGAACGTTCACAACGACGGAGACACGCTCGAG
GTAGTCGGCATGTGTTCGACTCACGGCACACACGTGGCGGCCATAGCGGCGGGGTACTTC
CCCGACGACCCCGACCGGAACGGGGTGGCGCCGGGTGCCAAGATTATATCGCTCACGATA
GGAGACAGTCGCCTGGGGTCCATGGAGACGGGCACGGCGCTGGTGCGGGCCTGCGTCAAG
GTCATGGAGCTGGCGGCGAGGACGAAGGTGGACGTCATCAACATGAGCTACGGGGAACAC
GCGCACTGGTCCAACGCGGGTCGTGTGGGGGAGATCATCAGTATGGTGGTGAACAAGTAC
GGCGTGTCGTGGGTGGTGTCCGGCGGCAACCACGGCCCCGCGCTCGCCACGGTCGGCGCG
CCGCCTGACATCGCGCAACCCATACTCATAGGCGTGGGCGCGTACGTGTCGTCCGAGATG
ATGTTGGCGGCGTACTCCATGCGGGCGCGCGGCTGCGGGCCGCGGAAGTCGACCTCGTCG
GCGGGGCCCTGCAGCGACGGCGCGCTCGGCATCTCCGTCTGCGCACCCGGGGCTGCGCTC
GCCTCCGTCGCCAGGTTCACTCTGAGGAACTCCCAGCTCATGAACGGCACGTCCATGGCG
GCGCCGCACGTGGCCGGGGCTGTCGCGGCCCTGATCTCGGGTCTGTCGTGCCGCGGCCTG
CCTCACTCGCCCTACTCCATGAAGCGAGCGCTGGAGAACACGGCCACGTACTTAGAACAC
GTGGAGCCCTGGGCGCAGGGGGCCGGCTTGTTGAATATAGAGAAGGCGTTCGAGCACTTG
GTGGAGCATCACGCGGCGGTGGAGCGTGACGTCACCTTCAATATAAAGTGCGGCGCCAAC
AACGCCAAGGGTATCTTCCTCCGTCCGCGGGCCGACGACCCGCCCCGGGACATCAGCATC
ACCGTGGAGCCGCAGTTCCTGGAGGACTTCCGAGACCAGAACAAGCGGGCGGTGATGGAG
CGCCAGTTGTCGTTCGAGGTCCGCCTGGCGCTGACGGCGGCTCCGGCCTGGCTGCACGGG
CCCAAGCACCTGCACCTGGCCGCGGCGCCCCGGGCCTTCGCCCTCAGGGTGCACACCGCG
GACTTACCTCCGGGACCTCACTTCGCCAGTCTGAACGCGTACGACGTGTCGTGCGTGTCC
AAGGGGCCGGTGTTCCGCGTGTCGGTGACGGTGCTGCAGCCTGAGCCGCTGGCAGGTCTG
CCACACGAGCCCCACATACGACTGACGGACGTACTGTTCCGGCCCTCCGCCATCAAGAGA
CACATCATAGTAGTCCCGCCCGAGGCGTCGTGGGGCGTGGTCCGCTTGGTCCGTCGCGGC
GGAGAGAGTTCGTCTCGGTTCCTGGTGCACGTGATGCAGCTCTCGCCGCGCCGCTCCTGC
AGGGACCACGAGACGCACCGCATCATGACGCTCGGACCGCACGCTCCCGCGCAGGCGCCC
TTCAGACTACTGGGCGGCGTGACGGTAGAGGTGGCGATCGCCAAGTACTGGGCGAACGCC
GGAGACGTGCAAGTAGATTATACTATAGAGTTACACGGACTGAGGCCGGACTGCGGGCAC
CGGCTGACGCTGACCAGTGCAGCGCTGGGCAGCGTGCGGCTCACAGCGCTGAGGCCGCTT
GATGTGCAGCCGACGGCGGTCCTCAAACACATCGAGCCCGTGTACAGGCCGTCCGAGTCC
AAGCTGTGTTCCCTGACCGCGCGTGACGTCATCCCCCCCTCCAGGCAGATCTACCAGCTG
CTGAACACGTATACCTTCAATATACCTAAAGCTACCGAAGTGTCGCCCATGGTGCCGATG
TTGTGTGACATGTTGTACGAGTCGGAGTTCGAGTCCCAGATGTGGATGCTGTACAACAGC
TGCAAACAACTCGTGGCTGTAGGGGACGCCTACCCCTCGAAGTACTCAGCCAAGGTGGAT
AAGGGCGAGTACACACTCCGCCTGTCTATTCGTCACGAGAACCGCGCGCTGCTGGAGAGG
CTCACCGAGCTGCCGGTCGTGGTGCAGCAGAGACTCGCGCAACCCATCACGCTGGACGTG
TACAGCGACCAGCCACAGGCGTTGACGGGCGGGAAGAAGTTCACGTCGGCGTCTCTGGCC
AGCGGCGATGTGCTGCCGCTGTACTTCGCGCCGCTCCCCGCTGATAAGATAAGTCGTTCG
AACCTGTCCATCGGCGTGTCCCTGACGGGGACGGTGTCGTTCGTGAAGGACGAGCTGGGT
CACAAGCACCTGCACATGGGCGAGTGTCAGACCCTACTGGACGGACCCCGTAGGACGATC
AAGGACAACAGGAGACACGAGGACTACCACGACGCGCTTAGAGAGTTCACCGTGGGCTGG
ATGACCAAGATGGAGGGAGAAAAGTTGGACCAGGTGTACGAAGAAATATTAGAAAAATTC
CCGAACTTCATTGGAGCTCACGTCGCTTACATGAACAGTCTGGACTCCCCGACAGACCCC
AAGAGGTTACCGAATACAGAAGACGGCACGAACGGACTGAAACCGGCTCAGGACGAACAG
ATCATAAGCATCGCTGACAAGGTCATCAAGAGTATAGACCAGGATAAGTTACTGGCGCAC
CTGGGGACGAAGAACGACATGCGAGCTGACTCCAACAAGATAAAACAAGAGTTCGACCGT
CAGCGCGGCTACCTGATCGAAGCGCTGTGCCGTCGCGGCTCCGCCATGTGTCGCCTGGGG
CGGTCGATCTCCGCCCTGCACGAGAACGCGAACACCTTACTGAAGTTCACGGAGCTGAGC
GAGCCGCGCGCGCTCCAGTACGGCCTTTGGCACTGGACCGCCTTGGAGCAGTGGGGGCGC
GCCATGAGGCTGTGGCTGAGGGTGCACGACGAGCGACCCTCGCGGGAGGTGGACCAGCGA
GCCGCGCGCGCCGCCCGGGCGCTGGGCTGGGGACACGTGGCCGCACACCTCGCCGCCGCC
GCGCCGCACAAGCACGCCGCACACTACCGCCCCTTCTGA
Protein sequence:
MADVPIDCEFPVWGLMPKRETGVVSFLNKYPEYDGRNTVIAILDSGVDPAAEGLKVTSTG
ETKVIERYDCSGCGDVDTSTVVKKVVDGYITGITGRKLKIPETWDNPKGEWRIGVVYPFS
LYPTKVKERIQEHRKEHVWDVGQKPAMAKATKDLQDFENEVSSKTTLSQEEKQAKEELEA
RVEVLKELDKKYTDVGPTYDCVLWHDGTVWRACIDTSEEGDLSSGVLLGEYSATQEHAHL
TPLDEMTVSVNVHNDGDTLEVVGMCSTHGTHVAAIAAGYFPDDPDRNGVAPGAKIISLTI
GDSRLGSMETGTALVRACVKVMELAARTKVDVINMSYGEHAHWSNAGRVGEIISMVVNKY
GVSWVVSGGNHGPALATVGAPPDIAQPILIGVGAYVSSEMMLAAYSMRARGCGPRKSTSS
AGPCSDGALGISVCAPGAALASVARFTLRNSQLMNGTSMAAPHVAGAVAALISGLSCRGL
PHSPYSMKRALENTATYLEHVEPWAQGAGLLNIEKAFEHLVEHHAAVERDVTFNIKCGAN
NAKGIFLRPRADDPPRDISITVEPQFLEDFRDQNKRAVMERQLSFEVRLALTAAPAWLHG
PKHLHLAAAPRAFALRVHTADLPPGPHFASLNAYDVSCVSKGPVFRVSVTVLQPEPLAGL
PHEPHIRLTDVLFRPSAIKRHIIVVPPEASWGVVRLVRRGGESSSRFLVHVMQLSPRRSC
RDHETHRIMTLGPHAPAQAPFRLLGGVTVEVAIAKYWANAGDVQVDYTIELHGLRPDCGH
RLTLTSAALGSVRLTALRPLDVQPTAVLKHIEPVYRPSESKLCSLTARDVIPPSRQIYQL
LNTYTFNIPKATEVSPMVPMLCDMLYESEFESQMWMLYNSCKQLVAVGDAYPSKYSAKVD
KGEYTLRLSIRHENRALLERLTELPVVVQQRLAQPITLDVYSDQPQALTGGKKFTSASLA
SGDVLPLYFAPLPADKISRSNLSIGVSLTGTVSFVKDELGHKHLHMGECQTLLDGPRRTI
KDNRRHEDYHDALREFTVGWMTKMEGEKLDQVYEEILEKFPNFIGAHVAYMNSLDSPTDP
KRLPNTEDGTNGLKPAQDEQIISIADKVIKSIDQDKLLAHLGTKNDMRADSNKIKQEFDR
QRGYLIEALCRRGSAMCRLGRSISALHENANTLLKFTELSEPRALQYGLWHWTALEQWGR
AMRLWLRVHDERPSREVDQRAARAARALGWGHVAAHLAAAAPHKHAAHYRPF