New model in OGS2.0 | DPOGS201501  |
---|---|
Genomic Position | scaffold1102:- 17998-25159 |
See gene structure | |
CDS Length | 2121 |
Paired RNAseq reads   | 3030 |
Single RNAseq reads   | 8022 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002593 (0.0) |
Best Drosophila hit   | CG5355 (0.0) |
Best Human hit | prolyl endopeptidase (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to prolyl endopeptidase isoform 1 [Apis mellifera] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to prolyl endopeptidase isoform 1 [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR004106 Peptidase S9A/B/C, oligopeptidase, N-terminal beta-propeller IPR002471 Peptidase S9, serine active site IPR002470 Peptidase S9A, prolyl oligopeptidase IPR023302 Peptidase S9A, oligopeptidase, N-terminal IPR001375 Peptidase S9, prolyl oligopeptidase, catalytic domain |
Orthology group | MCL12048 |
Nucleotide sequence:
ATGACCTTCGACTATCCAGAGGCGAGGAGGGATGAGACGGTGGTCGACAACTATCATGGT
ACCAACGTATCAGACCCATACCGCTGGTTGGAGGATCCAGATTCAGACGAGACCAAAGCG
TTTATAGAAGCCGAAAACAATATAACTCGTCCGTTCTTAGACTCGTGTCCCGTTAAAACC
GATATAAACAGTAGGCTCACAGAGCTCTGGAACTATCCCAAATACTCCTGTCCATTCAGA
AGAGGAGATAGATACTTCTTCTTCAAGAACACCGGACTGCAGAATCAGAATGTGCTATAC
GTCCAAGATAGTTTGGACGGCGAGCCGCGCGTGTTCCTCGATCCGAATACGTTGTCTGAG
GACGGTACGATCGCTTTGTCCGGGAGTCGCTTCACCGAGGACGGTTCAACCTTCGCGTAC
GGTCTGTCAGCCAGCGGATCGGATTGGATAACGATCCATTTGAAGGATGTTGCTACTGGC
GTAGACTATCCCGAGGTTTTAGAGAAAGTTAAGTTCGCCTCAATGTCGTGGACAAAGGAC
AATAAGGGACTCTTTTATTCTCGGTATCCAGAGCAGACCGGCAAGACGGATGGTTCGGAG
ACGGACGTGAACAGAGATCAGAAGCTGTGCTACCACAGGCTGAACACGCCGCAGGAGGAT
GACGTCATCGTGGTAGAATTCCCCCAGGAACCTCTGTGGAGGATCGGTGCGGAGGTGTCG
GACTGCGGCAGGTATCTCCTCGTGAGTCCGGTGAGAGACTGTCGCGACAACCTGCTGTTC
TTCGCCGACCTGTCCTCCGCCTCGCTCACAGGACACCTCCAACTAACACAGATCGTGCAC
AAGTTCGAAGCCGACTATGAGTACATAACGAACGAGGGTTCCGTATGCATATTCCGGACA
AACAAGAACGCACCCAACTACAGACTCATAAAAATCGACCTGAATAACCCAGCTGAGGAA
AATTGGGAAACTTTAATAGCGGAACATCCCACTGATGTCCTGGACTGGGCTTCTGCGGTC
GACAAAGATAAGTTAGTCATACACTACATAAGGGACGTTAAGAGCGTACTGCAGTTACAC
AGTATGAAGACGGGTGATTTGATGCAAAACTTCGATTTAGGTGTTGGCTCCATAGTGGGG
TTCTCGGGGAAGAAAGAACAGAGCGAAATATTCTATCACTTCATGTCATTCCTTACACCC
GGCGTCATCTATCACGTGGACTTCAAGAAACAACCGTACGCACCAACCATATTCAGAGAA
GTTAAAGTGAAAGGCTTCGACGCTTCGCAGTATGAAGCCAAACAAGTTTTCTATAGCAGC
AAAGATGGCACGAGAGTTCCTATGTTCATAGTATCTAAGAAAGGTTTACCGCGTGATGGG
TCCCGCCCGGCGCTGCTCTACGGCTACGGCGGGTTCAACATCAACGTCCAGCCGAGCTTC
AGCGTGACGCGGATCGTGTTCATGCAGCACTTCGAAGGTTCCGTAGCGGTTCCGAACATC
AGAGGCGGCGGTGAATACGGCGAGCGGTGGCACAACGCCGGCAGACTGCTGAACAAGCAG
AATGTCTTCGATGATTTCATATCCGCCGGCGAGTATTTGGTGCGGGAAGGGTACACCAGA
CCCGGCCTGCTCGCGGTCCAGGGCGGCTCAAACGGCGGGCTGCTGGTTGCAGCGGTCGCA
AATCAGCGGCCCGACCTGCTGGGCGCAGCGATCGTTCAAGTCGGAGTGCTGGACATGCTG
CGCTTCCAGAAGTTCACCATCGGACACGCCTGGATATCGGACTACGGCAGCTCAGATAAT
AAGACACATTTCGAAAACCTGCTTAAGTACTCGCCGCTGCACAACATCCAGTCGCCAGAT
AACGTAAGCCGTGCCGAGTACCCGGCGACGTTGGTGCTAACTGCGGATCACGATGACCGC
GTAGTGCCGCTTCATTCCCTCAAGTATATAGCGACATTACAGCACGCTGTTAGAGGCACG
CCGCAAAGACGACCGCTGTTAGCACGGATCGACACGAAGGCTGGTCACGGAGGAGGAAAA
CCGACCGCGAAAATAATCGATGAACACACAGACATCCTGTGCTTCCTCGCTCAAACCCTG
GGACTTAAGTTCCTGAAGTGA
Protein sequence:
MTFDYPEARRDETVVDNYHGTNVSDPYRWLEDPDSDETKAFIEAENNITRPFLDSCPVKT
DINSRLTELWNYPKYSCPFRRGDRYFFFKNTGLQNQNVLYVQDSLDGEPRVFLDPNTLSE
DGTIALSGSRFTEDGSTFAYGLSASGSDWITIHLKDVATGVDYPEVLEKVKFASMSWTKD
NKGLFYSRYPEQTGKTDGSETDVNRDQKLCYHRLNTPQEDDVIVVEFPQEPLWRIGAEVS
DCGRYLLVSPVRDCRDNLLFFADLSSASLTGHLQLTQIVHKFEADYEYITNEGSVCIFRT
NKNAPNYRLIKIDLNNPAEENWETLIAEHPTDVLDWASAVDKDKLVIHYIRDVKSVLQLH
SMKTGDLMQNFDLGVGSIVGFSGKKEQSEIFYHFMSFLTPGVIYHVDFKKQPYAPTIFRE
VKVKGFDASQYEAKQVFYSSKDGTRVPMFIVSKKGLPRDGSRPALLYGYGGFNINVQPSF
SVTRIVFMQHFEGSVAVPNIRGGGEYGERWHNAGRLLNKQNVFDDFISAGEYLVREGYTR
PGLLAVQGGSNGGLLVAAVANQRPDLLGAAIVQVGVLDMLRFQKFTIGHAWISDYGSSDN
KTHFENLLKYSPLHNIQSPDNVSRAEYPATLVLTADHDDRVVPLHSLKYIATLQHAVRGT
PQRRPLLARIDTKAGHGGGKPTAKIIDEHTDILCFLAQTLGLKFLK