New model in OGS2.0 | DPOGS207085  |
---|---|
Genomic Position | scaffold1:+ 2375321-2385335 |
See gene structure | |
CDS Length | 1659 |
Paired RNAseq reads   | 4585 |
Single RNAseq reads   | 12815 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013049 (1e-105) |
Best Drosophila hit   | CG31326 (4e-36) |
Best Human hit | plasma kallikrein precursor (8e-27) |
Best NR hit (blastp)   | hemolymph proteinase 19 [Manduca sexta] (0.0) |
Best NR hit (blastx)   | hemolymph proteinase 19 [Manduca sexta] (0.0) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR001314 Peptidase S1A, chymotrypsin-type IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site |
Orthology group | MCL18899 |
Nucleotide sequence:
ATAGGTCTTTTTGTGCCAACACACGAACAAAGTACAACTATATCGCCATGCCCCAACGTG
TTTATGTACGAACCCTCCGGCACAGAACCGGGAAGATGGTACGGGGTTGTCAATCTGTCA
ACAGATAGCACCTTACACTCTCTATGGTTGAATATCGTGCTTGATAGCAAGGCTGATATT
TTAGGGAATTGGGTAGGAGATGTAACGACCACAGACAATATAGATTTCAAAATTGAAAAC
ACAAGAATGAAAATACATCCTGGCCCGGCTGTGGCAGTTCGTTTCTTCGTACAATACAGC
CCTCTCAATAAAGCACCACGCTTGAGTGCTATCAGACTAAATGGTAGAGAAATCTGCAAT
GCCAAAACACCACAACCAGCGATTGAAGTGGTTGAAACCGCGAGGCCTGATCCAACATCA
ATCAGACCAAGGCCTGAAACCTCAAGGCCTGTAGACCGGCCGGTAGATAGGCCAATAGAT
CGCCCTGTTGAAAGACCGATCGATCGTCCGGTAGACAGGCCCAGCGTCCAAACAAGACCT
GTTCTGAGACCTGAAGATAAAACAAAACCTGTGTATGGAAGACCTTCTGGGGATGGACCT
GTGTATGTACCAACATCTAATCCACCAATCGAGCAGAGCAATGTTAACCCATATAGCATC
GGTGGTGGCCAGGTACCAGCTCAGAGTGTAACCCATTCAAGGCCAACATACCAGTTGACC
ACGACTTCTTATACATCCACCACGCCCGAAGACGAAGATAGTGACAGCGATGCTGACCCT
TCAGAATACTTCAACGGCGGTCAACTACTGGTCACACCGGTACCCAGCGGCCAAGGATAT
GTACAGCCTAAAAATGAACAATGCGGTAAAGTGCTCCGAAACAATCCGAATCCTCTGGTG
GTGAACGGCAAGCCGACGCTCGAAGGACAATGGCCCTGGCAGATAGCCCTTTATCAAACA
CAGACGGTGGATAGCAAGTACATTTGCGGCGGTACTCTCGTCTCCCACAAGCACGTGGTG
ACGGCAGCGCACTGCGTCACCCGCAAAGGTTCCAGTCGTACTGTGAACAAGAACACCCTC
ACCGTGTACTTGGGAAAACACAACCTCCGGACCTCTGTAGAGGGAGTTGAAATCAGACTT
GTGGGTGAGATAACTGTCCACCCTCAGTACAACGCGTCCTCGTTCAGTCGTGATCTCAGC
ATCCTCAAGCTCCGCAAAGCCGTCGAGTACACAGAATTCATACGTGCCGCCTGCCTCTGG
CCGGAGAACCAGATCGATTTGACGAACGTCATCGGCAAAAAGGGCTCCGTGGTAGGGTGG
GGTTTCGACGAGACGGGAGTCGCAACTGAAGAACTGACACTAGTGGAGATGCCGGTGGTG
GATCAAGAAACTTGCATCCGCTCTTACAGCGAGTTCTTCGCCAGATTCACTTCTGAGTAC
ACATACTGCGCTGGATATAGAGATGGCACGTCAGTGTGTAATGGTGACAGCGGTGGGGGT
ATGGTGTTCGAGATGCAAGGATCGTGGTATCTGAGAGGCCTGGTATCCCTCTCAGTGGCG
AGACAAAACGAATACAGATGTGACCCAACACACTACGTAGTATTTACAGACTTAGCCAAA
TTTTTATCTTGGATAAAGCAGCATGTAACTAGCGTCTAA
Protein sequence:
IGLFVPTHEQSTTISPCPNVFMYEPSGTEPGRWYGVVNLSTDSTLHSLWLNIVLDSKADI
LGNWVGDVTTTDNIDFKIENTRMKIHPGPAVAVRFFVQYSPLNKAPRLSAIRLNGREICN
AKTPQPAIEVVETARPDPTSIRPRPETSRPVDRPVDRPIDRPVERPIDRPVDRPSVQTRP
VLRPEDKTKPVYGRPSGDGPVYVPTSNPPIEQSNVNPYSIGGGQVPAQSVTHSRPTYQLT
TTSYTSTTPEDEDSDSDADPSEYFNGGQLLVTPVPSGQGYVQPKNEQCGKVLRNNPNPLV
VNGKPTLEGQWPWQIALYQTQTVDSKYICGGTLVSHKHVVTAAHCVTRKGSSRTVNKNTL
TVYLGKHNLRTSVEGVEIRLVGEITVHPQYNASSFSRDLSILKLRKAVEYTEFIRAACLW
PENQIDLTNVIGKKGSVVGWGFDETGVATEELTLVEMPVVDQETCIRSYSEFFARFTSEY
TYCAGYRDGTSVCNGDSGGGMVFEMQGSWYLRGLVSLSVARQNEYRCDPTHYVVFTDLAK
FLSWIKQHVTSV