New model in OGS2.0 | DPOGS215183  |
---|---|
Genomic Position | scaffold895:- 45147-52195 |
See gene structure | |
CDS Length | 1389 |
Paired RNAseq reads   | 281 |
Single RNAseq reads   | 1282 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008668 (8e-94) |
Best Drosophila hit   | CG1299 (2e-52) |
Best Human hit | plasma kallikrein precursor (5e-30) |
Best NR hit (blastp)   | clip domain serine protease 4 [Bombyx mori] (4e-104) |
Best NR hit (blastx)   | coagulation factor-like protein 3 [Hyphantria cunea] (1e-103) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR006604 Disulphide knot CLIP IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR022700 Proteinase, regulatory CLIP domain IPR009003 Peptidase cysteine/serine, trypsin-like IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL22674 |
Nucleotide sequence:
ATGTCACCTCCTCACGTAATGAGTAGTGCTCTGTTGACAGTGCTGTGTGTCTGTTTGTGT
CCACAAGCTGTGTTTTGTCAATACCAGGAAACTTGCACCACATTAGAAGGTGGCATCGGA
ACATGTACGCCAACTATCTTATGCGCTCCGTACTTCAATCTGCTGGGAGTAGCCAAAAAC
CTCACTATTTCTTTTCAACTTAGAGATGCTCAATGTGGGTCACATGGTATCAACATCATG
GTCTGCTGCCCAAATCAAAATGAGCCACCTCCAGAAACAAGCAAATTATTATTGACCTTA
AATACCTCCGATGACTCTAATAAGGATCAAGGGATCACTCAACTTGACGAAACGTGCACC
ACAATTGAAGGTGGTGTTGGCAAGTGTGAGTCGTTAGCAGCCTGCGAGCCGTACCTTCAT
CTGACGAGACAAGCCAAAAACATTCCCTTGGCTATTCAACTTAGAGATGCTCAATGCGGT
TCAGACGGAAACGACCAAAAGGTCTGCTGTCCTACTTCAGGTACCTCCACTTCATCGCCA
ACTGGCGAACCTTCCTTCAGGTCACTATCAGAGTCAGACTACATAACTGCCTTCCCTGAA
CCACCAGATTGTGGATTCAGTTTAGCACACTTTAACAGAGTTGTGGGAGGTGTGAACGCT
AAACTCGGTGACTTCCCCTGGATGGCACTTCTTGGTACCAAACAAGGAAACTGGGACGCA
GCACGTTGGATATGTGGGGGAACTCTGATCTCTCACCGCCACGTCCTGACCGCTGCTCAC
TGTATAAAGAATGAATTGAACGTGGTCCGACTTGGAGAACTGGACTTCGAAAGAGACGAC
GATGGCGCTTCTCCCATAGACTTTTCCATTAAAAGAAAAATCAAACATGAAAACTTCGAC
TACGCTTCCTTCACTAATGACATCGGCCTTTTGATATTGGGAAAGGATGTGGAGTTCACT
CGTCTGATGCGGCCGATCTGTCTGCCGACTCGTGAAGACCTACGTTCAAAATCTTTTGTT
GGCTACCATCCTTTCATCGCCGGTTGGGGAAACGTCGACAACCGTGGTGCTGCTAAATCT
CACATGCAAGTTGCGCAGCTGCCTGTCCTGGAAAACTCCAAATGCAGGAGGGTTTACGAA
TTGCGGGTCATCGACGAAAGGGTCATGTGTGCTGGCGTCACAGGCAAAGACTCCTGCAAT
GGTGACAGTGGCGGACCGCTCATGCAACCGAATACGAACCGGACAACGGGTAAAATATAT
TTCTATCAGACCGGCGTGGTGTCGTATGGTCACACTAGATGTGGTGAAGCGAGTTTCCCA
GGCGTGTACAGCTCAGTGCAGCACTTCCTGCCCTGGATACAGAAACACGTGCTGGGATCG
GACGAATGA
Protein sequence:
MSPPHVMSSALLTVLCVCLCPQAVFCQYQETCTTLEGGIGTCTPTILCAPYFNLLGVAKN
LTISFQLRDAQCGSHGINIMVCCPNQNEPPPETSKLLLTLNTSDDSNKDQGITQLDETCT
TIEGGVGKCESLAACEPYLHLTRQAKNIPLAIQLRDAQCGSDGNDQKVCCPTSGTSTSSP
TGEPSFRSLSESDYITAFPEPPDCGFSLAHFNRVVGGVNAKLGDFPWMALLGTKQGNWDA
ARWICGGTLISHRHVLTAAHCIKNELNVVRLGELDFERDDDGASPIDFSIKRKIKHENFD
YASFTNDIGLLILGKDVEFTRLMRPICLPTREDLRSKSFVGYHPFIAGWGNVDNRGAAKS
HMQVAQLPVLENSKCRRVYELRVIDERVMCAGVTGKDSCNGDSGGPLMQPNTNRTTGKIY
FYQTGVVSYGHTRCGEASFPGVYSSVQHFLPWIQKHVLGSDE