New model in OGS2.0 | DPOGS215188  |
---|---|
Genomic Position | scaffold895:- 1007-5346 |
See gene structure | |
CDS Length | 1152 |
Paired RNAseq reads   | 428 |
Single RNAseq reads   | 988 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008668 (7e-76) |
Best Drosophila hit   | CG32260 (7e-44) |
Best Human hit | coagulation factor X preproprotein (1e-26) |
Best NR hit (blastp)   | hemolymph proteinase 17 [Manduca sexta] (9e-99) |
Best NR hit (blastx)   | hemolymph proteinase 17 short form [Manduca sexta] (2e-92) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR006604 Disulphide knot CLIP IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR022700 Proteinase, regulatory CLIP domain IPR009003 Peptidase cysteine/serine, trypsin-like IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL10588 |
Nucleotide sequence:
ATGGTTCTCCGAGTAGTATTTCTGTGTTTGTGTTTCCAGACAGCGCTCTCTCGTATCGAA
TATAGAAGAGAGGAGTCATGCAAGGAAGTGAATGGTGTTACTGGAAGATGTGTCTCCATA
GAATCCTGTCCCCCGTTTGTCCTGATGATGCAGACGGAACTTATTACTCAATACAAAACA
CTTCTAAAGCAATCACACTGTGGGTTTGAGGGAAACGTCCCCATGGTTTGCTGTCCCGAT
ACTTCTCCTGAATCTCCATCCTCTCTCGGTCCCTCATCTCCTGTCGACGTTCCATCTAAG
GGGTCGGCTCGAATGATGAACCTTCAATCACCATTTCTGTCACCACCAACCTGTGGAGTG
TCGAACGCTTCATCTGGCCGGGTTGTGGGAGGAGTTGACGCTAAGCTCGGAGACTTGCCC
TGGATGTGCCTTTTGGGGTACTGGGAGGGTGGCTATGATAAAGGCGGATCAAACGGGGAC
ACCAAGTGGAGATGCGGGGGATCGCTGGTGTCCGCACAACACGTGCTCACAGCCGCTCAC
TGTATTCATCACAGAGAGAAAGAACTATACGTGGTCCGTCTCGGAGAGTTGGATCTCGAT
CGTGATGATGAAGCGGCTCCAATCGACGTCCTCATTAGAAGAGCAATAAAACATGAAGCA
TATAACAGGGACACGTACACTAACGACATAGGACTCCTCGTACTCGAAAGAGGTGTCGAG
TTCACAAACCTGATACGGCCTATTTGTCTTCCGATCCTTCCTGAATTACTGTCTAACACG
TTTGTCAACTACAGTCCGTTCGTTGCTGGCTGGGGCAGAACGTCAGATCGAGGTCCCGGT
TCGAGCCATCTCAAACTGACTCAATTGCAAGTAGTCGATAACCAAAAGTGTAAGAAAACG
TACCTGGAGTACCCCGCCGTGATTGATGATAAGGTCTTGTGTGCTGAAGCGGGAGGACGC
GACGCCTGCGAAGGGGACAGCGGGGGACCCCTTATACAACCATTTTATAATCAGGATAAG
AAAGTGTATTACTTCTACCAGACAGGTGTTGTAGCGTACGGAAGACGTTGTGCTGAAGCC
GGTTACCCCGGGGTATATTCCAGGGTAACTCACTACATACTCTGGATACAGAAGCACATC
ATGGAGAACTAA
Protein sequence:
MVLRVVFLCLCFQTALSRIEYRREESCKEVNGVTGRCVSIESCPPFVLMMQTELITQYKT
LLKQSHCGFEGNVPMVCCPDTSPESPSSLGPSSPVDVPSKGSARMMNLQSPFLSPPTCGV
SNASSGRVVGGVDAKLGDLPWMCLLGYWEGGYDKGGSNGDTKWRCGGSLVSAQHVLTAAH
CIHHREKELYVVRLGELDLDRDDEAAPIDVLIRRAIKHEAYNRDTYTNDIGLLVLERGVE
FTNLIRPICLPILPELLSNTFVNYSPFVAGWGRTSDRGPGSSHLKLTQLQVVDNQKCKKT
YLEYPAVIDDKVLCAEAGGRDACEGDSGGPLIQPFYNQDKKVYYFYQTGVVAYGRRCAEA
GYPGVYSRVTHYILWIQKHIMEN