New model in OGS2.0 | DPOGS210981  |
---|---|
Genomic Position | scaffold262:- 94988-104765 |
See gene structure | |
CDS Length | 1929 |
Paired RNAseq reads   | 1356 |
Single RNAseq reads   | 3126 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006406 (1e-22) |
Best Drosophila hit   | CG4914 (2e-85) |
Best Human hit | transmembrane protease serine 9 (3e-58) |
Best NR hit (blastp)   | serine protease-like protein [Bombyx mori] (1e-117) |
Best NR hit (blastx)   | serine protease-like protein [Bombyx mori] (6e-119) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR009003 Peptidase cysteine/serine, trypsin-like IPR001314 Peptidase S1A, chymotrypsin-type IPR001254 Peptidase S1/S6, chymotrypsin/Hap |
Orthology group | MCL10168 |
Nucleotide sequence:
ATGTTTTTGTATGTACTCTGTCTTATGCTATGCGTGTCTCTATGTTCTACGGATACATTG
ACTAAGAATTTACTGAGAGTAAGCGATGAATACTACGCTCATGGTCGAAACAACGATTTA
CCGCCGTGCCGTGATTGTAGTTGTGGAGAACGTAATGAAGAACCTAGAATCGTGGGTGGT
TCTTCCACCGACGTGAACGCATATCCTTGGACGGCTCGTCTTATCTATTATAAGTCGTTC
GGATGTGGTGCCTCGGTCATCAATGACAGATACGTTATAACGGCAGCCCATTGTGTGAAG
GGATTCATGTGGTTCTTATTCAAAGTGAAATTCGGTGAGCATGATCGTTGCGATACTGGC
CATGTGCCTGAAACTCGTACAGTAGTTAAGATGTATGTACACAACTTTACTTTGACGGAA
TTAACTAATGACATATCACTACTACAGCTCAATAGACCTTTGGAGTATACACATGCTATC
CGACCCGTTTGTCTGCCTAAAACAGCGGATAATTTGTACGTTGGCAAAATAGCTACTGTC
GCTGGCTGGGGCGCCGTCCAAGAAACTGGTAAATGGTCGTGTACGTTACTCGAGGCTCAG
TTACCGATACTGAGCAACGAGAATTGTACCAAGACGAAATATGATGTAACAAAAATTAAG
GAAGTTATGATGTGTGCTGGATATCCAGAAACCGCTCATAAAGACGCTTGCACTGGAGAT
AGCGGTGGACCGTTGTTTATGGAAAATAGTGAACACGCTTACGAATTAATTGGTATAGTA
TCTTGGGGCTACGGATGTGCTAGAAAAGGCTACCCCGGTGTTTACACGAGAGTAACCAAA
TATTTGGACTGGATACGTGATAATACACAAGACGCATATAGTTGTCTTTACAAGAGCTGG
TCCAGCCTAATAATAGAACAGCGCATAATAATGTGGTTGAGGGAGCAACGTCCTCATTCC
CGTTGCATCCTCTCTCTGCTACAGCCTGGGGCTAGTGCTGCCTTAGGTGAAAAGGCCTTT
AATGAAACAAAAGAAACAACCACTGCGGCAAGTGGTAATATTGAGAGTTCCAGTAATACT
AATAGTAGTACTACTTCTACTACTACTCCTGCTACTACATTCGATCAGGAGATGTTAGAC
GAACTATATCAAGATTCGCAAAACAGGTGTAACTGTCGTTGCGGTGAAAGAAACGAGGAA
TCTCGTATTGTGGGTGGAGTGGAAACATCAGTGAACGAGTTCCCTTGGGTCGCTCGTCTG
ACTTACTTTAACAAGTTCTACTGCGGGGGCATGCTGATAAATGATAGATATATCCTAACT
GCGGCCCATTGTGTTAAAGGATTAATGTGGTTCATGATAAAGGTAACTTTGGGAGAGCAC
AACCGTTGTAACGACTCTCGTCCTGTAACACGTTATGTAGTACAAGTTGTTGCCCACAAC
TTTACCTATCTTACATTCAGGGATGATGTTGCCGTTTTGAGATTGAACGAGCCGATCGAA
ATATCAGATACAATTAAACCAGTATGTCTGCCCCAAATTACCGATAATGATTACGTGGGG
GTAAAAGCAATTGCCGTTGGTTGGGGATCGATTGGTGAGCAGAAAAATCATTCGTGCACT
CTATTAAACGTGGAATTGCCAGTGCTTAGTAATGACGTTTGTAGAAACACTATGTATGAG
ACGAGTATGATAGCGGATGGAATGCTCTGCGCCGGTTACCCAGACGAAGGACAAAGGGAC
ACTTGCCAGGGTGACAGTGGTGGACCTCTGACTGCAGAGAGAAAGGATAAACGTTACGAA
CTGCTGGGTATAGTCTCTTGGGGTATTGGGTGTGGAAGACGTGGATATCCAGGGGTTTAC
ACGAGGGTTACAAAATACCTGAATTGGATCAGAGACAACTCCCGCCACGGATGTTTCTGT
TCAGACTAA
Protein sequence:
MFLYVLCLMLCVSLCSTDTLTKNLLRVSDEYYAHGRNNDLPPCRDCSCGERNEEPRIVGG
SSTDVNAYPWTARLIYYKSFGCGASVINDRYVITAAHCVKGFMWFLFKVKFGEHDRCDTG
HVPETRTVVKMYVHNFTLTELTNDISLLQLNRPLEYTHAIRPVCLPKTADNLYVGKIATV
AGWGAVQETGKWSCTLLEAQLPILSNENCTKTKYDVTKIKEVMMCAGYPETAHKDACTGD
SGGPLFMENSEHAYELIGIVSWGYGCARKGYPGVYTRVTKYLDWIRDNTQDAYSCLYKSW
SSLIIEQRIIMWLREQRPHSRCILSLLQPGASAALGEKAFNETKETTTAASGNIESSSNT
NSSTTSTTTPATTFDQEMLDELYQDSQNRCNCRCGERNEESRIVGGVETSVNEFPWVARL
TYFNKFYCGGMLINDRYILTAAHCVKGLMWFMIKVTLGEHNRCNDSRPVTRYVVQVVAHN
FTYLTFRDDVAVLRLNEPIEISDTIKPVCLPQITDNDYVGVKAIAVGWGSIGEQKNHSCT
LLNVELPVLSNDVCRNTMYETSMIADGMLCAGYPDEGQRDTCQGDSGGPLTAERKDKRYE
LLGIVSWGIGCGRRGYPGVYTRVTKYLNWIRDNSRHGCFCSD