New model in OGS2.0 | DPOGS205232  |
---|---|
Genomic Position | scaffold3784:- 5478-11404 |
See gene structure | |
CDS Length | 1542 |
Paired RNAseq reads   | 2833 |
Single RNAseq reads   | 6373 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014404 (2e-62) |
Best Drosophila hit   | CG5390 (2e-57) |
Best Human hit | hyaluronan-binding protein 2 isoform 1 preproprotein (2e-25) |
Best NR hit (blastp)   | serine proteinase-like protein 1 [Helicoverpa armigera] (3e-71) |
Best NR hit (blastx)   | serine proteinase-like protein 1 [Helicoverpa armigera] (5e-72) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR009003 Peptidase cysteine/serine, trypsin-like IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL22649 |
Nucleotide sequence:
ATGACGCCGAAAACCTGTTCGTGTAATGCACATGTTTGTGTCTCAACACAAAACTTGTTT
GGCTACAAGCGTTCAGTCGTGTTTCGCGCTTCTTGGCGAGCGGTCGTCAATATGAAGTCC
ATATCACTTATAGTCGTCGTGTCACTCGTGACTAGGAGTTTCGGGGCAAACATATCTTTC
CCAAACGATAAATTGGATGAAGCCAACGAATGGCTGAACAGGGTTTACGAAATGAACAGC
ACTGGTGATGTTTTGGATAGATTTGGTAACAAACAACAAGAATCCACAGAGGGGAGATCT
TGTGTATCAGTGAAACAAAGGAACGGCGTGTGTGTTGCTAAGGAACTGTGCTTGGAACAC
GAACCGGACGTGGATCCTACAACATTATTTATTCCGAGAGAAAGGATTGCTTGCACCCCA
TCAGAATACTGTTGCTTTAAAAGCGCTCCAATGAAAGCTGGGGAGGTATCTGGGGAGGGG
TCCGGGGAGACCATAAACAAATCTGGGGCAGGTCGTAAGTCGTGCACCAACGATAAAAAT
GAACAGGGACTGTGTGTCAGGAAGGCGTCCTGCGTGAACGCTGTGCAGACAGTTGACCCC
ACAATCAATTTCAATCTCAGGGAACGCGAATTATGCCACTACTTAGAGACCTGCTGCTTG
GAGAAGAACATCAAAAAGAAAGCTGTGAAACCGATTCGTCAGCAAAACACCGGCTGCGGC
TGGAGCAACCCTGGCGCCAACGTGTTCAGGGAGAAGAACTCGCCAACAGGTTTCGCTGAT
TACGGGGAATTCCCGTGGATGGTAGCTTTAATCCAGAAGGGTACTGGAAAAGATGGCTTC
AACGAAAGTTACGCTGGCGGAGGAGTTCTCATCCATCCGTCAGTAGTCATGACAGCGGCA
CATAAAGTACAAAATTTCAAACCGGAAGTGGTAAAAATCCGAGCCGGCGAATGGGACACT
CAAACAGACGCGGAAGTGGAGCCCTATCAAGAGAGAGACGTCTCTAAGATCATTATACAC
GAAGGTCACAATGAAAAACAGCACAACGATGTGGCGCTTCTGATTCTGAAGTCGCCAGTG
GATCTGTCAGATGCTCCTCACATCGCTGTAGGTTGTCTAGCATCTCGTCTCCCCCCACCT
GGAACGAGGTGTTACAGCATGGGATGGGGCGAAGACTTCCTCAATGACAACAAATACGCC
GTCATTTTAAAGAAGGTGGAACTGCCCCTGGTAGAAGCCTCGGACTGCGAGAGTCGTTAC
AAACGCACGGTTCTCTCGAGCGCTTACGTTCTGGATAAGACGTTGATGTGTGCGGGGGGC
GAGCAGGGAGTTGACACGTGTCGCGGAGATGGGGGGAGTCCGCTGGTGTGTCCCATTAAG
GGTCAGCCTGATAGGTTTGAGGTCGTCGGTCTGGTGGTGTACGGCCTGCAGTGCGGGACG
GGCGGTCTCCCAGGGGTATACCTGAACGTCCCGCAAGTACACGACTGGGTCGGTCAACAA
CTCGAAAAGGAATCTTTCGGAAGGACTTCATACGTCTATTAA
Protein sequence:
MTPKTCSCNAHVCVSTQNLFGYKRSVVFRASWRAVVNMKSISLIVVVSLVTRSFGANISF
PNDKLDEANEWLNRVYEMNSTGDVLDRFGNKQQESTEGRSCVSVKQRNGVCVAKELCLEH
EPDVDPTTLFIPRERIACTPSEYCCFKSAPMKAGEVSGEGSGETINKSGAGRKSCTNDKN
EQGLCVRKASCVNAVQTVDPTINFNLRERELCHYLETCCLEKNIKKKAVKPIRQQNTGCG
WSNPGANVFREKNSPTGFADYGEFPWMVALIQKGTGKDGFNESYAGGGVLIHPSVVMTAA
HKVQNFKPEVVKIRAGEWDTQTDAEVEPYQERDVSKIIIHEGHNEKQHNDVALLILKSPV
DLSDAPHIAVGCLASRLPPPGTRCYSMGWGEDFLNDNKYAVILKKVELPLVEASDCESRY
KRTVLSSAYVLDKTLMCAGGEQGVDTCRGDGGSPLVCPIKGQPDRFEVVGLVVYGLQCGT
GGLPGVYLNVPQVHDWVGQQLEKESFGRTSYVY