New model in OGS2.0 | DPOGS213975 |
---|---|
Genomic Position | scaffold3433:+ 13697-16882 |
See gene structure | |
CDS Length | 1314 |
Paired RNAseq reads | 153 |
Single RNAseq reads | 479 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013049 (3e-38) |
Best Drosophila hit | CG31326 (3e-25) |
Best Human hit | chymotrypsin-like protease CTRL-1 precursor (1e-20) |
Best NR hit (blastp) | hemolymph proteinase 16 [Manduca sexta] (3e-95) |
Best NR hit (blastx) | hemolymph proteinase 16 [Manduca sexta] (5e-92) |
GeneOntology terms | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families | IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR009003 Peptidase cysteine/serine, trypsin-like IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL12074 |
Nucleotide sequence:
ATGAAAGTATTAAATATTAATTCAATAGTAATAATAATTTATTGTGTATTAAATGTAGAA
ACTATAAAATTTGCTAGACGTGTTCGGAAACGAGCGTCCTACGATGCATTTCCATGTAGT
GATAGTGAAAAAATAAAATCGTATTTCTCTGCTGGATTAGTAGATGGTTATAATATAAAA
ATTGATGAGGAGGTCGTTCCTGGAACACTCATTAAATTAAAATTTGATTCTGAAGCGGCA
GTTACGTTGGATAAGGCGTTCGCAAATTTTTATAATAATAAAGATATCATAAACAATATT
TATGATATAATATTGTTTCGATTTAATAATACTTTCGTCCTGAATGTTAAAGGACAACCT
GCTCCATACCCTCCGCCTTATTTGACGAGCATTAAAATAAATGGCAATGAATTATGTAGC
CAGCCTAATTTGACATACTTTGAGGATTATCCGGTTGGTGTTGTGAGACGTCCCAACGTT
CCTGATCGATTTTGTGGAAGACGTAAAGTCATTCATAGTGAGCTTATAACAAATGGATTG
AAAACTAAACCAGGGGAATGGCCTTTTCATGCTGCACTACACCGTCGGGAAAAAATGGGT
CTAAGATACACTTGTGGAGGGACATTAATTTCCAAGTTTTTTGTTTTAACAGCTGCCCAT
TGCACTACCGTTAGAGGAGTAGCCATTTTACCTGAAATCTTTAGTGTTTTTCTGGGAAAA
TATAATTTGTTTGGTGGTGACGTGTCAGTACAAGAAAAAGAGGTTTACAAGGTGTATGTT
CATGACGAATTTACGTACAGGACTCTGGATAATGACATCTCTCTATTGAAATTGAAAACT
GAAGCCGTATATGATAATTACGTGCAACCAGCTTGTTTATGGTTCAACAACGTGTACGAT
CAGCTACCTTCATCGCAAATTCAGGGCACGGTACCAGGTTGGGGTTTTGATATAACTGAC
TCCTTGTCTCCGACTCTCCACGCAGCCAGCATGCCTTTAGTTCCTGACAGGACTTGTGAA
TTAACCAATCCTTTATTTTATGTACAAGCTCTGCGTACTGCTAAAAAATTCTGTGCCGGC
TATACAAATGGAACCTCTGCATGTAATGGTGATAGCGGAGGTGGTTTTCACGTCTTTGTT
CCTGATTTAGCAAAAAGCAATATCCCAGACGTACCCGGAGCTTGGTATATAAGAGGCATT
GTGTCCACCAGCTTATCGAGAACTGATGCTGCTATTTGTAATCCCAAAGCTTACGCCGTA
TTCACAGACGTTGAAAAATATCTAGATTGGATAAATATTTACGTAAATTCATAG
Protein sequence:
MKVLNINSIVIIIYCVLNVETIKFARRVRKRASYDAFPCSDSEKIKSYFSAGLVDGYNIK
IDEEVVPGTLIKLKFDSEAAVTLDKAFANFYNNKDIINNIYDIILFRFNNTFVLNVKGQP
APYPPPYLTSIKINGNELCSQPNLTYFEDYPVGVVRRPNVPDRFCGRRKVIHSELITNGL
KTKPGEWPFHAALHRREKMGLRYTCGGTLISKFFVLTAAHCTTVRGVAILPEIFSVFLGK
YNLFGGDVSVQEKEVYKVYVHDEFTYRTLDNDISLLKLKTEAVYDNYVQPACLWFNNVYD
QLPSSQIQGTVPGWGFDITDSLSPTLHAASMPLVPDRTCELTNPLFYVQALRTAKKFCAG
YTNGTSACNGDSGGGFHVFVPDLAKSNIPDVPGAWYIRGIVSTSLSRTDAAICNPKAYAV
FTDVEKYLDWINIYVNS