New model in OGS2.0 | DPOGS211355  |
---|---|
Genomic Position | scaffold695:- 16474-20075 |
See gene structure | |
CDS Length | 1089 |
Paired RNAseq reads   | 124 |
Single RNAseq reads   | 368 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008471 (6e-149) |
Best Drosophila hit   | CG17572 (4e-73) |
Best Human hit | chymotrypsin-like protease CTRL-1 precursor (2e-28) |
Best NR hit (blastp)   | PREDICTED: similar to CLIP-domain serine protease subfamily B (AGAP009263-PA) [Tribolium castaneum] (4e-86) |
Best NR hit (blastx)   | serine protease H99 [Tribolium castaneum] (2e-86) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR009003 Peptidase cysteine/serine, trypsin-like IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL17915 |
Nucleotide sequence:
ATGTTTAAACTGTATGCGTGTTTATGTTTAAGTATTTTAAAATTAACAGTTAACGCACAG
TTTGGGACCGTATCCATAAGTGTCAATATGGATCCACGAGCAGATATTGATATGAGTACT
CACAATCGCTGTCCAGTTAATATGTCGTGTGTCCCAATAAGTTCGTGTCCATTGTTAGAA
GATCTATTAGATTTCTCGTGTTTTTCATCCGATCGGTATTTCCATCGTCTGAACTCGTTA
ACATGTGGCAATGTTAACAATGAAGACTATGTGTGCTGTCCGTCGTGCGAGTGCGGGCGA
GTTTACGCACCAGGAACTGAATCCTGCGGGAAGAGCATGGTCCAAGGGATTGATTACAGC
GGCATTGGAGCTCATCCCTGGGTTGCCAGAATAGGATTCGCAAATAAAGACACGGGCAAC
GTAAGATTTGCTTGCAGTGGCTCCATTATTGCGAAGCGGGTTATTTTGACAGCGGCGCAT
TGTGCTTTGGCGAAACCTGAAGGATACAAATTGTCTACGATAGTAGTCGGTGAGTGGGAC
ATCAGTAGTAGTCCGGATTGCAGCGATTATTTCTGTGCTCCTGCCACACAAGCCATCAAA
GTGGAGAGCGTGTCTGTGCATCCAGGATACGAACAGAAGATATTCAGACATGACATAGCG
ATGATTATATTAAAGGATGAGATAAAATTTTCTGTGACAGCTGCTCCGATCTGTTTGAAT
GATAAGCCGGAAGTGGTGATCAACGAACGCGCTTCGCTTGTCGGATGGGGGAAACTGTCC
GGACAAAACAACTTGATTGGTCGCCAACAACAGTTAGAAGTACCGTTGGTGTCGCTGGAG
ATTTGTGAGAAGGTTTTTGGTGAATCCGTGCCTATTCATGAAGGGCAGCTTTGTGCGGGC
GGCGAAGAGGGCAAGGACGCATGTTCGGGCTTTGGAGGAGCTCCTTTGATTCTTAATAGA
GACGGCCAATTTGTACAGATTGGCATTGTATCCTTCGGGTCGGAGAACTGTGGCAGTGAA
GGCATCCCCAGCGTGTACACAAACATCGCACATTATTATAGGTGGATTGTTGACAACATG
CCTTCTTGA
Protein sequence:
MFKLYACLCLSILKLTVNAQFGTVSISVNMDPRADIDMSTHNRCPVNMSCVPISSCPLLE
DLLDFSCFSSDRYFHRLNSLTCGNVNNEDYVCCPSCECGRVYAPGTESCGKSMVQGIDYS
GIGAHPWVARIGFANKDTGNVRFACSGSIIAKRVILTAAHCALAKPEGYKLSTIVVGEWD
ISSSPDCSDYFCAPATQAIKVESVSVHPGYEQKIFRHDIAMIILKDEIKFSVTAAPICLN
DKPEVVINERASLVGWGKLSGQNNLIGRQQQLEVPLVSLEICEKVFGESVPIHEGQLCAG
GEEGKDACSGFGGAPLILNRDGQFVQIGIVSFGSENCGSEGIPSVYTNIAHYYRWIVDNM
PS