New model in OGS2.0 | DPOGS206217  |
---|---|
Genomic Position | scaffold3442:- 5534-8834 |
See gene structure | |
CDS Length | 1290 |
Paired RNAseq reads   | 27 |
Single RNAseq reads   | 79 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009694 (5e-128) |
Best Drosophila hit   | CG13744 (7e-107) |
Best Human hit | transmembrane protease serine 9 (1e-35) |
Best NR hit (blastp)   | GE22661 [Drosophila yakuba] (3e-110) |
Best NR hit (blastx)   | GD10114 [Drosophila simulans] (4e-110) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR001314 Peptidase S1A, chymotrypsin-type IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site |
Orthology group | MCL17661 |
Nucleotide sequence:
ATGTTCACGGTGAAGAGGCAAGCGAAGACATGGATACCGATACAGGACAGGAAAATAAAA
AACTGTTTTATCACCAAAAAACTATCGAACAGTCCCAAGAATGTATTTAAAAGGCGAGCC
TTTAAATGGCGGAAACCTAAATGCCAGACGGTTAACATAATCATACCTCTCATGATTTTG
AACTTTGCTGGACACACCAGCTCGGAGAGTCTTAGTAACAGAGTCCTGGCCTCACTCCTG
GGATACCCGACCACGTGTACTGTTGGTTCTCAAGTGCGAGCCTGTTCTCTGTCGCTGACT
TGTTGGCTCCGCGGTGGTATCAGGGTGAAGGGTTGCGGAGGAAGCTGGTTGTTCTCATGC
TGTTACATAGCCCGGGACAGCTATGACTATGATAACTCAATCCCCTCTTCCGACTGGAAA
TACAAAATACCGCCGAAGTTACGTCAAGTACCTCAGAGGAATGTGGTGCCAACTAACGTG
TTCCGACGGAGAGTCGACGACGACATTAGTCAGATGGAGTGCGGCCTCTCCTCAAGTCGC
ATGCTCCAGAAGCGTATCATCGGCGGTCGGGAGGCCAGGGTCGCGGAGTTCCCCTGGCAG
GCTCACGTCAGGATCTCAGAGTTCCAGTGCGGCGGAGTCTTAATATCTCGTTGGTACGTG
GCGACGGCAGCTCACTGCGTGTCCCGAGCTCGTCCTAGGGATGTGGCCGTGTGGCTCGGA
GCACTTGACACCACCTCTGGGGATAAAAGCGCGAGAAAAATTGGGGTCGTCCAGAAAATC
CTCCACCCCCTCTTCCAGTTTCGCATGACCCAACCTGACCGGTACGACATAGCGTTGCTA
AAACTCTCCCGACCTGTGACCTACACTAGTCACATCCTCCCGATCTGTCTGCCCGACGGA
GATTTCGAACTCCGCGGCAAGTCAGGGGTCATCGCCGGCTGGGGCAAGACCGATACCAGC
AACGGCCACACTGGCACTAACTTACTACGGTCCGCTACTGTACCGATTTTGAGCACCGAA
CAATGTATCAACTGGCACCAGAGTAAGCAGATCTCTGTTGAAATACATTCGGAGATGATC
TGCGCCGGACATTCAGACGGACACCAAGATGCGTGTCTAGGTGACTCTGGAGGTCCCCTA
ATTGTGTTGGACAGGGGTCGTTACTACCTGGCCGGTATCACCTCGGCCGGGTTCGGCTGC
GGCGTCGACCACCAGCCAGGGATCTATCACAACGTGCGGGTCACCGCTGGCTGGATCAGA
GACGTCATCACCAGATATGGTGACCTCTAG
Protein sequence:
MFTVKRQAKTWIPIQDRKIKNCFITKKLSNSPKNVFKRRAFKWRKPKCQTVNIIIPLMIL
NFAGHTSSESLSNRVLASLLGYPTTCTVGSQVRACSLSLTCWLRGGIRVKGCGGSWLFSC
CYIARDSYDYDNSIPSSDWKYKIPPKLRQVPQRNVVPTNVFRRRVDDDISQMECGLSSSR
MLQKRIIGGREARVAEFPWQAHVRISEFQCGGVLISRWYVATAAHCVSRARPRDVAVWLG
ALDTTSGDKSARKIGVVQKILHPLFQFRMTQPDRYDIALLKLSRPVTYTSHILPICLPDG
DFELRGKSGVIAGWGKTDTSNGHTGTNLLRSATVPILSTEQCINWHQSKQISVEIHSEMI
CAGHSDGHQDACLGDSGGPLIVLDRGRYYLAGITSAGFGCGVDHQPGIYHNVRVTAGWIR
DVITRYGDL