New model in OGS2.0 | DPOGS204450  |
---|---|
Genomic Position | scaffold3648:+ 9433-12475 |
See gene structure | |
CDS Length | 1149 |
Paired RNAseq reads   | 42 |
Single RNAseq reads   | 129 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013645 (2e-144) |
Best Drosophila hit   | CG14892 (1e-67) |
Best Human hit | enteropeptidase precursor (1e-17) |
Best NR hit (blastp)   | transmembrane protease, putative [Pediculus humanus corporis] (8e-80) |
Best NR hit (blastx)   | transmembrane protease, putative [Pediculus humanus corporis] (4e-79) |
GeneOntology terms    | GO:0006508 proteolysis GO:0004252 serine-type endopeptidase activity |
InterPro families    | IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR001314 Peptidase S1A, chymotrypsin-type IPR009003 Peptidase cysteine/serine, trypsin-like |
Orthology group | MCL16592 |
Nucleotide sequence:
GAATGTGGTGTTCCTCTCCGTCGCCACACCAGGCAGCCCCGAGAGAGGTCCGCCAGCCAG
CTCAGGATTATCAAGGGAAGGGAGTCCAAGAGAGGAGCCTGGCCTTGGCAGGTTTCTCTC
CAGCTGTTACATCCTAACTACGGTCTGATCGGCCACTGGTGCGGCGGAGTGCTGGTTCAT
CCGCAGTGGCTGCTCACCACCGCGCATTGCGTTCACAACGAGCTGTTCAACCTGCCGCTA
CCAGCTCTATGGACGGCGGTGCTCGGGGAGTGGGACCGTAACGAACAACGCGGCTCCTTC
CTGCCCATCGAGAGGATCATCCTGCATCACCGCTTCCACAACTACCAGCATGATATAGCT
CTGATGAAGATGACAAAGTCAGCGGACGTGAGCACGAGGAGTCGCATCCGCGCCATCTGT
CTGCCGCCATACGAGCCTGTGGACGACAACATAGAGAGGAGCACCTCCTACACCAGCACA
CAGGAAGTGAGGCGGAAGACGAGGCCGCCGCGACCCAAGCCCGACACCGCCAACAAATAC
TTGGAGAAAATCAACAACCTCACGAAGACCGTCCACTCGGCCAAGGACAAGAAGAAGAAC
ACCAGGTATAACGTGCGGGTCTCCAACGACGACGGGCTCAGAGATAGGAAGATAGAGGAC
AGAGAGGCGGTCTACGACGGAGCCTCCCTGGACAGCGTCGTGAACCTCATACGGAAAGGG
AAAGATGTCTCCAGGAGCGACAAGATGATAGCGAGCTACCACGAGATAGACCCCTTCATA
GACAACTCCATAGACGCCAAAGAGGAGTGTTACGCCACTGGCTGGGGACGGCAGCAGACC
AACGGCAGTCTCACGGACGTGCTGCTGGAGGCCGAGGTGCCGATACTGCCGCTCAAGACG
TGCAGGGAGCGGTACTCGCTCAGTCTGCCGCTCAACGACGGACACCTGTGCGCCGGCAGC
ACGGACGGCAGCAGCGGAGCCTGCGTGGGTGACAGCGGCGGTCCCCTCCAGTGTGTGGTG
GGCGGCAGGTGGGTGCTCCGCGGCCTGACGTCGTTCGGGTCGGGCTGCGCCCTGACCGGA
GTCCCCGACGTCTACACCAACGTTAGACATTACGTTGCCTGGATCTACGCTCACGTTTAC
GCTGGGTAG
Protein sequence:
ECGVPLRRHTRQPRERSASQLRIIKGRESKRGAWPWQVSLQLLHPNYGLIGHWCGGVLVH
PQWLLTTAHCVHNELFNLPLPALWTAVLGEWDRNEQRGSFLPIERIILHHRFHNYQHDIA
LMKMTKSADVSTRSRIRAICLPPYEPVDDNIERSTSYTSTQEVRRKTRPPRPKPDTANKY
LEKINNLTKTVHSAKDKKKNTRYNVRVSNDDGLRDRKIEDREAVYDGASLDSVVNLIRKG
KDVSRSDKMIASYHEIDPFIDNSIDAKEECYATGWGRQQTNGSLTDVLLEAEVPILPLKT
CRERYSLSLPLNDGHLCAGSTDGSSGACVGDSGGPLQCVVGGRWVLRGLTSFGSGCALTG
VPDVYTNVRHYVAWIYAHVYAG