New model in OGS2.0 | DPOGS205231  |
---|---|
Genomic Position | scaffold3349:+ 10749-33848 |
See gene structure | |
CDS Length | 1284 |
Paired RNAseq reads   | 546 |
Single RNAseq reads   | 1520 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014404 (6e-62) |
Best Drosophila hit   | CG5390 (7e-51) |
Best Human hit | serine protease 55 isoform 1 precursor (3e-17) |
Best NR hit (blastp)   | serine proteinase-like protein 1 [Helicoverpa armigera] (1e-72) |
Best NR hit (blastx)   | prophenoloxidase activating factor 1 [Lonomia obliqua] (6e-69) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR001314 Peptidase S1A, chymotrypsin-type IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR009003 Peptidase cysteine/serine, trypsin-like |
Orthology group | MCL22649 |
Nucleotide sequence:
ATGGAAAAGGTTACAGCTGGCCTTTGCGCCTGGCAAACAGAAGATAACGACTCGGAGGAA
GATTCAGTTGTTGATTGGGTAAACAAGATCATATCTGAATCAAAGATAAACGTAACGAAC
AGAGAAGTAACAAATCTAAATTCAGAAAGCGTTACGAATATAAATTGCACCGCGACTGAC
AACAGACCTGGGACTTGTGTGTTGTACTATCAATGCGACGAAGACAGTAACACTATTATA
GATGACGGAGCGTCCATAGTTAATTTTAGAACCGAGGCGTCCTGTCCTCATTATCTCAAG
GTCTGCTGTGCGATGGATAAAATTAAATCAGACGATAAAGCTAACACTATCCGGAGAGGA
AGTAATAACGAGTCTGCCCAGGAGTTGGATTCGAAGGACGACTCCGACAGCAGCAGTGCT
GTAGTTGACTTGGGGAAGTGTGGTTGGAACAATCCAGCGCTTTATGTGTTTCAACCCAAA
AGAAACAATTCCGAGGCAGAGCCGTTCTACGCAAATTATGGGGAATTCCCCTGGATGATC
GCCGTCATCAGGAGGTCCAATGATACGGATCTGTGGGCAAGAAAAAATTACGTCGGAGGT
GGAACTCTCATTCATCCGGGGGTAGTTGTCACTGCGGCTCACATAGTTCGGAATAAAAAG
CCCGATGACCTGAAATGTCGCGCTGGTGAATGGGACACTGAAGTGACCTTCGAGATATTT
CCACACCAAGAGAGGAATGTGAAGAATATTATCATCCACCCAGATTACTACAGGCCATCT
CTATACAACGACATGGGACTCCTGCTGTTAGAGGAACCGTTTGAACTGCTCCTCGCGCCA
CACATAGGTCTGGCTTGCGTTGGGAACAGCCTGCCGGCTCCCGGCACCGTCTGCTATGGA
ATGGGCTGGGGCAGGAAAATCGACAAGAAGTACGCAATTATTCTTAAAAAAATGCGGCTT
CCGTTAGTGGAAAGAGAGGAGTGCCAGGCCCTCCTGCGGAGTATACGTTTGGGGCCATTT
TTCCAACTGCACGAGTCCCTGACGTGTGCTGGCGGGGAAGATCGCATGGACATGTGCAAA
GGAGACGGCGGGTCCTCATTAGTATGCCCTATTCAGACTAATGGTAGAAATGTCAAATAC
GCCATGTTCGGGATGGTGGCGTACGGCCTGGGATGTCACTCGAGGAAAGTGCCCGGCGTG
TTCGTCAATGTGCCAAACCTGAAGTCGTGGCTGGACAGCACCATGGAGGCCGAAGGCTAT
TCTAAAGACACATACACTTACTAA
Protein sequence:
MEKVTAGLCAWQTEDNDSEEDSVVDWVNKIISESKINVTNREVTNLNSESVTNINCTATD
NRPGTCVLYYQCDEDSNTIIDDGASIVNFRTEASCPHYLKVCCAMDKIKSDDKANTIRRG
SNNESAQELDSKDDSDSSSAVVDLGKCGWNNPALYVFQPKRNNSEAEPFYANYGEFPWMI
AVIRRSNDTDLWARKNYVGGGTLIHPGVVVTAAHIVRNKKPDDLKCRAGEWDTEVTFEIF
PHQERNVKNIIIHPDYYRPSLYNDMGLLLLEEPFELLLAPHIGLACVGNSLPAPGTVCYG
MGWGRKIDKKYAIILKKMRLPLVEREECQALLRSIRLGPFFQLHESLTCAGGEDRMDMCK
GDGGSSLVCPIQTNGRNVKYAMFGMVAYGLGCHSRKVPGVFVNVPNLKSWLDSTMEAEGY
SKDTYTY