New model in OGS2.0 | DPOGS203698  |
---|---|
Genomic Position | scaffold21:- 178672-181171 |
See gene structure | |
CDS Length | 1599 |
Paired RNAseq reads   | 92 |
Single RNAseq reads   | 338 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003491 (9e-106) |
Best Drosophila hit   | CG13318 (1e-62) |
Best Human hit | serine protease 42 precursor (2e-19) |
Best NR hit (blastp)   | serine protease, putative [Aedes aegypti] (9e-81) |
Best NR hit (blastx)   | serine protease, putative [Aedes aegypti] (8e-79) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL15883 |
Nucleotide sequence:
ATGGCCCCAACAGCAGCGCCCACACCTGCGCCAACAACAGCCCCCACACCGGCGCCAACT
ACAGCCCCCACACCGGCGCCAACTACAGCCCCCACACCGGCGCCAACAACAGCCCCTACA
CCGGCGCCAACAACAGCTCCAACACCTGCGCCCACAACAGCTACTACAGCAGCTCCAACC
CCACCGCCAACAATGCCACCTACAGCAATGACAACAAGAAATGGTAATAGTTCCAATGTT
ACGGGAAACTTAACCTCATCGTCTACCACGACAGCATCACTAACCATCATAACTAATCAA
AGTTCATATAATACAACCGCAATGCCAACCTCGACCCCAACTAACCCACCGACGACAAGT
GGAGAAACACTGCCGGTTTTGCTTTGTAAAGATCCTGATGTGATATGTGTCTTTAATCCT
GATGAAAACGCTGCAGGCTCGTTCCCGATTGACCCACGGCTTGGCACGACACCATCAGCA
GCCCTGCAATCGGCGCAACCTATAATGGGTTACAGTGGCCAAATATCATCTTTAGACTCC
GCAGTCAGATTTCCAAGAGAACGTAGAAGCGTAAACCGTAAAATAATGACAGTAAAAGAA
TCGTTTAAAAAGCTAAATTATATAGATCCCCCTAAGCATCAAATTCGTAAGAGGCAAAGT
TGTCGTTGTGTACCCGCTGGAACTTGTGCATCAGGGGGGGCTGGTATGATCGACTTCAGG
ATTGTAACCCCCGTGAATGCGTGTCCTGCTGGCCAAGTGTATTGCTGTGGCGACACGACT
GCAGTTACAGTACGTTGTGGAGTCGTACAAGCTGCTCCATCAACTGGTGTCACTCCAGCA
GCGGGGGAAGCAAATTTTGGGGAATATCCCTGGCAGGCATTGGTTCTTACCAAACAGAAT
GATTATATTGCTGGTGGTGTGCTTATAGATCAATTGAATGTACTGACGGTGACACATAGA
ATGATGCCGTATGTTGTTTCAGGTACAGCACCTAATGTGAAAGTGAGGTTGGGAGAATGG
GACGCTGCAGGGACAAATGAACCAGTTCCTTTCCAAGAGTATAATGTAGCTAAAGTTTTC
AGTCACCCCTCTTACAACGCCAATACTCTACAATACGATATAATGGTACTGAGATTGTCT
TCTTCTGTACCACTGACACCAATGACGGGTTCAACGACTACAATCAACCGAGCATGTCTA
CCTCCATCCTCGACTGCAACTTACACAGGACTTACATGCTGGGTATCAGGATGGGGAAAA
AATATGTTTGGATTACAAGGACAATACCAAAACATATTAAAGAAAGTGGATGTACCTATA
GTGGCACCAGCAACTTGCCAGAGTCAGTTACAGGCAGCTCGTCTTGGGCCCACTTACGTA
CTGGATACTACCTCTTTTATCTGTGCTGGCGGCGAAAGCAGTAAGGATTCTTGCACGGGT
GACGGAGGATCAGGTTTAGTCTGTTCTATTAATGGGCAATGGATTGTAGTAGGTTTAGTG
GCATGGGGTCTCGGCTGTGCTTCCGCAAATGTACCAGCGGCTTACGTGAATGTTGCTGCC
CTACTACCTTGGATACAACAGCAAGTTGCCACTGCGTAG
Protein sequence:
MAPTAAPTPAPTTAPTPAPTTAPTPAPTTAPTPAPTTAPTPAPTTAPTPAPTTATTAAPT
PPPTMPPTAMTTRNGNSSNVTGNLTSSSTTTASLTIITNQSSYNTTAMPTSTPTNPPTTS
GETLPVLLCKDPDVICVFNPDENAAGSFPIDPRLGTTPSAALQSAQPIMGYSGQISSLDS
AVRFPRERRSVNRKIMTVKESFKKLNYIDPPKHQIRKRQSCRCVPAGTCASGGAGMIDFR
IVTPVNACPAGQVYCCGDTTAVTVRCGVVQAAPSTGVTPAAGEANFGEYPWQALVLTKQN
DYIAGGVLIDQLNVLTVTHRMMPYVVSGTAPNVKVRLGEWDAAGTNEPVPFQEYNVAKVF
SHPSYNANTLQYDIMVLRLSSSVPLTPMTGSTTTINRACLPPSSTATYTGLTCWVSGWGK
NMFGLQGQYQNILKKVDVPIVAPATCQSQLQAARLGPTYVLDTTSFICAGGESSKDSCTG
DGGSGLVCSINGQWIVVGLVAWGLGCASANVPAAYVNVAALLPWIQQQVATA