DPGLEAN01855 in OGS1.0

New model in OGS2.0DPOGS203698 
Genomic Positionscaffold21:- 178672-181171
See gene structure
CDS Length1599
Paired RNAseq reads  92
Single RNAseq reads  338
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003491 (9e-106)
Best Drosophila hit  CG13318 (1e-62)
Best Human hitserine protease 42 precursor (2e-19)
Best NR hit (blastp)  serine protease, putative [Aedes aegypti] (9e-81)
Best NR hit (blastx)  serine protease, putative [Aedes aegypti] (8e-79)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families

  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR001314 Peptidase S1A, chymotrypsin-type
Orthology groupMCL15883

Nucleotide sequence:

ATGGCCCCAACAGCAGCGCCCACACCTGCGCCAACAACAGCCCCCACACCGGCGCCAACT
ACAGCCCCCACACCGGCGCCAACTACAGCCCCCACACCGGCGCCAACAACAGCCCCTACA
CCGGCGCCAACAACAGCTCCAACACCTGCGCCCACAACAGCTACTACAGCAGCTCCAACC
CCACCGCCAACAATGCCACCTACAGCAATGACAACAAGAAATGGTAATAGTTCCAATGTT
ACGGGAAACTTAACCTCATCGTCTACCACGACAGCATCACTAACCATCATAACTAATCAA
AGTTCATATAATACAACCGCAATGCCAACCTCGACCCCAACTAACCCACCGACGACAAGT
GGAGAAACACTGCCGGTTTTGCTTTGTAAAGATCCTGATGTGATATGTGTCTTTAATCCT
GATGAAAACGCTGCAGGCTCGTTCCCGATTGACCCACGGCTTGGCACGACACCATCAGCA
GCCCTGCAATCGGCGCAACCTATAATGGGTTACAGTGGCCAAATATCATCTTTAGACTCC
GCAGTCAGATTTCCAAGAGAACGTAGAAGCGTAAACCGTAAAATAATGACAGTAAAAGAA
TCGTTTAAAAAGCTAAATTATATAGATCCCCCTAAGCATCAAATTCGTAAGAGGCAAAGT
TGTCGTTGTGTACCCGCTGGAACTTGTGCATCAGGGGGGGCTGGTATGATCGACTTCAGG
ATTGTAACCCCCGTGAATGCGTGTCCTGCTGGCCAAGTGTATTGCTGTGGCGACACGACT
GCAGTTACAGTACGTTGTGGAGTCGTACAAGCTGCTCCATCAACTGGTGTCACTCCAGCA
GCGGGGGAAGCAAATTTTGGGGAATATCCCTGGCAGGCATTGGTTCTTACCAAACAGAAT
GATTATATTGCTGGTGGTGTGCTTATAGATCAATTGAATGTACTGACGGTGACACATAGA
ATGATGCCGTATGTTGTTTCAGGTACAGCACCTAATGTGAAAGTGAGGTTGGGAGAATGG
GACGCTGCAGGGACAAATGAACCAGTTCCTTTCCAAGAGTATAATGTAGCTAAAGTTTTC
AGTCACCCCTCTTACAACGCCAATACTCTACAATACGATATAATGGTACTGAGATTGTCT
TCTTCTGTACCACTGACACCAATGACGGGTTCAACGACTACAATCAACCGAGCATGTCTA
CCTCCATCCTCGACTGCAACTTACACAGGACTTACATGCTGGGTATCAGGATGGGGAAAA
AATATGTTTGGATTACAAGGACAATACCAAAACATATTAAAGAAAGTGGATGTACCTATA
GTGGCACCAGCAACTTGCCAGAGTCAGTTACAGGCAGCTCGTCTTGGGCCCACTTACGTA
CTGGATACTACCTCTTTTATCTGTGCTGGCGGCGAAAGCAGTAAGGATTCTTGCACGGGT
GACGGAGGATCAGGTTTAGTCTGTTCTATTAATGGGCAATGGATTGTAGTAGGTTTAGTG
GCATGGGGTCTCGGCTGTGCTTCCGCAAATGTACCAGCGGCTTACGTGAATGTTGCTGCC
CTACTACCTTGGATACAACAGCAAGTTGCCACTGCGTAG

Protein sequence:

MAPTAAPTPAPTTAPTPAPTTAPTPAPTTAPTPAPTTAPTPAPTTAPTPAPTTATTAAPT
PPPTMPPTAMTTRNGNSSNVTGNLTSSSTTTASLTIITNQSSYNTTAMPTSTPTNPPTTS
GETLPVLLCKDPDVICVFNPDENAAGSFPIDPRLGTTPSAALQSAQPIMGYSGQISSLDS
AVRFPRERRSVNRKIMTVKESFKKLNYIDPPKHQIRKRQSCRCVPAGTCASGGAGMIDFR
IVTPVNACPAGQVYCCGDTTAVTVRCGVVQAAPSTGVTPAAGEANFGEYPWQALVLTKQN
DYIAGGVLIDQLNVLTVTHRMMPYVVSGTAPNVKVRLGEWDAAGTNEPVPFQEYNVAKVF
SHPSYNANTLQYDIMVLRLSSSVPLTPMTGSTTTINRACLPPSSTATYTGLTCWVSGWGK
NMFGLQGQYQNILKKVDVPIVAPATCQSQLQAARLGPTYVLDTTSFICAGGESSKDSCTG
DGGSGLVCSINGQWIVVGLVAWGLGCASANVPAAYVNVAALLPWIQQQVATA