DPGLEAN19430 in OGS1.0

New model in OGS2.0DPOGS204361 
Genomic Positionscaffold4890:+ 2261-5106
See gene structure
CDS Length876
Paired RNAseq reads  60
Single RNAseq reads  299
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012923 (5e-20)
Best Drosophila hit  CG34350 (7e-10)
Best Human hitchymotrypsin-C preproprotein (9e-11)
Best NR hit (blastp)  hypothetical protein Phum_PHUM609370 [Pediculus humanus corporis] (1e-39)
Best NR hit (blastx)  hemolymph proteinase 16 [Manduca sexta] (3e-63)
GeneOntology terms


  
GO:0005576 extracellular region
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
GO:0008233 peptidase activity
InterPro families


  
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
Orthology groupMCL12074

Nucleotide sequence:

ATGGAACCTGGTGTGGTGTTTGGATTTTATTATTTTGCACAAGAAACTGGATTTAATTTT
GTTGTAACTTCCTTCGTCCCTGGAATAGTTCCGTATATAACAAGCGTTATAATCAACAAT
GAGGAATATTGTCAATATCCAAATGTGGGATTCCTAAATCAATATGTTGTGGAAACGTTA
CTAAATGACGAAGAAAAGAACTGTGGTCGGCGTCAAGTACAAAGCACGCAGTTGATGGTT
AATGGCGCTAATACTAAACCCGGAGACTGGCCATGGCACGTCGCTATTTATAAACAAGAA
AGAAACATCATTAAGTATATCTGTGGTGGAACTCTTGTGTCTAAGAATTTCGTATTAACA
GCCGCTCATTGTGTGTCAGTGAGGGGTTCTGCCTTGTTGCCAGACACGATAAGTGTTGTC
CTTGGGAAATACAATTTATTTGGAGGTGATTTTGGGTCTGAAGAAAAGGAGGTTGTAGGA
TGGGGTTTTGATAACAGTGGAACTCTTTCGCGTACACTTAAACAAGCTAAGATGCCGATT
GTCTCGGATAACGTCTGTATCAGAAGTAAACCCTTATTTTATGCGAACATTTTGAATGGA
AATAAATTTTGTGCTGGATTTCATAACGGAACATCTGCTTGCAACGGTGACAGCGGTGGA
GCACTTGTGGTATTCGTACCAGATACGGCTGAGGATAATGACATAAGAGCTGAAGGAACT
TGGCATGTTAAAGGCATTGTATCGATGACACTCTCTCAAAAAGATGTACCCGTATGCGAT
CCTGAACAGTACGTTGTGTTTACAGACGTTGAGAAATACAGAGTTTGGATAAAAAGCTAC
ATCAAAGAAAAAGAAAGTGAAGATATTGATCAATAA

Protein sequence:

MEPGVVFGFYYFAQETGFNFVVTSFVPGIVPYITSVIINNEEYCQYPNVGFLNQYVVETL
LNDEEKNCGRRQVQSTQLMVNGANTKPGDWPWHVAIYKQERNIIKYICGGTLVSKNFVLT
AAHCVSVRGSALLPDTISVVLGKYNLFGGDFGSEEKEVVGWGFDNSGTLSRTLKQAKMPI
VSDNVCIRSKPLFYANILNGNKFCAGFHNGTSACNGDSGGALVVFVPDTAEDNDIRAEGT
WHVKGIVSMTLSQKDVPVCDPEQYVVFTDVEKYRVWIKSYIKEKESEDIDQ