DPGLEAN10810 in OGS1.0

New model in OGS2.0DPOGS206620 
Genomic Positionscaffold551:+ 89171-98661
See gene structure
CDS Length1191
Paired RNAseq reads  785
Single RNAseq reads  2142
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008515 (2e-27)
Best Drosophila hit  jonah 74E (3e-46)
Best Human hittransmembrane protease serine 3 isoform 1 (4e-28)
Best NR hit (blastp)  chymotrypsin [Helicoverpa armigera] (2e-71)
Best NR hit (blastx)  chymotrypsin [Helicoverpa armigera] (4e-73)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families


  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
Orthology groupMCL16739

Nucleotide sequence:

ATGGGTTACAGTTCCTATACATCCATCTGGTGTGTTATTTTTACCACATTATATTTGCAG
GCTGGTCTCATAAGCGATATTCTTGGTAGCTCAGGACAAGAGGTATGTGGAGGTTCTCTA
CTTAGCACAAACCGAGTGTTAATGGCTGCTCATTGCTGGGACGATGGTAGAAGCAAAGCT
TGGAGATTCACTGTTGTTCTTGGTTCGGAAACGCTTTTCACAGGCGGTACTAGAATTCTT
ACCAGCGATGTGGTCACCCATCCACTGTGGAATCCTGCATTTGTCCATAATGACGTTGCC
ATGATTCGTTTACCCGAACATGTTGTTCTATCCGTTGACAGTATCAATAAAATATATTTT
GTGAATATGGCTGTCATCACTAACGAAATATGCAACGTCGCTGTCTTCGGTGACATTCAA
TCATCGAACATTTGCACGAACACCCTCGGTGATAAGAGCACTTGCCGTGGTGACTCTGGT
GGCCCACTTGTTGTCCAAAGACGTGTCAGGACTGTTCTGGCTGGTCTCATCAGTGATATA
ACGGGAATAGAAGGAAGAGGTGTTTGCGGAGGATCTCTCGTCTCGACAAACCGGGTAATT
ACAGCTGCTCATTGCTGGGACGATGGTAGAAACAAAGCCTGGAGATTCACTGTTGTTCTT
GGATCGGAAACGCTTTTCACAGGCGGTACTAGAATCCTTACCAGTGACGTCGTGATGCAC
CCACTGTGGATCCCAATACTGATCCTCAATGACATCGCTGTCATCCGTTTGCCAGAACCT
GTCGCTTTGTCCGATACCATCGGAACCGTTTCTTTGCCCACTGGATTAGAAATCTTTGAC
GACTTCAACGGTCAAGTCGCTATTGCCTCTGGCTATGGACTTACACAAGACGGCGGCAGT
ATCAGCAACAGCCAATTCTTAAGTTACGTCAATATGTCAGTCATTACAAATGAAGTTTGC
AACATCGCTTTCTTCGGTAACATTCGGCCTTCGAACATTTGCACCAGCACCCAAGGTGGT
AAGAGCACTTGCCGTGGTGACTCTGGTGGTCCACTTGTTGTCCAAAGACGTGACAGAACT
GTTCTGGTTGGCGTTACCTCATTTGGAATTGCTTTTGGTTGCGAAATCGGATGGCCAGCA
GCCTTTTCAAGAATTACATCATTCCTTGGATTTATTAATGACAATTTATAA

Protein sequence:

MGYSSYTSIWCVIFTTLYLQAGLISDILGSSGQEVCGGSLLSTNRVLMAAHCWDDGRSKA
WRFTVVLGSETLFTGGTRILTSDVVTHPLWNPAFVHNDVAMIRLPEHVVLSVDSINKIYF
VNMAVITNEICNVAVFGDIQSSNICTNTLGDKSTCRGDSGGPLVVQRRVRTVLAGLISDI
TGIEGRGVCGGSLVSTNRVITAAHCWDDGRNKAWRFTVVLGSETLFTGGTRILTSDVVMH
PLWIPILILNDIAVIRLPEPVALSDTIGTVSLPTGLEIFDDFNGQVAIASGYGLTQDGGS
ISNSQFLSYVNMSVITNEVCNIAFFGNIRPSNICTSTQGGKSTCRGDSGGPLVVQRRDRT
VLVGVTSFGIAFGCEIGWPAAFSRITSFLGFINDNL