DPGLEAN18265 in OGS1.0

New model in OGS2.0DPOGS215100 
Genomic Positionscaffold4222:+ 121-4616
See gene structure
CDS Length1128
Paired RNAseq reads  689
Single RNAseq reads  2486
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009610 (2e-35)
Best Drosophila hit  CG3700, isoform A (2e-33)
Best Human hittransmembrane protease serine 2 isoform 2 (2e-28)
Best NR hit (blastp)  PREDICTED: similar to snake CG7996-PA [Apis mellifera] (3e-41)
Best NR hit (blastx)  PREDICTED: similar to snake CG7996-PA [Apis mellifera] (4e-43)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families


  
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
Orthology groupMCL23866

Nucleotide sequence:

ACATGTAAATTTCAAGGTGATCAAAGAATAGTCTGCTGTCCTGAGACCGACCTGTTCCAT
GAAACGGGAGTCTTCAAGCATCACTTCATTGGGCTTGCCAAAGGAGTCAAGAAATCTAAG
TACATGACCTGTCGCTACGATGGCTACCAGCCCTTGCAATGTTGTGAGAACGCTAAACCC
GTCACCATTCCACCAGAACCGGCAACTTGTCCAAGCCTCCCAAGACCCCTGCTGGCCAAA
AACCACATCGCCTGGACTAAATGCGTTGACTACCAGCGTTACATCCACAAGTGCGTGCCA
GTTGATCCAATCAACCAACCATACAAAATGCAAAGGGTAAACACTTGCGGCATCAGCAAC
TCCAATTTTAGGATATCCGGTGGCGTTGAAGCTAAACCCAGAGAGTTTCCGTTCATGGCC
GTCATCGGCTGCCACAATTCCCTGGACGTGGACGCCGACATCAAGTGGGTAGGCGGAGGC
TCGCTGATCAGTGAGAAGTTTATACTCACGGCTACTCACATATTGAGTGAACCGACTTAT
GGCCGCGTACGGTACGCCTTGCTTGGCACTTTGAATAAGACAGACATAAGGTCCGGAGTC
CTTTACAATATCGTGTCTATGATCGCGCACCCTGAATACGACATTCCCGTTAAAGCGAAT
GACATAGCGCTCCTGGAGCTAGACAGACAGGTCTTTTTCAATGAATTCATTCACCCCGTC
TGTCTCCCGGTGCCGGGCAGATATATTACAAATGACTATATTGTTGCCGGTTGGGGCGAA
AACAACAACAGATACAGCAGTGACGTGCTGTTAACTGCGAGACTGCGACCCAGCGATGAA
TGCAAGAGCAGAATAGTAAGAAAAGACTTCGTTTATTCGAATGAGAAGTATATCTGTGCT
AAAGGAGAGCTGGAAAAGGGCGTCTATCAAGACACCTGCAAGGGCGACAGCGGAGGCCCG
TTGTTGGCTCTGATGTTTAATATAAACTGCTCCTACTCCTTGGAGGGTATCGTCAGTTTT
GGACCCGAATGCGGCAAAGGCTTTCCAGCGGTTTACACCAAAGTATCCAACTATTTGGAT
TGGATAGTTGAAAACGTATGGCCCGATAAGGTCAACAAAAAGCAATAA

Protein sequence:

TCKFQGDQRIVCCPETDLFHETGVFKHHFIGLAKGVKKSKYMTCRYDGYQPLQCCENAKP
VTIPPEPATCPSLPRPLLAKNHIAWTKCVDYQRYIHKCVPVDPINQPYKMQRVNTCGISN
SNFRISGGVEAKPREFPFMAVIGCHNSLDVDADIKWVGGGSLISEKFILTATHILSEPTY
GRVRYALLGTLNKTDIRSGVLYNIVSMIAHPEYDIPVKANDIALLELDRQVFFNEFIHPV
CLPVPGRYITNDYIVAGWGENNNRYSSDVLLTARLRPSDECKSRIVRKDFVYSNEKYICA
KGELEKGVYQDTCKGDSGGPLLALMFNINCSYSLEGIVSFGPECGKGFPAVYTKVSNYLD
WIVENVWPDKVNKKQ