DPGLEAN19881 in OGS1.0

New model in OGS2.0DPOGS213461 
Genomic Positionscaffold4038:+ 7370-11324
See gene structure
CDS Length1251
Paired RNAseq reads  246
Single RNAseq reads  727
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004487 (2e-103)
Best Drosophila hit  CG13430, isoform B (9e-18)
Best Human hitkallikrein-13 precursor (2e-15)
Best NR hit (blastp)  GE15163 [Drosophila yakuba] (1e-21)
Best NR hit (blastx)  trypsin 2 [Culex quinquefasciatus] (3e-18)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families


  
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001314 Peptidase S1A, chymotrypsin-type
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
Orthology groupMCL39860

Nucleotide sequence:

ATGAACTGGGTTCTCGTTTTTATGACTCTATCAATGTTTTATAGCTATGTTCTGAGCTAT
GGAGACCAGGCCTCAGTAGTTAAATTTAAATTCAATCTGGACCCGTATGGAGAGGCTCCG
CTCCAAGATGGCCGACGGAGACATCGAGATAACAAAAAATCATTAAGAGTCCGAAACGAC
TTCCTGTTCAGTTTAAACAAAGACGCTCTCAGGATCCGGGGGGGGAATGCCACGGATACG
ACCAACTATCCGTACATAGCGGCCATTATAATCAACGGCAGGTTATGGTGCGCCGGCACC
ATCGTCGACGTCAACTGGGTACTGACAGCGGCGCATTGTCTGAATTACGTGCTTCACGTA
GCGCCAATGAAGACCCTGGGGCAGTACGTGAAGGTCAGGGTCGGCAGCGCCCAGGCTCAC
GAAGGAGGTTTGCTGGTAGACGTCGCGGGGGCCGTGCGACACCCGAAATTCGAAGAGGAA
CCCGTGCCTCATGCTGATGTAGCTTTATTGAAACTGACTGAAAACCTTGAATTCTCAACT
CACATCAATCTGATTAAAATAAACGAAGATATGAGAGAGCCTTACGCGCAGAGTTTCGTG
TCTGTAACCGGCTGGGGAGCGACCCGTGGCACAGACACAGCCTTCAGAGAACACACGCCC
GACCTGATGACGGCTCGTCTCAAGGTTCGCACGGTCAACTACTGCAGAGACGCGTACCAA
CTGGTTAGCGGGTTTCAGTTCACCGCAGACTTCTTCTGCGCTTCGTTGAGAAACGGCACC
AGAGACGCGTGTTTGGGCACAGACACAGCCTTCAGAGAACACACGCCCGACCTGATGACG
GCTCGTCTGAAGGTTCGCACGGTCAACTACTGCAGAGACGCGTACCAACTGGTTAGCGGG
TTTCAGTTCACCGCAGACTTCTTCTGCGCTTCGTTAAGAAACGGCACCAGAGACGCGTGT
TTGTTCGACGCGGGCGCGCCAGCCACCCAACACAACAAATTAATGGGCGTCATGAGCTTC
GGGCCCGAGCGTTGCGGACACGAATACCAACCAGCGGTGTTCATTAAGGCTTTTTATTTC
AGGGATTTCGTGAAGCACACTATATCCTCATATAAGACTACAGCTGAACTTATAGAAGCC
ATGAAAGATATCGACAAAGTTATCAGACCACCCGTTCATGTGAAACAGGAACACGTGGTC
GTCGAGAAAGATGAGCAAGAGGTCACGGAACCAGATTATAAACACGATTGA

Protein sequence:

MNWVLVFMTLSMFYSYVLSYGDQASVVKFKFNLDPYGEAPLQDGRRRHRDNKKSLRVRND
FLFSLNKDALRIRGGNATDTTNYPYIAAIIINGRLWCAGTIVDVNWVLTAAHCLNYVLHV
APMKTLGQYVKVRVGSAQAHEGGLLVDVAGAVRHPKFEEEPVPHADVALLKLTENLEFST
HINLIKINEDMREPYAQSFVSVTGWGATRGTDTAFREHTPDLMTARLKVRTVNYCRDAYQ
LVSGFQFTADFFCASLRNGTRDACLGTDTAFREHTPDLMTARLKVRTVNYCRDAYQLVSG
FQFTADFFCASLRNGTRDACLFDAGAPATQHNKLMGVMSFGPERCGHEYQPAVFIKAFYF
RDFVKHTISSYKTTAELIEAMKDIDKVIRPPVHVKQEHVVVEKDEQEVTEPDYKHD