DPGLEAN18948 in OGS1.0

New model in OGS2.0DPOGS201115 
Genomic Positionscaffold258:+ 5659-30543
See gene structure
CDS Length2514
Paired RNAseq reads  1422
Single RNAseq reads  3234
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006181 (1e-50)
Best Drosophila hit  CG9059, isoform D (3e-108)
Best Human hitinactive dipeptidyl peptidase 10 isoform b (1e-63)
Best NR hit (blastp)  PREDICTED: similar to CG9059 CG9059-PB [Tribolium castaneum] (2e-165)
Best NR hit (blastx)  PREDICTED: similar to CG9059 CG9059-PB [Tribolium castaneum] (4e-148)
GeneOntology terms


  
GO:0008239 dipeptidyl-peptidase activity
GO:0008236 serine-type peptidase activity
GO:0016020 membrane
GO:0006508 proteolysis
InterPro families
  
IPR002469 Peptidase S9B, dipeptidylpeptidase IV N-terminal
IPR001375 Peptidase S9, prolyl oligopeptidase, catalytic domain
Orthology groupMCL18897

Nucleotide sequence:

ATGAAACCTCCATCAGTCTTCCTCGGAGCGCTCGACACGAAGCTGGACGGCGCTAGATTT
ATTTTAATAAATTACGAGGAGTTGGTGTTCCGCGACCGTTGGGGAGGATTGACGCTGTTC
AACGTCAAGAACTTAACAACCAGACTGCTTATGAACAATTCTACCTTTCCGAGACTCGTG
GACAAGGCCGTAGAGTTGGACGTAGTCGAGGGCGCGGACGTGGCTTTGGACGTAGTCAAG
GTGGAATGGGCAACAGCAACAAGGTTAATCGACGACAGAGAATCGACACATCTAGTAGTA
GCTCTAAAAAGGGAGCTGAATGCTGTAGACTTTAAAGTCTCTTCCGATCTGAAGTTCGTT
CTGCTGATATCCGACGTGCGACCGGGCTGGCGACACGCAAGACTAGCAAGATACCACGTG
TATGATGTTATAACTAGGAACAAAATCCCCATTTCGCCGATAGAGGACGACAGGTCTGCT
CCCTTGCTGCAGTATGCAGAATGGTCTCCTGTCGGCTCCGGGCTGGTGTTCGTATATGAC
AACGACATTTACTACAAGCCTAAGGTTTTAAAGGCCTTGGTTTGCAGAATCACTAGTAAC
GGAGTTCCAGGTGTAATCTTTAATGGAGTACCAGATTTCCTTTACGAGACCGAGGTGTTG
CGATTGGACCGCGCCCTGTGGTTCAGCCCCGACGGACAGACGCTCATGTACGTGACCTAC
AATGACAGCCTGGTCCAACAACACAAATATCCTTGGTATGGTTTGGATCAACAGGAACCG
CCCGCCTACCCTGCCATACGGACCCTGAGATATCCGAAGATGAACACTAATAACCCAGCA
GTAACGGTGTACGTGGTCAGTCTGAAGACTCCAAAGTTTCTGTTCCCACATGCTATACAG
TTTAATTCACCCTTTGACTCTGGCTGGTATGTTCGTTGGACAAGTTGGGTGTCTGAGCGT
CAGATAGCTGCTCTCCTTCTCAACAGACCTCAGAATCTATCCATCATCGCGACATGCAAC
GCTGTGTCTTACAACTGCCAAGATATCTATCGCGACGAATCTGACGGTTCGCGATGGTCA
GGTCTTGGGTCAGACCCGGAGGAAGAGTGCGGCTGGTGCGGGGGGGCAGCGCTCGTGGGG
GGGAGGAGCGGCATCTTCACGTCCATACCTGTCACCGACCAGGGAGGCGTGTGGAGACAC
GCTATACATCTCACCCAGGAGACCAGAACGACCATTACCCAGGGGAACTTCGAAATAACA
CAGCTGATTGGATGGGATGAGAAACGAAGACTGCTCTACGTGATCGGTACCGCCCCTGAC
AAAGCCGGAGAGCGTCACCTCTACCGCGTGTCGGTGCCTGTGGACGGCTGGCCTCCTCCG
CCCGTCTGCCTCACCTGCCCTGGACGGATGATTGAAGCCACCGCTGAACCCAGCACGGAG
GAACCGGAATACGACAACAGCACGTCGTCATTACCCGCGTGGCCGACGTCCACAGTTCTG
CCTCACGTGGCCGACGAGAACGACTCGCTTCCCACCGCCTGTCTCTACAACAGAGTTATA
TTTAGCAAAAATTTCTCGTACTACGTTCAGGAGTGCCTCGGTCCAGAGCCCCCGGCGATA
TTCCTTTGTACGTCAGCGGGGTCTCGCCGAGCCGTGCTGTGGGACGGGGCGCCGCTCAGA
CAAAAGTTCGCGGCCCTCGCTTCCCCCCAGGCCAAAGTGTTCAGAGTCGAGGTCCAAGCG
CAAAGATCAGCACGCGTAAGGTTGCTGCTGCCTCCGGGCTTGCGGGACACTGACGACTTG
CCGCTACCACTCGTATTGCATTTGTCTTCGGCTCCAGGTTCCCAGCTGGTGACTGAGCAG
TGGGCACCTGGTTGGGGCTGGTATCTCGCCGCCGCAAGGAACTTTATAATTGCAGAAATA
GACGCGAGAGGATCCGGCGGACAGGGAGAGGAGTTACGAACAGAGATATACCAGAAACTG
CTCTCAGTAGACGTCGAAGACCAAATAGCTGTTTTATCCTACCTCCGTGACAACCTGAAG
ATGGTGGATGGGAGCCGTACTGGTGCCTGGGGGAGCGGGTACGGCGCGGGCGCGGCGCTC
GCCCTCGCTGCGGGAGACGCCGCTAACCTCACCAGGTGCCTGGCCCTGCTGGCGCCCCTC
GCCGACTTACGACACCACAACTCGTTCTGGTCGGAGCGCTACTCCGGCCTGGGCGGCGGA
GCGTCGCTGGGTGTGTGGCGCCGGGCGTCCTCGGTGCCTCCTCGGCGCGTGTTGCTCGCG
CACGCCACCGCGGACGTGCGGGCGCCTCCGCCCCATGCTCTCGCTCTTGCACGAGCTCTC
ATACAAGCCAGAGCCGTGTACTCTCATCAGGTGTATCCTGATGAGGGTCACAACTTCGAG
CGCTCGTTCCTGCACGTGTACTCGACTATGGAGCAGTTCTTCGACGAATGTTTCGGGCCC
GTGGAGCTCGCGGACTGGGACAATCCCGGAGGACTGTTTCCATTTAGGGATTGA

Protein sequence:

MKPPSVFLGALDTKLDGARFILINYEELVFRDRWGGLTLFNVKNLTTRLLMNNSTFPRLV
DKAVELDVVEGADVALDVVKVEWATATRLIDDRESTHLVVALKRELNAVDFKVSSDLKFV
LLISDVRPGWRHARLARYHVYDVITRNKIPISPIEDDRSAPLLQYAEWSPVGSGLVFVYD
NDIYYKPKVLKALVCRITSNGVPGVIFNGVPDFLYETEVLRLDRALWFSPDGQTLMYVTY
NDSLVQQHKYPWYGLDQQEPPAYPAIRTLRYPKMNTNNPAVTVYVVSLKTPKFLFPHAIQ
FNSPFDSGWYVRWTSWVSERQIAALLLNRPQNLSIIATCNAVSYNCQDIYRDESDGSRWS
GLGSDPEEECGWCGGAALVGGRSGIFTSIPVTDQGGVWRHAIHLTQETRTTITQGNFEIT
QLIGWDEKRRLLYVIGTAPDKAGERHLYRVSVPVDGWPPPPVCLTCPGRMIEATAEPSTE
EPEYDNSTSSLPAWPTSTVLPHVADENDSLPTACLYNRVIFSKNFSYYVQECLGPEPPAI
FLCTSAGSRRAVLWDGAPLRQKFAALASPQAKVFRVEVQAQRSARVRLLLPPGLRDTDDL
PLPLVLHLSSAPGSQLVTEQWAPGWGWYLAAARNFIIAEIDARGSGGQGEELRTEIYQKL
LSVDVEDQIAVLSYLRDNLKMVDGSRTGAWGSGYGAGAALALAAGDAANLTRCLALLAPL
ADLRHHNSFWSERYSGLGGGASLGVWRRASSVPPRRVLLAHATADVRAPPPHALALARAL
IQARAVYSHQVYPDEGHNFERSFLHVYSTMEQFFDECFGPVELADWDNPGGLFPFRD