DPGLEAN12708 in OGS1.0

New model in OGS2.0DPOGS215364 
Genomic Positionscaffold3667:+ 625-9041
See gene structure
CDS Length1848
Paired RNAseq reads  2956
Single RNAseq reads  7404
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009550 (1e-102)
Best Drosophila hit  CG3744, isoform B (4e-68)
Best Human hitdipeptidyl peptidase 8 isoform 3 (8e-97)
Best NR hit (blastp)  PREDICTED: similar to AGAP003138-PA [Tribolium castaneum] (4e-158)
Best NR hit (blastx)  PREDICTED: similar to AGAP003138-PA [Tribolium castaneum] (5e-144)
GeneOntology terms



  
GO:0005634 nucleus
GO:0005737 cytoplasm
GO:0006508 proteolysis
GO:0008236 serine-type peptidase activity
GO:0016020 membrane
InterPro families
  
IPR001375 Peptidase S9, prolyl oligopeptidase, catalytic domain
IPR002469 Peptidase S9B, dipeptidylpeptidase IV N-terminal
Orthology groupMCL11576

Nucleotide sequence:

ATGGCGATTAATGCTCTAACCTTCTCCATGCGAGAAGAGGCCTTTGCTCAGCAAATAGTG
TACGAGGAGGTGGATGAGGGTGAGGTGAAGATATACAGCTTCCCATCATCACAGAGCTCC
AGCGGGGAGGTCGAGGAGTTCAGGTTTCCCCGCGCCGGCACCCCTAATGCTAAATCAGTC
CTGAAAATGGTGACCTTCAGATTACAGAAAGCTCCCCCCACCACCGTCCTTGATTATTAC
CAAGAAGGGAACTCTAATACTGTTGCATCAGAGAGCCCCGGGAACAGTTCGGATCCCTTG
GAGGTGGTCGATGTAAGATGGTATGAACTGAGACATTCGCTGAAAGAGGTGTTCCCCTGG
TTTGAATACCTGGCCAGAGTCGGTTGGACCCCGTGCTCTCAATACGTTTGGGTCCAGGTG
TTGGACAGGAAGCAGCAGAGGTTAGAACTGGCCCTGGTGCCGGTTAGTGAGTTCAATGTC
CCAGTGAGGTATGAGCAGGGGTCTGATGGAGGAAGACTGGATGAGGAATCTCCAGCTTCA
GGGAGTAGACAGGGAGACAGGACACAGATCCAGGTGTTGGTGTCTGAGACGGCTCCCGAC
GCGTGGGTCAACGTCCACGACATACTGCACTTCCTGCCCTCAGAACCTGGTATTGTGAGG
TTCATCTGGGCTTCAGAGGAAACCGGACACCTGCACCTGTATCTCATCACCTGCGCTGTC
AACGGACAAAGGGCTATGACAGTAACTGATATAATGGCTGAGGATGAGTCAAATGCTGCA
GTCCCTCGGGTGATCAGCAAGGAACCCCTCACCGATGGGGACTGGGAGGTCATGGGAAGA
AAGATATGGGTGGACGAGCCGCGCGGTCTGGTGTATTTCGTAGGGCTCCGTGAGACGCCG
CTGGAGCGCCACCTGTACGTGGTGTCAATGTCCGCGCCCAGGCAGGTCGTCCTGCTCACT
AAGCCGGGACATTCACACAGTGTTGACATGGACGAGTCACCGGAACCTCGTTCGTTCAAC
GGTTCCTGGGACTGTCGTCCTGATGAGGAGGAGTCGCCCAGCACCCGCCCTCCCCCGGTG
CCCCCTCCACAGATACTATCGACTCGTCTGTCTTGCGGAGCCCTAGCATACTGCACACTA
TGGCGGAGCGCCGTCCCAGGGCGAAGGCCGACCGTCTTACACGTTTACGGAGGGCCCGAG
GTTCAAACGGTCACTAATAGTTACAAGGGTGTACGACAGTTGAGAATGCATATGCTGGCT
GCCCGAGGGTTCACAGTGGTGTCCGTGGACTCGAGGGGGTCTAAACACAGAGGGAGGTTG
TGGGAAGCAGCTATCAAAGGAAAGATGGGACAAGTGGAGCTGGACGATCAGGTGGAAGTT
CTCCAATGGCTGGCGAAAGAAACTGGCTGCATTGATATGGATCGAGTCGCTATACACGGG
TGGAGTTATGGTGGTTATCTGTCACTGCTGGGGCTGGCGACCCGTCCTAATACCTTCAAG
GTGTGCGTGGCGGGTGCTCCGGTGACGTGCTGGAGGCTCTACGACACGGCCTACACGGAG
CGCTACATGGGACTCCCGGCCTGCGCCCCTCATTCCTACAGCCGAGCCAGTGTGTTGGCC
CACGCTCCCTTCTTCCCTGATAGGGAGGGCCGTCTCCTTATAATCCACGGTTTAGCCGAC
GAGAACGTCCACTTCTGTCACACGGCTGCTCTCCTGGCCGAGCTGGTGAGGCTCGGGAAG
CCTCACAGAGTTCAGGTTTACCCGGGTGAGAGGCATTCGCTGCGAGCTATGCACGCGGCT
AAGCATTACGAGGCGACACTGCTGCACTTCCTACACGAGAACCTGTAG

Protein sequence:

MAINALTFSMREEAFAQQIVYEEVDEGEVKIYSFPSSQSSSGEVEEFRFPRAGTPNAKSV
LKMVTFRLQKAPPTTVLDYYQEGNSNTVASESPGNSSDPLEVVDVRWYELRHSLKEVFPW
FEYLARVGWTPCSQYVWVQVLDRKQQRLELALVPVSEFNVPVRYEQGSDGGRLDEESPAS
GSRQGDRTQIQVLVSETAPDAWVNVHDILHFLPSEPGIVRFIWASEETGHLHLYLITCAV
NGQRAMTVTDIMAEDESNAAVPRVISKEPLTDGDWEVMGRKIWVDEPRGLVYFVGLRETP
LERHLYVVSMSAPRQVVLLTKPGHSHSVDMDESPEPRSFNGSWDCRPDEEESPSTRPPPV
PPPQILSTRLSCGALAYCTLWRSAVPGRRPTVLHVYGGPEVQTVTNSYKGVRQLRMHMLA
ARGFTVVSVDSRGSKHRGRLWEAAIKGKMGQVELDDQVEVLQWLAKETGCIDMDRVAIHG
WSYGGYLSLLGLATRPNTFKVCVAGAPVTCWRLYDTAYTERYMGLPACAPHSYSRASVLA
HAPFFPDREGRLLIIHGLADENVHFCHTAALLAELVRLGKPHRVQVYPGERHSLRAMHAA
KHYEATLLHFLHENL