DPGLEAN13079 in OGS1.0

New model in OGS2.0DPOGS214132 
Genomic Positionscaffold241:- 88997-97161
See gene structure
CDS Length2307
Paired RNAseq reads  3517
Single RNAseq reads  10130
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006180 (0.0)
Best Drosophila hit  CG11034 (2e-113)
Best Human hitdipeptidyl peptidase 4 (5e-101)
Best NR hit (blastp)  dipeptidyl-peptidase [Aedes aegypti] (1e-137)
Best NR hit (blastx)  dipeptidyl-peptidase [Aedes aegypti] (1e-135)
GeneOntology terms


  
GO:0008239 dipeptidyl-peptidase activity
GO:0006508 proteolysis
GO:0016020 membrane
GO:0008236 serine-type peptidase activity
InterPro families
  
IPR002469 Peptidase S9B, dipeptidylpeptidase IV N-terminal
IPR001375 Peptidase S9, prolyl oligopeptidase, catalytic domain
Orthology groupMCL12154

Nucleotide sequence:

ATGTATGAACATGACTTGGGATTGCTTAACGGTGCTATCCTCTTATACAGTGACGAGCCC
ATTCATAATCGTAAAATGAAATATAATCTGGTAGTCCTAGCAACGGCATTGGTGTACTTC
CTTGCCGATTCTGTTGCTTCACCTCAAGGACTCCTGAAGACATTCACTTTGGAGGAACTG
GTGCCTTTGCAACATGAGTTCTTTCCTGACAGAGTGGCTGTACAATGGATATCAGATACG
GAATATATTATAGCGGAACCAGATTCAGTAAATAAATATGACGCCATTACCGACACACAC
AGCACAATACTGGATAAGAAGGAATTGCTCAACATGAGCCAGTTTTCGGTGTCCTCGTTC
TCGAACGATCAAAAATATGTACTACTAGTACTTACACCAAGTAGAAAGAAGATTTATAGA
TATTCAACATTGGCTGAATATTCGCTGTATGATCTTGAGAAGAATAAAATCGCTAACATC
GCCCACGGACCGCTCCAGGTTGTTGTATGGGGCAGTGACAAGTCTTTAGCCTACGTTGAA
GACAACAATGTGTACTATATACCTGACGTGGCTCAACCCGATGTTGTGACAGCACTGACG
AAAGATGGCGTCCCAGGAGAAATATATCATGGTGTGACAGATTGGATTTATGAAGAGGAA
GTGTTTAATGCAGCGGAAGCAATGTGGTTTTCACCTCATGGGACCTACTTGGCCGTAGCA
ACCTTCAATGACACTCAAGTGGAGTCCGCCTTATACCCCTACTACGGGGAACCGTCCGAT
TTTAATAGTCAATATCCTTTACTCGTTCATTTTAAATATCCTAAGGCAGGTCGCACGAAT
CCAGATGTGCAACTGCGTGTGTTCAATCTCAATGACACGTCCAGTGAGCCGATGATGATT
CCAGCCCCTGTAGATATTGTGGGCCTAGATCACATTTTGGGGAGGGTCAATTGGGCTACT
GATCAAAATCTCGTCGTTCTATGGCTTAACAGACGACAGAGTATTAGTGTTTTAGTGAAC
TGCAATCTAAAAGAGAACAAATGCAATATAGTGAAACAGCATAATGAACCCAATGGTTGG
ATTGATATTAACGAACCGTTTTTCGATAAAACAGGAAAGAAAATGTTAGAAATTCAACCC
ATGCATTACGAAGATCAGAGATTTATGCATGTAGCACATTTTGATTTCGAAACTCAAGAA
ACGACCGATTTGAGTCCAGGAAATTCCACAGTCACAGAAATATTGGGATGGGATCAGAAA
TCAGACATTGTTCTGTATATTGTATCCCCGGGAAATGAACCTTGGCAAAGACAACTGTGG
GGTGCCTCTAAAGGAATCAATAGATGCATTTCGTGCACCAAACCGACTTGTCACAACGTT
GACGGTATGTTTTCACCGGCAGGTAGCTATGGAATTGTATCGTGCAGTGCCGTAAATGTA
CCTCCAGTTACATACTTTTTCAAAAGCCAGAATAGAGGCTTTAAGATCATAACGGAAAAC
TCGAAATTGCTTGAAAAATTGAGTCGTTATAAAATGCCTTTGGTCTTATTTAACAAGATA
TCGTTAGAAGAGGATACGATGGCTCATATCAAGTTGTTGTTGCCACCTGAAATGAAACCA
GGGAAGAAGTATCCTATGATAGTGAGGTTATACGCTGGACCCGGAACAACTAGAGTCAAA
GACACCTATGATCTTGAATACTACAATCTTTATTTAAGCGGCAATCGTAGTTTCATAGTA
GCGTCGATCGATGTAAGGGGTTCGGGCGCGATGGGTGTGGAGGCGATGCACGCCCTCAAC
AACGCTCTTGGGACCGTTGAAATTACCGATACTTTAACAGCTATCAGACGACTTGTGAGT
ATGTATTCGTTCATTGATACCGACCGTATTGGAGCTTGGGGATGGAGTTATGGTGGTTAC
GCTACCACTATGATGTTGATCAGAGACCATGACAAGATAGTGACGTGTGGCGCTGCTGTC
GCTCCAGTTACTTCGTGGCTATATTATGATACAATTTACACGGAGAGGTATATGGATACA
CCTCAAAACAACCCAGTGGGCTATGAAAACTCAGACCTGATGATGCAAGCTGAAAAACTC
CGAGACCGCCGTTATCTTTTAGTACATGGCACTGGTGATGACAATGTTCACTACCAACAC
AGCTTGCAACTAGCCAAGGTGCTGCAAAGAGCTGACATTGCATTTGAACAAATGAGTTAT
ACTGATGAAAATCATTCTTTGCGAGGTGTGAGTCGACATTTCTACCATACATTGGATCAC
TTCTGGTCGCAATGTTTTAACTTATAA

Protein sequence:

MYEHDLGLLNGAILLYSDEPIHNRKMKYNLVVLATALVYFLADSVASPQGLLKTFTLEEL
VPLQHEFFPDRVAVQWISDTEYIIAEPDSVNKYDAITDTHSTILDKKELLNMSQFSVSSF
SNDQKYVLLVLTPSRKKIYRYSTLAEYSLYDLEKNKIANIAHGPLQVVVWGSDKSLAYVE
DNNVYYIPDVAQPDVVTALTKDGVPGEIYHGVTDWIYEEEVFNAAEAMWFSPHGTYLAVA
TFNDTQVESALYPYYGEPSDFNSQYPLLVHFKYPKAGRTNPDVQLRVFNLNDTSSEPMMI
PAPVDIVGLDHILGRVNWATDQNLVVLWLNRRQSISVLVNCNLKENKCNIVKQHNEPNGW
IDINEPFFDKTGKKMLEIQPMHYEDQRFMHVAHFDFETQETTDLSPGNSTVTEILGWDQK
SDIVLYIVSPGNEPWQRQLWGASKGINRCISCTKPTCHNVDGMFSPAGSYGIVSCSAVNV
PPVTYFFKSQNRGFKIITENSKLLEKLSRYKMPLVLFNKISLEEDTMAHIKLLLPPEMKP
GKKYPMIVRLYAGPGTTRVKDTYDLEYYNLYLSGNRSFIVASIDVRGSGAMGVEAMHALN
NALGTVEITDTLTAIRRLVSMYSFIDTDRIGAWGWSYGGYATTMMLIRDHDKIVTCGAAV
APVTSWLYYDTIYTERYMDTPQNNPVGYENSDLMMQAEKLRDRRYLLVHGTGDDNVHYQH
SLQLAKVLQRADIAFEQMSYTDENHSLRGVSRHFYHTLDHFWSQCFNL