New model in OGS2.0 | DPOGS214132  |
---|---|
Genomic Position | scaffold241:- 88997-97161 |
See gene structure | |
CDS Length | 2307 |
Paired RNAseq reads   | 3517 |
Single RNAseq reads   | 10130 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006180 (0.0) |
Best Drosophila hit   | CG11034 (2e-113) |
Best Human hit | dipeptidyl peptidase 4 (5e-101) |
Best NR hit (blastp)   | dipeptidyl-peptidase [Aedes aegypti] (1e-137) |
Best NR hit (blastx)   | dipeptidyl-peptidase [Aedes aegypti] (1e-135) |
GeneOntology terms    | GO:0008239 dipeptidyl-peptidase activity GO:0006508 proteolysis GO:0016020 membrane GO:0008236 serine-type peptidase activity |
InterPro families    | IPR002469 Peptidase S9B, dipeptidylpeptidase IV N-terminal IPR001375 Peptidase S9, prolyl oligopeptidase, catalytic domain |
Orthology group | MCL12154 |
Nucleotide sequence:
ATGTATGAACATGACTTGGGATTGCTTAACGGTGCTATCCTCTTATACAGTGACGAGCCC
ATTCATAATCGTAAAATGAAATATAATCTGGTAGTCCTAGCAACGGCATTGGTGTACTTC
CTTGCCGATTCTGTTGCTTCACCTCAAGGACTCCTGAAGACATTCACTTTGGAGGAACTG
GTGCCTTTGCAACATGAGTTCTTTCCTGACAGAGTGGCTGTACAATGGATATCAGATACG
GAATATATTATAGCGGAACCAGATTCAGTAAATAAATATGACGCCATTACCGACACACAC
AGCACAATACTGGATAAGAAGGAATTGCTCAACATGAGCCAGTTTTCGGTGTCCTCGTTC
TCGAACGATCAAAAATATGTACTACTAGTACTTACACCAAGTAGAAAGAAGATTTATAGA
TATTCAACATTGGCTGAATATTCGCTGTATGATCTTGAGAAGAATAAAATCGCTAACATC
GCCCACGGACCGCTCCAGGTTGTTGTATGGGGCAGTGACAAGTCTTTAGCCTACGTTGAA
GACAACAATGTGTACTATATACCTGACGTGGCTCAACCCGATGTTGTGACAGCACTGACG
AAAGATGGCGTCCCAGGAGAAATATATCATGGTGTGACAGATTGGATTTATGAAGAGGAA
GTGTTTAATGCAGCGGAAGCAATGTGGTTTTCACCTCATGGGACCTACTTGGCCGTAGCA
ACCTTCAATGACACTCAAGTGGAGTCCGCCTTATACCCCTACTACGGGGAACCGTCCGAT
TTTAATAGTCAATATCCTTTACTCGTTCATTTTAAATATCCTAAGGCAGGTCGCACGAAT
CCAGATGTGCAACTGCGTGTGTTCAATCTCAATGACACGTCCAGTGAGCCGATGATGATT
CCAGCCCCTGTAGATATTGTGGGCCTAGATCACATTTTGGGGAGGGTCAATTGGGCTACT
GATCAAAATCTCGTCGTTCTATGGCTTAACAGACGACAGAGTATTAGTGTTTTAGTGAAC
TGCAATCTAAAAGAGAACAAATGCAATATAGTGAAACAGCATAATGAACCCAATGGTTGG
ATTGATATTAACGAACCGTTTTTCGATAAAACAGGAAAGAAAATGTTAGAAATTCAACCC
ATGCATTACGAAGATCAGAGATTTATGCATGTAGCACATTTTGATTTCGAAACTCAAGAA
ACGACCGATTTGAGTCCAGGAAATTCCACAGTCACAGAAATATTGGGATGGGATCAGAAA
TCAGACATTGTTCTGTATATTGTATCCCCGGGAAATGAACCTTGGCAAAGACAACTGTGG
GGTGCCTCTAAAGGAATCAATAGATGCATTTCGTGCACCAAACCGACTTGTCACAACGTT
GACGGTATGTTTTCACCGGCAGGTAGCTATGGAATTGTATCGTGCAGTGCCGTAAATGTA
CCTCCAGTTACATACTTTTTCAAAAGCCAGAATAGAGGCTTTAAGATCATAACGGAAAAC
TCGAAATTGCTTGAAAAATTGAGTCGTTATAAAATGCCTTTGGTCTTATTTAACAAGATA
TCGTTAGAAGAGGATACGATGGCTCATATCAAGTTGTTGTTGCCACCTGAAATGAAACCA
GGGAAGAAGTATCCTATGATAGTGAGGTTATACGCTGGACCCGGAACAACTAGAGTCAAA
GACACCTATGATCTTGAATACTACAATCTTTATTTAAGCGGCAATCGTAGTTTCATAGTA
GCGTCGATCGATGTAAGGGGTTCGGGCGCGATGGGTGTGGAGGCGATGCACGCCCTCAAC
AACGCTCTTGGGACCGTTGAAATTACCGATACTTTAACAGCTATCAGACGACTTGTGAGT
ATGTATTCGTTCATTGATACCGACCGTATTGGAGCTTGGGGATGGAGTTATGGTGGTTAC
GCTACCACTATGATGTTGATCAGAGACCATGACAAGATAGTGACGTGTGGCGCTGCTGTC
GCTCCAGTTACTTCGTGGCTATATTATGATACAATTTACACGGAGAGGTATATGGATACA
CCTCAAAACAACCCAGTGGGCTATGAAAACTCAGACCTGATGATGCAAGCTGAAAAACTC
CGAGACCGCCGTTATCTTTTAGTACATGGCACTGGTGATGACAATGTTCACTACCAACAC
AGCTTGCAACTAGCCAAGGTGCTGCAAAGAGCTGACATTGCATTTGAACAAATGAGTTAT
ACTGATGAAAATCATTCTTTGCGAGGTGTGAGTCGACATTTCTACCATACATTGGATCAC
TTCTGGTCGCAATGTTTTAACTTATAA
Protein sequence:
MYEHDLGLLNGAILLYSDEPIHNRKMKYNLVVLATALVYFLADSVASPQGLLKTFTLEEL
VPLQHEFFPDRVAVQWISDTEYIIAEPDSVNKYDAITDTHSTILDKKELLNMSQFSVSSF
SNDQKYVLLVLTPSRKKIYRYSTLAEYSLYDLEKNKIANIAHGPLQVVVWGSDKSLAYVE
DNNVYYIPDVAQPDVVTALTKDGVPGEIYHGVTDWIYEEEVFNAAEAMWFSPHGTYLAVA
TFNDTQVESALYPYYGEPSDFNSQYPLLVHFKYPKAGRTNPDVQLRVFNLNDTSSEPMMI
PAPVDIVGLDHILGRVNWATDQNLVVLWLNRRQSISVLVNCNLKENKCNIVKQHNEPNGW
IDINEPFFDKTGKKMLEIQPMHYEDQRFMHVAHFDFETQETTDLSPGNSTVTEILGWDQK
SDIVLYIVSPGNEPWQRQLWGASKGINRCISCTKPTCHNVDGMFSPAGSYGIVSCSAVNV
PPVTYFFKSQNRGFKIITENSKLLEKLSRYKMPLVLFNKISLEEDTMAHIKLLLPPEMKP
GKKYPMIVRLYAGPGTTRVKDTYDLEYYNLYLSGNRSFIVASIDVRGSGAMGVEAMHALN
NALGTVEITDTLTAIRRLVSMYSFIDTDRIGAWGWSYGGYATTMMLIRDHDKIVTCGAAV
APVTSWLYYDTIYTERYMDTPQNNPVGYENSDLMMQAEKLRDRRYLLVHGTGDDNVHYQH
SLQLAKVLQRADIAFEQMSYTDENHSLRGVSRHFYHTLDHFWSQCFNL