New model in OGS2.0 | DPOGS201115  |
---|---|
Genomic Position | scaffold258:+ 5659-30543 |
See gene structure | |
CDS Length | 2514 |
Paired RNAseq reads   | 1422 |
Single RNAseq reads   | 3234 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006181 (1e-50) |
Best Drosophila hit   | CG9059, isoform D (3e-108) |
Best Human hit | inactive dipeptidyl peptidase 10 isoform b (1e-63) |
Best NR hit (blastp)   | PREDICTED: similar to CG9059 CG9059-PB [Tribolium castaneum] (2e-165) |
Best NR hit (blastx)   | PREDICTED: similar to CG9059 CG9059-PB [Tribolium castaneum] (4e-148) |
GeneOntology terms    | GO:0008239 dipeptidyl-peptidase activity GO:0008236 serine-type peptidase activity GO:0016020 membrane GO:0006508 proteolysis |
InterPro families    | IPR002469 Peptidase S9B, dipeptidylpeptidase IV N-terminal IPR001375 Peptidase S9, prolyl oligopeptidase, catalytic domain |
Orthology group | MCL18897 |
Nucleotide sequence:
ATGAAACCTCCATCAGTCTTCCTCGGAGCGCTCGACACGAAGCTGGACGGCGCTAGATTT
ATTTTAATAAATTACGAGGAGTTGGTGTTCCGCGACCGTTGGGGAGGATTGACGCTGTTC
AACGTCAAGAACTTAACAACCAGACTGCTTATGAACAATTCTACCTTTCCGAGACTCGTG
GACAAGGCCGTAGAGTTGGACGTAGTCGAGGGCGCGGACGTGGCTTTGGACGTAGTCAAG
GTGGAATGGGCAACAGCAACAAGGTTAATCGACGACAGAGAATCGACACATCTAGTAGTA
GCTCTAAAAAGGGAGCTGAATGCTGTAGACTTTAAAGTCTCTTCCGATCTGAAGTTCGTT
CTGCTGATATCCGACGTGCGACCGGGCTGGCGACACGCAAGACTAGCAAGATACCACGTG
TATGATGTTATAACTAGGAACAAAATCCCCATTTCGCCGATAGAGGACGACAGGTCTGCT
CCCTTGCTGCAGTATGCAGAATGGTCTCCTGTCGGCTCCGGGCTGGTGTTCGTATATGAC
AACGACATTTACTACAAGCCTAAGGTTTTAAAGGCCTTGGTTTGCAGAATCACTAGTAAC
GGAGTTCCAGGTGTAATCTTTAATGGAGTACCAGATTTCCTTTACGAGACCGAGGTGTTG
CGATTGGACCGCGCCCTGTGGTTCAGCCCCGACGGACAGACGCTCATGTACGTGACCTAC
AATGACAGCCTGGTCCAACAACACAAATATCCTTGGTATGGTTTGGATCAACAGGAACCG
CCCGCCTACCCTGCCATACGGACCCTGAGATATCCGAAGATGAACACTAATAACCCAGCA
GTAACGGTGTACGTGGTCAGTCTGAAGACTCCAAAGTTTCTGTTCCCACATGCTATACAG
TTTAATTCACCCTTTGACTCTGGCTGGTATGTTCGTTGGACAAGTTGGGTGTCTGAGCGT
CAGATAGCTGCTCTCCTTCTCAACAGACCTCAGAATCTATCCATCATCGCGACATGCAAC
GCTGTGTCTTACAACTGCCAAGATATCTATCGCGACGAATCTGACGGTTCGCGATGGTCA
GGTCTTGGGTCAGACCCGGAGGAAGAGTGCGGCTGGTGCGGGGGGGCAGCGCTCGTGGGG
GGGAGGAGCGGCATCTTCACGTCCATACCTGTCACCGACCAGGGAGGCGTGTGGAGACAC
GCTATACATCTCACCCAGGAGACCAGAACGACCATTACCCAGGGGAACTTCGAAATAACA
CAGCTGATTGGATGGGATGAGAAACGAAGACTGCTCTACGTGATCGGTACCGCCCCTGAC
AAAGCCGGAGAGCGTCACCTCTACCGCGTGTCGGTGCCTGTGGACGGCTGGCCTCCTCCG
CCCGTCTGCCTCACCTGCCCTGGACGGATGATTGAAGCCACCGCTGAACCCAGCACGGAG
GAACCGGAATACGACAACAGCACGTCGTCATTACCCGCGTGGCCGACGTCCACAGTTCTG
CCTCACGTGGCCGACGAGAACGACTCGCTTCCCACCGCCTGTCTCTACAACAGAGTTATA
TTTAGCAAAAATTTCTCGTACTACGTTCAGGAGTGCCTCGGTCCAGAGCCCCCGGCGATA
TTCCTTTGTACGTCAGCGGGGTCTCGCCGAGCCGTGCTGTGGGACGGGGCGCCGCTCAGA
CAAAAGTTCGCGGCCCTCGCTTCCCCCCAGGCCAAAGTGTTCAGAGTCGAGGTCCAAGCG
CAAAGATCAGCACGCGTAAGGTTGCTGCTGCCTCCGGGCTTGCGGGACACTGACGACTTG
CCGCTACCACTCGTATTGCATTTGTCTTCGGCTCCAGGTTCCCAGCTGGTGACTGAGCAG
TGGGCACCTGGTTGGGGCTGGTATCTCGCCGCCGCAAGGAACTTTATAATTGCAGAAATA
GACGCGAGAGGATCCGGCGGACAGGGAGAGGAGTTACGAACAGAGATATACCAGAAACTG
CTCTCAGTAGACGTCGAAGACCAAATAGCTGTTTTATCCTACCTCCGTGACAACCTGAAG
ATGGTGGATGGGAGCCGTACTGGTGCCTGGGGGAGCGGGTACGGCGCGGGCGCGGCGCTC
GCCCTCGCTGCGGGAGACGCCGCTAACCTCACCAGGTGCCTGGCCCTGCTGGCGCCCCTC
GCCGACTTACGACACCACAACTCGTTCTGGTCGGAGCGCTACTCCGGCCTGGGCGGCGGA
GCGTCGCTGGGTGTGTGGCGCCGGGCGTCCTCGGTGCCTCCTCGGCGCGTGTTGCTCGCG
CACGCCACCGCGGACGTGCGGGCGCCTCCGCCCCATGCTCTCGCTCTTGCACGAGCTCTC
ATACAAGCCAGAGCCGTGTACTCTCATCAGGTGTATCCTGATGAGGGTCACAACTTCGAG
CGCTCGTTCCTGCACGTGTACTCGACTATGGAGCAGTTCTTCGACGAATGTTTCGGGCCC
GTGGAGCTCGCGGACTGGGACAATCCCGGAGGACTGTTTCCATTTAGGGATTGA
Protein sequence:
MKPPSVFLGALDTKLDGARFILINYEELVFRDRWGGLTLFNVKNLTTRLLMNNSTFPRLV
DKAVELDVVEGADVALDVVKVEWATATRLIDDRESTHLVVALKRELNAVDFKVSSDLKFV
LLISDVRPGWRHARLARYHVYDVITRNKIPISPIEDDRSAPLLQYAEWSPVGSGLVFVYD
NDIYYKPKVLKALVCRITSNGVPGVIFNGVPDFLYETEVLRLDRALWFSPDGQTLMYVTY
NDSLVQQHKYPWYGLDQQEPPAYPAIRTLRYPKMNTNNPAVTVYVVSLKTPKFLFPHAIQ
FNSPFDSGWYVRWTSWVSERQIAALLLNRPQNLSIIATCNAVSYNCQDIYRDESDGSRWS
GLGSDPEEECGWCGGAALVGGRSGIFTSIPVTDQGGVWRHAIHLTQETRTTITQGNFEIT
QLIGWDEKRRLLYVIGTAPDKAGERHLYRVSVPVDGWPPPPVCLTCPGRMIEATAEPSTE
EPEYDNSTSSLPAWPTSTVLPHVADENDSLPTACLYNRVIFSKNFSYYVQECLGPEPPAI
FLCTSAGSRRAVLWDGAPLRQKFAALASPQAKVFRVEVQAQRSARVRLLLPPGLRDTDDL
PLPLVLHLSSAPGSQLVTEQWAPGWGWYLAAARNFIIAEIDARGSGGQGEELRTEIYQKL
LSVDVEDQIAVLSYLRDNLKMVDGSRTGAWGSGYGAGAALALAAGDAANLTRCLALLAPL
ADLRHHNSFWSERYSGLGGGASLGVWRRASSVPPRRVLLAHATADVRAPPPHALALARAL
IQARAVYSHQVYPDEGHNFERSFLHVYSTMEQFFDECFGPVELADWDNPGGLFPFRD