New model in OGS2.0 | DPOGS214134  |
---|---|
Genomic Position | scaffold241:- 18476-30038 |
See gene structure | |
CDS Length | 2682 |
Paired RNAseq reads   | 1675 |
Single RNAseq reads   | 4833 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006181 (0.0) |
Best Drosophila hit   | CG40072 (0.0) |
Best Human hit | inactive dipeptidyl peptidase 10 isoform b (2e-71) |
Best NR hit (blastp)   | AGAP005043-PB [Anopheles gambiae str. PEST] (0.0) |
Best NR hit (blastx)   | AGAP005043-PB [Anopheles gambiae str. PEST] (0.0) |
GeneOntology terms    | GO:0008239 dipeptidyl-peptidase activity GO:0006508 proteolysis GO:0016020 membrane GO:0008236 serine-type peptidase activity |
InterPro families    | IPR002469 Peptidase S9B, dipeptidylpeptidase IV N-terminal IPR001375 Peptidase S9, prolyl oligopeptidase, catalytic domain |
Orthology group | MCL17037 |
Nucleotide sequence:
ATGCTCTGTTGTCGGATTGAGGTTTATGATGGGGCTGTCCGTGTTCCGGTGTGTCGCGAG
TGTCGCGTCTGTGGCGTCCGTGGCGTCTGTGGCGTTTGTGGCGCGCGTGTCAGTGTCGCG
GGCGAGGTGGTGTGCGGGCGCGTAGGGCTGCGCGGTGGCGGGTGCGCGCGCGGCTCGGCC
GGCTCGGGCGGCGGCGCATTATCTGCGAAGGAAGAGGACCTGTACCCCGGCGATGGACAC
AATTGGAGAAGCATCATATTCTCATTGATGGTCATCAGTTTTGTGATAGCAGGCATTGTC
ACAGCGATTTACTTATTGGGATATGTGGACGAGCTGCTGTACTGGTCGGGGCGTCGTATG
AGGCTGGACGAGTTCCTGCGAGGAGACCTGACTGGCGAACGTCTGCCGACCACGTGGGTC
AGCTCACACCAGCTCGTGTACCAGGCGGACGACGGAGGTCTACTCGCCCTGGACACCTTC
AACAATACGCTATACGTGCTCGTCACAAACCACACTCTCAGGCAGCTCAATGTGCGAGGG
TATCAATGCTCGCCAAATCTACGCTTCGTGCTGTTCCAACACAACATCAAAGAGGTTTAC
CGACGGACTTTTACTGCGCATTACACGGTTTATGACGTCACAAATGACCACCATATCCCC
CTCTTCGGCGAGGGTCAGAGCAGTTGGGAGTGGCAGCACGCGGCTTGGCTCGGTACAGAA
GGTGCTATCGTACTGGCTGCAGACAACGAGGTCTTGGCGAGACCAGCACCTCCTGCACGA
AGAGCTCCCTTACTTCGACTTACAAATGATGCTGTCGCGGGCAGCGTTTATAACGGGGTG
TCGGATTGGTTGTACCAAGAGGAGGTGACAAAAGAATCATCAGCGACTTGGGGATCATCA
GACGGGGCTTTCGTGTTATATGTCCAGTACGATGACAGGAAGGTGTCTCAGATGAGGTTC
CCACATATTTCCTCCGGGATAGGTGGCGCTGGTGCCTCCAGATCAGGGTTTTTGCTTCCC
GCTTTCAACAACAGTAACCCAACCATTTTCCCTGATCACGTAACGATCAGATATCCCACG
CCCGGTAGTTCAATACCTCTCGTTAAGTTGTGGATAGTTGCGGTACAAAACGTAACATCT
CCACCCAGGTGGGAAGTAAAACCTCCAAGTACATTAGATGGCATGGAATATTATCTCATA
TCAGCCCAATGGGTGGGCAAAGAAAATTCTCATATAGGAGTAGTTTGGATGAACCGTGCA
CAGAATCTAACTGTGTACAGCAGCTGTTATGCTCCAAATTGGACATGCACTGAGACCCAT
TCAGAAAAAGCCACTGATGAGCCTTGGTTAGAAGTTCATCAGCGTCCGGTTTACTCAGAA
GACGGCAGTGCTTTTCTACTTCTAGCAGCAGTCCAAGAAGGCGGAGGCCAATACTATACT
CATATTAAGCATGTTGATGTCCTCCGTCAACGCATAGCTGTTCTATCGCACGGTAAAGTG
GAGGTGGCGAAGATCCTGGCGTGGGACCAGGAAAACAATTTAGTTTATTATTTAGGTAGC
GCAGATAGACCGGGCCAGAGGCAGGTGTACGTGGTACGTGATCCAAGCTACGGAGGAGCT
AGCAACTCCGTCAGAGCTAGAGCTGAACGCGAGGAACCACGTTGCCTTACGTGTGAGTTG
GCTGTATGGCCTGCTCGGCTTCATTACGCCAACTGCACTTTCTGGAGCGCGACGTTCCCA
CCTCCTAAGCCGAAGCGTGGTATAACTCATTACGTTCTGGAGTGCAGAGGTCCTGGGCCT
CCACTCGCAGGTCTTCACGATGCCAAGACTCATAAGTTAGAGAGAATTTTATACGATACG
AGGCCTTATAGATCTGTACGATTACGTGAGTTGGCATTACCTTCTCGTAGATCATTTGAT
GTACAATTGAGTAGTGGCTCTAAAGCCCGTGTACAGCTTCTGCTGCCGCCGTCTTGGAGA
GAAGAACTCCGTGACGCAGCATTTCCTGTACTAGTTCACGTAGACGGTCGCCCTGGCAGT
CAACAAGTGACAGATGAATTCCTGGTAGACTGGGGAACGTATATGTCCTCACGTAACGAC
GTAGTTTACGTTAAATTAGATGTAGCAGGCGCTAAAGGGCTACCCCGAGCGCTGTTACGA
GGTCGCCTCGGTGGGGTCGAGGTGGCCGATCAATTGGCTGTTATTAGATATTTATTAGAA
ACATTTAAATTCTTGGATGTAACTCGTGTTGCTGTTTGGGGATGGGGTTATGGTGGATAT
GTAACGTCAATGTTGTTGGGGTCTCAGCAGTCTACTTTAAAGTGTGGTATAGCGGTGTCA
CCGATCACAGACTGGCTGTATTACAACGCAGCATTCACGGAGCGTATCCTGGGCCAACCG
TCAGTTAATTATAAAGGGTATGTGGAGGCTGATGCGTCCCAGCGCGCGCACCACGTGCCG
CCGCACGCGTTGTACCTCGTGCACGGGATGGCAGACATGAGCGCGCCGTACCCTCACGCT
CTGCAGTTGGCTAGGGCCTTGACTGATGCTGGAGCGTATGCTGATGAAGGACACGACCTT
GAAGGTGTTATCGAGCATGTTTACCGGTCAATGGAAGATTACCTCCTAGAGTGCTTGTCC
CTCGACCCAGAAGACACCAAGCTGCCTCCGCCAGATAGATAA
Protein sequence:
MLCCRIEVYDGAVRVPVCRECRVCGVRGVCGVCGARVSVAGEVVCGRVGLRGGGCARGSA
GSGGGALSAKEEDLYPGDGHNWRSIIFSLMVISFVIAGIVTAIYLLGYVDELLYWSGRRM
RLDEFLRGDLTGERLPTTWVSSHQLVYQADDGGLLALDTFNNTLYVLVTNHTLRQLNVRG
YQCSPNLRFVLFQHNIKEVYRRTFTAHYTVYDVTNDHHIPLFGEGQSSWEWQHAAWLGTE
GAIVLAADNEVLARPAPPARRAPLLRLTNDAVAGSVYNGVSDWLYQEEVTKESSATWGSS
DGAFVLYVQYDDRKVSQMRFPHISSGIGGAGASRSGFLLPAFNNSNPTIFPDHVTIRYPT
PGSSIPLVKLWIVAVQNVTSPPRWEVKPPSTLDGMEYYLISAQWVGKENSHIGVVWMNRA
QNLTVYSSCYAPNWTCTETHSEKATDEPWLEVHQRPVYSEDGSAFLLLAAVQEGGGQYYT
HIKHVDVLRQRIAVLSHGKVEVAKILAWDQENNLVYYLGSADRPGQRQVYVVRDPSYGGA
SNSVRARAEREEPRCLTCELAVWPARLHYANCTFWSATFPPPKPKRGITHYVLECRGPGP
PLAGLHDAKTHKLERILYDTRPYRSVRLRELALPSRRSFDVQLSSGSKARVQLLLPPSWR
EELRDAAFPVLVHVDGRPGSQQVTDEFLVDWGTYMSSRNDVVYVKLDVAGAKGLPRALLR
GRLGGVEVADQLAVIRYLLETFKFLDVTRVAVWGWGYGGYVTSMLLGSQQSTLKCGIAVS
PITDWLYYNAAFTERILGQPSVNYKGYVEADASQRAHHVPPHALYLVHGMADMSAPYPHA
LQLARALTDAGAYADEGHDLEGVIEHVYRSMEDYLLECLSLDPEDTKLPPPDR