New model in OGS2.0 | DPOGS215364  |
---|---|
Genomic Position | scaffold3667:+ 625-9041 |
See gene structure | |
CDS Length | 1848 |
Paired RNAseq reads   | 2956 |
Single RNAseq reads   | 7404 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009550 (1e-102) |
Best Drosophila hit   | CG3744, isoform B (4e-68) |
Best Human hit | dipeptidyl peptidase 8 isoform 3 (8e-97) |
Best NR hit (blastp)   | PREDICTED: similar to AGAP003138-PA [Tribolium castaneum] (4e-158) |
Best NR hit (blastx)   | PREDICTED: similar to AGAP003138-PA [Tribolium castaneum] (5e-144) |
GeneOntology terms    | GO:0005634 nucleus GO:0005737 cytoplasm GO:0006508 proteolysis GO:0008236 serine-type peptidase activity GO:0016020 membrane |
InterPro families    | IPR001375 Peptidase S9, prolyl oligopeptidase, catalytic domain IPR002469 Peptidase S9B, dipeptidylpeptidase IV N-terminal |
Orthology group | MCL11576 |
Nucleotide sequence:
ATGGCGATTAATGCTCTAACCTTCTCCATGCGAGAAGAGGCCTTTGCTCAGCAAATAGTG
TACGAGGAGGTGGATGAGGGTGAGGTGAAGATATACAGCTTCCCATCATCACAGAGCTCC
AGCGGGGAGGTCGAGGAGTTCAGGTTTCCCCGCGCCGGCACCCCTAATGCTAAATCAGTC
CTGAAAATGGTGACCTTCAGATTACAGAAAGCTCCCCCCACCACCGTCCTTGATTATTAC
CAAGAAGGGAACTCTAATACTGTTGCATCAGAGAGCCCCGGGAACAGTTCGGATCCCTTG
GAGGTGGTCGATGTAAGATGGTATGAACTGAGACATTCGCTGAAAGAGGTGTTCCCCTGG
TTTGAATACCTGGCCAGAGTCGGTTGGACCCCGTGCTCTCAATACGTTTGGGTCCAGGTG
TTGGACAGGAAGCAGCAGAGGTTAGAACTGGCCCTGGTGCCGGTTAGTGAGTTCAATGTC
CCAGTGAGGTATGAGCAGGGGTCTGATGGAGGAAGACTGGATGAGGAATCTCCAGCTTCA
GGGAGTAGACAGGGAGACAGGACACAGATCCAGGTGTTGGTGTCTGAGACGGCTCCCGAC
GCGTGGGTCAACGTCCACGACATACTGCACTTCCTGCCCTCAGAACCTGGTATTGTGAGG
TTCATCTGGGCTTCAGAGGAAACCGGACACCTGCACCTGTATCTCATCACCTGCGCTGTC
AACGGACAAAGGGCTATGACAGTAACTGATATAATGGCTGAGGATGAGTCAAATGCTGCA
GTCCCTCGGGTGATCAGCAAGGAACCCCTCACCGATGGGGACTGGGAGGTCATGGGAAGA
AAGATATGGGTGGACGAGCCGCGCGGTCTGGTGTATTTCGTAGGGCTCCGTGAGACGCCG
CTGGAGCGCCACCTGTACGTGGTGTCAATGTCCGCGCCCAGGCAGGTCGTCCTGCTCACT
AAGCCGGGACATTCACACAGTGTTGACATGGACGAGTCACCGGAACCTCGTTCGTTCAAC
GGTTCCTGGGACTGTCGTCCTGATGAGGAGGAGTCGCCCAGCACCCGCCCTCCCCCGGTG
CCCCCTCCACAGATACTATCGACTCGTCTGTCTTGCGGAGCCCTAGCATACTGCACACTA
TGGCGGAGCGCCGTCCCAGGGCGAAGGCCGACCGTCTTACACGTTTACGGAGGGCCCGAG
GTTCAAACGGTCACTAATAGTTACAAGGGTGTACGACAGTTGAGAATGCATATGCTGGCT
GCCCGAGGGTTCACAGTGGTGTCCGTGGACTCGAGGGGGTCTAAACACAGAGGGAGGTTG
TGGGAAGCAGCTATCAAAGGAAAGATGGGACAAGTGGAGCTGGACGATCAGGTGGAAGTT
CTCCAATGGCTGGCGAAAGAAACTGGCTGCATTGATATGGATCGAGTCGCTATACACGGG
TGGAGTTATGGTGGTTATCTGTCACTGCTGGGGCTGGCGACCCGTCCTAATACCTTCAAG
GTGTGCGTGGCGGGTGCTCCGGTGACGTGCTGGAGGCTCTACGACACGGCCTACACGGAG
CGCTACATGGGACTCCCGGCCTGCGCCCCTCATTCCTACAGCCGAGCCAGTGTGTTGGCC
CACGCTCCCTTCTTCCCTGATAGGGAGGGCCGTCTCCTTATAATCCACGGTTTAGCCGAC
GAGAACGTCCACTTCTGTCACACGGCTGCTCTCCTGGCCGAGCTGGTGAGGCTCGGGAAG
CCTCACAGAGTTCAGGTTTACCCGGGTGAGAGGCATTCGCTGCGAGCTATGCACGCGGCT
AAGCATTACGAGGCGACACTGCTGCACTTCCTACACGAGAACCTGTAG
Protein sequence:
MAINALTFSMREEAFAQQIVYEEVDEGEVKIYSFPSSQSSSGEVEEFRFPRAGTPNAKSV
LKMVTFRLQKAPPTTVLDYYQEGNSNTVASESPGNSSDPLEVVDVRWYELRHSLKEVFPW
FEYLARVGWTPCSQYVWVQVLDRKQQRLELALVPVSEFNVPVRYEQGSDGGRLDEESPAS
GSRQGDRTQIQVLVSETAPDAWVNVHDILHFLPSEPGIVRFIWASEETGHLHLYLITCAV
NGQRAMTVTDIMAEDESNAAVPRVISKEPLTDGDWEVMGRKIWVDEPRGLVYFVGLRETP
LERHLYVVSMSAPRQVVLLTKPGHSHSVDMDESPEPRSFNGSWDCRPDEEESPSTRPPPV
PPPQILSTRLSCGALAYCTLWRSAVPGRRPTVLHVYGGPEVQTVTNSYKGVRQLRMHMLA
ARGFTVVSVDSRGSKHRGRLWEAAIKGKMGQVELDDQVEVLQWLAKETGCIDMDRVAIHG
WSYGGYLSLLGLATRPNTFKVCVAGAPVTCWRLYDTAYTERYMGLPACAPHSYSRASVLA
HAPFFPDREGRLLIIHGLADENVHFCHTAALLAELVRLGKPHRVQVYPGERHSLRAMHAA
KHYEATLLHFLHENL