New model in OGS2.0 | DPOGS203390  |
---|---|
Genomic Position | scaffold6:+ 508131-528776 |
See gene structure | |
CDS Length | 1314 |
Paired RNAseq reads   | 403 |
Single RNAseq reads   | 982 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011789 (8e-106) |
Best Drosophila hit   | CG42400 (7e-115) |
Best Human hit | dipeptidase 2 precursor (6e-86) |
Best NR hit (blastp)   | PREDICTED: similar to CG6154-PA, isoform A [Apis mellifera] (1e-145) |
Best NR hit (blastx)   | PREDICTED: similar to CG6154-PA, isoform A [Apis mellifera] (8e-130) |
GeneOntology terms    | GO:0016805 dipeptidase activity GO:0006508 proteolysis GO:0008235 metalloexopeptidase activity GO:0008239 dipeptidyl-peptidase activity |
InterPro families    | IPR000180 Peptidase M19, renal dipeptidase, active site IPR008257 Peptidase M19, renal dipeptidase |
Orthology group | MCL10515 |
Nucleotide sequence:
ATGTCACTCGCGAATAACGGCGTTCGCAAACACGACCACGTTCGTCCGCCCAGGAGGATT
AAGTGGCCAAATTTCTTAGCGGGAGTCAGGATGGTGAATCGCAATGAGAGACGTGCTGTG
GCCTGCATATTGCTGACGGTGATCGCCGTCGTTGCGGCCGCTTCATATGATCGCGAACGA
CTTGAGATTGCGAAGCAAATTCTCGAGGAAGTACCTCTCACCGACGGACACAACGATTTG
CCCTGGAACATCCGGAAGTTTCTTCGCAATCAAATCAACGATTTCGAACTGGACACCGAT
TTAACCCAAGTGGAACCTTGGTCCAAATCAAAATATTCACATACCGATCTTCCTAGACTT
AGACAGGGCATGGTCGGAGCTCAGTTCTGGTCAGCCTTCGTACCGTGTGCAGCTCAAAAT
AAGGACGCTGTTCAGTTGACCCTTGAACAGATCGATGTCATTCGTCGCCTGGTAGCCAAA
TATCCTCACCAGTTTCAACTCGCTACGTCTGTTAGTGATATCCTCGAAGCTCATAGTGCT
AGACCTCGTAAAATCGCTTCTTTGATCGGCATTGAAGGTGGACACTCTATTGGCAACTCC
TTAGGCATTCTTCGCAGCTACTATCAACTCGGAGTACGCTACATGACTCTAACCCATACA
TGCAACACTCCATGGGCTGATTCTGCCAACGAAGCACCAGTCGCTAACGGACTCACGGAA
TTTGGAGAGAAAGTTGTCCGTGAGATGAACCGTCTTGGCATGCTGATTGATTTATCTCAC
GTGGGAGAGAACACTACTAGAGCAGCCATACGTCTCTCGAAAGCACCGGTCATTTTCAGT
CATTCTTCAGTCTACAGTTTATGTCCTCACAAACGAAATGTCCCCGATGACATCATACAA
TCCCTGAAAGTTAATGGTGGAATTATCATGGTTAACTTTTTTCCTGATTTTGTGAAATGT
GCGCCAAACGCTACCATATCCGATGTTGCTGAACATTTCCATTACCTGAAGAGGATGATC
GGAGCTGATTATGTTGGAGTTGGCGGTGACTTCGACGGCGTTAATAGAGTTCCCCGCGGC
TTGGAAGACGTTTCCAAATATCCCGAATTGTTTGCTGAATTACTGCGAAGTGGTCAGTGG
AGTGTTCAGGAACTGAAGAACCTTGCCGGCTTGAATATACTACGAGTTATGCGCCAAGTT
GAAAAGATCCGTGACGACATGCGAACCAATGGCTCCGAGCCTGAGGAACACCCCGATTCT
CCTAACGACAACGGCAGCTGCACCAGCAATGCTTTCTATTCAGACGACGTTTAA
Protein sequence:
MSLANNGVRKHDHVRPPRRIKWPNFLAGVRMVNRNERRAVACILLTVIAVVAAASYDRER
LEIAKQILEEVPLTDGHNDLPWNIRKFLRNQINDFELDTDLTQVEPWSKSKYSHTDLPRL
RQGMVGAQFWSAFVPCAAQNKDAVQLTLEQIDVIRRLVAKYPHQFQLATSVSDILEAHSA
RPRKIASLIGIEGGHSIGNSLGILRSYYQLGVRYMTLTHTCNTPWADSANEAPVANGLTE
FGEKVVREMNRLGMLIDLSHVGENTTRAAIRLSKAPVIFSHSSVYSLCPHKRNVPDDIIQ
SLKVNGGIIMVNFFPDFVKCAPNATISDVAEHFHYLKRMIGADYVGVGGDFDGVNRVPRG
LEDVSKYPELFAELLRSGQWSVQELKNLAGLNILRVMRQVEKIRDDMRTNGSEPEEHPDS
PNDNGSCTSNAFYSDDV