DPGLEAN08811 in OGS1.0

New model in OGS2.0DPOGS203390 
Genomic Positionscaffold6:+ 508131-528776
See gene structure
CDS Length1314
Paired RNAseq reads  403
Single RNAseq reads  982
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011789 (8e-106)
Best Drosophila hit  CG42400 (7e-115)
Best Human hitdipeptidase 2 precursor (6e-86)
Best NR hit (blastp)  PREDICTED: similar to CG6154-PA, isoform A [Apis mellifera] (1e-145)
Best NR hit (blastx)  PREDICTED: similar to CG6154-PA, isoform A [Apis mellifera] (8e-130)
GeneOntology terms


  
GO:0016805 dipeptidase activity
GO:0006508 proteolysis
GO:0008235 metalloexopeptidase activity
GO:0008239 dipeptidyl-peptidase activity
InterPro families
  
IPR000180 Peptidase M19, renal dipeptidase, active site
IPR008257 Peptidase M19, renal dipeptidase
Orthology groupMCL10515

Nucleotide sequence:

ATGTCACTCGCGAATAACGGCGTTCGCAAACACGACCACGTTCGTCCGCCCAGGAGGATT
AAGTGGCCAAATTTCTTAGCGGGAGTCAGGATGGTGAATCGCAATGAGAGACGTGCTGTG
GCCTGCATATTGCTGACGGTGATCGCCGTCGTTGCGGCCGCTTCATATGATCGCGAACGA
CTTGAGATTGCGAAGCAAATTCTCGAGGAAGTACCTCTCACCGACGGACACAACGATTTG
CCCTGGAACATCCGGAAGTTTCTTCGCAATCAAATCAACGATTTCGAACTGGACACCGAT
TTAACCCAAGTGGAACCTTGGTCCAAATCAAAATATTCACATACCGATCTTCCTAGACTT
AGACAGGGCATGGTCGGAGCTCAGTTCTGGTCAGCCTTCGTACCGTGTGCAGCTCAAAAT
AAGGACGCTGTTCAGTTGACCCTTGAACAGATCGATGTCATTCGTCGCCTGGTAGCCAAA
TATCCTCACCAGTTTCAACTCGCTACGTCTGTTAGTGATATCCTCGAAGCTCATAGTGCT
AGACCTCGTAAAATCGCTTCTTTGATCGGCATTGAAGGTGGACACTCTATTGGCAACTCC
TTAGGCATTCTTCGCAGCTACTATCAACTCGGAGTACGCTACATGACTCTAACCCATACA
TGCAACACTCCATGGGCTGATTCTGCCAACGAAGCACCAGTCGCTAACGGACTCACGGAA
TTTGGAGAGAAAGTTGTCCGTGAGATGAACCGTCTTGGCATGCTGATTGATTTATCTCAC
GTGGGAGAGAACACTACTAGAGCAGCCATACGTCTCTCGAAAGCACCGGTCATTTTCAGT
CATTCTTCAGTCTACAGTTTATGTCCTCACAAACGAAATGTCCCCGATGACATCATACAA
TCCCTGAAAGTTAATGGTGGAATTATCATGGTTAACTTTTTTCCTGATTTTGTGAAATGT
GCGCCAAACGCTACCATATCCGATGTTGCTGAACATTTCCATTACCTGAAGAGGATGATC
GGAGCTGATTATGTTGGAGTTGGCGGTGACTTCGACGGCGTTAATAGAGTTCCCCGCGGC
TTGGAAGACGTTTCCAAATATCCCGAATTGTTTGCTGAATTACTGCGAAGTGGTCAGTGG
AGTGTTCAGGAACTGAAGAACCTTGCCGGCTTGAATATACTACGAGTTATGCGCCAAGTT
GAAAAGATCCGTGACGACATGCGAACCAATGGCTCCGAGCCTGAGGAACACCCCGATTCT
CCTAACGACAACGGCAGCTGCACCAGCAATGCTTTCTATTCAGACGACGTTTAA

Protein sequence:

MSLANNGVRKHDHVRPPRRIKWPNFLAGVRMVNRNERRAVACILLTVIAVVAAASYDRER
LEIAKQILEEVPLTDGHNDLPWNIRKFLRNQINDFELDTDLTQVEPWSKSKYSHTDLPRL
RQGMVGAQFWSAFVPCAAQNKDAVQLTLEQIDVIRRLVAKYPHQFQLATSVSDILEAHSA
RPRKIASLIGIEGGHSIGNSLGILRSYYQLGVRYMTLTHTCNTPWADSANEAPVANGLTE
FGEKVVREMNRLGMLIDLSHVGENTTRAAIRLSKAPVIFSHSSVYSLCPHKRNVPDDIIQ
SLKVNGGIIMVNFFPDFVKCAPNATISDVAEHFHYLKRMIGADYVGVGGDFDGVNRVPRG
LEDVSKYPELFAELLRSGQWSVQELKNLAGLNILRVMRQVEKIRDDMRTNGSEPEEHPDS
PNDNGSCTSNAFYSDDV