DPGLEAN02737 in OGS1.0

New model in OGS2.0DPOGS213621 
Genomic Positionscaffold1176:+ 47136-59506
See gene structure
CDS Length1428
Paired RNAseq reads  188
Single RNAseq reads  577
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011789 (0.0)
Best Drosophila hit  CG6154, isoform C (2e-116)
Best Human hitdipeptidase 1 precursor (2e-79)
Best NR hit (blastp)  PREDICTED: similar to CG6154-PA, isoform A [Apis mellifera] (9e-150)
Best NR hit (blastx)  PREDICTED: similar to CG6154-PA, isoform A [Apis mellifera] (8e-144)
GeneOntology terms


  
GO:0016805 dipeptidase activity
GO:0006508 proteolysis
GO:0008235 metalloexopeptidase activity
GO:0008239 dipeptidyl-peptidase activity
InterPro families  IPR008257 Peptidase M19, renal dipeptidase
Orthology groupMCL10515

Nucleotide sequence:

ATGGGCTACGGGAACTACATGGACTACCAAGTGGCACCCGCTCACGCCTGCCTGTGCCGC
GCGATCACGTCTCCTCACGCCCACGCGGCTGTACCAGAGGGCATGACATTTCCTGGTTCG
TTACCAGATGTGGCAGAAAGATGCGGTCGCTGGGCGCCAGATTCATTATCAGCATCCGCA
TCTGGGAGCTCTGACGAAAAGCCAACGCCGCGTCGACCGCGGTGGCGTATCGCACTCGCT
GCTCTCGTTGTATTGGCCGCACTCGGCGCTGCACTAGCCCTCCCCCTCGCCCTGGGAGGT
GGCGGTCGTGCTACCCCTGAGCAGCGATTAACAACTATACGGAGAATGTTACGTGATTCT
CCACTTATAGACGGTCATAATGACCTAGCGTGGAATGTTAGAAAATTTCTTCATAATAAA
ATAGGTGACTTTAATTTGAGTGCTGGCTTGGAGGGCTTAGAGCCGTGGGCCCGATCGCGC
TGGTCGCACACGGATATTCCGCGGCTACGACTCGGTCAGATTGGGGCACAGTTTTGGTCA
GCATACGTTCCATGTGGTGCGCGAGATAAAGACGCTGTGCAGTTGGCCATTGAACAAATG
GATGTCATCAAACGGATAGTTGATATGAACGCAGCTCATCTTGCTTTAGTTACCGGCGCG
TCGGACCTCTTAGATGCCCATCGAGATGGTCGGATTGCTTCTTTGATTGGAGTTGAAGGT
GGCCATGCACTCGGTGATTCCCTAGCGGTGCTTCGCGCGTTTTATAACCTAGGTGCTCGA
TATCTTACTGTTACTCATACATGTGATACGCGATGGGCGCGTGCCGCTGGCACCTCTGGT
GGGCTCACTGAGTTCGGACGCGCCGTTGTTCGAGAAATGAATCGTCTTGGTATGATAGTT
GATCTGTCGCATGCAGGTGAAGAGACAGCCCGCGACGCTCTTGAAACTTCACAGGCACCC
GTAGTATTCTCTCATTCTGGAGCTGCAGCAATATGTAATTCGTCTAGAAATGTGCCAGAT
GATCTGCTTCGCATGATCGCTGCGAATGGTGGCGTAGTTATGATTAATTTCTATGCTAAA
CTTGTAACATGCAGCGAGCGAGCGACAATCGAAGATGTTATTGCACACATAAACCACGTG
CGAAGGGTAGCCGGAGTGGAGCACGTAGGTCTGGGAGCTGGTTATGATGGTATAGACGCA
CCGCCCGTGGGTTTAGAGGATGTTTCACGTTACCCCCACCTGTTAGCCGAGTTACTTCGC
GATCCGGATTGGAGTGAAGAAGACGTTCGTAAGCTGGCCGGTATGAATGTTGTCCGCGTA
CTGCAGCACGTGGAGCGCGTGCGAGATCAATGGAAACGTGCCGCCGTTTTTCCTGGCGAA
GAAACACCTGGTGCGCGACGCAGCGAGTGCGTGTACGGAACCGCGTGA

Protein sequence:

MGYGNYMDYQVAPAHACLCRAITSPHAHAAVPEGMTFPGSLPDVAERCGRWAPDSLSASA
SGSSDEKPTPRRPRWRIALAALVVLAALGAALALPLALGGGGRATPEQRLTTIRRMLRDS
PLIDGHNDLAWNVRKFLHNKIGDFNLSAGLEGLEPWARSRWSHTDIPRLRLGQIGAQFWS
AYVPCGARDKDAVQLAIEQMDVIKRIVDMNAAHLALVTGASDLLDAHRDGRIASLIGVEG
GHALGDSLAVLRAFYNLGARYLTVTHTCDTRWARAAGTSGGLTEFGRAVVREMNRLGMIV
DLSHAGEETARDALETSQAPVVFSHSGAAAICNSSRNVPDDLLRMIAANGGVVMINFYAK
LVTCSERATIEDVIAHINHVRRVAGVEHVGLGAGYDGIDAPPVGLEDVSRYPHLLAELLR
DPDWSEEDVRKLAGMNVVRVLQHVERVRDQWKRAAVFPGEETPGARRSECVYGTA