New model in OGS2.0 | DPOGS213621  |
---|---|
Genomic Position | scaffold1176:+ 47136-59506 |
See gene structure | |
CDS Length | 1428 |
Paired RNAseq reads   | 188 |
Single RNAseq reads   | 577 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011789 (0.0) |
Best Drosophila hit   | CG6154, isoform C (2e-116) |
Best Human hit | dipeptidase 1 precursor (2e-79) |
Best NR hit (blastp)   | PREDICTED: similar to CG6154-PA, isoform A [Apis mellifera] (9e-150) |
Best NR hit (blastx)   | PREDICTED: similar to CG6154-PA, isoform A [Apis mellifera] (8e-144) |
GeneOntology terms    | GO:0016805 dipeptidase activity GO:0006508 proteolysis GO:0008235 metalloexopeptidase activity GO:0008239 dipeptidyl-peptidase activity |
InterPro families   | IPR008257 Peptidase M19, renal dipeptidase |
Orthology group | MCL10515 |
Nucleotide sequence:
ATGGGCTACGGGAACTACATGGACTACCAAGTGGCACCCGCTCACGCCTGCCTGTGCCGC
GCGATCACGTCTCCTCACGCCCACGCGGCTGTACCAGAGGGCATGACATTTCCTGGTTCG
TTACCAGATGTGGCAGAAAGATGCGGTCGCTGGGCGCCAGATTCATTATCAGCATCCGCA
TCTGGGAGCTCTGACGAAAAGCCAACGCCGCGTCGACCGCGGTGGCGTATCGCACTCGCT
GCTCTCGTTGTATTGGCCGCACTCGGCGCTGCACTAGCCCTCCCCCTCGCCCTGGGAGGT
GGCGGTCGTGCTACCCCTGAGCAGCGATTAACAACTATACGGAGAATGTTACGTGATTCT
CCACTTATAGACGGTCATAATGACCTAGCGTGGAATGTTAGAAAATTTCTTCATAATAAA
ATAGGTGACTTTAATTTGAGTGCTGGCTTGGAGGGCTTAGAGCCGTGGGCCCGATCGCGC
TGGTCGCACACGGATATTCCGCGGCTACGACTCGGTCAGATTGGGGCACAGTTTTGGTCA
GCATACGTTCCATGTGGTGCGCGAGATAAAGACGCTGTGCAGTTGGCCATTGAACAAATG
GATGTCATCAAACGGATAGTTGATATGAACGCAGCTCATCTTGCTTTAGTTACCGGCGCG
TCGGACCTCTTAGATGCCCATCGAGATGGTCGGATTGCTTCTTTGATTGGAGTTGAAGGT
GGCCATGCACTCGGTGATTCCCTAGCGGTGCTTCGCGCGTTTTATAACCTAGGTGCTCGA
TATCTTACTGTTACTCATACATGTGATACGCGATGGGCGCGTGCCGCTGGCACCTCTGGT
GGGCTCACTGAGTTCGGACGCGCCGTTGTTCGAGAAATGAATCGTCTTGGTATGATAGTT
GATCTGTCGCATGCAGGTGAAGAGACAGCCCGCGACGCTCTTGAAACTTCACAGGCACCC
GTAGTATTCTCTCATTCTGGAGCTGCAGCAATATGTAATTCGTCTAGAAATGTGCCAGAT
GATCTGCTTCGCATGATCGCTGCGAATGGTGGCGTAGTTATGATTAATTTCTATGCTAAA
CTTGTAACATGCAGCGAGCGAGCGACAATCGAAGATGTTATTGCACACATAAACCACGTG
CGAAGGGTAGCCGGAGTGGAGCACGTAGGTCTGGGAGCTGGTTATGATGGTATAGACGCA
CCGCCCGTGGGTTTAGAGGATGTTTCACGTTACCCCCACCTGTTAGCCGAGTTACTTCGC
GATCCGGATTGGAGTGAAGAAGACGTTCGTAAGCTGGCCGGTATGAATGTTGTCCGCGTA
CTGCAGCACGTGGAGCGCGTGCGAGATCAATGGAAACGTGCCGCCGTTTTTCCTGGCGAA
GAAACACCTGGTGCGCGACGCAGCGAGTGCGTGTACGGAACCGCGTGA
Protein sequence:
MGYGNYMDYQVAPAHACLCRAITSPHAHAAVPEGMTFPGSLPDVAERCGRWAPDSLSASA
SGSSDEKPTPRRPRWRIALAALVVLAALGAALALPLALGGGGRATPEQRLTTIRRMLRDS
PLIDGHNDLAWNVRKFLHNKIGDFNLSAGLEGLEPWARSRWSHTDIPRLRLGQIGAQFWS
AYVPCGARDKDAVQLAIEQMDVIKRIVDMNAAHLALVTGASDLLDAHRDGRIASLIGVEG
GHALGDSLAVLRAFYNLGARYLTVTHTCDTRWARAAGTSGGLTEFGRAVVREMNRLGMIV
DLSHAGEETARDALETSQAPVVFSHSGAAAICNSSRNVPDDLLRMIAANGGVVMINFYAK
LVTCSERATIEDVIAHINHVRRVAGVEHVGLGAGYDGIDAPPVGLEDVSRYPHLLAELLR
DPDWSEEDVRKLAGMNVVRVLQHVERVRDQWKRAAVFPGEETPGARRSECVYGTA