DPGLEAN03031 in OGS1.0

New model in OGS2.0DPOGS206546 
Genomic Positionscaffold74:+ 70099-81695
See gene structure
CDS Length2649
Paired RNAseq reads  12
Single RNAseq reads  30
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014035 (3e-133)
Best Drosophila hit  CG42400 (2e-139)
Best Human hitdipeptidase 1 precursor (1e-85)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC014432 [Tribolium castaneum] (2e-166)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC014432 [Tribolium castaneum] (9e-153)
GeneOntology terms


  
GO:0016805 dipeptidase activity
GO:0006508 proteolysis
GO:0008235 metalloexopeptidase activity
GO:0008239 dipeptidyl-peptidase activity
InterPro families
  
IPR000180 Peptidase M19, renal dipeptidase, active site
IPR008257 Peptidase M19, renal dipeptidase
Orthology groupMCL16798

Nucleotide sequence:

ATGATTCAAGTGGATGGTTTCAAACGCTCGTCTCCGTCACCATCGCGATGCTCAGTTCGG
AGTGACAGTCGGGTCTTCAGACAGCCTCGCGTCATTCAAGTTAACCGAGACGCAGCTACA
ATGATGGACGATGAAGCCAGTGTCAGCGGCATGGGGAATGTCACTGAACCTCCTTGCTCC
TCAGATTTCGAATCTAACGTTAATCAGTCAACACATTTTCGATCACGGACATTTAGTGAC
GGCTTCATTAGAAACACAAGCCTCGAATTGTTACTTGGTGTTGATTGCGAGGCCTGTCTG
GCGCTCGCTGCACAACAACATTGCATCAATGAACCTATAACGGAACAGGATTTTGATTTG
GTATGCAATTGTAATGTTCGCCAACATAATTACTTGAACTATTACCGTGACTCCGTCAAA
CAGAAGGGGCATACTCTGAATATTTATGATATAGAAAGAATTAAGAGAGACACGGGAAGA
AGTAGTCTAGCTGTCGGTTCACATCGTAGTCTTGATAGAAAACATTACAAAAGTAAAACC
ATAGGCGTTTCAAGCTACATATCCAAGCAGTTAATATCATATGACCCGACTAATACACTT
GTACAAAAACTCAAAAGAAAATTGTCTAATCTGAAAAAAGTGAGGGGTGAAAATATTGTG
AAAAAACCACATAATGGCCTTCAACTTGCTGAAATGGCATCCACTTCTACTAGCTCCCTA
AAAAGGCTATCAATGTTATCACTTGTTGATAAATCGCAAAAAAAAATGGACAATCAAAAT
TGCATTTTTTCTTCAAACGGTGTGCAATCGAAAGATGATTTCAAAAACAATGTTACCACA
TATGAAAATTTTTCTTCCAATATTGACCAATGCTATGACCGGCTTCGTACATCCAACATC
GATACATCACAAAAGAACGCCAAAGGAAGAAATTTTGATATCAACGACGAACTAATTCCG
AGGAACCCGCCTTTGGTGAAATATTCAACACTTGATAATCGAAGCAGGTGTCGAAGTATA
AAGAGACAGCTCTCGTGTAACGATTTTCCCAGCGACAGGTGTAGGGAACGTGTGCCCGAG
ACATGGTCGCAAGACGAGAACTCTTTTGGTTTGAGAAATTCGAGGAATGAAGTAGATGAC
GAACAGACATTCATAGAAGCTCACAAAGTATTAAGAGGTGCTCCAAGAAGTGCTCATGCA
CATTGTACCTGTAACTCCATTCAAGACAATAGAATACCGTCTATATTTTATGACACCCAT
GGTAAAAAGCGTTCAGCCCCCGCCCCTGACGTTACGGATCGCTCCCAGGGAAGATCTTGC
AGCGACTGCAAACCTGCACAGTGGTACCCCCCGAGCACATCCGCGACCAGCCACAGTTCT
ACGGTTACTGCTAATTCATCACGTCTGACAGCATGGCACCGTCGATGGTGTTGCGTGGCT
ATACTTGTGTTGGTAGCGGGCACAGCCTGCGTTGCCGGACCGCTGGCTCTTAGGGCTCCA
CCTGGTGCACCCCTACACGAAAGACTCCGCCTCGCAGAAAGACTACTGCACGACACACCA
CTTATTGATGGACATAATGATTTACCTTGGAATATACGAAAATTTTTACACAATAAAATC
AAAGACTTTAGATTTGATGAAGACCTTCGAACTATATCTCCCTGGGCTACGAGCTCGTGG
AGTCATACAGATTTACTTCGTCTTAAGCATGGAAGAGTAGCCGCTCAGTTTTGGGCCGCA
TACGTGCCTTGCGACGCGCAACATCGGGATGCAGTGCAATTGACCTTCGAACAAATAGAC
CTAATCCAGAGACTCACAGACAAGTACCATCCACAACTAACATTCTGTACCTCTGCCGAC
GATATATTATCGGCTCACGTAAACCACCGGCTGTGCTCACTGGTGGGTGTGGAAGGTGGG
CATGCAATTGGAGGTTCCTTAGGTGTACTAAGGACGTTGTATCAAGTTGGAGTTCGGTAT
CTAACTCTAACTTCGACTTGCGATACGCCTTGGGCTGAATGTGCTTCCACCGATCGACCT
GAATCCGCACAAAGGGGAGGATTAACGCCTTTTGGTAAAGTGGTGGTTAAAGAAATGAAT
AGATTGGGCATGCTGGTTGATCTATCACATGTTTCTGAGCGAACCATGCGGGATGCCCTT
TCGGTTTCACGAGCGCCAGTGCTTTTCTCACATTCCTCGGCCCGAGCGCTTTGTAACGTA
ACTCGAAATGTACCAGACAGCGTGCTTCGACTCTTAGCAGCTAATAAAGGACTGATAATG
GTCAACTTCTACACTTCTTTTCTCACTTGTAGAGATACGGCTACCGTTCAGGATGCTATA
GAACACATAAACCATATCCGCGACATCGCTGGTGTCGACAGCGTTGGCTTAGGAGCAGGA
TACGATGGAATAAATTACACACCTCATGGGTTAGAAGATGTCTCGTCATATCCATTATTA
TTTGCTGAACTGATGGAAGACGGATGGAGCATAGAAGATTTGAAAAAATTGGCTGGCCTG
AATTTATTACGTGTAATGAACGCAGCAGAACGTGTATCTAGAGAATTATCATCAGCCCAT
GTCACTCCTTACGAAGAAGTTGGACCCAGAGTGTTAGACTCGCACAATTGTTCCAGTCAG
GACGTTTAA

Protein sequence:

MIQVDGFKRSSPSPSRCSVRSDSRVFRQPRVIQVNRDAATMMDDEASVSGMGNVTEPPCS
SDFESNVNQSTHFRSRTFSDGFIRNTSLELLLGVDCEACLALAAQQHCINEPITEQDFDL
VCNCNVRQHNYLNYYRDSVKQKGHTLNIYDIERIKRDTGRSSLAVGSHRSLDRKHYKSKT
IGVSSYISKQLISYDPTNTLVQKLKRKLSNLKKVRGENIVKKPHNGLQLAEMASTSTSSL
KRLSMLSLVDKSQKKMDNQNCIFSSNGVQSKDDFKNNVTTYENFSSNIDQCYDRLRTSNI
DTSQKNAKGRNFDINDELIPRNPPLVKYSTLDNRSRCRSIKRQLSCNDFPSDRCRERVPE
TWSQDENSFGLRNSRNEVDDEQTFIEAHKVLRGAPRSAHAHCTCNSIQDNRIPSIFYDTH
GKKRSAPAPDVTDRSQGRSCSDCKPAQWYPPSTSATSHSSTVTANSSRLTAWHRRWCCVA
ILVLVAGTACVAGPLALRAPPGAPLHERLRLAERLLHDTPLIDGHNDLPWNIRKFLHNKI
KDFRFDEDLRTISPWATSSWSHTDLLRLKHGRVAAQFWAAYVPCDAQHRDAVQLTFEQID
LIQRLTDKYHPQLTFCTSADDILSAHVNHRLCSLVGVEGGHAIGGSLGVLRTLYQVGVRY
LTLTSTCDTPWAECASTDRPESAQRGGLTPFGKVVVKEMNRLGMLVDLSHVSERTMRDAL
SVSRAPVLFSHSSARALCNVTRNVPDSVLRLLAANKGLIMVNFYTSFLTCRDTATVQDAI
EHINHIRDIAGVDSVGLGAGYDGINYTPHGLEDVSSYPLLFAELMEDGWSIEDLKKLAGL
NLLRVMNAAERVSRELSSAHVTPYEEVGPRVLDSHNCSSQDV