DPGLEAN19089 in OGS1.0

New model in OGS2.0DPOGS206688 
Genomic Positionscaffold1413:+ 8716-14808
See gene structure
CDS Length1365
Paired RNAseq reads  1861
Single RNAseq reads  5325
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008521 (0.0)
Best Drosophila hit  dipeptidase C (1e-132)
Best Human hitxaa-Pro dipeptidase isoform 1 (2e-113)
Best NR hit (blastp)  xaa-pro dipeptidase [Culex quinquefasciatus] (6e-162)
Best NR hit (blastx)  PREDICTED: similar to xaa-pro dipeptidase pepd/pepq(e.coli) [Nasonia vitripennis] (1e-141)
GeneOntology terms





  
GO:0006508 proteolysis
GO:0008239 dipeptidyl-peptidase activity
GO:0016805 dipeptidase activity
GO:0008235 metalloexopeptidase activity
GO:0009987 cellular process
GO:0030145 manganese ion binding
GO:0004177 aminopeptidase activity
InterPro families

  
IPR001131 Peptidase M24B, X-Pro dipeptidase/aminopeptidase P, conserved site
IPR000994 Peptidase M24, structural domain
IPR007865 Peptidase M24B, X-Pro dipeptidase/aminopeptidase P N-terminal
Orthology groupMCL11155

Nucleotide sequence:

ATGGCTGGTGTTTGGTCTATGGGTCCTGGTACATATGAAGTTCCATTGTCCTTGTTTGCT
AAGAATAGAGATAGACTTGCAGAAAAGTTGAAGAGTGGCCAAGTAGTTGTTCTGCAAGGT
GGAGATGATATAAATCTCTATGATACTGACATCCAATATGTCTTCCGACAGGAAGCATAT
TTTACATGGGCTTGTGGCGTACGAGAGCCAGGCTGCTATTTTGCTCTTGATGTAAAAACC
AAGAAAAGCATTGTCTTTGTGCCTCGTCTGCCAGATGAGTATGAAATTTGGATGGGCAAA
CTACTTAGTTGTCAAGATTACACCAATATGTATGGGGTTGATGAAGTCCGCTATGTTGAT
GAGATCTGTGATGTATTAAAATCACTCGAACCTGATACTTTGCTAACACTTGTAATGGAC
AATGAAACACTATTTCCGATTATTGCTGAACTGCGCGTCATCAAAACGCCAGAGGAGATA
GAAGTAATGCGTTACATATGCAAAGTATCGTCCGATGCTCACAAACAGGTTATGCTCTAC
GCTAAGCCCGGCCTTCTGGAGTATCAATGCGAATCAGTATTCCTCGATCATTGTTACCGT
GTGGGCGGGTGTCGCCACGTGTCCTATACATGTATATGCGGCTCGGGTGACAATTCTGCC
ATTTTGCACTACGGACACGCCGCAGCTCCGAATAATAAGATGTTAAAGGATGGGGATATA
TGTTTATTCGACATGGGTGGCAACTATGCTGGGTACGCCGCAGACATCACATGCTCTTTC
CCTGCTAATGGAAAGTTCACTGAAGATCAGAAGCTCATATATGAAGCTGTGCTCGCTGCA
AGAGATGCGGTTATTAGACAAGGAAAACCGGGAGTCAAATGGACGGACATGCATCTAGCT
GCGAATAGAGCCATGTTGGAACATCTCAAGAGAGGTGGACTCTTGAAGGGAGAAGTGGAG
AAAATGATTGCGTTTGGTGTGAATGGCATCCTTCAACCTCATGGCCTCGGTCACTTGTTG
GGTCTAGATGTGCATGATGTAGGGGGTTACCTCAAGCACTGCCCTCCCAGACCCAGCGGG
CCCCTTGGAAGACTAAGAACTGCTCGGATCTTGGAAGCCGGCATGATCCTCACTATTGAA
CCCGGATGTTACTTCATACCAAAGTTGTTGGATGCAGCTAAACGTACCCAGAAACTAGCG
CAGTTCTTTAACTGGGATGTAATGGATAGATTCAGAGGCTTTGGCGGAGTTCGCATAGAA
GACGACGTGCTCATCACAGACAAGGGCGTCGAAAATCTCACATTCGTGCCAAGAACTGTT
GCGGAAATAGAAGAGTTCATGGCCAATGGCGCAAACTTCAAGTAA

Protein sequence:

MAGVWSMGPGTYEVPLSLFAKNRDRLAEKLKSGQVVVLQGGDDINLYDTDIQYVFRQEAY
FTWACGVREPGCYFALDVKTKKSIVFVPRLPDEYEIWMGKLLSCQDYTNMYGVDEVRYVD
EICDVLKSLEPDTLLTLVMDNETLFPIIAELRVIKTPEEIEVMRYICKVSSDAHKQVMLY
AKPGLLEYQCESVFLDHCYRVGGCRHVSYTCICGSGDNSAILHYGHAAAPNNKMLKDGDI
CLFDMGGNYAGYAADITCSFPANGKFTEDQKLIYEAVLAARDAVIRQGKPGVKWTDMHLA
ANRAMLEHLKRGGLLKGEVEKMIAFGVNGILQPHGLGHLLGLDVHDVGGYLKHCPPRPSG
PLGRLRTARILEAGMILTIEPGCYFIPKLLDAAKRTQKLAQFFNWDVMDRFRGFGGVRIE
DDVLITDKGVENLTFVPRTVAEIEEFMANGANFK