DPGLEAN02682 in OGS1.0

New model in OGS2.0DPOGS207580 
Genomic Positionscaffold4787:- 618-4901
See gene structure
CDS Length1857
Paired RNAseq reads  3114
Single RNAseq reads  7407
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004688 (0.0)
Best Drosophila hit  dipeptidyl aminopeptidase III, isoform C (0.0)
Best Human hitdipeptidyl peptidase 3 (1e-175)
Best NR hit (blastp)  PREDICTED: similar to dipeptidyl peptidase iii [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to dipeptidyl peptidase iii [Nasonia vitripennis] (0.0)
GeneOntology terms


  
GO:0008239 dipeptidyl-peptidase activity
GO:0006508 proteolysis
GO:0016020 membrane
GO:0005829 cytosol
InterPro families
  
IPR016526 Peptidase M49, dipeptidyl-peptidase 3, eukaryotic
IPR005317 Peptidase M49, dipeptidyl-peptidase III
Orthology groupMCL12893

Nucleotide sequence:

ATGGAGGATAAGTCGATCTTTCTTTTGCCAAACAGTCAGAAATTTGTTGAACTAGATAGT
TCACAGGCATTTACAAAATTAACTAAACAAGAAAAGTTGTATGCTCACTATTTGAGTCAG
GCTGCTTGGAATGGTGGTTTAATTGTTCTTCTACAAACAAGTCCAGAATCACCAAGGATT
TTTTCACTTTTGCACAGAATTTTTAAATCAGAAGGATTAGCTGATTTAAAAAAAGTTTCC
CTTGGAGCTGGTGTATCCGAGGATGATTTTCAGGCCTTCTTAGTTTATGCGGGTGGATTA
TTTGCTAACAGCGGTAATTACAAAGGCTTTGGTGATACAAAATTCATTCCTAACTTGCCA
AAAGAATGTTTTGAAGTTATCGTTAAATCATCAAAAGCATTTAAAAATGATGAAGCACAT
ATAAGTAAACTGTGGGAGAACACTAAAAATGCTATGTACAGTACTGCACCCAGATTAGCC
AGCCTTGGTTTAGCCGATAAGGGTATAACAACATATTTCTCAAGTAACTGTACAGAGGCG
GACTCGACTCTTGTAAATGACTGGATGAAAACAAAACGCATTGAAGCATACATTTGTAGA
ACTTTCAAGACAACCGCTGACGATGGATTACCTTTGTATACGATACACCTGGCCAGTGTC
GAGAAAAGCTCAAAGCCGCCCCTTACTATGGATAAAGAAAAATACAAAAATGCGTACTTC
CAAGTGACTCGGGGAGATTATTCGCCATTATTGAGTTTGGTCAACGAAAATCTTGCAAAA
GCTATGGAGTATGCGGCAAATGAGAATGAAAAGAATATGATTAAACATTACATTAACAGT
TTTAAAGAGGGAGATTTAAGTGAACATAAAGAAGGCAGCAGGTTCTGGGTGAAGGACAAA
GGACCGATTATAGAGACATATCAAGGCTTCATAGAGACATACCGCGATCCCAGCGGACAA
AGAGGTGAATTCGAGGGTTTTGTGGCTATGGTCAATAAAGATATGTCAAAAAAGTTTGGG
GAACTCGTCCATGGTGCTGAAAACTTCATAAAGCTGTTACCGTGGGGGGAGGGGCTTGAG
AAGGATTCCTTCCTCCGACCGGACTTCACTAGTCTAGACGTACTGACGTTCTCAGGGAGC
GGTATACCAGCCGGAATTAACATACCTAACTATGATGAGATCCGACAAAATGAAGGCTTT
AAGAACGTGTCCCTGGGTAACGTGTTCCCCGCCGCTTATAAGGAGTCCGTTATACCATTC
CTCTCTGATAGTGATAAAGTTCTTTTAGAAAAATACAGGGTTGCTGCATTTGAGGTTCAA
GTAGGACTTCATGAACTGCTGGGTCATGGCAGCGGGAAGCTTCTCAGACAAAACGCAGAC
GGGACATTCAACTTCGACAAGGAGAAAGTTAAAAATCCTCTAACTGGCAAGGAGATCGAG
TCGTGGTATTCAGAAGGCGAGAATTACGACAGCAAGTTCACCACTTTGGGATCCGCCTTC
GAGGAATGCCGGGCGGAGGCTGTTGGATTGTATCTGTCGTTACGACCTGAGATACTCAAA
ATCTTCGGTTACGAGGGTCAGGAAGCAGAGGACGTGATGTACGTCAACTGGCTCAGTCTA
CTGTGGAACGGAGCCGCCAAGGCCACGGAAATGTACCAGCCGGCTACGAAAACGTGGCTA
CAGGCCCACGCGAGAGCTCGTTTTGTTTTAATGAGACTGTTGGAATTGGAAGGTAACGGA
ATACTAACAGTCACCGAGGTTGATCCCGGCAAGAACCTGTTGCTTACTTTAGACAGGAAA
CGTTTGGCTACTGACGGAAAACGAATTGTCGGTAGGAATGACTTGTTATTAATTTAA

Protein sequence:

MEDKSIFLLPNSQKFVELDSSQAFTKLTKQEKLYAHYLSQAAWNGGLIVLLQTSPESPRI
FSLLHRIFKSEGLADLKKVSLGAGVSEDDFQAFLVYAGGLFANSGNYKGFGDTKFIPNLP
KECFEVIVKSSKAFKNDEAHISKLWENTKNAMYSTAPRLASLGLADKGITTYFSSNCTEA
DSTLVNDWMKTKRIEAYICRTFKTTADDGLPLYTIHLASVEKSSKPPLTMDKEKYKNAYF
QVTRGDYSPLLSLVNENLAKAMEYAANENEKNMIKHYINSFKEGDLSEHKEGSRFWVKDK
GPIIETYQGFIETYRDPSGQRGEFEGFVAMVNKDMSKKFGELVHGAENFIKLLPWGEGLE
KDSFLRPDFTSLDVLTFSGSGIPAGINIPNYDEIRQNEGFKNVSLGNVFPAAYKESVIPF
LSDSDKVLLEKYRVAAFEVQVGLHELLGHGSGKLLRQNADGTFNFDKEKVKNPLTGKEIE
SWYSEGENYDSKFTTLGSAFEECRAEAVGLYLSLRPEILKIFGYEGQEAEDVMYVNWLSL
LWNGAAKATEMYQPATKTWLQAHARARFVLMRLLELEGNGILTVTEVDPGKNLLLTLDRK
RLATDGKRIVGRNDLLLI