DPGLEAN14779 in OGS1.0

New model in OGS2.0DPOGS213433 
Genomic Positionscaffold3039:+ 5613-22532
See gene structure
CDS Length2259
Paired RNAseq reads  107
Single RNAseq reads  290
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004470 (9e-122)
Best Drosophila hit  CG11319 (4e-90)
Best Human hitdipeptidyl aminopeptidase-like protein 6 isoform 1 (3e-62)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC010074 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC010074 [Tribolium castaneum] (0.0)
GeneOntology terms


  
GO:0008239 dipeptidyl-peptidase activity
GO:0008236 serine-type peptidase activity
GO:0016020 membrane
GO:0006508 proteolysis
InterPro families
  
IPR002469 Peptidase S9B, dipeptidylpeptidase IV N-terminal
IPR001375 Peptidase S9, prolyl oligopeptidase, catalytic domain
Orthology groupMCL11245

Nucleotide sequence:

ATGGAATTTCCACAGGGCTCAGGGCTAGGTGACGAGTTCGTGTATAGAGATGCGTGGGGT
GGAATAAGTCTTTTATTTGCTGCCAACCAGACCTCAAAGACTTTAATGCCTAACAGTACA
TTTCGTGTTCTACAACCGGCGTCTTATTCACTGTCAGCAGACCGACGATTCCTGTTGTTG
GCGCACGCACCGCGCAAGTTACATCAGTATTCATATCTAGCGCGATACTCGGTCTATGAT
ATTCTCACAACCGAGTCTTATCCACTGACCCCCCTCCCAGATGACATAGGAGGGGGGGTG
ATCAGCGAAGGACCTCTCCTCTTACTTGCGATGTGGACTCCGAAAGGGCATGGGCTCATC
ACAGTCAAGGATTATGATATTTATTATCGCCCGGCACCACGCTCATCTACCGGGTATAGA
GTAACTGACACCGGTATACCGGGAAGAATAAATAATGGTGTTCCGGATTGGTTGTATGAA
GTTGAAATCCTCAAATCCCGGTCCGCGTTGTGGATGTCAGCGGATGGCCACATGGTTTTA
TATGCCACCTTTAATGATAGCTTGGTTCACGAACAAAAATTTCCGTGGTATGGAGCTGCT
TTGGATACTGATGATCCTGCCAAGACTTATCCGGAAATAAGAAGCGTCAGATACCCCAAG
CCGGGGACTAACAATCCGGTTGTAAAGTTGACGGTGGCCGATATTGCTGACCCTAAACAC
ATACGTTCCCACCATCTTACACCCCCTAAGGTTTATAGAATATTCTCAGGCAAAGAAGCG
TGGGTAGAAGCTGCCCCTGCGCCGTTATGGTCAGCGGGCGGCGCCGCGTTAGTGACACTA
GCCCCTGTCCGAGACGGCCCCGCGGGTCTGTTCCGTCACATAATACGAGCTGAACATAAT
TCTCATGGACCCAGAGCATTGCCACTCACACACGGAAGCTTCGATGTAGTTAAATTGCTA
GCATGGGATCACGCAAACCAACACATTTATTATTTGGGTATACCAGAGGGCAAACCAGGA
CAGCAGCACTTATACAGAGTGTCATCTGAAGCGCCACGACCTGGCACCCCCCAGAAGCTC
CCTTATTGCGTCACTTGTAACTCCCAGCCATCACCATCAATAAACCTGGAGTTCTATGGT
AATTTGGCTTCTTCTGGAGACAGCTCCTGGGACAGTGATTGGGAAGAGATCCCTACATCA
CCATCACCTCCTAAGAAGAAAAAAAAGAAGCAACCAGAACAAGTCCAACAAAACCAGCCG
TGCCTTTATCACGATGCTCATTTTAGTCCATCATCTACATATTTTGTACTGGAGTGCCTG
GGACCAGGAGTCCCCACGTCGAGCTTACACAAGATAGCTTTACCTGAGCCAAGACTTTTG
ATGCATCTAGAAAACAATACAGCTGTTAAGGAAAAGTTAGCAGCAATAGCTCTACCCACG
CCGCGGACATTTTCAGTACAACTGTCAAGTGGACACGCGGCAAGAGTGAGACTTTTACTT
CCTCCAGGACTAAGGGAAGATGAAGTTACTAAATACCCTTTGGTCATGAAAGTACACGGG
GCTCCGGGAACACAGCTGGTTACGGAGCAATGGTCGTTAGACTGGGGTTCTCTGGCAGCT
GGGGCCGGAGCTATACTAGCATCCGTGGATGCAAGAGGAGCTGGAGGAAGAGGATTGGCT
GCCCATCACACTCTGCATAAGAGACTGGGGACTGTTGAACTGAAAGATCAACTGGAGGTG
GCTGAATATCTGCGGGACTCACTTCACTTCATAGATGCACGTCGAGTGGCGGTGTGGGGA
CGAGCGCATGGAGGTTTTCTGGCAGCGTTGGCGCTTGCCTCCCCCTTGAGAGTGTTCCAT
TGCGGGATCACAATTACACCTATTGTGCGCTGGAGATATTACGCCTCGGCCTACGCAGAG
CGGTACATGGGTCTGCCAAACGCGACAGGTAATTACCGAGGCTACGCGGATGCTGATGTC
ACGAAGCAAGCTTCAGCATTACAAGATAAGATGATCCTGGTGGTTCACGGAACTGCTGAT
GATGATGTACATATTCAGCAGACCATGTCGCTAGCCAGGGCGTTGTCAGACCACGGGAGT
ACATTTAGGCAACAGATATATCCGGACGAAGGCCATAGTTTGGAGGGTGTAAAACATCAT
TTGTATCGAACGATGTCTTCGTTCCTTGATGATTGTTTTAAAAAGCAAGTACCACCGGAG
ACGAAGGCGGGGCTTCGGAACGGTGGAAACCTTGACTGA

Protein sequence:

MEFPQGSGLGDEFVYRDAWGGISLLFAANQTSKTLMPNSTFRVLQPASYSLSADRRFLLL
AHAPRKLHQYSYLARYSVYDILTTESYPLTPLPDDIGGGVISEGPLLLLAMWTPKGHGLI
TVKDYDIYYRPAPRSSTGYRVTDTGIPGRINNGVPDWLYEVEILKSRSALWMSADGHMVL
YATFNDSLVHEQKFPWYGAALDTDDPAKTYPEIRSVRYPKPGTNNPVVKLTVADIADPKH
IRSHHLTPPKVYRIFSGKEAWVEAAPAPLWSAGGAALVTLAPVRDGPAGLFRHIIRAEHN
SHGPRALPLTHGSFDVVKLLAWDHANQHIYYLGIPEGKPGQQHLYRVSSEAPRPGTPQKL
PYCVTCNSQPSPSINLEFYGNLASSGDSSWDSDWEEIPTSPSPPKKKKKKQPEQVQQNQP
CLYHDAHFSPSSTYFVLECLGPGVPTSSLHKIALPEPRLLMHLENNTAVKEKLAAIALPT
PRTFSVQLSSGHAARVRLLLPPGLREDEVTKYPLVMKVHGAPGTQLVTEQWSLDWGSLAA
GAGAILASVDARGAGGRGLAAHHTLHKRLGTVELKDQLEVAEYLRDSLHFIDARRVAVWG
RAHGGFLAALALASPLRVFHCGITITPIVRWRYYASAYAERYMGLPNATGNYRGYADADV
TKQASALQDKMILVVHGTADDDVHIQQTMSLARALSDHGSTFRQQIYPDEGHSLEGVKHH
LYRTMSSFLDDCFKKQVPPETKAGLRNGGNLD