DPGLEAN02648 in OGS1.0

New model in OGS2.0DPOGS210662 
Genomic Positionscaffold635:+ 71528-73054
See gene structure
CDS Length1527
Paired RNAseq reads  317
Single RNAseq reads  799
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001803 (0.0)
Best Drosophila hit  dipeptidase B, isoform B (3e-99)
Best Human hitprobable aminopeptidase NPEPL1 (5e-21)
Best NR hit (blastp)  PREDICTED: similar to Sb:cb283 protein [Tribolium castaneum] (4e-113)
Best NR hit (blastx)  PREDICTED: similar to Sb:cb283 protein [Tribolium castaneum] (5e-109)
GeneOntology terms



  
GO:0008240 tripeptidyl-peptidase activity
GO:0006508 proteolysis
GO:0008239 dipeptidyl-peptidase activity
GO:0004177 aminopeptidase activity
GO:0005622 intracellular
InterPro families
  
IPR000819 Peptidase M17, leucyl aminopeptidase, C-terminal
IPR011356 Peptidase M17
Orthology groupMCL16297

Nucleotide sequence:

ATGTCGCCGTTTAAATTGTACGAGAATATTTTTATTGAGACGAATCTTTTATCCTCGGAC
TATGACGGCGTTATCCTCATACTGTATCCCAGGGACATGAATGTGGCGTTGCCCAGGCAT
GTGTCGAGCTTCATAGACAAAATCTTTATCCTGGATAAGAGTATTTACAAGACGCCCAGC
GTTTGGAACTGTGATTACGTTTCTGGAGGGCGGTTGGTGCTGTCGCCGGTAGGTAATGTA
ACTCCATACCATGACGTTACCGTGGTGAGAGAAGCCGCGAAGAGGGGAATGCTGCGAGCA
ATGGAAGCCGGTATGACCAAACCGTTGCTGATCGTTGAAAACGTAGTCCATTACCCCGAC
GGGCAATTAGTCTGCATTCTGGGGGCTCTGGAATCCTTATATGTTCCGATACAGATAAGG
GAGATGAAACCCCAGAAACAGGTATACAGAATCGGTCTGCATGCTGAGGAAAAAGCAACT
GAGTCATTTGAAAAGATAGTTAGAAACGCTATCGCCTTGGAGCGAGCTAGGATCGTAGCT
AGAGACATCGCTGGCGGGGATCCCGAGAGAATGGCTCCCGGGAGGATAGCTGATTATGTA
GTCAAAGTGTTCGCCGAAGATCCTTGTGTATCCATCAAAATTATTGACAACGATGATATT
ATAGCGCAGAAATATCCACTGCTGGCAGCTGTATCGCGGGCAGCGAATAACGTGGAGAGA
CACAAGGCTAGAGTTGTTTTACTGGAGTACAATTCATCTAACCCGGTCAGGGTGACAGAA
ACCATAATGTTGGTTGGCAAAGGGGTGACGTACGACACTGGCGGCGCTGATATAAAGATA
TCTGGCAAGATGGCCGGCATGTCCAGGGATAAATGCGGGGCAGCGGCTGTAGCTGGGTTT
TTGAAGGCCTGCTCCATACTGAAACCTCCACATCTGAAGGTCATTGGGGTTATGTGTTTG
TGTCGCAATTCTATCGGCTCAGATTCCTATGTGTCTGATGAATTGCTAACATCCAGTAGC
GGAAAACTGGTCAGGGTTACCAACACGGATGCGGAGGGTAGGCTAGCTATGGCAGATTCT
CTTTACATGCTGGCCAATATGGCGGAAAAAGAGCTCAACCCACATCTCTACACCATAGCG
ACTTTGACCGGACACGCCAGAGCCTGCTACGGTAATTATACAGCAGCTATGGACAATCAC
AGCGCCAAGGGCACCAACCACTCGAGCAAATTGCAGTTCAGCGGGTCAAGACTCGGAGAA
GGATTCGAGATATCTACCGTGAGGGCCGAGGATTTGGCTGTAAATGATGGGAAATGTAGC
GGAGATGATCTCGTTCAATATGACACTGACGCGAAATGCCGCAACCACCAGCTAGCTGCA
GGGTTTCTGATCAGGGTTGCCGGTTTGGAAGACAAGAATATAAAATACACGCATCTCGAT
ATAGCTGGAGCGGCGGGATGTCCTCCGGAAAAGCCCACAGCGACGCCCGTCTTATCTTTG
TGTCACTTACACAAAGTCTTATTGTAA

Protein sequence:

MSPFKLYENIFIETNLLSSDYDGVILILYPRDMNVALPRHVSSFIDKIFILDKSIYKTPS
VWNCDYVSGGRLVLSPVGNVTPYHDVTVVREAAKRGMLRAMEAGMTKPLLIVENVVHYPD
GQLVCILGALESLYVPIQIREMKPQKQVYRIGLHAEEKATESFEKIVRNAIALERARIVA
RDIAGGDPERMAPGRIADYVVKVFAEDPCVSIKIIDNDDIIAQKYPLLAAVSRAANNVER
HKARVVLLEYNSSNPVRVTETIMLVGKGVTYDTGGADIKISGKMAGMSRDKCGAAAVAGF
LKACSILKPPHLKVIGVMCLCRNSIGSDSYVSDELLTSSSGKLVRVTNTDAEGRLAMADS
LYMLANMAEKELNPHLYTIATLTGHARACYGNYTAAMDNHSAKGTNHSSKLQFSGSRLGE
GFEISTVRAEDLAVNDGKCSGDDLVQYDTDAKCRNHQLAAGFLIRVAGLEDKNIKYTHLD
IAGAAGCPPEKPTATPVLSLCHLHKVLL