DPGLEAN04056 in OGS1.0

New model in OGS2.0DPOGS210564 
Genomic Positionscaffold18247:- 557-5554
See gene structure
CDS Length1182
Paired RNAseq reads  1234
Single RNAseq reads  3068
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009941 (2e-32)
Best Drosophila hit  CG12082 (3e-94)
Best Human hitubiquitin carboxyl-terminal hydrolase 5 isoform 2 (1e-87)
Best NR hit (blastp)  ubiquitin carboxyl-terminal hydrolase 5 [Culex quinquefasciatus] (4e-115)
Best NR hit (blastx)  PREDICTED: similar to ubiquitin carboxyl-terminal hydrolase 5 [Tribolium castaneum] (2e-107)
GeneOntology terms


  
GO:0016579 protein deubiquitination
GO:0004221 ubiquitin thiolesterase activity
GO:0006511 ubiquitin-dependent protein catabolic process
GO:0008270 zinc ion binding
InterPro families


  
IPR001607 Zinc finger, UBP-type
IPR001394 Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
IPR018200 Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2, conserved site
IPR013083 Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL11167

Nucleotide sequence:

ATGTCGGAAATCCCGGAGCCAGAACAAACAGGAGATGGTCCAGAGAAGAAAATAACTCGC
CTGGCGATCGGTGTGGAGGGTGGCTTCGACCCTGACTGTGGCAAACCAAAGTACACTTAC
ACAGAACACTACAGCGTTGTGGTGCTGCCGGGGTTTCACACATTCCCCTGGCCTAATGAC
GCTTTACCTGACGTGGTAAAGAAATCTGTTCAGGCTGTGCTAGATGCGGATTCTCCATTC
AAGCTCGCTGAGGCGGAGGCTTTACACGGCACCTGGGATGGGGAGAAGCGAGAAGTATCC
GTCCACTCGGTTAACTTGAAGCAGTTAGATAACGGCGTTAAAATACCACCTTCCGGCTGG
AAATGTGCCAAGTGTGATCTGACGAACAACTTGTGGTTGAATCTGACCGACGGGTCCATA
TTGTGTGGGAGGAGATTCTTCGATGGCTCCGGCGGAAACGATCACGCGGTGGAGCATTTC
CGCGCGACCGGATATCCGCTCGCTGTGAAGCTTGGCACGATAACAGCTGACGGTACTGGC
GACGTGTACTCGTACGCCGAAGACGATATGGTCGAGGACCCCTACCTGGCGGAACACCTC
AAACACTTCGGCATCAACGTCCAGCAGTTACAGAAGACGGAGAAGTCGATGGTGGAGTTG
GAGCTGGAACTGAACCGCCGTACGGGCGAGTGGAACACCATCCAGGAGTCTGGAAGTGAG
CTGCGACCGCTGCACGGACCAGCACTCACAGGTGTCAACAACCTCGGCAACTCCTGTTAC
ATCAATAGTGTGGTCCAGGTGCTCTTCCGTATGCCGGACTTCATACGTCGCTACGTGGAA
GGCGCGCCAGAGATATTCTCGACCTTCCCCGAGGATCCTGCTAACGATTTCAACGTGCAG
ACAGATCCGTCCGAAGTGGTCCGTCCCCTGATACCGTTTCAAGCGTGTTTAGACGCGTTC
ATGAAGGAGGAACTCATTGAACAGTTCTTTAGTTCAGCTCTCAATAAGAAAGTTACTGCT
CGCAAAATAACCCGGCTGGCGACTTTCCCCGATTACCTTTGGATCCAGTTAAAGAAATTC
ACTATCAAAGAAGATTGGACACCCGCCAAGCTAGATGTGGCCGTGGACATGCCGTGGGAG
GTCGGTGTCATTGTCATCGTCCCAAAACAAACGTTTTTTTAA

Protein sequence:

MSEIPEPEQTGDGPEKKITRLAIGVEGGFDPDCGKPKYTYTEHYSVVVLPGFHTFPWPND
ALPDVVKKSVQAVLDADSPFKLAEAEALHGTWDGEKREVSVHSVNLKQLDNGVKIPPSGW
KCAKCDLTNNLWLNLTDGSILCGRRFFDGSGGNDHAVEHFRATGYPLAVKLGTITADGTG
DVYSYAEDDMVEDPYLAEHLKHFGINVQQLQKTEKSMVELELELNRRTGEWNTIQESGSE
LRPLHGPALTGVNNLGNSCYINSVVQVLFRMPDFIRRYVEGAPEIFSTFPEDPANDFNVQ
TDPSEVVRPLIPFQACLDAFMKEELIEQFFSSALNKKVTARKITRLATFPDYLWIQLKKF
TIKEDWTPAKLDVAVDMPWEVGVIVIVPKQTFF