DPGLEAN02365 in OGS1.0

New model in OGS2.0DPOGS215176 
Genomic Positionscaffold255:+ 23421-29512
See gene structure
CDS Length1839
Paired RNAseq reads  749
Single RNAseq reads  2403
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008660 (1e-09)
Best Drosophila hit  CG5798 (3e-39)
Best Human hitubiquitin carboxyl-terminal hydrolase 8 (3e-40)
Best NR hit (blastp)  GG24410 [Drosophila erecta] (6e-53)
Best NR hit (blastx)  PREDICTED: similar to Ubiquitin carboxyl-terminal hydrolase 8 (Ubiquitin thiolesterase 8) (Ubiquitin-specific processing protease 8) (Deubiquitinating enzyme 8) (hUBPy) isoform 3 [Canis familiaris] (6e-50)
GeneOntology terms












  
GO:0004221 ubiquitin thiolesterase activity
GO:0004843 ubiquitin-specific protease activity
GO:0005515 protein binding
GO:0005622 intracellular
GO:0005769 early endosome
GO:0005829 cytosol
GO:0006511 ubiquitin-dependent protein catabolic process
GO:0007032 endosome organization
GO:0007265 Ras protein signal transduction
GO:0008233 peptidase activity
GO:0008234 cysteine-type peptidase activity
GO:0019897 extrinsic to plasma membrane
GO:0070536 protein K63-linked deubiquitination
GO:0071108 protein K48-linked deubiquitination
InterPro families
  
IPR001763 Rhodanese-like
IPR015063 Domain of unknown function DUF1873
Orthology groupMCL24111

Nucleotide sequence:

ATGACGGAGACGCGAAGATTACAATTACATTTAGGAAAATGTATAGAGGATTTGGACAAA
TTATACAATGTACCTGACTTGAAATCTAAAAGAGCAACGATGTTATGCAAAACAGCTCAA
AAACTCTTCGAGTCTGCAGAGGAGGCTCGCGAGAAAGGTGACGAAGAGTATTCCTATGTT
CAGTACATGAAGTACTTACGTATCATCGCTTACATAAGCAAAGACAAAGACTACTTAAAG
GACAAAACATACTTCAACAGTATGCTCGGTTCCAAAAACCCTAATAAGGCCTTGGACGCT
GCTGAAAAATTAAAGAATAGTTTAATAATAAGATATGAGAAAGAACAACAAGTAAATCGT
CTGAACGACATCCAAGAAAACGAGCTGATCAAGCAGAAGATGGAAGACAACAGGAAGAAG
GCTATCGAGGCTATGGTGGTCGCAGAACCAACACATCAGGGACTACCAGGTCCGGATGAA
GTGTCCATAAAGTCTGAACAGTTATATGTCCTGTTGAAGAGTAGCAAGCTCAAGATTATG
ATTCTGGACGCTCGGCCCAGTCAGCATTACCAGGAATCGCATATCAACCATCCGGTGTGC
ATCAATGTCCCCGAGGAGTGCATTTCACCCGGTCAGTCGGCCAACATGCTGGAACAGAAG
TTACCGCAGGTGTCCAGGAGCGTGTGGGCCGAGCGCGCTTCCATGGAACTCATCGTGATG
ATGGACTGGAACAGTATCACCGTCATACCGGGACAGAAGCTACATCTGCTCAAAACCATA
CTACTAAAGTGGGACGTGAAGGTCCACTACGCTCGCCAACCGGTGTGGCTGGTCGGCGGC
TACGAGGACTGGCTTCTCAAGTACCCCGCCTTCACCACCAACCCTCGAGCGATCCCACCC
ACCAGAGAACAGGACGTGGACGACATGCTGGATGAGATCGAGTACCCGGCCTGGTCGGAT
CTGAGTCCTCCTCCGCTGGCTGTCAATAGATCCTCCAAGCCGTCGGAGCCTCTCGTAGAT
AGAAGCAGTAAGCTGGCCGCTGTCCAGCTGTATGAAGAGCGAGCTCGCGGCGTCCAGAAC
ATCCTGGACCAGCAGGAGAGGATCGCCGACACCTCGCTCACACTGGAGATGCAGCCCGAC
CTTCGACTTGACTGGGAGAAAGTTAGATCTCAGAGGGAGGGGGAGCAGAGGGACGAGATG
AGGGCCATGTACAAGCTGCGAGAGCAGGAGATCATATCGCAGCTGATGCAACTCGAGAGC
AAACAGCTCCGCGAGCAGCTGGAGGAGTATCAGCGGAGGGAGAGGGAGGAGTCGGACCGA
CTGGAGGGGGGCGAGGACGAGCACCACGACGGAGACGCTATAGCAGAGCGCGCGCGGCAG
GCTGTGAGGGATGTGGCGGCCAAGAGAGCGAGGATAGCGGCCGTCACGGCTCAGAGGGAG
CGGCTCGACAGGAGGCGAGAGGTGCTGGAGATGGAGCGGAAGAAGAAACTAGCGGAGGCG
CGGGCGGCGAGGAAACCCGGGGACAAAGAGGAAGACGAGGCTCGTCCTGACAGTCCGGCG
CTGCCGCGGTCGCAGTCCTCGCCGAACATCGCCAAGGTGTCGTCTGACGAGGAGGAGGTC
ACCAGCCCCGTGTTTGACCGGAGCACGAAGCCGGCCAAGATGGCGCCCTCCAGTGACATG
CATCACAGAGACTTCCTACCCGTGTGGGGTGACGTGGTCAGTATAGTGTACATGTATACA
AACTCGCCTTTAATAAAAGGCAATCCTGTTCATAATACCAACTTTAAACAAGCAATGCGT
GTGTACACACACACAAGATTATCTTTAACATCAAAGTAA

Protein sequence:

MTETRRLQLHLGKCIEDLDKLYNVPDLKSKRATMLCKTAQKLFESAEEAREKGDEEYSYV
QYMKYLRIIAYISKDKDYLKDKTYFNSMLGSKNPNKALDAAEKLKNSLIIRYEKEQQVNR
LNDIQENELIKQKMEDNRKKAIEAMVVAEPTHQGLPGPDEVSIKSEQLYVLLKSSKLKIM
ILDARPSQHYQESHINHPVCINVPEECISPGQSANMLEQKLPQVSRSVWAERASMELIVM
MDWNSITVIPGQKLHLLKTILLKWDVKVHYARQPVWLVGGYEDWLLKYPAFTTNPRAIPP
TREQDVDDMLDEIEYPAWSDLSPPPLAVNRSSKPSEPLVDRSSKLAAVQLYEERARGVQN
ILDQQERIADTSLTLEMQPDLRLDWEKVRSQREGEQRDEMRAMYKLREQEIISQLMQLES
KQLREQLEEYQRREREESDRLEGGEDEHHDGDAIAERARQAVRDVAAKRARIAAVTAQRE
RLDRRREVLEMERKKKLAEARAARKPGDKEEDEARPDSPALPRSQSSPNIAKVSSDEEEV
TSPVFDRSTKPAKMAPSSDMHHRDFLPVWGDVVSIVYMYTNSPLIKGNPVHNTNFKQAMR
VYTHTRLSLTSK