DPGLEAN00737 in OGS1.0

New model in OGS2.0DPOGS208930 
Genomic Positionscaffold3263:- 5206-10339
See gene structure
CDS Length1869
Paired RNAseq reads  2345
Single RNAseq reads  5667
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002507 (0.0)
Best Drosophila hit  CG15817, isoform C (1e-35)
Best Human hitubiquitin carboxyl-terminal hydrolase 1 (1e-16)
Best NR hit (blastp)  PREDICTED: similar to ubiquitin specific protease [Tribolium castaneum] (6e-105)
Best NR hit (blastx)  PREDICTED: similar to ubiquitin specific protease [Tribolium castaneum] (2e-91)
GeneOntology terms
  
GO:0006511 ubiquitin-dependent protein catabolic process
GO:0004221 ubiquitin thiolesterase activity
InterPro families
  
IPR018200 Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2, conserved site
IPR001394 Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
Orthology groupMCL14536

Nucleotide sequence:

ATGCCAGTTAACTCTTTATCAGAAACGACAAAAGGTGTTGCGAGAACTAAATTATCCCTC
TCTCTTAGAAGAAATAATGCAACAAATCCGATCAACGCAAATCAGAAACCAGGAGATTTA
AAGGACAGCTCTTACAAGGGATCAAATGACAATAAAGAGAATAAACCAGTGAAGAGACCT
ATAACAGGCACATATCTTAATACATTGAATGCTGCTAAAAAGCTTAAGCCTTTGACTGAA
ACGTCGAAACCGAAAGAAACAGTTAGTGAGCCACAGCTAGTTGTATTTGAACCATCAATA
AGTAACATTGAAATGTTGAATGGCCACCATCCTTCGGACAACCAGTATGGAGGGAGTACA
CAGTGGAAAGCCCCCATTGCCACTCTGTCTAATCTTGGAAATACTTGTTTCCTCAATAGT
GTACTATATACCCTACGATATGCTCCACAATTTGTGCACAACCTCCATCATCTAGTGTCC
GATCTCACTAGAGTGGAACAGAAATTGGGCAGCATCAGGTTAAAAAGCTCATCTCTGGGA
CGAAGTGCAGCTGGGCTTGCATCTTCTGGTACTAGATCATGGAGCAGTAAAGATTTGTTA
TCTCTGGGACAATCGGATAACACCACAGGAAAAAGTAAAATACAGATAGCCACAGAGAAG
CTTCATGAAACATATCTTAGCCTACGGGCCGCGGAAAGCAAGTGTATAAACAGTGGTGCT
GCTGATGCCAGCCCGGAACCATATGCTGCTGATGCATTTCTAGCGGCATTGAGAGAAGTC
AACTCTACATTTGAAGGTAATCGGCAACAAGATGCACATGAGCTTCTTGTTTGTATCTTG
GACAGCATTAGAGAAACATGCAGAGCTCTAAGTGCAAGAGCATCCCGTCTTCAATTACAT
GAAAATGGCGACAGCAATGGCCTCGGTCGCCAACCCAGCCTTGACGGTGATGGCAGTAAG
ACGTTAGGACATCTCCGTAAGTCGTGGAAAAAACGCAAGGAAACAAAAACCACTGACAAA
AGAAGCTCACCCTCAGAAGAACGTCCGCCTTCACCGGATCCTGAGAAGGATGAACGCTCG
AGGCCTGGCTGGGACTTTGTTGCTGATGATTTTGAAGGTACCATGGTTGTTCGAACTATG
TGCCTGGAGTGTGAGGCGGTGACGGAGAAAGCTCAAGCTGTGTGTGAGCTCTGTGTGCCA
GTAGGTGATGATGATACAAATGAAGAACCATTTAGAGCTGCATGTCTCTCTAGTGAATAT
TTGAGGGATCAGAATAAGTATTGGTGCGAGCGCTGCCTGCGCTACAACGAGGCTAGACGC
AGTGTCGCGTATTCGCGACTGCCACGGCTGTTAGTGTTGCAGCTCAAGCGCTTCAGCGGC
GGCATGGAAAAGATCACAAGACACGCGCCCACGCCACTTCTCATGCCTTGCTTCTGTGAG
CCATGTGCCAAACGGCCACCTGATCATCCACCCACACACAGATACATCCTATGGGCGGTG
ATAATGCACCTTGGTCAGGCGTTGACCGGTGGCCACTATGTAGCGTACGCGAGAGATCGT
TCCAACGGTGACGGCGAGGTGGCCAGCAAATGTGAGAGAACTGGCGGTGGTGACGCAGCA
TCTAACAACAGCGGCTCAAGCTTCATGCGAACTCTATTTAATCGCCCGAGAGCACAACCA
TCTGGCTGCGCTGCCAATGATTGCTGTGTACCGCGCCCTCGGCTAGACACCTGCTGGCTG
GCCTGCGACGACGACCTCGTCAAACCCATATCAAATGAAGAGTTCGAGGATCTATTATCC
GCCGAGCCGAAAATGCGCTCCGCAGCAACACCATACTTACTGTTCTATGTGAAGAGCGAA
GTCGGTTAA

Protein sequence:

MPVNSLSETTKGVARTKLSLSLRRNNATNPINANQKPGDLKDSSYKGSNDNKENKPVKRP
ITGTYLNTLNAAKKLKPLTETSKPKETVSEPQLVVFEPSISNIEMLNGHHPSDNQYGGST
QWKAPIATLSNLGNTCFLNSVLYTLRYAPQFVHNLHHLVSDLTRVEQKLGSIRLKSSSLG
RSAAGLASSGTRSWSSKDLLSLGQSDNTTGKSKIQIATEKLHETYLSLRAAESKCINSGA
ADASPEPYAADAFLAALREVNSTFEGNRQQDAHELLVCILDSIRETCRALSARASRLQLH
ENGDSNGLGRQPSLDGDGSKTLGHLRKSWKKRKETKTTDKRSSPSEERPPSPDPEKDERS
RPGWDFVADDFEGTMVVRTMCLECEAVTEKAQAVCELCVPVGDDDTNEEPFRAACLSSEY
LRDQNKYWCERCLRYNEARRSVAYSRLPRLLVLQLKRFSGGMEKITRHAPTPLLMPCFCE
PCAKRPPDHPPTHRYILWAVIMHLGQALTGGHYVAYARDRSNGDGEVASKCERTGGGDAA
SNNSGSSFMRTLFNRPRAQPSGCAANDCCVPRPRLDTCWLACDDDLVKPISNEEFEDLLS
AEPKMRSAATPYLLFYVKSEVG