DPGLEAN11106 in OGS1.0

New model in OGS2.0DPOGS204226 
Genomic Positionscaffold1204:+ 411-7770
See gene structure
CDS Length3300
Paired RNAseq reads  3144
Single RNAseq reads  7713
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007507 (6e-166)
Best Drosophila hit  fat facets, isoform C (0.0)
Best Human hitprobable ubiquitin carboxyl-terminal hydrolase FAF-X isoform 3 (0.0)
Best NR hit (blastp)  PREDICTED: similar to ubiquitin specific peptidase 9 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to Probable ubiquitin carboxyl-terminal hydrolase FAF-X (Ubiquitin thioesterase FAF-X) (Ubiquitin-specific-processing protease FAF-X) (Deubiquitinating enzyme FAF-X) (Fat facets protein-related, X-linked) (Ubiquitin-specific protease 9, X c... [Apis mellifera] (0.0)
GeneOntology terms











  
GO:0007097 nuclear migration
GO:0009790 embryo development
GO:0048749 compound eye development
GO:0008583 mystery cell fate differentiation
GO:0007349 cellularization
GO:0005737 cytoplasm
GO:0006511 ubiquitin-dependent protein catabolic process
GO:0016579 protein deubiquitination
GO:0004843 ubiquitin-specific protease activity
GO:0006897 endocytosis
GO:0045861 negative regulation of proteolysis
GO:0008354 germ cell migration
GO:0004221 ubiquitin thiolesterase activity
InterPro families
  
IPR018200 Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2, conserved site
IPR001394 Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
Orthology groupMCL10749

Nucleotide sequence:

GTGGAGTACTTGCGTGCCGTCCGCGATAATCCTGCAACTTTAGATGACACTTTATTGGAA
GGTCATCTCAATCTTACCAAAGAATTATTCACCCACGTCCCGGCTCAGGTGAAGTTCCAA
TACGGCGCACATCCTGACCACAAAGACTCGGGCCTCATTAGGGAAGTGACAACTGAGTTC
CTGTGGCCGTATTCGTGGGCGTGGTGTCATATGAGTAACGAACTGTCTGGCGACGAGGAC
GGCACTAGCGAGGAGGAGTGTGAGGGTGTAGCCCCGCTCTGCCGTACACCCGGCGCCGCA
GCCGCTGCAACCGATCTACTGCTGGCTCTGGTGCATGCTTGCGTTCCTAATATGGCTGCG
CTTGCCAACTTACTTGAACAGATGTTCTATAGTGATAAAAACATGGGTCTATCCGAGTGG
GAGTACATGCCTTGCGTGGGACCACGGCCGCCGGCCGGGCTGGTGGGTCTGAAGAACGCG
GGCGCCACGTGTTATATGAACTCCGTGTTACAACAACTGTACTGCGTGCGAGCTGTACGG
GACGTGCTCCTAACAGTACAGGGGGCTGCTACTGACCCAAATGAAGATTTTTCCGGCGAA
ACGCATCATCACAGTATTTTGGAAAATAACACAGAGAATAACGCAGACTATAATATCACT
ATACTTAAACAAGTGCAGGCCATATTTGCCCACTTACATTATAGCAAACTTCAGTATTAT
GTACCCAGGGGGCTTTGGGCACACTTCAGGTTACAAGGTGAACCCGTGAATTTACGTGAA
CAGCAAGATGCTGTGGAGTTTTTCATGTCTCTGGTGGAATCCCTCGATGAGGCTTTAAAA
TCGTTGGGACAAGAACAACTTATGGCGAAAACGATGGGAGGAACATACTCCGATCAGAAA
ATTTGTAAGGGATGTCCACACAGGTATTGCAAAGAAGAACCATTCAGTGTAGTTTCCCTG
GACATAAGAAATATGTCCCGCTTGCAAGAATCTCTGGAAGCGTACGTCAGGGGAGAGCTT
CTGGAGGGGGCCGATGCTTATTATTGCGATAAGTGTAACAAGAAGGTTGTAACCGTGAAA
CGTCTGTGTCTCAACAAATTACCGCCAGTCCTAGTCATACAGCTGAAGAGATTCGAATAC
GACTTCGAGAAAGTTTGTGCAATAAAATTCAACGATTACTTTGAATTTCCGCGAGAGTTG
GATGTAGAGCCGTACACGGCGTGGGGTCTGGCACGAGCCGAGGGCGACGCGTCCCTGTGG
GAGGGCGGAGAACGTACGGAGACTCACTACCAGCTCAGCGGGATCGTGGTTCACTCCGGC
CAGGCCTCCGGAGGACACTACTATTCATATGTACTACTTAGAGACAACGCCGGTGACGCG
GGTCGATGGGTGAAGCTGGACGATGGCGAGGTGTCGGAGTGCGCCATGCATGATGACGAC
GAAATGAAGGCTCAGTGCTTCGGCGGAGAGTACATGGGAGAGGTATTTGATTCGACCATA
AAGAGGGTGTCATACAAGAGACAGAAGAGATGGTGGAACGCTTACATGCTGTTTTACACA
CGAAAGGACATGATTGATACATCCGGCCTCGAGAGAATCATGCAAAACGTAACACTCAAG
GAAAGTGCCATACCCAAACCTATCTGGAATTCGGTTCGTCGCAGTAATATCGCTTTCTCA
CACAACCAGGACCAGTTCAGTTTGGAACATTTTAATTTTATGAAGAAGCTATGTTGTATG
CGTATGCAAGTGTTACCCGGCTCACAGAGCGCGGTATGGGGCCCAGAGCACGAGGAAATG
TCGATGTTAGCTGTACAGTTAGCAGCTAAATTCTTGTTCCAAGTTGGTTTCCATACAAAG
AAAACACTACGTGGACCCGCTGCAGACTGGCAAGACATACTCTGCCAGCATCTGAGATGT
TCTCAGGCAGTCAGAACTTGGTTTGCGACCGACCTGTTCAAACATTCTCACAGGTTATGT
GACTACTTACTGTCGTGTCCATCGGCTGAAGTAAGAGTTGTATTTATGAAAATCATCGTG
TTTTTGGCTCATTTCTCCATACAGGACTCGCCAGTGAGCTGCGGCTACGGGACGTGGTGT
TCACGCGAGGAGGCCACCTCTCTATCGGATCAGGTTATATGTGCTGCTCGTGCGTTGGCG
GTGCCTCACGCACACGCACACGACCACCGACACCTGCCGCTGCTGTTCAACCTATTCCAC
GCGTACGCTATGCTGGGATTGGGGGAGAGACACCAGCTGCTTAGGCTCAAGATACTCGAC
ATAGTACTGACAGTCTGTTTGGAAGATTCGTCCTCGTCGCTTGGAAAATATCAGTATCCG
GAATCTGCTAAAATACATCAGGTTGTCTGTGCATTGGTGCGTTGTTGTGACGTGAGCGCT
CGTTGTCAGTCGGCTAACGCCAGTGAGGGCGTCCTGCCGCTGGCGAACCCGTACGCGGAC
GCGGCGCACGCGCACTCCCCGCGACCCGCACTGTCCGCAGCCGCAGCTGACGTGCTCTAC
AACCGCACCGGGTCTTACATGAAGAAACTAACTGAGGAGTGCTGCGGGTGTGAAGAGGGT
ATCCGACTGCTTCAGTTCATGTGCTGGGAGCATGCTGGCTGGTCTCGCATGGCGCTGGCC
GAGCTGCTGTGGCAGATGGCGTACGCGTTCTGTCATGAACTGCGGAGACACGCGGACGCA
CTCACGGCACTCCTTCTAATGGAAGATAGTTGGCAGCATCACAGGATACACAACGCTATC
AAGGGCGTGTCAGAGGAGCGTCCCGGGCTTCTTGAGACAGCATTGCGGGCGCGAAGTCAC
TACCAGAAACGAGCGTATGCTTGTGTGAAGTTGGTTGTAGGAGTAATGTGTCGCACTCCG
CTCGCTGTACGAGCTGTGCACGCGCAGTCCGACGCGCGAAGACGTTGGAGACAACTTCTA
GCGTGGCTACAGGACGAGCTCGAACGCAAGTATGGATCCGGAGGTTACGGGTCATACGGT
ACTTGGTCTCCCCCCAGCCTGTCCAACGAAACGTCAAGTGGATATTTCCTGGAGCGTAGC
AACTCCGCAAGAAAGACTCTCGAAAAGGCCTATCAACTCTGTCCGGAGGAGGAAGAAGAA
GAAGAGGAGTCGCGAGATGCCGGCTCGGGCTCGGGCTCCGGCGAGGCCGGTGACGAAAGC
GGTGACGACGATGCCCCCGATGACGAGGAGCCCGCGCCGCGCCGCCTGCAGCTTGCGCCA
CCCGCCCCGCCTGCGCCTCCAGCACCTCCGGCCCCTCCCGCGCCGCCCGCCGGCCCGTGA

Protein sequence:

VEYLRAVRDNPATLDDTLLEGHLNLTKELFTHVPAQVKFQYGAHPDHKDSGLIREVTTEF
LWPYSWAWCHMSNELSGDEDGTSEEECEGVAPLCRTPGAAAAATDLLLALVHACVPNMAA
LANLLEQMFYSDKNMGLSEWEYMPCVGPRPPAGLVGLKNAGATCYMNSVLQQLYCVRAVR
DVLLTVQGAATDPNEDFSGETHHHSILENNTENNADYNITILKQVQAIFAHLHYSKLQYY
VPRGLWAHFRLQGEPVNLREQQDAVEFFMSLVESLDEALKSLGQEQLMAKTMGGTYSDQK
ICKGCPHRYCKEEPFSVVSLDIRNMSRLQESLEAYVRGELLEGADAYYCDKCNKKVVTVK
RLCLNKLPPVLVIQLKRFEYDFEKVCAIKFNDYFEFPRELDVEPYTAWGLARAEGDASLW
EGGERTETHYQLSGIVVHSGQASGGHYYSYVLLRDNAGDAGRWVKLDDGEVSECAMHDDD
EMKAQCFGGEYMGEVFDSTIKRVSYKRQKRWWNAYMLFYTRKDMIDTSGLERIMQNVTLK
ESAIPKPIWNSVRRSNIAFSHNQDQFSLEHFNFMKKLCCMRMQVLPGSQSAVWGPEHEEM
SMLAVQLAAKFLFQVGFHTKKTLRGPAADWQDILCQHLRCSQAVRTWFATDLFKHSHRLC
DYLLSCPSAEVRVVFMKIIVFLAHFSIQDSPVSCGYGTWCSREEATSLSDQVICAARALA
VPHAHAHDHRHLPLLFNLFHAYAMLGLGERHQLLRLKILDIVLTVCLEDSSSSLGKYQYP
ESAKIHQVVCALVRCCDVSARCQSANASEGVLPLANPYADAAHAHSPRPALSAAAADVLY
NRTGSYMKKLTEECCGCEEGIRLLQFMCWEHAGWSRMALAELLWQMAYAFCHELRRHADA
LTALLLMEDSWQHHRIHNAIKGVSEERPGLLETALRARSHYQKRAYACVKLVVGVMCRTP
LAVRAVHAQSDARRRWRQLLAWLQDELERKYGSGGYGSYGTWSPPSLSNETSSGYFLERS
NSARKTLEKAYQLCPEEEEEEEESRDAGSGSGSGEAGDESGDDDAPDDEEPAPRRLQLAP
PAPPAPPAPPAPPAPPAGP