New model in OGS2.0 | DPOGS204226  |
---|---|
Genomic Position | scaffold1204:+ 411-7770 |
See gene structure | |
CDS Length | 3300 |
Paired RNAseq reads   | 3144 |
Single RNAseq reads   | 7713 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007507 (6e-166) |
Best Drosophila hit   | fat facets, isoform C (0.0) |
Best Human hit | probable ubiquitin carboxyl-terminal hydrolase FAF-X isoform 3 (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to ubiquitin specific peptidase 9 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to Probable ubiquitin carboxyl-terminal hydrolase FAF-X (Ubiquitin thioesterase FAF-X) (Ubiquitin-specific-processing protease FAF-X) (Deubiquitinating enzyme FAF-X) (Fat facets protein-related, X-linked) (Ubiquitin-specific protease 9, X c... [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0007097 nuclear migration GO:0009790 embryo development GO:0048749 compound eye development GO:0008583 mystery cell fate differentiation GO:0007349 cellularization GO:0005737 cytoplasm GO:0006511 ubiquitin-dependent protein catabolic process GO:0016579 protein deubiquitination GO:0004843 ubiquitin-specific protease activity GO:0006897 endocytosis GO:0045861 negative regulation of proteolysis GO:0008354 germ cell migration GO:0004221 ubiquitin thiolesterase activity |
InterPro families    | IPR018200 Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2, conserved site IPR001394 Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2 |
Orthology group | MCL10749 |
Nucleotide sequence:
GTGGAGTACTTGCGTGCCGTCCGCGATAATCCTGCAACTTTAGATGACACTTTATTGGAA
GGTCATCTCAATCTTACCAAAGAATTATTCACCCACGTCCCGGCTCAGGTGAAGTTCCAA
TACGGCGCACATCCTGACCACAAAGACTCGGGCCTCATTAGGGAAGTGACAACTGAGTTC
CTGTGGCCGTATTCGTGGGCGTGGTGTCATATGAGTAACGAACTGTCTGGCGACGAGGAC
GGCACTAGCGAGGAGGAGTGTGAGGGTGTAGCCCCGCTCTGCCGTACACCCGGCGCCGCA
GCCGCTGCAACCGATCTACTGCTGGCTCTGGTGCATGCTTGCGTTCCTAATATGGCTGCG
CTTGCCAACTTACTTGAACAGATGTTCTATAGTGATAAAAACATGGGTCTATCCGAGTGG
GAGTACATGCCTTGCGTGGGACCACGGCCGCCGGCCGGGCTGGTGGGTCTGAAGAACGCG
GGCGCCACGTGTTATATGAACTCCGTGTTACAACAACTGTACTGCGTGCGAGCTGTACGG
GACGTGCTCCTAACAGTACAGGGGGCTGCTACTGACCCAAATGAAGATTTTTCCGGCGAA
ACGCATCATCACAGTATTTTGGAAAATAACACAGAGAATAACGCAGACTATAATATCACT
ATACTTAAACAAGTGCAGGCCATATTTGCCCACTTACATTATAGCAAACTTCAGTATTAT
GTACCCAGGGGGCTTTGGGCACACTTCAGGTTACAAGGTGAACCCGTGAATTTACGTGAA
CAGCAAGATGCTGTGGAGTTTTTCATGTCTCTGGTGGAATCCCTCGATGAGGCTTTAAAA
TCGTTGGGACAAGAACAACTTATGGCGAAAACGATGGGAGGAACATACTCCGATCAGAAA
ATTTGTAAGGGATGTCCACACAGGTATTGCAAAGAAGAACCATTCAGTGTAGTTTCCCTG
GACATAAGAAATATGTCCCGCTTGCAAGAATCTCTGGAAGCGTACGTCAGGGGAGAGCTT
CTGGAGGGGGCCGATGCTTATTATTGCGATAAGTGTAACAAGAAGGTTGTAACCGTGAAA
CGTCTGTGTCTCAACAAATTACCGCCAGTCCTAGTCATACAGCTGAAGAGATTCGAATAC
GACTTCGAGAAAGTTTGTGCAATAAAATTCAACGATTACTTTGAATTTCCGCGAGAGTTG
GATGTAGAGCCGTACACGGCGTGGGGTCTGGCACGAGCCGAGGGCGACGCGTCCCTGTGG
GAGGGCGGAGAACGTACGGAGACTCACTACCAGCTCAGCGGGATCGTGGTTCACTCCGGC
CAGGCCTCCGGAGGACACTACTATTCATATGTACTACTTAGAGACAACGCCGGTGACGCG
GGTCGATGGGTGAAGCTGGACGATGGCGAGGTGTCGGAGTGCGCCATGCATGATGACGAC
GAAATGAAGGCTCAGTGCTTCGGCGGAGAGTACATGGGAGAGGTATTTGATTCGACCATA
AAGAGGGTGTCATACAAGAGACAGAAGAGATGGTGGAACGCTTACATGCTGTTTTACACA
CGAAAGGACATGATTGATACATCCGGCCTCGAGAGAATCATGCAAAACGTAACACTCAAG
GAAAGTGCCATACCCAAACCTATCTGGAATTCGGTTCGTCGCAGTAATATCGCTTTCTCA
CACAACCAGGACCAGTTCAGTTTGGAACATTTTAATTTTATGAAGAAGCTATGTTGTATG
CGTATGCAAGTGTTACCCGGCTCACAGAGCGCGGTATGGGGCCCAGAGCACGAGGAAATG
TCGATGTTAGCTGTACAGTTAGCAGCTAAATTCTTGTTCCAAGTTGGTTTCCATACAAAG
AAAACACTACGTGGACCCGCTGCAGACTGGCAAGACATACTCTGCCAGCATCTGAGATGT
TCTCAGGCAGTCAGAACTTGGTTTGCGACCGACCTGTTCAAACATTCTCACAGGTTATGT
GACTACTTACTGTCGTGTCCATCGGCTGAAGTAAGAGTTGTATTTATGAAAATCATCGTG
TTTTTGGCTCATTTCTCCATACAGGACTCGCCAGTGAGCTGCGGCTACGGGACGTGGTGT
TCACGCGAGGAGGCCACCTCTCTATCGGATCAGGTTATATGTGCTGCTCGTGCGTTGGCG
GTGCCTCACGCACACGCACACGACCACCGACACCTGCCGCTGCTGTTCAACCTATTCCAC
GCGTACGCTATGCTGGGATTGGGGGAGAGACACCAGCTGCTTAGGCTCAAGATACTCGAC
ATAGTACTGACAGTCTGTTTGGAAGATTCGTCCTCGTCGCTTGGAAAATATCAGTATCCG
GAATCTGCTAAAATACATCAGGTTGTCTGTGCATTGGTGCGTTGTTGTGACGTGAGCGCT
CGTTGTCAGTCGGCTAACGCCAGTGAGGGCGTCCTGCCGCTGGCGAACCCGTACGCGGAC
GCGGCGCACGCGCACTCCCCGCGACCCGCACTGTCCGCAGCCGCAGCTGACGTGCTCTAC
AACCGCACCGGGTCTTACATGAAGAAACTAACTGAGGAGTGCTGCGGGTGTGAAGAGGGT
ATCCGACTGCTTCAGTTCATGTGCTGGGAGCATGCTGGCTGGTCTCGCATGGCGCTGGCC
GAGCTGCTGTGGCAGATGGCGTACGCGTTCTGTCATGAACTGCGGAGACACGCGGACGCA
CTCACGGCACTCCTTCTAATGGAAGATAGTTGGCAGCATCACAGGATACACAACGCTATC
AAGGGCGTGTCAGAGGAGCGTCCCGGGCTTCTTGAGACAGCATTGCGGGCGCGAAGTCAC
TACCAGAAACGAGCGTATGCTTGTGTGAAGTTGGTTGTAGGAGTAATGTGTCGCACTCCG
CTCGCTGTACGAGCTGTGCACGCGCAGTCCGACGCGCGAAGACGTTGGAGACAACTTCTA
GCGTGGCTACAGGACGAGCTCGAACGCAAGTATGGATCCGGAGGTTACGGGTCATACGGT
ACTTGGTCTCCCCCCAGCCTGTCCAACGAAACGTCAAGTGGATATTTCCTGGAGCGTAGC
AACTCCGCAAGAAAGACTCTCGAAAAGGCCTATCAACTCTGTCCGGAGGAGGAAGAAGAA
GAAGAGGAGTCGCGAGATGCCGGCTCGGGCTCGGGCTCCGGCGAGGCCGGTGACGAAAGC
GGTGACGACGATGCCCCCGATGACGAGGAGCCCGCGCCGCGCCGCCTGCAGCTTGCGCCA
CCCGCCCCGCCTGCGCCTCCAGCACCTCCGGCCCCTCCCGCGCCGCCCGCCGGCCCGTGA
Protein sequence:
VEYLRAVRDNPATLDDTLLEGHLNLTKELFTHVPAQVKFQYGAHPDHKDSGLIREVTTEF
LWPYSWAWCHMSNELSGDEDGTSEEECEGVAPLCRTPGAAAAATDLLLALVHACVPNMAA
LANLLEQMFYSDKNMGLSEWEYMPCVGPRPPAGLVGLKNAGATCYMNSVLQQLYCVRAVR
DVLLTVQGAATDPNEDFSGETHHHSILENNTENNADYNITILKQVQAIFAHLHYSKLQYY
VPRGLWAHFRLQGEPVNLREQQDAVEFFMSLVESLDEALKSLGQEQLMAKTMGGTYSDQK
ICKGCPHRYCKEEPFSVVSLDIRNMSRLQESLEAYVRGELLEGADAYYCDKCNKKVVTVK
RLCLNKLPPVLVIQLKRFEYDFEKVCAIKFNDYFEFPRELDVEPYTAWGLARAEGDASLW
EGGERTETHYQLSGIVVHSGQASGGHYYSYVLLRDNAGDAGRWVKLDDGEVSECAMHDDD
EMKAQCFGGEYMGEVFDSTIKRVSYKRQKRWWNAYMLFYTRKDMIDTSGLERIMQNVTLK
ESAIPKPIWNSVRRSNIAFSHNQDQFSLEHFNFMKKLCCMRMQVLPGSQSAVWGPEHEEM
SMLAVQLAAKFLFQVGFHTKKTLRGPAADWQDILCQHLRCSQAVRTWFATDLFKHSHRLC
DYLLSCPSAEVRVVFMKIIVFLAHFSIQDSPVSCGYGTWCSREEATSLSDQVICAARALA
VPHAHAHDHRHLPLLFNLFHAYAMLGLGERHQLLRLKILDIVLTVCLEDSSSSLGKYQYP
ESAKIHQVVCALVRCCDVSARCQSANASEGVLPLANPYADAAHAHSPRPALSAAAADVLY
NRTGSYMKKLTEECCGCEEGIRLLQFMCWEHAGWSRMALAELLWQMAYAFCHELRRHADA
LTALLLMEDSWQHHRIHNAIKGVSEERPGLLETALRARSHYQKRAYACVKLVVGVMCRTP
LAVRAVHAQSDARRRWRQLLAWLQDELERKYGSGGYGSYGTWSPPSLSNETSSGYFLERS
NSARKTLEKAYQLCPEEEEEEEESRDAGSGSGSGEAGDESGDDDAPDDEEPAPRRLQLAP
PAPPAPPAPPAPPAPPAGP