New model in OGS2.0 | DPOGS210851  |
---|---|
Genomic Position | scaffold908:+ 110640-114876 |
See gene structure | |
CDS Length | 1980 |
Paired RNAseq reads   | 839 |
Single RNAseq reads   | 1949 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006979 (5e-139) |
Best Drosophila hit   | ND |
Best Human hit | histone H2A deubiquitinase MYSM1 (2e-15) |
Best NR hit (blastp)   | PREDICTED: similar to myb-like, SWIRM and MPN domains 1 [Tribolium castaneum] (2e-36) |
Best NR hit (blastx)   | PREDICTED: similar to myb-like, SWIRM and MPN domains 1 [Tribolium castaneum] (2e-23) |
GeneOntology terms    | GO:0003677 DNA binding GO:0003713 transcription coactivator activity GO:0004221 ubiquitin thiolesterase activity GO:0004843 ubiquitin-specific protease activity GO:0005634 nucleus GO:0008237 metallopeptidase activity GO:0016578 histone deubiquitination GO:0016585 chromatin remodeling complex GO:0042393 histone binding GO:0045449 regulation of transcription GO:0045893 positive regulation of transcription, DNA-dependent |
InterPro families   | IPR000555 Mov34/MPN/PAD-1 |
Orthology group | MCL30997 |
Nucleotide sequence:
ATGGCCGACGACGACGAGATTGACATTCTTGGTGATTTTTCATTTAATTCTTGTTTTGCC
CAAAATAATCAGGGAATTCCTTCTTGCTCCAACAGAGAAGACACCGTGCACCCTCAATGG
CTTCTGGATTCCCCTCCAACAAATTGGTATGATACACAGAATAAAGATAAAAGTTATAGG
CCCAAAGATGGACCATCAAGGAAGCTATCAGGAACAACAGCAAATTATCAGCATACAACG
GTCCATACATCCTGGACTCAGAAGGAAAGAGATTTGCTGGCACAAGAAATGGCCAGGTAT
GGGAGAAATGTGAACAAAATATCCAAAGCACTAAAAACAAAAAGTGAATTAGAAATTCAA
GCTCTCATAGAAGCAGAGCACGGCATTCTATTGGAGACGGAAAATATTAAAACACCCGCA
GTGAAACCTGACAACATACCCACAGTAGCACAGGAGGAAAAAATATCTAACTGTGGTAAT
GTGGATCTTGTGGTTAATAACAACACAGAAGAATGTGAAACAGCACCTGTGCCAAGAAAA
TGTTCAAAAATGAAGAAATCACACAAAAATATCAAAGAAATTGATAGCACCATTGAAACA
AATCCACTGATTGGCTCCGAAATATTCTATGACGATGATTTAATTATAGGATCGACAGAG
TCCATCGGTTCCGAGTTAGATGTGACAGATGTTGTAGCAACGAGTCTTACCAAGCAGCAA
AGAGACAAAACGAAAGTGTTAAGGAAGAATGGAAACCACAGAAGAAAAGTGTCCAGAAAC
TTCGATAGAAATAGGAGCAAGGATTTTCTTAAATCACCACATAGAAGAAAAAAAGATTCC
AGCTTGTCAGATGATAGTGTGAAAAGTCCAAAGATGCAGATTGTTCTGGGCTCTGGGCTG
GCTCTGCCTGTGTCAGAAGGTGAAGAAGTGATAAAAATAGAGAAGAAGCCCGACTTAGAT
GGTGAAAGTGATATAGAAGTGGATGTAGGCAGTGATTCTGATAAAGATATATATATACCA
AAAAATAAAACAGTCAAAGAGGTTGTTCACGAAGAGGTTCCAGTTGCTGTGCCATTGAGA
AAATTTGAACCCATGCCCAGAAGAAATCGGAAAATTAACTTAGACGGCGGTGGTGGTTAC
ACGATAATGCACACGGAAGCTGGTGACATGTATGAGATAGGTCAAGAACCTCGGAAAGAG
AGACAGCAAAGAAAACAAGCGGTCCAACTTATACCGTTGCATGTTTATAACTCTGAGAAA
CCGGCGCCGTGTGCCGTGCACATGTTCGTGTCGGTGTTAGTGAGTATGGACGTGCAGGCT
CACTGCAGCAGGGCGGAGGTGATGGGTCTGACGGGAGGCAGCTGGGAGCCCGGACCACGA
ACACTCACGCTGCAGCTGTACAGGACTGTGCGGGCCGCCGCCGCACACACGCACTGCGAC
ATGGACCCGGTGTCCCAGTCGTCGTCGGCGGAGTCCCTCCGGTGTCGTGGTGTGAGTGTG
TGTGGTTGGCATCACTCCCACCCCCAGTTCCCGCCCTCTCCGTCCGTGAGAGACCTCGTC
AGTCAACGCTCGCTCCAGAGCCTCGCCTGGGGTCTGCCGTGTGTGGCGCTGGTCACCTCC
CAGCACTGGCCTCCCGGACGCAGAGCCTCGCAACTCAGATGTTTCCGTGTAGAAGAGGAC
GACAAGCTTGACACTCCGGAGGTCCCCGCGGGCTACCAGCTCAATGTGAAGTTGGAGCGT
GACCTGGACCGGAGCACCCTCGACCAGTACTTGGAGGAGCTCCGTGTCCTGGCACACGAC
ACGCTCGCACACGTGGAGCTGCCCGTGGACGTGACACGGGACGTGTGTCCTCAGGCCGGC
ATCACTTACATGGAGAAGTGTCTTTCAAGTGTGAGTCACCACATGCGGTCGGCCGGCTAC
GAAGACGAGGATCCCATAGTCGCTCGGCTGTTACAAGGAATTAGAGATATATTCAGATAG
Protein sequence:
MADDDEIDILGDFSFNSCFAQNNQGIPSCSNREDTVHPQWLLDSPPTNWYDTQNKDKSYR
PKDGPSRKLSGTTANYQHTTVHTSWTQKERDLLAQEMARYGRNVNKISKALKTKSELEIQ
ALIEAEHGILLETENIKTPAVKPDNIPTVAQEEKISNCGNVDLVVNNNTEECETAPVPRK
CSKMKKSHKNIKEIDSTIETNPLIGSEIFYDDDLIIGSTESIGSELDVTDVVATSLTKQQ
RDKTKVLRKNGNHRRKVSRNFDRNRSKDFLKSPHRRKKDSSLSDDSVKSPKMQIVLGSGL
ALPVSEGEEVIKIEKKPDLDGESDIEVDVGSDSDKDIYIPKNKTVKEVVHEEVPVAVPLR
KFEPMPRRNRKINLDGGGGYTIMHTEAGDMYEIGQEPRKERQQRKQAVQLIPLHVYNSEK
PAPCAVHMFVSVLVSMDVQAHCSRAEVMGLTGGSWEPGPRTLTLQLYRTVRAAAAHTHCD
MDPVSQSSSAESLRCRGVSVCGWHHSHPQFPPSPSVRDLVSQRSLQSLAWGLPCVALVTS
QHWPPGRRASQLRCFRVEEDDKLDTPEVPAGYQLNVKLERDLDRSTLDQYLEELRVLAHD
TLAHVELPVDVTRDVCPQAGITYMEKCLSSVSHHMRSAGYEDEDPIVARLLQGIRDIFR