New model in OGS2.0 | DPOGS205923  |
---|---|
Genomic Position | scaffold3744:- 26198-30148 |
See gene structure | |
CDS Length | 1431 |
Paired RNAseq reads   | 777 |
Single RNAseq reads   | 1947 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005894 (0.0) |
Best Drosophila hit   | CG5585 (5e-175) |
Best Human hit | retinoblastoma-binding protein 5 isoform 1 (1e-158) |
Best NR hit (blastp)   | AGAP010575-PA [Anopheles gambiae str. PEST] (0.0) |
Best NR hit (blastx)   | GE22415 [Drosophila yakuba] (5e-180) |
GeneOntology terms    | GO:0045449 regulation of transcription GO:0006350 transcription GO:0016568 chromatin modification GO:0005515 protein binding GO:0006974 response to DNA damage stimulus GO:0035097 histone methyltransferase complex GO:0005634 nucleus |
InterPro families    | IPR019775 WD40 repeat, conserved site IPR011046 WD40 repeat-like-containing domain IPR019782 WD40 repeat 2 IPR017986 WD40-repeat-containing domain IPR015943 WD40/YVTN repeat-like-containing domain IPR019781 WD40 repeat, subgroup IPR001680 WD40 repeat |
Orthology group | MCL14841 |
Nucleotide sequence:
ATGAACTTAGAATTACTTGAATCTTTTGGTCAAAATTATCCAGAAGAGTTTGATGGCACA
TTAGATTCCATTTCCCTTGCTGCGACGTGCGCCTTCAACCGTCGCGGCACTCTTCTGGCA
GTGGGTTGTAACGATGGAAGAATATTCATCTGGGACTTTCTTACTAGAGGCATTGCTAAA
TCTATATCGGCACATGTTTATCCAGTGTGTAGCTTGAGTTGGTCAAGAAACTGTAAAAAG
TTACTCTCAGCATCCACAGACCACAATGTATGTATTTGGGATGTTCTCTCTGGTGAATGC
GAACAACGGTACAGATTCCCAACACCGATATTACGTGTACAGTTTGATCCTAGAAATGAT
AAGAGATTCCTTGTATGTCCAATGAGACACGCCGCTTTGTTGGTTGATACCGACGGTGAA
CATAAAATCCTGCCCATTGATGAAGATGGTGATACGAATGTGATAGCTTCATTTGACAGG
CGAGGTGACTATGTGTATACAGGGAATGCGAAGGGTAAAATATTAATACTCGACTCACAA
CAGTTAACAGTGAAGGCGAGCTTCAAAATTACCGTTGGTACTTCAAGCACTACCGGCATA
AAGAGTATTGAGTTTGCCCGACGAGGAGACTGTTTCCTTGTAAATACATCCGACAGAGTG
ATCAGAGTTTACGACGCCAACACGGTTGTGAAGTGTGGTGTGAACGGGGAACCGGAACCG
ATACAAAAGCTACAAGATCTCGTCAATAAAACTACATGGAAGAAGTGCTGTTTTTCTGGA
GACGGTGAATACATCTGCGCGGGATCAGCTCGCCAGCACGCGCTTTACATATGGGAGAAG
TCCATAGGGAACCTCGTCAAGATACTCCACGGAACTAAAGGAGAACTGCTCTTGGATGTT
GTGTGGCATCCCGTCAGACCCATCATAGCTAGCATAAGCGCTGGCGTGGTGTCAATATGG
GCTCAGAATCAAGTGGAAAATTGGTCTGCGTTCGCTCCGGACTTCAAGGAGTTGGATGAG
AATGTGGAGTATGAGGAGAGGGAGAGCGAGTTCGATGTGGAAGACGAGGACCGCTCCGTG
GACCAGGGCGCGGACAGCAGAGACGACGAGGAAGTAGAGGTGGATGTGACCACGTGTTCC
GCTGTGGCCGCCTTCTGCAGCTCAGACGAAGACGCCGAGGACGACACGCTGCTGGCCTTC
CTGCCGATAGCACCGGAAATAGAGGATCCAGAGGACGGTTGGGCGGCGACTCAGGAGACG
GTGACCCCGAGCGAGACCCCCGAGAAACTGGAACCGGCGGCGAAGAGACCCAAGAGTAAG
ACCTACGACATATCGCTGAAGATAGCGCCCCAGGAACAACCGCTCAGCTTCGGCGGCAAG
AACAAGCAGGCGGCGGGGAACAAGAAGCTCGCGGGCAGACCAAGGAAATAA
Protein sequence:
MNLELLESFGQNYPEEFDGTLDSISLAATCAFNRRGTLLAVGCNDGRIFIWDFLTRGIAK
SISAHVYPVCSLSWSRNCKKLLSASTDHNVCIWDVLSGECEQRYRFPTPILRVQFDPRND
KRFLVCPMRHAALLVDTDGEHKILPIDEDGDTNVIASFDRRGDYVYTGNAKGKILILDSQ
QLTVKASFKITVGTSSTTGIKSIEFARRGDCFLVNTSDRVIRVYDANTVVKCGVNGEPEP
IQKLQDLVNKTTWKKCCFSGDGEYICAGSARQHALYIWEKSIGNLVKILHGTKGELLLDV
VWHPVRPIIASISAGVVSIWAQNQVENWSAFAPDFKELDENVEYEERESEFDVEDEDRSV
DQGADSRDDEEVEVDVTTCSAVAAFCSSDEDAEDDTLLAFLPIAPEIEDPEDGWAATQET
VTPSETPEKLEPAAKRPKSKTYDISLKIAPQEQPLSFGGKNKQAAGNKKLAGRPRK