DPGLEAN16511 in OGS1.0

New model in OGS2.0DPOGS212870 
Genomic Positionscaffold344:- 55329-65578
See gene structure
CDS Length4299
Paired RNAseq reads  2744
Single RNAseq reads  6380
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000814 (2e-85)
Best Drosophila hit  ATP-dependent chromatin assembly factor large subunit (4e-105)
Best Human hitbromodomain adjacent to zinc finger domain protein 1A isoform b (4e-33)
Best NR hit (blastp)  PREDICTED: similar to zinc finger protein [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to zinc finger protein [Tribolium castaneum] (0.0)
GeneOntology terms



















  
GO:0006355 regulation of transcription, DNA-dependent
GO:0006334 nucleosome assembly
GO:0016584 nucleosome positioning
GO:0016590 ACF complex
GO:0006333 chromatin assembly or disassembly
GO:0005678 chromatin assembly complex
GO:0004402 histone acetyltransferase activity
GO:0016589 NURF complex
GO:0008623 chromatin accessibility complex
GO:0042766 nucleosome mobilization
GO:0043462 regulation of ATPase activity
GO:0003677 DNA binding
GO:0031497 chromatin assembly
GO:0005515 protein binding
GO:0008270 zinc ion binding
GO:0048666 neuron development
GO:0008544 epidermis development
GO:0007517 muscle organ development
GO:0048813 dendrite morphogenesis
GO:0007399 nervous system development
GO:0032582 negative regulation of gene-specific transcription
InterPro families








  
IPR013136 WSTF/Acf1/Cbp146
IPR001487 Bromodomain
IPR019787 Zinc finger, PHD-finger
IPR004022 DDT domain
IPR018501 DDT domain superfamily
IPR011011 Zinc finger, FYVE/PHD-type
IPR019786 Zinc finger, PHD-type, conserved site
IPR018500 DDT domain, subgroup
IPR001965 Zinc finger, PHD-type
IPR013083 Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL12590

Nucleotide sequence:

ATGCCTCTATTGAAAAGAAAAGCCTTTGAGAAATCGAATGTGTCGGAATATCTAAGGGAC
GACGATGAGGTGTTTCACTGCGAGATCACCGACGAAATATTCAAGGATTATGAAGAATAC
TGCGAAAGGATCATTCTGGTCAATTCAATGGTGTGGTCATGTGAGATGACGGGAAAGAAT
AACCTCACGTATGCAGAGGCTTTGGAAAGTGAAAAAGCTGCCCGAAGATCACTAAAGGAT
TTCCCAATGGAACTCCGAATACCAATATTATATTTGGCTGCAAGAACCAAAAGGAGTTCA
TTTGCAGACATGTCAGAGGATGTATTTAATTTTGTAAGAGTGAGGTTCTTTGTGGGTGAA
ACGGTTGAGGCCTGCCTAGAGGGTGACATCTGGCGAGAGGCTCATATACTGTCAGTTACA
GCCCCCAAACAACATCCGGACAGCCAGGCAATGCTCCCAGCATCCTCTTACTGTTATGAG
GTGGAGCAGTTCAGTGAGGATCCGGAGACTGCGGGGCAGATCGGCACCGCGCCCTACGAC
AGAGTTAGGCGACGTAAGGGCGTCTACACACGGGACAAAAACAGACTGTTCTTGAAACAG
TTTGTCACACCCGGAGCCGTCATTGGCATTAAGAAATCAGCAATAGAAAAATACAACATA
GGAAAAGTGAACTTTGACCAGATATTCACAGGGAATCCCCCAGAATTCCCGTCATCCAAG
AAGCTGTCAAAGAACAGTATGTCCCCACCATCCACTTCAAGGCTACCACACACTCCTGGC
TCAGGGTCGAGATCAGCCAAAAAACTCAGTCCTGATAAGAAGGGACGGCAGGAGTCTATG
GATAAATTTGTTAAGAAAACTGATGGTAAATCAAAAACCCCCATTGACCCTGAAGCCAGA
AAAGTAGCCCAAGAAATGGCTGACAAAATGAGGAAAGCGGAGGACTTATTGAGGCAACAT
AAAGAAGAAGAGAAAGCGAGAAAGAAAGAGAAAAATGCCAGACTCATGGCATTTATGAAG
GAATGGCATAAGGTTAAAGAAGACCAGGAGCTAGAAGATCACAAGATTATACCAAAGGGC
ACTCCGGTAGAGATAGAAGGAATACCACATAAAAATTTTGGTGACTTCCTCTCAGTGTTA
GAATTTGTTCACAATTTCTCAGAACTACTCAAACTCAAAGACGTGTTTCACACCGAGTTT
GACCTTGAAACATTTCGGAAATCATTGACCTCTAAAGAACACGCGTGGATATTCAGCGAG
TTGGTACAGATGCTACTATCAGCAATATTCTCTTTGCAAGAAGATGAGGCAGAAGAATAT
AATGAGGGCAAAGGAATTCAGGAATCAGTAGAGGAGCCTCCGGCCATCTCGACAGGTATC
GCGCGCGCCATAGAGCAAGCGACGCGAGCCGGAAAGTGGGCGCACACCTACCTCGGGACA
CCACTCAGCAAGCTGCCCCTGGACCCCACCACTGTGTCTGAAGTGCTGAGGTTGCACCTG
CTGTCGTCGGGTGGCGTGGCTGGTTCCCGCTGCCTGGCGTGGCGGCTGCACCAGCGCGGT
GGGTACTCCAGCGCCGATGACCCAGCACTGCGCCTGCGCACGTCACGTCCACATACTTTA
AGAGCCTTACGGACGCAGCACGTCGCCGATCTACCCCTCGATGACAGACTCGCGATCCTT
CAGTGCCTCATGAATCAAATCCTGAGTTACTCGACGGTGCGCGAGCAGGTCGAGGAGAAG
ATAGAGGAATATAAGAACTTAAAACAAGCCTTGCGAATATTACAAATAAACGAGCGCAAA
CGCGAACCGCAACTGTCGACCGCGCGGTCGGAGCTGAAGAGAGAGGCGGCTCAGAAGAAG
GAGGAACTCAAGTTGACGGGGGATCGCGCGCGGGCTGCGGACGAACAGCTGAAGGCGGCC
ATAGACAAGCTGAACAAGGAGAGCGACGCCAAGAGACTGGAGTTCGAGAAGAAGTTGAAG
GAGCTGCAGGCTCAGCTCTTTGACTACACCACTTACCTTGGGTCGGATCGCGCGTTCCGT
CGTTATTGGATCAGTAGGCGGGTGGCGGGACTATTCGTGGAAGCGGGGCCGGAGCCCCGA
GGTCCTTGCCGCAACAAGCCCCTGCCCCCGCCGCCCGCTCCCCGCGACGACATCTTGTCT
TACGTCACTGAATTATTCCACTCAGAGAGGGAGAGGGAGAGACAGAAAGAACAAGCTGGC
AGCGACAAAGAGAACGAGTCCGGCGCCAACTCCCGCGGTGCCTCGCCAAAGAAACCTCTA
ACAAACCTCAACGGTTTGACCCAAGACAAGAAACCCATGGAGAGCACGGTGCAGCACTGC
CGGGACATGCTGCTGTGCACTGCCGATGTCAACACGTGCTATGTACACGGGAAGGGTGAT
TACCGACCGCAATGGTGGGTGTACCACACCCGCGACCAGCTGCAGGCACTCATCGCGTCA
CTCAACAAGCGAGGCTTGAGAGAGAGTGAACTGAAGCAGGCCTTGGAGGTGGACAAGGAA
CACATAGCTGACTACATCACCAAGTGCCCTCTGAACCTCCTGACACCCGGCCCGGCTCCG
CCGACGCCGAACGTGCCGTCGACGCGCCAGCGCCGCTTCCAGCCGTCTCTCACGGTACTA
CCGGACTGTTCGCTGGCGGATGCTCTGGAACTGACACTCAGAGACCACATCCTGGAACTC
GAGGAGAAGATCTTCCACGGATGTCTGGGAGCATTGAAAGTTAAGGAGCGGTCGGCGTGG
CGCGGGACTCTCATGGTGCGCGGCTACGACAAGCAGGCGCGGTCATTGACCTGGGGCCCG
GACGGCCGGTTCAGGGACGACTGCCATTTACCAGACGGATTGCTGAAATTACCGCCAGAT
TTAGATGAGACCGAGTTGGAGGGCATCGTTGAGAACAGATACCGCGACCCGGGGCACTGT
CTGGAGCCGCCCAGGGTGAATGGCATCAAGATAGAGAACGGAGGGGGAGAGGCGGCCGGG
GCCGAGGACGCTGGGGTCGTCCGCTCACTAGCCAGCGCTCTGCTGCAGGTGGCGCAGGCC
ATACACCACAAGTACCTGAAGAGGCCGCTCGGCCTCGACGAGAAGGAGCGCAAGGATCGC
GAAGCCAAGAACAAATCTCTGGAGCTGGAGGCGCTGCAGCGGTGGGAGGTGTCGCTGATG
GAGTGCCGCAGCTTCGCCAGCGTGGCGCTGCACCTGCTGACGCTGGACAGCAGCGTGTGC
TGGTCGGCCAGCGTGCTGCACGCCAGCTGCCGCCTGTGCCGCCGCCGCACCGACCCCGAC
AACATGCTGCTCTGCGACAGCTGCAACAAAGGACACCATCTCTACTGCCTCAAGCCAAAG
CTCACGAAGGTGCCGGAGGGGGACTGGTTCTGTGATCAATGCAAACCGACAGAGAAGACG
CCCAAGAAGCGAAGAAAACTATACACCGACCCCGACGACACGCTCGACGACAGCTCGGAG
TCGTGTTCGAGCGCGCCGGTGGAGCTGTGCGCGTTGTGCGGCAGCGGCGGGCGGCTGGCG
GCCTCGTGTCGCTCGTGCGGGAGACGCTTCCACGCCGAGTGCGCGCCTTCCGGGGGGCGG
AGGGCCGTGTGCGGGGACTGCGCTAAACCAAACAGAGATTCCGAGGACAGCGAATATAAC
ACGGCGCTGGTCAAACTGAAGACACGGCAGCAGAGAACCGAGGAACCCGCCAGGAGAGGC
AGGAAATCTAAAGAAGTCGTCAACGGAAGTACAAACAGGAGATCGAAGTCATTCATGAAT
GGCGTCAATGGTGATGTAGTGTCGAGTCGTAAGCGCGGCCGCCAGGAGGAGGAGTTACTG
CACGTGGAGTCGCTCACACAGCTGTTGAAGGAGTGCGGCAAACATCGGGATTGCTGGCCC
TTTGATGAGCCGGTCTCGACGGAGGATGTGCCGGACTATCTCAGCGTGATCGAGCAGCCG
ATGGACTTCTACACAATCCGCGGCAAGCTGGAGAAAGGTTCCTACACCACCGACCAACAG
ATGCTGGACGACGTCGCGCTCATCTTCAAAAACTGCTACACCTACAACCAAGACACACAC
CCTGTGGCCAAAGCGGGAGCGCGACTCGAAAAGTATATCATAAAGCGCTGTTCGGAACTC
AATCTACCCGCGTTGCCCGCCACCTCGCTCGAAGACAACGAGGCTGAAGCGACGCAAGAG
AATAACGAGCGAGAAGAGGTCGCGGAAGCTGCCGGGGAAGAGCTCGACTCGGACGACGAG
GTTCTCGCGCCTCGAGCCAAGCGACCCAAGATACATTGA

Protein sequence:

MPLLKRKAFEKSNVSEYLRDDDEVFHCEITDEIFKDYEEYCERIILVNSMVWSCEMTGKN
NLTYAEALESEKAARRSLKDFPMELRIPILYLAARTKRSSFADMSEDVFNFVRVRFFVGE
TVEACLEGDIWREAHILSVTAPKQHPDSQAMLPASSYCYEVEQFSEDPETAGQIGTAPYD
RVRRRKGVYTRDKNRLFLKQFVTPGAVIGIKKSAIEKYNIGKVNFDQIFTGNPPEFPSSK
KLSKNSMSPPSTSRLPHTPGSGSRSAKKLSPDKKGRQESMDKFVKKTDGKSKTPIDPEAR
KVAQEMADKMRKAEDLLRQHKEEEKARKKEKNARLMAFMKEWHKVKEDQELEDHKIIPKG
TPVEIEGIPHKNFGDFLSVLEFVHNFSELLKLKDVFHTEFDLETFRKSLTSKEHAWIFSE
LVQMLLSAIFSLQEDEAEEYNEGKGIQESVEEPPAISTGIARAIEQATRAGKWAHTYLGT
PLSKLPLDPTTVSEVLRLHLLSSGGVAGSRCLAWRLHQRGGYSSADDPALRLRTSRPHTL
RALRTQHVADLPLDDRLAILQCLMNQILSYSTVREQVEEKIEEYKNLKQALRILQINERK
REPQLSTARSELKREAAQKKEELKLTGDRARAADEQLKAAIDKLNKESDAKRLEFEKKLK
ELQAQLFDYTTYLGSDRAFRRYWISRRVAGLFVEAGPEPRGPCRNKPLPPPPAPRDDILS
YVTELFHSERERERQKEQAGSDKENESGANSRGASPKKPLTNLNGLTQDKKPMESTVQHC
RDMLLCTADVNTCYVHGKGDYRPQWWVYHTRDQLQALIASLNKRGLRESELKQALEVDKE
HIADYITKCPLNLLTPGPAPPTPNVPSTRQRRFQPSLTVLPDCSLADALELTLRDHILEL
EEKIFHGCLGALKVKERSAWRGTLMVRGYDKQARSLTWGPDGRFRDDCHLPDGLLKLPPD
LDETELEGIVENRYRDPGHCLEPPRVNGIKIENGGGEAAGAEDAGVVRSLASALLQVAQA
IHHKYLKRPLGLDEKERKDREAKNKSLELEALQRWEVSLMECRSFASVALHLLTLDSSVC
WSASVLHASCRLCRRRTDPDNMLLCDSCNKGHHLYCLKPKLTKVPEGDWFCDQCKPTEKT
PKKRRKLYTDPDDTLDDSSESCSSAPVELCALCGSGGRLAASCRSCGRRFHAECAPSGGR
RAVCGDCAKPNRDSEDSEYNTALVKLKTRQQRTEEPARRGRKSKEVVNGSTNRRSKSFMN
GVNGDVVSSRKRGRQEEELLHVESLTQLLKECGKHRDCWPFDEPVSTEDVPDYLSVIEQP
MDFYTIRGKLEKGSYTTDQQMLDDVALIFKNCYTYNQDTHPVAKAGARLEKYIIKRCSEL
NLPALPATSLEDNEAEATQENNEREEVAEAAGEELDSDDEVLAPRAKRPKIH