DPGLEAN19454 in OGS1.0

New model in OGS2.0DPOGS211014 
Genomic Positionscaffold180:- 49605-53219
See gene structure
CDS Length1866
Paired RNAseq reads  373
Single RNAseq reads  1109
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006496 (0.0)
Best Drosophila hit  will decrease acetylation (2e-47)
Best Human hitTAF5-like RNA polymerase II p300/CBP-associated factor-associated factor 65 kDa subunit 5L isoform a (2e-70)
Best NR hit (blastp)  WD-repeat protein, putative [Pediculus humanus corporis] (8e-118)
Best NR hit (blastx)  WD-repeat protein, putative [Pediculus humanus corporis] (1e-107)
GeneOntology terms

  
GO:0005634 nucleus
GO:0030528 transcription regulator activity
GO:0045449 regulation of transcription
InterPro families







  
IPR001680 WD40 repeat
IPR011046 WD40 repeat-like-containing domain
IPR019775 WD40 repeat, conserved site
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019781 WD40 repeat, subgroup
IPR007582 TFIID subunit, WD40-associated region
IPR020472 G-protein beta WD-40 repeat
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
Orthology groupMCL14288

Nucleotide sequence:

ATGAAAAGGACTAGAAATGATGCAGTTAAAGCAGCCGTTACATCCTACTTAGAGCGAAGG
AATTATCCCGACATTGATTTCTTTAACCACAATAATTGTACTAGTCGCAGTGCTGAAGAA
ATGGCAGTGGCTACAATTGTACAATGTGAAGCTAGCCGTGCCAATTCGATTATGTTTTCT
CTTATTAATAATGACCCTGGAAATTATGACGCTCAGTACACAAAACTGGTTACTTTTATA
AAGGAAATAAATATTGAAAAGGTTAAAAATGAGTTACTTGGTTTACTTACACCATTGTTA
TGCCACTTATATTTGGAAATGCTGCGCGGTGGCCATGGTGGTGCAGCTCAAATGTTTCTA
AAGAGACACTCTGCTACATTACCACAAAAGGAGTTATCATACCACCAACCAATAGACGGC
AATCTTCCATCAGCACTCTACCGACCAAACAGCCTTGAGCAATTGTTCAATTCTCTACAA
AACGGTACTATAGATAATGAGACCCCAGAGAAAGACTATATGAATCAACTGTTAGATGAC
ATTGGAACTATATATACCTTGCAGGAAGTTGAATCTCGACCTTCCATAGCTGCTTTTCGG
TCCTGCAAATATGACATATATCTATCACAAGACTCTCTAAACCTTCTGAAGACTTTTCTG
GCTAGACATGGACATGTTTTAATCATACAAGTTCTACAAACATGGTTCCATATTGATATT
AATAATGACAATAAGAAAACTTCTGAAGACGATGACGAAGAACACAATGAAGATGAAAAT
CATATGAATGTTGCGAATTGTGATGAAAAACCAACTGATAAAAATGATGTGTTTTCCAAA
TGTAATGGTCACACAGAACATCAATCTGTAGACAAAGAATTGAGAGATCTGCAAGATGCT
ATTAAAGGTGTTAGAGAAACAATTGCACCACTCAAACTGTACAAAATTGCAACTCCTGAT
AGCCATCTGATATGCGGTAAAACAGACCAATACTGTAATGTGTTATGTGGAGGATTTGAA
AACTCAGAAATAAGACTTTGGGATCTTGGACAGAATAATGTTAAGAAAAAGATTAACAGA
AACATATCGGAAGTGGAAATTGCTTGTTGTATACCAGCCGAACCCGAAACTTCATTAGAC
AATACCTTTCAAATAGGAACAGGTTTACCACTTAGGGGTCATTCTGGTCCAATTCAAGCT
ATCAGTATTCTAGCTCAAGAGCAACTAGTGTTGTCCGCATCCCACGATAATACCATGAGG
GCATGGAAATTGTCAGATTATTCATGTGCTTCTATATACCGAGGTCACAATTATCCGATA
TGGTGCATGGACGTATCCAAAAATGGTTTATTTATTGTAACGGGATCTCATGATAGAACT
GCAAAACTATGGTCATTGGATCGCACATTTCCAGTTAGGATTTTTGTGGGACATTTATCT
GATGTTACCTGCGTAAAATTTCATCCCAACGAGGCGTACCTGGCGTCAGGAGGCGCGGAT
CGCACGGTTCGAATGTGGAGTGTATGTGACGCTAGACTTGTTCGTGTATTGTGTGGACAT
CGCGCTCCACCACGAGCACTGGCCTTCTCACCCTCAGGGAAACATTTGGCTAGTGCAGGT
GATGATAAAAAAATTAAAGTGTGGGATCTAGCCGCTTGCAACTGTATTCATGAATACAGA
GGACATCATAGTAAAGTGACGTCATTAGATTGGTCAGCGGTCGGAAAGGCTAGCTTAACT
AACAGAATATCGTCAGATCCTAATGACACAAATGCAGATAATTCAATATTATGCTCCGCT
GGTATGGATGGCATAGTAAAGGTTTTTTATGACACAATGAGTTTTTTGTTCACTCATGAT
TCATAG

Protein sequence:

MKRTRNDAVKAAVTSYLERRNYPDIDFFNHNNCTSRSAEEMAVATIVQCEASRANSIMFS
LINNDPGNYDAQYTKLVTFIKEINIEKVKNELLGLLTPLLCHLYLEMLRGGHGGAAQMFL
KRHSATLPQKELSYHQPIDGNLPSALYRPNSLEQLFNSLQNGTIDNETPEKDYMNQLLDD
IGTIYTLQEVESRPSIAAFRSCKYDIYLSQDSLNLLKTFLARHGHVLIIQVLQTWFHIDI
NNDNKKTSEDDDEEHNEDENHMNVANCDEKPTDKNDVFSKCNGHTEHQSVDKELRDLQDA
IKGVRETIAPLKLYKIATPDSHLICGKTDQYCNVLCGGFENSEIRLWDLGQNNVKKKINR
NISEVEIACCIPAEPETSLDNTFQIGTGLPLRGHSGPIQAISILAQEQLVLSASHDNTMR
AWKLSDYSCASIYRGHNYPIWCMDVSKNGLFIVTGSHDRTAKLWSLDRTFPVRIFVGHLS
DVTCVKFHPNEAYLASGGADRTVRMWSVCDARLVRVLCGHRAPPRALAFSPSGKHLASAG
DDKKIKVWDLAACNCIHEYRGHHSKVTSLDWSAVGKASLTNRISSDPNDTNADNSILCSA
GMDGIVKVFYDTMSFLFTHDS