DPGLEAN03032 in OGS1.0

New model in OGS2.0DPOGS206547 
Genomic Positionscaffold74:+ 85167-89558
See gene structure
CDS Length1956
Paired RNAseq reads  952
Single RNAseq reads  2447
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014036 (0.0)
Best Drosophila hit  TBP-associated factor 5 (1e-157)
Best Human hittranscription initiation factor TFIID subunit 5 (1e-159)
Best NR hit (blastp)  wd-repeat protein [Aedes aegypti] (0.0)
Best NR hit (blastx)  transcription initiation factor TFIID subunit 5 [Culex quinquefasciatus] (0.0)
GeneOntology terms
  
GO:0030528 transcription regulator activity
GO:0005634 nucleus
InterPro families







  
IPR019775 WD40 repeat, conserved site
IPR020472 G-protein beta WD-40 repeat
IPR019781 WD40 repeat, subgroup
IPR007582 TFIID subunit, WD40-associated region
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR001680 WD40 repeat
IPR011046 WD40 repeat-like-containing domain
Orthology groupMCL14952

Nucleotide sequence:

ATGGGAGATAAGTCCACCCCACTTTTAGCTGTACTGCAACTTCTCCGAAAATATGATTTG
AAAGGCACAGAAGAACTTTTAAGGAAAGAAGCAAACTTAGGAGATGTTGAATACGACAAC
TTAGATTTGCCAGAAGTAGAACTTGCCAGTATACTTACTGCTCATCACACAGAAAGTGAT
CCATATAGCTATGAGTTTGCTTATGATAGTCTTAAAAAGTTCGTTGAAAATTCTCTAGAC
ATTTACAAGTATGAATTATCAACACTTCTTTACCCAGTGTTTGTGCATATGTATTTGGCA
CTAATATTGTATGACCACAATGAACATGCTTATAAGTTCTTTGAAAAATTTGGCCTAGAA
CAGGAAGATTATTATCAAGAGGATTTAACTCGTCTCTCAATTGTGAAACATAAAGATCAA
ATTAAAGGAAATGAAATTGCAGAAATATACAGCTCAAACAAATTTCAAGTTCAAGTATCT
CGAGATGCATCTACACAGCTTAAGAGATATTTACATGAACAGAAGAGTTCACCGGTGATT
ATAAATATCTTAAATAATCACATACAAATTGATATACATGACGGACCAGGTCGAACACAA
GCACAAGTGAGAGCTACTATTGGGGGGCTACTAGGTGAAGCTTCTAGAAACGAAAATCGC
ACAAAAGTGTATTATGGATTATTAAAAGAACCTGACATACAAGTCCTTCCACCACCCATT
GAAGATGAAGAGGAAGCAGAGGAGACCCCAGATAAACCCAAAAAAAAGAAGGCCAAAAAG
GACAATATTTTTCTCAAAAAACCTAAATCTGATCCCAATGCACCACCAAATGACAGAATT
CCCTTGCCAGAGTTAAAAGAAACAGACAAGCTGGAAAAGGGTAAAGCCATAAGAGAAGCT
GCAAAACGTGTTCAACTTGGACCAGAGAGTCTACCATCTATTTGCTTTTACACTCTCTTA
AACAGCGGCCATACAGCCATATGTGCTGATATCTGTGATGACTCAACACTGCTTGCCGTT
GGCTTCAACAACTCTTATATTAAGGTTTGGACTTTGACTACAATAAGATTAAGAGGAATG
AAATCAGCTGAAAAGTTGCAAGACATCGACAGAGAAGCTGGTGACGTCTTAGTGAGGATG
ATGGAAGAGAAGGACAGAGATACATGCCGTACACTATACGGCCACTCGGGATCAGTATTC
AAAGTGGCATTCGATCCTTTCAAAACTTTGTTACTATCATGCTCTGAAGATTCCACAGTC
CGGCTATGGTCCCTGCAGTGTTGGTCGTGCCTAGTGGCGTACCGTGGCCACGCGTGGTCG
GTTTGGGACGTACGTTGGTCGCCTCATGGCCACTACTTCGCCAGCGCCGGGCACGACCGG
ACCGCACGCCTCTGGGCCACCGACCACCATCAACCGCTCCGAATATTCGCTGGACATCTT
TCTGACGTCGATTGTGTTCAATTCCATCCAAACTCGAATTACATAGCAACGGGATCCAGC
GACCGCACCGTAAGACTATGGGACTGTTTGACGGGAACGCAAGTACGGATCATGACCGGT
CACAAGACAACTCCATATACCGTTGCGTTTTCTGTATGTGGTCGTTGGATAGCTTCAGGC
GGCGCGGGGGGAGAGATAGTTGTATGGGATATTTCTACCGGTCTACCAATGAGCACTCTG
CCTCCAATGCATGTAGCGCCTGTTCACGCCTTAGCCTTCAGTCGAGACGGCACTATCTTA
TCTTCAGGTTCATTGGACTCCACAATCAAACTGTGGGATTTCACATTGATTACGGATGAA
AGTCTGACGGAAGAGACAGGTTCAAGTACTGTCACACAAAAAGAAGAGAAGGTTCTCTTG
CGTTCGTTTGCGACGAAAAATTCGCCAATCAAGCATTTGCATTTCACCCGTCGCAACCTT
TTGTTGGCTGTAGGGTCATATGAAGGAAGTTCCTAA

Protein sequence:

MGDKSTPLLAVLQLLRKYDLKGTEELLRKEANLGDVEYDNLDLPEVELASILTAHHTESD
PYSYEFAYDSLKKFVENSLDIYKYELSTLLYPVFVHMYLALILYDHNEHAYKFFEKFGLE
QEDYYQEDLTRLSIVKHKDQIKGNEIAEIYSSNKFQVQVSRDASTQLKRYLHEQKSSPVI
INILNNHIQIDIHDGPGRTQAQVRATIGGLLGEASRNENRTKVYYGLLKEPDIQVLPPPI
EDEEEAEETPDKPKKKKAKKDNIFLKKPKSDPNAPPNDRIPLPELKETDKLEKGKAIREA
AKRVQLGPESLPSICFYTLLNSGHTAICADICDDSTLLAVGFNNSYIKVWTLTTIRLRGM
KSAEKLQDIDREAGDVLVRMMEEKDRDTCRTLYGHSGSVFKVAFDPFKTLLLSCSEDSTV
RLWSLQCWSCLVAYRGHAWSVWDVRWSPHGHYFASAGHDRTARLWATDHHQPLRIFAGHL
SDVDCVQFHPNSNYIATGSSDRTVRLWDCLTGTQVRIMTGHKTTPYTVAFSVCGRWIASG
GAGGEIVVWDISTGLPMSTLPPMHVAPVHALAFSRDGTILSSGSLDSTIKLWDFTLITDE
SLTEETGSSTVTQKEEKVLLRSFATKNSPIKHLHFTRRNLLLAVGSYEGSS