New model in OGS2.0 | DPOGS206547  |
---|---|
Genomic Position | scaffold74:+ 85167-89558 |
See gene structure | |
CDS Length | 1956 |
Paired RNAseq reads   | 952 |
Single RNAseq reads   | 2447 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014036 (0.0) |
Best Drosophila hit   | TBP-associated factor 5 (1e-157) |
Best Human hit | transcription initiation factor TFIID subunit 5 (1e-159) |
Best NR hit (blastp)   | wd-repeat protein [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | transcription initiation factor TFIID subunit 5 [Culex quinquefasciatus] (0.0) |
GeneOntology terms    | GO:0030528 transcription regulator activity GO:0005634 nucleus |
InterPro families    | IPR019775 WD40 repeat, conserved site IPR020472 G-protein beta WD-40 repeat IPR019781 WD40 repeat, subgroup IPR007582 TFIID subunit, WD40-associated region IPR015943 WD40/YVTN repeat-like-containing domain IPR019782 WD40 repeat 2 IPR017986 WD40-repeat-containing domain IPR001680 WD40 repeat IPR011046 WD40 repeat-like-containing domain |
Orthology group | MCL14952 |
Nucleotide sequence:
ATGGGAGATAAGTCCACCCCACTTTTAGCTGTACTGCAACTTCTCCGAAAATATGATTTG
AAAGGCACAGAAGAACTTTTAAGGAAAGAAGCAAACTTAGGAGATGTTGAATACGACAAC
TTAGATTTGCCAGAAGTAGAACTTGCCAGTATACTTACTGCTCATCACACAGAAAGTGAT
CCATATAGCTATGAGTTTGCTTATGATAGTCTTAAAAAGTTCGTTGAAAATTCTCTAGAC
ATTTACAAGTATGAATTATCAACACTTCTTTACCCAGTGTTTGTGCATATGTATTTGGCA
CTAATATTGTATGACCACAATGAACATGCTTATAAGTTCTTTGAAAAATTTGGCCTAGAA
CAGGAAGATTATTATCAAGAGGATTTAACTCGTCTCTCAATTGTGAAACATAAAGATCAA
ATTAAAGGAAATGAAATTGCAGAAATATACAGCTCAAACAAATTTCAAGTTCAAGTATCT
CGAGATGCATCTACACAGCTTAAGAGATATTTACATGAACAGAAGAGTTCACCGGTGATT
ATAAATATCTTAAATAATCACATACAAATTGATATACATGACGGACCAGGTCGAACACAA
GCACAAGTGAGAGCTACTATTGGGGGGCTACTAGGTGAAGCTTCTAGAAACGAAAATCGC
ACAAAAGTGTATTATGGATTATTAAAAGAACCTGACATACAAGTCCTTCCACCACCCATT
GAAGATGAAGAGGAAGCAGAGGAGACCCCAGATAAACCCAAAAAAAAGAAGGCCAAAAAG
GACAATATTTTTCTCAAAAAACCTAAATCTGATCCCAATGCACCACCAAATGACAGAATT
CCCTTGCCAGAGTTAAAAGAAACAGACAAGCTGGAAAAGGGTAAAGCCATAAGAGAAGCT
GCAAAACGTGTTCAACTTGGACCAGAGAGTCTACCATCTATTTGCTTTTACACTCTCTTA
AACAGCGGCCATACAGCCATATGTGCTGATATCTGTGATGACTCAACACTGCTTGCCGTT
GGCTTCAACAACTCTTATATTAAGGTTTGGACTTTGACTACAATAAGATTAAGAGGAATG
AAATCAGCTGAAAAGTTGCAAGACATCGACAGAGAAGCTGGTGACGTCTTAGTGAGGATG
ATGGAAGAGAAGGACAGAGATACATGCCGTACACTATACGGCCACTCGGGATCAGTATTC
AAAGTGGCATTCGATCCTTTCAAAACTTTGTTACTATCATGCTCTGAAGATTCCACAGTC
CGGCTATGGTCCCTGCAGTGTTGGTCGTGCCTAGTGGCGTACCGTGGCCACGCGTGGTCG
GTTTGGGACGTACGTTGGTCGCCTCATGGCCACTACTTCGCCAGCGCCGGGCACGACCGG
ACCGCACGCCTCTGGGCCACCGACCACCATCAACCGCTCCGAATATTCGCTGGACATCTT
TCTGACGTCGATTGTGTTCAATTCCATCCAAACTCGAATTACATAGCAACGGGATCCAGC
GACCGCACCGTAAGACTATGGGACTGTTTGACGGGAACGCAAGTACGGATCATGACCGGT
CACAAGACAACTCCATATACCGTTGCGTTTTCTGTATGTGGTCGTTGGATAGCTTCAGGC
GGCGCGGGGGGAGAGATAGTTGTATGGGATATTTCTACCGGTCTACCAATGAGCACTCTG
CCTCCAATGCATGTAGCGCCTGTTCACGCCTTAGCCTTCAGTCGAGACGGCACTATCTTA
TCTTCAGGTTCATTGGACTCCACAATCAAACTGTGGGATTTCACATTGATTACGGATGAA
AGTCTGACGGAAGAGACAGGTTCAAGTACTGTCACACAAAAAGAAGAGAAGGTTCTCTTG
CGTTCGTTTGCGACGAAAAATTCGCCAATCAAGCATTTGCATTTCACCCGTCGCAACCTT
TTGTTGGCTGTAGGGTCATATGAAGGAAGTTCCTAA
Protein sequence:
MGDKSTPLLAVLQLLRKYDLKGTEELLRKEANLGDVEYDNLDLPEVELASILTAHHTESD
PYSYEFAYDSLKKFVENSLDIYKYELSTLLYPVFVHMYLALILYDHNEHAYKFFEKFGLE
QEDYYQEDLTRLSIVKHKDQIKGNEIAEIYSSNKFQVQVSRDASTQLKRYLHEQKSSPVI
INILNNHIQIDIHDGPGRTQAQVRATIGGLLGEASRNENRTKVYYGLLKEPDIQVLPPPI
EDEEEAEETPDKPKKKKAKKDNIFLKKPKSDPNAPPNDRIPLPELKETDKLEKGKAIREA
AKRVQLGPESLPSICFYTLLNSGHTAICADICDDSTLLAVGFNNSYIKVWTLTTIRLRGM
KSAEKLQDIDREAGDVLVRMMEEKDRDTCRTLYGHSGSVFKVAFDPFKTLLLSCSEDSTV
RLWSLQCWSCLVAYRGHAWSVWDVRWSPHGHYFASAGHDRTARLWATDHHQPLRIFAGHL
SDVDCVQFHPNSNYIATGSSDRTVRLWDCLTGTQVRIMTGHKTTPYTVAFSVCGRWIASG
GAGGEIVVWDISTGLPMSTLPPMHVAPVHALAFSRDGTILSSGSLDSTIKLWDFTLITDE
SLTEETGSSTVTQKEEKVLLRSFATKNSPIKHLHFTRRNLLLAVGSYEGSS