DPGLEAN22284 in OGS1.0

New model in OGS2.0DPOGS204303 
Genomic Positionscaffold777:- 26692-33137
See gene structure
CDS Length1740
Paired RNAseq reads  464
Single RNAseq reads  1557
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007579 (0.0)
Best Drosophila hit  TBP-associated factor 6 (4e-155)
Best Human hittranscription initiation factor TFIID subunit 6 isoform epsilon (2e-122)
Best NR hit (blastp)  transcription initiation factor TFIID subunit 6 [Culex quinquefasciatus] (0.0)
Best NR hit (blastx)  transcription initiation factor TFIID subunit 6 [Culex quinquefasciatus] (4e-174)
GeneOntology terms






  
GO:0005634 nucleus
GO:0006355 regulation of transcription, DNA-dependent
GO:0016251 general RNA polymerase II transcription factor activity
GO:0005669 transcription factor TFIID complex
GO:0006367 transcription initiation from RNA polymerase II promoter
GO:0051090 regulation of transcription factor activity
GO:0016986 transcription initiation factor activity
GO:0003677 DNA binding
InterPro families  IPR011442 Domain of unknown function DUF1546
Orthology groupMCL14342

Nucleotide sequence:

ATGCATCATTCAAAAAGACAAAAACTTTCTATTACAGATATAGACAATGCTTTAAAAATA
AAAAACACTGAAGCCCAGTATGGTTTTGTGCAGCCTGACTCATTGCCATTCAGGTTTGCA
TCTGGTGGTGGTAGAGAGCTACATTTTATAGAGGAGAAAGAAATTGATTTATCAGAAATA
CTGTCAGCTCCACCCCCAAAAATTCCTCTTGATGTGTCCTTGAGAGCACACTGGCTTAGT
GTTGATGGGGTTCAGCCAACTGTTCCCGAGAATCCGCCACCATTATCCAAAGAGGCACAA
AAATTGGAGTCAGTTGATCCTGTTTCTAAATTAAGCAAGCCTGCCAATAAAGATTCAGCA
GGAAAACCAGTTAGTGGTAAAGCAGCTAGACTTAAAGCCTCAGAGTCTGTCCATGTTAAA
CAACTTGCAACACACGAGCTTAGTGTGGAACAACAGCTATATTATAAGGAAATCACAGAA
GCAGGTGTGGGCAGTGATGAGGGACGGAGAGCTGAAGCCCTGCAATCACTGGCATGTGAT
CCTGGCCTACATGAGATGTTGCCAAGGATGTGTACATTTATATCAGAGGGTGTAAAAGTC
AATGTTGTCCAGAATAACTTGGCTCTCCTTATTTATTTGATGAGAATGGTGAAAGCAATG
TTGGACAATCAATCACTTTATTTAGAAAAATATCTTCATGAATTGATTCCATCAGTCTCA
ACGTGTATAGTGTCCCGACAGCTTTGTACGCGGCCAGAAGTTGACAACCACTGGGCGCTC
CGAGACTTCGCCGCTCGACTAATGGCCCAGCTGTGCAAAACATTTAATACTTCTACTAAT
AATCTACAAACAAGAGTTACAAGGTTGTTTGCAAAAGCCCTGCAATGTCCATCACAAACA
AACAACGAAAGTGGACCGTCAATGGTTGCTTCTATGAAGGAATCTGAGAAGACTCCTTTA
GCCTCGCTCTATGGAGCAGTCCAAGGTTTAGCTGAGTTGGGTCCTGAGGTGGTGAAGGTA
TTTATCCTGCCTCGTGTGCGATGGTTAGGCGAGCGTGTGGAGGGTGCGCTAGGTGGGGCT
GCGGGCGCAGACCGTGTAGCTGCGAGCAACCTTAAACACCAGTTACTCAAGGTGTTGGCT
CCAGTAGTGCGACAGCTTCGTCAACCGCCCGACCTTCCTGATGACTACAAACGCGAGTAC
GGCTACCTCGGTCCGAGTCTACAGCAAGCTGTGAGTAAGCTGCGGTCGTCTCCGACAGGC
GGCGGCGGCGGCGGCGCGGTGGCCGTGTTGCCGTGTACCCCGCCTCTGCTACCTCACCCG
CCATCACCCGCACCACACGCCAAGTCCATCGGCGCCCCCTCCCCCGCGCCCTCAACACCT
CCGCCGCAGAAATTCGTCATAGTAGCCTCGCAACAGAAGACGCAACAGAACCAACCAGCC
AGTGGGTCCGGTCACATAGTAGTTCATAGCTCGCAGCCTACCATCGTCCGCAGTCAAAAT
GTACAGTCGGTGGTGGTGACGAGCGGGCCGGCGGGAGCACAGCCGCCGCAGAAGCTGGTG
GTGGTGGGGATGAACCCCCTGCACACAGCACACTCGCAACACTCACCGCTGCAGGCCACC
ACCACGGTGTCGGGCGTCAGTCAGGCGCCCGTGTCAGTGGTCGCCAAGCCGGTGTTCGCT
CGCGGCGGCTCGGCCCCGCAGCCTCCGCCGGAGCTGGACGACCTGTCGCACCTCGCTTGA

Protein sequence:

MHHSKRQKLSITDIDNALKIKNTEAQYGFVQPDSLPFRFASGGGRELHFIEEKEIDLSEI
LSAPPPKIPLDVSLRAHWLSVDGVQPTVPENPPPLSKEAQKLESVDPVSKLSKPANKDSA
GKPVSGKAARLKASESVHVKQLATHELSVEQQLYYKEITEAGVGSDEGRRAEALQSLACD
PGLHEMLPRMCTFISEGVKVNVVQNNLALLIYLMRMVKAMLDNQSLYLEKYLHELIPSVS
TCIVSRQLCTRPEVDNHWALRDFAARLMAQLCKTFNTSTNNLQTRVTRLFAKALQCPSQT
NNESGPSMVASMKESEKTPLASLYGAVQGLAELGPEVVKVFILPRVRWLGERVEGALGGA
AGADRVAASNLKHQLLKVLAPVVRQLRQPPDLPDDYKREYGYLGPSLQQAVSKLRSSPTG
GGGGGAVAVLPCTPPLLPHPPSPAPHAKSIGAPSPAPSTPPPQKFVIVASQQKTQQNQPA
SGSGHIVVHSSQPTIVRSQNVQSVVVTSGPAGAQPPQKLVVVGMNPLHTAHSQHSPLQAT
TTVSGVSQAPVSVVAKPVFARGGSAPQPPPELDDLSHLA