DPGLEAN08785 in OGS1.0

New model in OGS2.0DPOGS203366 
Genomic Positionscaffold6:+ 30869-40689
See gene structure
CDS Length2799
Paired RNAseq reads  1515
Single RNAseq reads  4052
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003891 (0.0)
Best Drosophila hit  TBP-associated factor 4, isoform E (2e-80)
Best Human hittranscription initiation factor TFIID subunit 4 (1e-60)
Best NR hit (blastp)  transcription initiation factor [Aedes aegypti] (2e-100)
Best NR hit (blastx)  transcription initiation factor TFIID subunit, putative [Pediculus humanus corporis] (2e-145)
GeneOntology terms








  
GO:0006355 regulation of transcription, DNA-dependent
GO:0006367 transcription initiation from RNA polymerase II promoter
GO:0016251 general RNA polymerase II transcription factor activity
GO:0005669 transcription factor TFIID complex
GO:0005634 nucleus
GO:0016986 transcription initiation factor activity
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0007517 muscle organ development
GO:0048813 dendrite morphogenesis
GO:0045944 positive regulation of transcription from RNA polymerase II promoter
InterPro families

  
IPR003894 TAFH/NHR1
IPR009072 Histone-fold
IPR007900 Transcription initiation factor TFIID component TAF4
Orthology groupMCL14702

Nucleotide sequence:

ATGGCGTCAGCGGAGTTTTTAGAGCAAGCTTTGTCTACAGACGTTGATGAGAATGCAGTT
AACGCGATAGTAGGTTCCCTAGAAAATCATTTAGTGACGTCCGTTCCGTCAATATCGTCA
CAGAACAATTTGTTGACTGTTATTCCAAGTCAACTTAGCCTTGCAACAAGTGAAAATACC
ATTATTGGACAAAAATATAACAAAGAGAATAGTGATGGCGATATAGGGAGTGTAAACTTT
AGACCAAATATTGTTTCAAGTTCCTCGTTTAGTTTACCCTCAACTTTTATTAACCAAACA
AGCCTGTCTCAAAACATTTCAAATGGTACTGATTTGGTAAAAGTTATAAGTTCTCAACCG
CTAACTTTATCCGTCTCTGATAATAGTGTTGTGTTCTCAGCGCCATCATACGCAAACGGT
TGTCCTTCTTTGCCGTTATCCCAAGCTCAGATAATTCAGACTGTACAAGGAAGTAGTGCA
ATAAATCAGCCAATTAATAAATCTATTACTATGCAAAATCCTCCTTTGGTTATAAAACAG
GGAACGACTTCTGGTCAAGTCAGTATGCAAGCCAATATGGTACCAATGACAGTGAATTCT
AGCATGCCGGGTTCTATTTCTAACGTGATGACTATTAATAAGCCAGGAGGGCAGAACGTC
GTTGTCACAACACAGAATCTCGGTACAGGCCAACCTGCTATATTGCCCAATGTTCAAATT
TTAAACATGAGGCCGGGTGCGCCTGCGGTGGCGGCTCAAAAATCGGTCGCAACTGTGTCC
CCGCGCGTTGTTATCGGAACTCCTCAGGTTGTTGGACAGAGAGCAGCTGCCCCTGGAATA
ACGCTGCAAACACTACAAAGTCTACAACAGGGGCAGCAGGGTCATTTGTTATTAAAGACT
GAGAACGGTCATTACCAGCTGTTAAGAGTGGGTCCAGGGCCCGGGGCCAGCACGCTGGCG
CCGCAGCAACAGACGATGCGACTGTCCACAGTGCCGGCACATCCCGGGGTGTCAACGGTG
TCCACGAGCGTGCCGGCCCCGGTACAGATACCTGGTCAGATGCCGCAGGGGCCGGTCGCT
ACCCCGGTGCCAGCGGCATCTGTGACCGTGCCTCTGCCTTCGCCACAATCACTCCAACCC
ACAGTCACTACTCAGAAGCCGTTGGACAACACTAAGGAGAAGTGTCGCAACTTCCTGGCC
AACCTGCTGGACCTGTCTAGCAAGGAGCCGAAGTCCGTGGAGAGGAGTGTCAGGAACCTC
ATACAGGAACTGATCGACGCTCAGGTGGAACCGGAGGAGTTCTGTGATAGGTTGGAAAGA
CTCCTGAACGCCAGCCCACAACCCTGCCTCATCGGCTTCCTGAAGAAAAGTCTACCGTTG
CTGCGTCAGTCCCTCGTCACCAAGGAGCTGGTGATAGAGGGCATCAACCCGCCGTCTCCG
CACGTGGCGTTCTCAGCGATATCGCCGCAAGCACCCAACACGGCGGTAGCCACCAGCAAC
ATACAGATGCCGGGCCTAACGTTAGTGGTTCGGCAGCCAGATGACGAAGGCAGCTCAAGT
CCGACCCTAACGCCCCTCCTGCCTCCCGTGATGCCGGTCATCCCGCCCCAACCACCATCA
CCGAAACAGATTAACATAGTCGCCGTGCAGTCGGCGCAGGTTTGTCGCTCGCGTTACACC
AGCCTTAGCACCAGTAGAGTCATCGGAATCATTCGCTACGGTGGTTCCACACGTGATGCG
GCTCTTTATATAACTTATATGTCAATCCGTCTCCATCATTGCGCTCTTCCACATCAGCCG
AAGCCTCAGCCCAAGTCTGGTGGTACGATAGCGGTGCTCCAGAACATTCCAGTGCATCCG
AAGATCAACGTCAGCAAAGTGGGCAAGACTATGACGGTGAACAGTAAGGCTGGCTTCACG
CGACCCACGGGCTCCGCTAACACGGGCCTCTCGACTGTGCTCACGGCGGGGAAGTCTCTG
CTGCGGGACAGGGAGAGGAGATCAGCTCAGTTCTCGCAGAGCTTCGTGGACGACAAGATG
GCCGGCGATGACGACATCAATGACGTAGCAGCCATGGGAGGAGTAAATCTCGCTGAGGAG
AGCCAGCGGATATTGGGCTCCACGGAAATGATCGGAGCACAGATCAGATCCTGTAAAGAC
GAGACCTTAGTACCAATGGCGGTGATGCAGGCCAGGATACGTGCGGTGTCTCTGAGACAC
GGCCTGGAGGAGCCCCCGGCGGAGGTTGGGGCCTTACTGAGCCACGCGCTGCAAGAACGA
CTCAAATCGCTACTAGAGAAGCTAGCGGTCATATCACAGCACCGGATAGACACGCATGTC
AAGATGGATTCGCGTTATGAAGTGACTCAGGATGTGAAGGGCCAGCTGAAGTTCCTTGAA
GAACTGGACAGAGTGGATAAGAAGAGACGAGAGGACTCAGAGAGGGAGATGTTGTTGAGA
GCAGCCAAGTCGCGATCCAAGAACGAGGACCCTGAACAGGCCAAGCTTAAGGCGAAGGCC
AAGGAGATGCAGCGCGCTGAGTTAGAGGAGCTGAGGCAGCGGGAGGCGAATCTGACAGCA
TTACAGGCGATCGGGCCAAGGAAGAAGCCGCGAACTGACGGCTCAGCAGCTGGAGATAAT
CTGGGATCCAGCGGTCAGAGTACTGGACCTTCCGGCCGAGGTCAGCTCCCACAGCGAACC
CGTCTGAAGAGAGTCAACATGCGAGATATGCTGTTCATGATGGAACAAGAGCCTGAATAT
AGACACTCGGCGCTACTATACCGTGCCTACCTCAAGTAA

Protein sequence:

MASAEFLEQALSTDVDENAVNAIVGSLENHLVTSVPSISSQNNLLTVIPSQLSLATSENT
IIGQKYNKENSDGDIGSVNFRPNIVSSSSFSLPSTFINQTSLSQNISNGTDLVKVISSQP
LTLSVSDNSVVFSAPSYANGCPSLPLSQAQIIQTVQGSSAINQPINKSITMQNPPLVIKQ
GTTSGQVSMQANMVPMTVNSSMPGSISNVMTINKPGGQNVVVTTQNLGTGQPAILPNVQI
LNMRPGAPAVAAQKSVATVSPRVVIGTPQVVGQRAAAPGITLQTLQSLQQGQQGHLLLKT
ENGHYQLLRVGPGPGASTLAPQQQTMRLSTVPAHPGVSTVSTSVPAPVQIPGQMPQGPVA
TPVPAASVTVPLPSPQSLQPTVTTQKPLDNTKEKCRNFLANLLDLSSKEPKSVERSVRNL
IQELIDAQVEPEEFCDRLERLLNASPQPCLIGFLKKSLPLLRQSLVTKELVIEGINPPSP
HVAFSAISPQAPNTAVATSNIQMPGLTLVVRQPDDEGSSSPTLTPLLPPVMPVIPPQPPS
PKQINIVAVQSAQVCRSRYTSLSTSRVIGIIRYGGSTRDAALYITYMSIRLHHCALPHQP
KPQPKSGGTIAVLQNIPVHPKINVSKVGKTMTVNSKAGFTRPTGSANTGLSTVLTAGKSL
LRDRERRSAQFSQSFVDDKMAGDDDINDVAAMGGVNLAEESQRILGSTEMIGAQIRSCKD
ETLVPMAVMQARIRAVSLRHGLEEPPAEVGALLSHALQERLKSLLEKLAVISQHRIDTHV
KMDSRYEVTQDVKGQLKFLEELDRVDKKRREDSEREMLLRAAKSRSKNEDPEQAKLKAKA
KEMQRAELEELRQREANLTALQAIGPRKKPRTDGSAAGDNLGSSGQSTGPSGRGQLPQRT
RLKRVNMRDMLFMMEQEPEYRHSALLYRAYLK