DPGLEAN10828 in OGS1.0

New model in OGS2.0DPOGS202147 
Genomic Positionscaffold669:+ 11759-17040
See gene structure
CDS Length1386
Paired RNAseq reads  614
Single RNAseq reads  1513
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003421 (2e-180)
Best Drosophila hit  CG6751 (3e-90)
Best Human hitperiodic tryptophan protein 1 homolog (4e-78)
Best NR hit (blastp)  wd-repeat protein [Aedes aegypti] (8e-115)
Best NR hit (blastx)  wd-repeat protein [Aedes aegypti] (2e-115)
GeneOntology terms
  
GO:0016251 general RNA polymerase II transcription factor activity
GO:0005669 transcription factor TFIID complex
InterPro families






  
IPR011046 WD40 repeat-like-containing domain
IPR001680 WD40 repeat
IPR019781 WD40 repeat, subgroup
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR019775 WD40 repeat, conserved site
IPR020472 G-protein beta WD-40 repeat
IPR015943 WD40/YVTN repeat-like-containing domain
Orthology groupMCL14286

Nucleotide sequence:

ATGGAAGAGGAGGGTACTCCCACTGTAAGTTTAGTATCATGTATGCATTTTGTGAGACGG
GGAATAGCGAAACCAGTGCCAGAAAAGATTGAATTGACAGAAAATGAATTAGAAAAAATT
ATAAAGCAGACTGCTGAAGATCTTCGTTTAACAGAAGCAGGAGATGATCAAAGTGGAGAA
GAAGATGAAGCTGCACAGAGTATCAGGGAGCCTCCAGCAAACCCCAATGATGAGTTTGAC
TTTGAACATTATGACCAAGAAGATTCGAGTAACCCTGTAGGTATAGGGACTATAGCAACT
CTACCTAACTTAGGTGATCTCAGTGAAAACATACAAATCAGAACAGAAGGTCCAGATAGT
GATGAAGAAGATGACATCATTAAGCCAGATGACAACTTATTACTTGTAGGACATGTTGAA
ACAGATGCCAGTGTCTTAGAAGTTTATATTTTCAACAAAGAAGAGGGATCATTCTACGTC
CATCATGACATAATACTGCCCTGGTTTCCGCTGTGTATAGAGTGGCTCAATCATGACCCC
TCAGATCCACAACCAGGCAATCTTTGCGCTCTCGGTGGCATGGACCCAGTGATACAAGTG
TGGGATTTGGATATTGAAAACTGTTTGGAACCGGCTTTCAAGCTCGGCAGGAAACCAAAT
AAGAAGAAAAAGACAAAAAGAATTGGTCACAAGGATGCTGTTCTGGATCTGTCTTGGAAC
ACGAACTTTTCTCACGTCTTAGCGAGTGGCTCGGCGGACAACACTGTACTACTGTGGGAT
CTCGATCAAGGCTTACCACACACTAAACTAACCTACTTCGAAGACAAAGTCCAATCACTA
TCGTTCCACCCCCTGGAAGCCCAGACCCTCCTGTCTGGTTGTTGTGACGGCCGAGCGCGT
GTGTCGGACTGTCGGGACGAGGCCGCCTTCCGCACGTGGGTGCTCCCCACTGAGATAGAG
CGAGTGCACTGGGATAGGAACCAACCGTTCTGTTTCGCGATGAGCAACAATATCGGTAAA
GTGGCGTACGTGGACGTCAGACAGGAAGAACCGTTGTGGACCATCGACGCTCATCAGAAG
GAAGTCACAGGACTCATTTTAAGTGAAAAGGTTCCAGGGCTGATGATAACTGTCGGCTCG
GATGAAAAACTCAAATGCTGGGATATCACGGGCCCTACTCCGCTACAAATAAACGAGCGC
ACCAACAGGGTCGGACAGGCCTTATGCGCCGCTCAGTGCCCGGAGGCGCCGTTCGCCGTA
GCGGTGGGCGGAGACAACAAAGAGTGCTACATCGAAATGGTAGACCTCAGCAACAACGAT
GAAGTTATGAACCGTTTCGGCCAGCGCGTCACGACCGAATCCAACGCTGAAGCTATGGAC
GCGTAA

Protein sequence:

MEEEGTPTVSLVSCMHFVRRGIAKPVPEKIELTENELEKIIKQTAEDLRLTEAGDDQSGE
EDEAAQSIREPPANPNDEFDFEHYDQEDSSNPVGIGTIATLPNLGDLSENIQIRTEGPDS
DEEDDIIKPDDNLLLVGHVETDASVLEVYIFNKEEGSFYVHHDIILPWFPLCIEWLNHDP
SDPQPGNLCALGGMDPVIQVWDLDIENCLEPAFKLGRKPNKKKKTKRIGHKDAVLDLSWN
TNFSHVLASGSADNTVLLWDLDQGLPHTKLTYFEDKVQSLSFHPLEAQTLLSGCCDGRAR
VSDCRDEAAFRTWVLPTEIERVHWDRNQPFCFAMSNNIGKVAYVDVRQEEPLWTIDAHQK
EVTGLILSEKVPGLMITVGSDEKLKCWDITGPTPLQINERTNRVGQALCAAQCPEAPFAV
AVGGDNKECYIEMVDLSNNDEVMNRFGQRVTTESNAEAMDA