DPGLEAN16732 in OGS1.0

New model in OGS2.0DPOGS208661 
Genomic Positionscaffold594:- 19842-29050
See gene structure
CDS Length4032
Paired RNAseq reads  1170
Single RNAseq reads  2865
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007759 (0.0)
Best Drosophila hit  CG9727 (7e-46)
Best Human hitDNA-binding protein RFX7 (2e-31)
Best NR hit (blastp)  GF16654 [Drosophila ananassae] (6e-54)
Best NR hit (blastx)  GF16654 [Drosophila ananassae] (2e-52)
GeneOntology terms
  
GO:0003677 DNA binding
GO:0006355 regulation of transcription, DNA-dependent
InterPro families
  
IPR011991 Winged helix-turn-helix transcription repressor DNA-binding
IPR003150 DNA-binding RFX
Orthology groupMCL18929

Nucleotide sequence:

ATGGATAGTTCAACTCCGTTTCAACCTTGGTCTAGTGACGATAATTCAAAGAATGTTAAA
TCTGAGCGAGAATTAGTAGATGACGCGAAACTTCATCGGGAAAATGTCAATCATCCAAAA
CATAGGCAAACTGCGTCTTACAATAAAGAAAGTGTCGTCGATCCTAGCGTCGTTCCTGGA
CCATCGAGTGCCACGGACACCGATGTCCATCCACTCGGCGGTAAGGAGTCAAAAATGGAC
TCCAGCAAAATAGCTTCCATGCAACAAATAGTCGAGAACACACTCAGTCAAGAGGGCCGT
CAAAGAGTGTCACAATTGTTGGAAGCTGTAGAGGGTCTCAGTGGTGCGGAGAGATTGTTA
CTTTATCTCCGTCTACCAACCGGTGTACCTCCACACGATCCCCTCAAACAGCCAGTCAAT
CCGCTGGGCTCCAGAGCCGAACTACAGCAGACTGTAACGTGGATACAAACACACTTGGAG
GTTGATCCTGACGTTTCGTTGCCAAAACAAGATGTTTACGATGAATACATAGCTCATTGT
ATGACCAGCAATATGAAACCACTATCGACCGCTGATTTCGGCAAAGTCATGAAGCAGGTG
TATCCTAGTGTGCGCCCGCGCCGGTTGGGAACGCGCGGCAATTCAAGATACTGTTACGCT
GGTCTCCGGAAGAAAGTTAAACTTGAAGTGCCACAGTTGCCAAATTTGGGTGAATCGACC
AAGGAACCAAGTGTGCCTTCAAGAGAAAATGAAAGAATCATTTGTGATTGGGCTGAATCT
AAATTGGGCGTTAAGTTCATGAACATATCGGAGTTATCTCGTCACCTGTTAAGCGCGATG
CGCGCCCCGCCCGGTCCGACCAGCGCCACACCGCCGCCGCAAACGCATGGATCTGATGAA
CCACCGGGACCACAGTTAATGAAACAACAGTTAAAAAGAAAGCTACAGACGCAGGGTACG
GTGGGTCGACCGAAAAAGAACAAGGGGCAAGAAGTCGCCGGGGAATCGCCGCCGTCCACT
TCATACGTAAGCCATCCCACAGTGAAACATGAGAGCGAACTGGTGCCCGAATCGTACGGC
TATCAGCCGGCTTACATGCCCGTTTATGACGTACGGCCGGCCTTTCCCTATGCGGCGCCA
CCTCGTGCTCACCCCCCCCATCCTCTTCCCCCACACCCTCATCCCCCACCACCGCACCCG
CCCGCCATCCCAGTACACGATTACCGGCCCGATCCTTACGTCTTCGAACCGCCATATGCA
CCACGCGACCTTACTCTTCCGGACACGGCCCACAACGTACCGATTAACCTCAGCAGTGAT
GTTTCCCTCGACCTGTCCACCGAGCGCACAGAGTGGCAACGTCGCCGTCCCCCGGACCCT
CCGGCTCCGACCGCCCGTCTTCCGCTGCCTGGGAAGAAGTTAATCTTGGAGACGTATCAG
AGCGAGACTCAAGCTAGCTCCCCTCGCTCGTCACAAACTGACACGCGAGCCGTCCACCAG
CCGCCAGAAGAATTTCCGCGCACGGAATACTTGCCTAAGAAGATGCGTGCCGCTGAGATA
CTGGGCGGTAAGTTGGCGGCACCGAGACAGGTCGCCGCTGCGGACGCGTCGACGTCATCA
TCGACAGGGGCGGAATCGGCACGTTCCGAGGTTGCTTTTTTAAAGGATCCAAAAAACTTA
ACGAACCGGTCCAAAAGCACCGCCGCGGCGCTGGCCCGGGAGGAACAGGAGGCGGCGAGT
CCGACACGGTCGGTGAAGGCTCCGACGCCTAAACACAATCGAACTAAACTGAGAAAGTCG
TCACACATGAGAACGAAGGGGTCGTCTCCGGAGAGGAACGAGCACGACCATTCCGCGGCG
CCCGATACCTGTTGTGGTATTAATGGCATCAATATTATTACGGGAATGATCTGTGGGGAA
AAACATTCTGAGATACTGAACCGTGAACGAGTTATTAGCATCTGTAATATCGATAAACAC
GATTTAGACGATTATCTCAATGAGGGCAATAGCCAGGAACACGAGGAAGAATTAATGCAA
TACTTCCATCATCGCGATGGTGACACAGATATACCAGCAAAATCTAATCAGAACGAAAAT
ACGACTTTTTTAGAAACAAGTCAGGACCCACACGACGAACATAGTCAAGGGAAAAGTGAA
AAAATATCGCAACTGCGAGAACTATTAGCTAAGAATTTGAAAAGTGGGTCCAATACACAG
AATTTATTATTAAATCAAGAAAAACAAACTCCGATTAATAATAATCACGAAACACATGTT
AATAAGAGTTCAATATCGGACGGAGCATTTAGGCCACTAAACAACGTTATGGAAATGATA
AACGGCTCCAATGGCTCAAACGAAAATGAAAATAATGTAAGGAAAAAATGTGATAATATT
CCAAGTGAAAACGGAATTTCCGCTTGTGCCCCAGAGAATTCACACACTCAGGTGATGACA
ACAGCGACTGGTTCTCATCTTTACAATGGTAACAGTCATACGGAACCACAAAGTCCAACT
ACTAGAACACAACAATACGATTTCGTACCTATATCCGATGGATGTCATTCTCCTGGGAAT
TTTAATTCAAAGTCTCCTTTAGGTTTTGGACAACGAGGGCACAGTCCAACTAAAACGAAT
AAACCTATTATAATGGGTGGTTCCTTGCAAACATCTCCTATTTCACATAGCATGGCAGCA
AGTCCTTTTGTCAGTCCAAGAAATACTCCGGTGCCAAGGTCACGCTATTGTTCTAGACCA
ATACCTAAACACAACAATAGAAAGAGACGTACCATTTTGTCACTCGGTGTAAATGAGACT
GGCACTAAACAATTCGCAATACCAAATGATCAGAAATTCCTGCCGAAGTCTTCTTTGGGT
GGGCAGAATATAAAATATTGCCAACCAATGTCAGCACCTCCATCGCCCAACCTCCTACCA
CACTTTAAACAACAAATGATACAAAATTCCGTGATACCAAATGGCACCAGTAATATGTGC
TTTGTGTCAAATACGTTTCGAGAGAATGTTAATGGTGAAATGCCACAACCATTATCAGCA
GATCCGTTGTCCAGTGAAGTGAGTCAATTTTTTCAGGAGCCTGTAACGGGTTATAGAATA
GCTCACGATACATCTTTTAGGTCACAATCGGTACCATTGAAACAGGCAACCATAAACATT
GGCTTGTTGAGCTACAACAATACACCAGTCGGTTCCGTCCCACCTACACCAGTACCGAAT
GAGTTTTGCGATTTTGGTTCTTTAGCTGACACATGTGATATATCTAGGGCAGGACTCAAT
CCTGAGACTCTAGATAAAATATACGACGCTATTGATAGCAGCAATGACGTACTTAACGGT
GGCACGAGTAATAACATATTAAACTCCAGCGACACCCTAACTAACGGTTGTGATCCTTTA
CCCGAACAACAAATTCTGATAGATGAACAGTTAAGTTCACTTATACCTAGCGGGGAAGAA
TTGCTCGATAGGAGCTCCCTTATGGCCAGCGGAGAAGACTTACTCGAAAGGGGTGAACAA
TTCAATGAGTCGTTTCCCAATGACTCGACGGCTTCAGAGAATGTGGAAGAGTTCTTGAAA
CGTACAAACAGCATAGAATTTGATTTATCTGATTTGGTAACAGAAAAAAATAAATACTAC
GCATCACGTTCCGTACCAAGTACGCCATTGCCTTATAAGCGAACGGCTTCGAACTTACAA
ATAGATCCACGGCACGCTAGAGATCTCTTCGCTACCGAAAATTATTCCAGTGTATCGAAC
GGTATATCTTCAAAATCCGTGCCATCTACACCACAATTAGCGGAAGATCGCAGTGTTTTT
AGTTACACCAACAGAGACTTTCTCATTAACGGAAACTCGGTTGACATGTGCTCTAATCAG
ATTAGGCAACCGGTTGAAAATGACCAAGCCTTGACGTCCCCACTTGACGAAATACTAGGT
CCTCTAACGCCAGCCGCAGATCTATTGGCTGACCTCGACAAAATAGATACTGCTCCATAT
GTTGACCTCTAG

Protein sequence:

MDSSTPFQPWSSDDNSKNVKSERELVDDAKLHRENVNHPKHRQTASYNKESVVDPSVVPG
PSSATDTDVHPLGGKESKMDSSKIASMQQIVENTLSQEGRQRVSQLLEAVEGLSGAERLL
LYLRLPTGVPPHDPLKQPVNPLGSRAELQQTVTWIQTHLEVDPDVSLPKQDVYDEYIAHC
MTSNMKPLSTADFGKVMKQVYPSVRPRRLGTRGNSRYCYAGLRKKVKLEVPQLPNLGEST
KEPSVPSRENERIICDWAESKLGVKFMNISELSRHLLSAMRAPPGPTSATPPPQTHGSDE
PPGPQLMKQQLKRKLQTQGTVGRPKKNKGQEVAGESPPSTSYVSHPTVKHESELVPESYG
YQPAYMPVYDVRPAFPYAAPPRAHPPHPLPPHPHPPPPHPPAIPVHDYRPDPYVFEPPYA
PRDLTLPDTAHNVPINLSSDVSLDLSTERTEWQRRRPPDPPAPTARLPLPGKKLILETYQ
SETQASSPRSSQTDTRAVHQPPEEFPRTEYLPKKMRAAEILGGKLAAPRQVAAADASTSS
STGAESARSEVAFLKDPKNLTNRSKSTAAALAREEQEAASPTRSVKAPTPKHNRTKLRKS
SHMRTKGSSPERNEHDHSAAPDTCCGINGINIITGMICGEKHSEILNRERVISICNIDKH
DLDDYLNEGNSQEHEEELMQYFHHRDGDTDIPAKSNQNENTTFLETSQDPHDEHSQGKSE
KISQLRELLAKNLKSGSNTQNLLLNQEKQTPINNNHETHVNKSSISDGAFRPLNNVMEMI
NGSNGSNENENNVRKKCDNIPSENGISACAPENSHTQVMTTATGSHLYNGNSHTEPQSPT
TRTQQYDFVPISDGCHSPGNFNSKSPLGFGQRGHSPTKTNKPIIMGGSLQTSPISHSMAA
SPFVSPRNTPVPRSRYCSRPIPKHNNRKRRTILSLGVNETGTKQFAIPNDQKFLPKSSLG
GQNIKYCQPMSAPPSPNLLPHFKQQMIQNSVIPNGTSNMCFVSNTFRENVNGEMPQPLSA
DPLSSEVSQFFQEPVTGYRIAHDTSFRSQSVPLKQATINIGLLSYNNTPVGSVPPTPVPN
EFCDFGSLADTCDISRAGLNPETLDKIYDAIDSSNDVLNGGTSNNILNSSDTLTNGCDPL
PEQQILIDEQLSSLIPSGEELLDRSSLMASGEDLLERGEQFNESFPNDSTASENVEEFLK
RTNSIEFDLSDLVTEKNKYYASRSVPSTPLPYKRTASNLQIDPRHARDLFATENYSSVSN
GISSKSVPSTPQLAEDRSVFSYTNRDFLINGNSVDMCSNQIRQPVENDQALTSPLDEILG
PLTPAADLLADLDKIDTAPYVDL