DPGLEAN21854 in OGS1.0

New model in OGS2.0DPOGS209925 
Genomic Positionscaffold1866:- 37277-41353
See gene structure
CDS Length1215
Paired RNAseq reads  1024
Single RNAseq reads  2497
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010929 (1e-101)
Best Drosophila hit  clockwork orange (2e-29)
Best Human hithairy/enhancer-of-split related with YRPW motif protein 2 (2e-10)
Best NR hit (blastp)  PREDICTED: similar to class b basic helix-loop-helix protein (bhlhb) (differentially expressed in chondrocytes) (mdec) (sharp) [Tribolium castaneum] (1e-46)
Best NR hit (blastx)  PREDICTED: similar to class b basic helix-loop-helix protein (bhlhb) (differentially expressed in chondrocytes) (mdec) (sharp) [Tribolium castaneum] (1e-35)
GeneOntology terms





  
GO:0003702 RNA polymerase II transcription factor activity
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0006355 regulation of transcription, DNA-dependent
GO:0003677 DNA binding
GO:0016564 transcription repressor activity
GO:0042752 regulation of circadian rhythm
InterPro families
  
IPR011598 Helix-loop-helix DNA-binding
IPR001092 Helix-loop-helix DNA-binding domain
Orthology groupMCL17144

Nucleotide sequence:

ATGAAGCCACGCACGGTAACCTACTCCAATGAGGAGTTCGCGCGCGAGCCGCTGAGCTTC
GCGCCGCCGTCCGAGGACGAGGCGGAGTACCCTCCAGGGTACAAGAAGGGGAAGGTCTCG
AGGCAAGATCCAATGTCGCACCGCATCATCGAGAAGCGGCGGCGGGACCGGATGAACAAC
TGCCTGGCGGACCTGTCCCGCCTCATACCACCCGAGTACCTCAAGAAGGGCCGCGGCCGC
GTGGAGAAGACGGAGATCATAGAGATGGCCATACGACACCTCAAGTATTTACAGGACAGA
GTCCACGTTCTGGAGCGGGGGTCGGAGTTCCTCGCGGGGTACCAGAGGGCGGGGGCGGAG
GCGGTGCGGTTCGTGGAGCTCCAGGGCTCCCGTGACGGCCTGGCGGATCAGCTCGCTGCA
CACCTACACTCACACGCTGACATGATGGCCAAAGAAGCTGTACACGAAAAGCGTGTGTAT
CCGAACTCCTCGTCGGAGACGACCAGCTCGTCGAGCAGTTCCCAGGGCTTCGCCGTGAAG
GTGATCCAGCGGCCGGAGCCCCCGCCCGCGTTCCCGGAGCCCTACGAGCCTGACAGACAG
GAACATTTCGCGGACTGCGAGAGATTGTCGGTGCACCAACCAGCTGTAATGGAGCCGTTG
GAAGGCGAGCCTCTCCCGCTGGACGGACGAGTGAAGAAGGAAGTGACGCTGAGGAAGATT
AGGAAGCCAGAACACGAGGACTACTTGCACTCGTACAAGTTCAAGAACTCCATAGAGAGG
AGGTTCTCCAGGTCGCAGGACTCCGAGGGTGACGCTTGGAGCGCCGGGCCGACAGCGAAG
GCCTACAATCATAAACGTCGGAGGCCTACCAAGCCCGCGCCGCCCTCCACGTCCACTTCC
GCCTCGGGCTCCACCGAGGAAGCGCGCGACACAAGCCCACAAGACACGTGCAGCGACTCC
CCCCACCACCACTCGTTCGACAAGCCGCCGCCGCCCGCGCAGTACGTGCCCGTGTTCGCC
TTGAACGCGCTCGGCAAGTACTACGTGCCGCTGAGCGTGGAGTACGGCTGCGTGTCTCGT
CAGCTGGGCGCGGGCGTGACGTCACTGGAGGCGGCCGAGGCGCGCGCGCTTCACCCCGTC
ACCATACACGTGAACTTCCAACCCTGCATCGACTACCTCAAGCGGGAGCCCGACCCTCAC
TGGCGCCCGCTCTAA

Protein sequence:

MKPRTVTYSNEEFAREPLSFAPPSEDEAEYPPGYKKGKVSRQDPMSHRIIEKRRRDRMNN
CLADLSRLIPPEYLKKGRGRVEKTEIIEMAIRHLKYLQDRVHVLERGSEFLAGYQRAGAE
AVRFVELQGSRDGLADQLAAHLHSHADMMAKEAVHEKRVYPNSSSETTSSSSSSQGFAVK
VIQRPEPPPAFPEPYEPDRQEHFADCERLSVHQPAVMEPLEGEPLPLDGRVKKEVTLRKI
RKPEHEDYLHSYKFKNSIERRFSRSQDSEGDAWSAGPTAKAYNHKRRRPTKPAPPSTSTS
ASGSTEEARDTSPQDTCSDSPHHHSFDKPPPPAQYVPVFALNALGKYYVPLSVEYGCVSR
QLGAGVTSLEAAEARALHPVTIHVNFQPCIDYLKREPDPHWRPL