DPGLEAN18972 in OGS1.0

New model in OGS2.0DPOGS202501 
Genomic Positionscaffold189:- 67646-108190
See gene structure
CDS Length2115
Paired RNAseq reads  3424
Single RNAseq reads  9437
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012449 (4e-35)
Best Drosophila hit  groucho, isoform C (7e-165)
Best Human hittransducin-like enhancer protein 4 (2e-165)
Best NR hit (blastp)  groucho [Bombyx mori] (0.0)
Best NR hit (blastx)  groucho [Bombyx mori] (0.0)
GeneOntology terms



  
GO:0005634 nucleus
GO:0016055 Wnt receptor signaling pathway
GO:0045449 regulation of transcription
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families







  
IPR019775 WD40 repeat, conserved site
IPR011046 WD40 repeat-like-containing domain
IPR009146 Groucho/transducin-like enhancer
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR015943 WD40/YVTN repeat-like-containing domain
IPR005617 Groucho/TLE, N-terminal Q-rich domain
IPR019781 WD40 repeat, subgroup
IPR001680 WD40 repeat
Orthology groupMCL10171

Nucleotide sequence:

ATGAACGCCGCCGCTGCCGCCGCTGCCGTGGCGGCCGCCAGGCATCCGGGCCCCCCGCAG
CCCGGGCAACCCATCAAGTTCACGGTGGGCGAGTCATGTGACAGGATTAAGGAGGAATTT
AATTTCTTACAAGCTCAATATCATAATTTAAAATTAGAATGCGAGAAACTGGCTAGTGAA
AAAATTGAAATACAGAGGCATTATGTTATGTACTATGAAATGTCATACGGGCTCAACGTG
GAAATGCACAAACAGACGGAGATCGCTAAGAGATTAAATGCTATAATAGCTCAAATATTG
CCATTCCTCTCTCAAGAGCATCAGCAGCAAGTGGCGTCGGCAGTGGAGAGGGCGAAGCAG
GTCACGATGACAGAACTGAACGCTATTATTGGGCAACAGCGACCAGACCTGCCGCGTCTC
TTGCAGCAGATGCATGCAGCACATTTGCCGGCACACGGAGCTCCGCCACTGCCTCTTCTC
AGCCAAGGAGCCCTGCCGCCCGCGGGGTTACTGGGCCTCGGAGTACCCCACCATCCTCTG
TCAGTGCTCGCCAAACCCCCCGACATACATCGTCCTGATGATAAGGGCAATGGTATCAGC
TCGGCGGAAGAGCGACACAGAAATTCAATATCCCCGGGCGAGAGAGAGAAATATAGAACA
AGGAGTCCCGCTGAACCAGATCACAAGAAACTAAAAAAGGAGGAAAAAGATATGGGACAT
GAATTAGTTGTAGACGACGCCAGCGAAGAACCCACATCACCTCACAACGGGGCGCCTTCA
CCCAGAGAGAACGGTCTGGACAAACTTCAACCCAAGAAAGAACATCCCCCTCACAGTCCG
CGGTCTGGAACGTCCAGTAACGCATCGACGCCTTCGACAAAAAAGTTAGACGAGAAACCC
AGCACGCCGATTTCAAAACCGGTGACGCCGACTTCCGGCGCTAGTGGCGTCGGCTCGGCG
GGGCCACCTATGAAGGCGGCGGTGAAGCCCCCGGCGTTACAGTACCCCTACCTAGGTAAC
GGGGCCCACGACGCATACGGACTTGCCGGATATTCAGCCAGAGCGGCGATGGCGTACGAG
CCACTACGTCCCCCAATAGGACCAGCGGCTCTGGCACCCATACCTGGCGGAAAACCAGCG
TACTCGTTCCACGTATCGGCCGAGGGCCAGATGCAACCGGTCCCATTCCCCCCGGACGCC
CTCATGGGGCCGGGGATCCCCCGCCACGCGCGGCAGGTGTCCGCCCTCGCCCACGGGGAA
GTGGTGTGCGCGGTGACAGTCTCCTCGCCAACCAAGTACGTGTACACCGGCGGTAAGGGC
TGCGTCAAGGTGTGGGACATCAGCCAGCCGAGCAAAGCGCCCGTCAGCCAGCTGGATTGT
TTGCAACGTGATAATTACATCCGGTCGGTGAAGTTACTTCCTGACGGCCGGACCTTGATT
GTCGGCGGGGAAGCCTCCAACTTGTCTATATGGGACCTCGCTTCTCCGACTCCCCGCATT
AAGGCGGAACTGACGTCATCAGCGCCCGCTTGTTACGCGCTGGCTATTAGCCCAGACTCT
AAGGTGTGCTTCAGTTGTTGTTCCGACGGCAACATCGCGGTGTGGGACCTCCACAACCAG
ACCCTGGTGAGACAGTTCCAGGGACACACGGACGGAGCCTCATGCATCGACATCTCCGCT
GACGGCACCAAGCTTTGGACGGGCGGACTTGATAATACTGTCAGATCCTGGGATTTAAGA
GAAGGAAGACAATTACAACAGCACGACTTCAGCTCACAGATATTCTCACTGGGATACTGT
CCGACGGGTGAATGGCTCGCAGTGGGCATGGAGAACAGCAACGTGGAGGTGTTGCACGCC
GTGAAGCCTGACAAGTACCAACTGCACCTGCACGAGTCCTGTGTACTTTCCCTCAGGTTC
GCCTCCTGCGGGAAGTGGTTCGTCTCCACGGGGAAGGACAACCTGCTCAACGCCTGGCGC
ACGCCCTACGGGGCGAGCATCTTCCAGTCTAAGGAGTCGTCGTCGGTGCTGAGCTGCGAC
ATCTCATCGGACGACAAGTACATAGTGACCGGGTCAGGCGACAAGAAGGCCACAGTGTAC
GAAGTGATCTACTAA

Protein sequence:

MNAAAAAAAVAAARHPGPPQPGQPIKFTVGESCDRIKEEFNFLQAQYHNLKLECEKLASE
KIEIQRHYVMYYEMSYGLNVEMHKQTEIAKRLNAIIAQILPFLSQEHQQQVASAVERAKQ
VTMTELNAIIGQQRPDLPRLLQQMHAAHLPAHGAPPLPLLSQGALPPAGLLGLGVPHHPL
SVLAKPPDIHRPDDKGNGISSAEERHRNSISPGEREKYRTRSPAEPDHKKLKKEEKDMGH
ELVVDDASEEPTSPHNGAPSPRENGLDKLQPKKEHPPHSPRSGTSSNASTPSTKKLDEKP
STPISKPVTPTSGASGVGSAGPPMKAAVKPPALQYPYLGNGAHDAYGLAGYSARAAMAYE
PLRPPIGPAALAPIPGGKPAYSFHVSAEGQMQPVPFPPDALMGPGIPRHARQVSALAHGE
VVCAVTVSSPTKYVYTGGKGCVKVWDISQPSKAPVSQLDCLQRDNYIRSVKLLPDGRTLI
VGGEASNLSIWDLASPTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQ
TLVRQFQGHTDGASCIDISADGTKLWTGGLDNTVRSWDLREGRQLQQHDFSSQIFSLGYC
PTGEWLAVGMENSNVEVLHAVKPDKYQLHLHESCVLSLRFASCGKWFVSTGKDNLLNAWR
TPYGASIFQSKESSSVLSCDISSDDKYIVTGSGDKKATVYEVIY