New model in OGS2.0 | DPOGS202501  |
---|---|
Genomic Position | scaffold189:- 67646-108190 |
See gene structure | |
CDS Length | 2115 |
Paired RNAseq reads   | 3424 |
Single RNAseq reads   | 9437 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012449 (4e-35) |
Best Drosophila hit   | groucho, isoform C (7e-165) |
Best Human hit | transducin-like enhancer protein 4 (2e-165) |
Best NR hit (blastp)   | groucho [Bombyx mori] (0.0) |
Best NR hit (blastx)   | groucho [Bombyx mori] (0.0) |
GeneOntology terms    | GO:0005634 nucleus GO:0016055 Wnt receptor signaling pathway GO:0045449 regulation of transcription GO:0003674 molecular_function GO:0008150 biological_process |
InterPro families    | IPR019775 WD40 repeat, conserved site IPR011046 WD40 repeat-like-containing domain IPR009146 Groucho/transducin-like enhancer IPR019782 WD40 repeat 2 IPR017986 WD40-repeat-containing domain IPR015943 WD40/YVTN repeat-like-containing domain IPR005617 Groucho/TLE, N-terminal Q-rich domain IPR019781 WD40 repeat, subgroup IPR001680 WD40 repeat |
Orthology group | MCL10171 |
Nucleotide sequence:
ATGAACGCCGCCGCTGCCGCCGCTGCCGTGGCGGCCGCCAGGCATCCGGGCCCCCCGCAG
CCCGGGCAACCCATCAAGTTCACGGTGGGCGAGTCATGTGACAGGATTAAGGAGGAATTT
AATTTCTTACAAGCTCAATATCATAATTTAAAATTAGAATGCGAGAAACTGGCTAGTGAA
AAAATTGAAATACAGAGGCATTATGTTATGTACTATGAAATGTCATACGGGCTCAACGTG
GAAATGCACAAACAGACGGAGATCGCTAAGAGATTAAATGCTATAATAGCTCAAATATTG
CCATTCCTCTCTCAAGAGCATCAGCAGCAAGTGGCGTCGGCAGTGGAGAGGGCGAAGCAG
GTCACGATGACAGAACTGAACGCTATTATTGGGCAACAGCGACCAGACCTGCCGCGTCTC
TTGCAGCAGATGCATGCAGCACATTTGCCGGCACACGGAGCTCCGCCACTGCCTCTTCTC
AGCCAAGGAGCCCTGCCGCCCGCGGGGTTACTGGGCCTCGGAGTACCCCACCATCCTCTG
TCAGTGCTCGCCAAACCCCCCGACATACATCGTCCTGATGATAAGGGCAATGGTATCAGC
TCGGCGGAAGAGCGACACAGAAATTCAATATCCCCGGGCGAGAGAGAGAAATATAGAACA
AGGAGTCCCGCTGAACCAGATCACAAGAAACTAAAAAAGGAGGAAAAAGATATGGGACAT
GAATTAGTTGTAGACGACGCCAGCGAAGAACCCACATCACCTCACAACGGGGCGCCTTCA
CCCAGAGAGAACGGTCTGGACAAACTTCAACCCAAGAAAGAACATCCCCCTCACAGTCCG
CGGTCTGGAACGTCCAGTAACGCATCGACGCCTTCGACAAAAAAGTTAGACGAGAAACCC
AGCACGCCGATTTCAAAACCGGTGACGCCGACTTCCGGCGCTAGTGGCGTCGGCTCGGCG
GGGCCACCTATGAAGGCGGCGGTGAAGCCCCCGGCGTTACAGTACCCCTACCTAGGTAAC
GGGGCCCACGACGCATACGGACTTGCCGGATATTCAGCCAGAGCGGCGATGGCGTACGAG
CCACTACGTCCCCCAATAGGACCAGCGGCTCTGGCACCCATACCTGGCGGAAAACCAGCG
TACTCGTTCCACGTATCGGCCGAGGGCCAGATGCAACCGGTCCCATTCCCCCCGGACGCC
CTCATGGGGCCGGGGATCCCCCGCCACGCGCGGCAGGTGTCCGCCCTCGCCCACGGGGAA
GTGGTGTGCGCGGTGACAGTCTCCTCGCCAACCAAGTACGTGTACACCGGCGGTAAGGGC
TGCGTCAAGGTGTGGGACATCAGCCAGCCGAGCAAAGCGCCCGTCAGCCAGCTGGATTGT
TTGCAACGTGATAATTACATCCGGTCGGTGAAGTTACTTCCTGACGGCCGGACCTTGATT
GTCGGCGGGGAAGCCTCCAACTTGTCTATATGGGACCTCGCTTCTCCGACTCCCCGCATT
AAGGCGGAACTGACGTCATCAGCGCCCGCTTGTTACGCGCTGGCTATTAGCCCAGACTCT
AAGGTGTGCTTCAGTTGTTGTTCCGACGGCAACATCGCGGTGTGGGACCTCCACAACCAG
ACCCTGGTGAGACAGTTCCAGGGACACACGGACGGAGCCTCATGCATCGACATCTCCGCT
GACGGCACCAAGCTTTGGACGGGCGGACTTGATAATACTGTCAGATCCTGGGATTTAAGA
GAAGGAAGACAATTACAACAGCACGACTTCAGCTCACAGATATTCTCACTGGGATACTGT
CCGACGGGTGAATGGCTCGCAGTGGGCATGGAGAACAGCAACGTGGAGGTGTTGCACGCC
GTGAAGCCTGACAAGTACCAACTGCACCTGCACGAGTCCTGTGTACTTTCCCTCAGGTTC
GCCTCCTGCGGGAAGTGGTTCGTCTCCACGGGGAAGGACAACCTGCTCAACGCCTGGCGC
ACGCCCTACGGGGCGAGCATCTTCCAGTCTAAGGAGTCGTCGTCGGTGCTGAGCTGCGAC
ATCTCATCGGACGACAAGTACATAGTGACCGGGTCAGGCGACAAGAAGGCCACAGTGTAC
GAAGTGATCTACTAA
Protein sequence:
MNAAAAAAAVAAARHPGPPQPGQPIKFTVGESCDRIKEEFNFLQAQYHNLKLECEKLASE
KIEIQRHYVMYYEMSYGLNVEMHKQTEIAKRLNAIIAQILPFLSQEHQQQVASAVERAKQ
VTMTELNAIIGQQRPDLPRLLQQMHAAHLPAHGAPPLPLLSQGALPPAGLLGLGVPHHPL
SVLAKPPDIHRPDDKGNGISSAEERHRNSISPGEREKYRTRSPAEPDHKKLKKEEKDMGH
ELVVDDASEEPTSPHNGAPSPRENGLDKLQPKKEHPPHSPRSGTSSNASTPSTKKLDEKP
STPISKPVTPTSGASGVGSAGPPMKAAVKPPALQYPYLGNGAHDAYGLAGYSARAAMAYE
PLRPPIGPAALAPIPGGKPAYSFHVSAEGQMQPVPFPPDALMGPGIPRHARQVSALAHGE
VVCAVTVSSPTKYVYTGGKGCVKVWDISQPSKAPVSQLDCLQRDNYIRSVKLLPDGRTLI
VGGEASNLSIWDLASPTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQ
TLVRQFQGHTDGASCIDISADGTKLWTGGLDNTVRSWDLREGRQLQQHDFSSQIFSLGYC
PTGEWLAVGMENSNVEVLHAVKPDKYQLHLHESCVLSLRFASCGKWFVSTGKDNLLNAWR
TPYGASIFQSKESSSVLSCDISSDDKYIVTGSGDKKATVYEVIY