DPGLEAN00875 in OGS1.0

New model in OGS2.0DPOGS204080 
Genomic Positionscaffold2858:+ 8384-15758
See gene structure
CDS Length1629
Paired RNAseq reads  2138
Single RNAseq reads  4966
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010815 (7e-111)
Best Drosophila hit  Eip93F, isoform B (3e-25)
Best Human hitligand-dependent corepressor isoform 1 (9e-11)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC012464 [Tribolium castaneum] (1e-48)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC012464 [Tribolium castaneum] (6e-39)
GeneOntology terms



  
GO:0005634 nucleus
GO:0007613 memory
GO:0042803 protein homodimerization activity
GO:0043565 sequence-specific DNA binding
GO:0045893 positive regulation of transcription, DNA-dependent
InterPro families

  
IPR009057 Homeodomain-like
IPR007889 Helix-turn-helix, Psq
IPR011526 Helix-turn-helix, Psq-like
Orthology groupMCL18016

Nucleotide sequence:

ATGCTGCGCCGACAATCTTTACAAGCGGAGGACAAGCCGGGGAGTTCCCCGACCACGGAG
CAGCCCCTCGACCTGAGCGCCAAATCCACCTCCAGCACGAGTGGGACCCCACCACCGGAA
AAGGTGACCGACACCAGATTAAAACGTCCTCCGCTCGAGGCTTCGGGGAACAGCACGCGG
CGGACGTACACCGAGGATGAGCTGCAGTCAGCTCTCAGAGACATCCAGTCCGGGAGGCTG
GGCACGAGGCGGGCGGCCGTCGTGTACGGCATCCCGCGCTCCACGCTCAGGAACAAGTTC
AACAAGTTCGGGCTGGCGGCGGAAGCGGGCCCGGACTCCGAGCCGGACTCCGAGTCCGAC
AAGCCCGACTCGCCGCCATCCGTCATACTTAAGATCCCGACCTTCCCGCCGCCGGACGAG
AAGAGCCCCTCCCCGGCCGCCCCCGTCACCCCCCTCACTCCGCTCCTGCCTCCCCAGCCT
CCCCTCAACCCTCCGTCGCAGCTGCTCCTGTCTCCATCCGTGTTCGCGGACCCCTCCACC
CCCCGGCATCTCTTCACGTCGCTGAACGACGTCATCGCCAAGAGCATCAGCCAGAAGTTC
CAGCAGCCCTTAGAGCGCCCGCCCCCCGCCGACCTGCAATACCTCCGCCCGCCCGACAGA
CACGTGTCCGTCATCAAGACGCCCCCCGACAACCAGCGGTACTCCGCGCCCGGGAACTCC
AAGAACAACGGCCAGCCGCCGGCCGGGGGGAAGGGCACCAGACCCAAGAGGGGCAAATAC
AGGAACTACGACCGGGATAGTCTGGTGGAGGCCGTGAAGGCGGTGCAGAGGGGGGAGATG
TCCGTGCACCGCGCCGGCTCCTACTACGGAGTGCCCCACTCCACGCTGGAGTACAAGGTC
AAGGAGAGGCATCTCATGAGACCCCGGAAGCGCGAGCCCAAGCCGCCGCCCGACGTCAAG
CAGCCGCCGCCCAAGCCCTCGAAGCCGCCGACGAAGCCCTTCACGAACGGCCTCAACGGC
CCCGAGGCCTACCCGGCCGGCTACCCGTTCTGGTCGGGCGCTCCGCCCTTCAGACACGCG
CCCGACCTGTACGCCTCGCACATGATGCGGCGCTTGCAGGAGGAGGCGCCGCCGCCCGCC
AACGGCTCCTTCCTCGAGGGCATCATCAGGTCCAGCCTGGAGCGGCCCGGCGCCGCCCTG
CTGCAGCGGCTGGGAGCACCCGCCTCGCCCCCCGCCCCCGGCCCCGGTCCTGGCCCCGGA
CCCGGCCCCGGCGCGGCGGGTGACGCACTGCGCCGGCGACCGGACGCCGCGGACGAGTCG
GCCGCCAAGAGACCGCGCCTCGACACCGACCACCAGCTGGCGGCCGACATGAGGGAGGCG
GTGCAGAGGCTGCGGGCGGACAAGATGAGGCCCAGGAACGGCACCCCCCCCCGCCCCCCG
CACCGCCCGCCCAGGAGAGGGTCTAGCCTCACCGGGCACCGACATCACAATAGTTACCTT
ACACACACACACACAGAGACGACACACCGGACGGAGGCGCCCGGCGGACCAGCACCGGCA
CACTCCCCCCCCGACCCCCGACCGGCCCGGCCCGGCCCGGAGACCCTCGGTGCAGACCTC
GTTATGTAA

Protein sequence:

MLRRQSLQAEDKPGSSPTTEQPLDLSAKSTSSTSGTPPPEKVTDTRLKRPPLEASGNSTR
RTYTEDELQSALRDIQSGRLGTRRAAVVYGIPRSTLRNKFNKFGLAAEAGPDSEPDSESD
KPDSPPSVILKIPTFPPPDEKSPSPAAPVTPLTPLLPPQPPLNPPSQLLLSPSVFADPST
PRHLFTSLNDVIAKSISQKFQQPLERPPPADLQYLRPPDRHVSVIKTPPDNQRYSAPGNS
KNNGQPPAGGKGTRPKRGKYRNYDRDSLVEAVKAVQRGEMSVHRAGSYYGVPHSTLEYKV
KERHLMRPRKREPKPPPDVKQPPPKPSKPPTKPFTNGLNGPEAYPAGYPFWSGAPPFRHA
PDLYASHMMRRLQEEAPPPANGSFLEGIIRSSLERPGAALLQRLGAPASPPAPGPGPGPG
PGPGAAGDALRRRPDAADESAAKRPRLDTDHQLAADMREAVQRLRADKMRPRNGTPPRPP
HRPPRRGSSLTGHRHHNSYLTHTHTETTHRTEAPGGPAPAHSPPDPRPARPGPETLGADL
VM