DPGLEAN15291 in OGS1.0

New model in OGS2.0DPOGS215998 
Genomic Positionscaffold370:+ 34222-62100
See gene structure
CDS Length1476
Paired RNAseq reads  408
Single RNAseq reads  1334
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000933 (1e-21)
Best Drosophila hit  twin of eyg (3e-38)
Best Human hitpaired box protein Pax-8 isoform PAX8E (9e-21)
Best NR hit (blastp)  GL24956 [Drosophila persimilis] (9e-99)
Best NR hit (blastx)  eyegone [Tribolium castaneum] (2e-42)
GeneOntology terms




  
GO:0005667 transcription factor complex
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0006355 regulation of transcription, DNA-dependent
GO:0043565 sequence-specific DNA binding
InterPro families




  
IPR009057 Homeodomain-like
IPR011991 Winged helix-turn-helix transcription repressor DNA-binding
IPR012287 Homeodomain-related
IPR001523 Paired box protein, N-terminal
IPR001356 Homeobox
IPR017970 Homeobox, conserved site
Orthology groupMCL17339

Nucleotide sequence:

ATGTTGATATCTGCGGGCAACGGCCCCATACCCGAATTCTCTCCTTCCTCGTGCTGCGCG
TCCTGGAAGATGGAGCCAGCGCCTGCTGCCTCCATAACACAGCCATTGAATATATCGCCC
AACGTGCCAGTGTCACCATCATCATCAGGAGGTACAAACACGCTGTACGCGGGCGCGTCA
CACCCATTGTCGTCTTTGTTGTCACAACAGCGGTTACTCGAACTGTCTCGCTTCGGTTTG
CGTCACTACGATATAGCTCATCACGTGCTGTCTCAGCAAGGTGCTGTTACGAAGCTGCTT
GGTACACTCAGGCCACCTGGTCTCATCGGTGGCAGCAAGCCAAAGGTCGCTACGCCGGCA
GTAGTTTCAAAGATAGAACAATACAAAAGAGAGAACCCTACCATTTTTGCTTGGGAGATA
CGAGAGCGGCTTATTTCTGAAGGCGTCTGTACTAATGCAACAGCACCGAGTGTATCTTCA
ATCAATCGGATACTCAGAAACAGAGCCGCGGAAAGGGCTGCAGCAGAGTTTGCTAGAGCG
GCGGGATATGGTCTGTATGCTGCACCACCTCCATATGGCGGGTTCCCGTGGGCCAGCGGT
GGTGGTGTATGGCCACCTGGAAGCCTACCCCTACCTCCAGGAGTACCGCCTTCTTCAGTT
GGAGTACCTCATCCGGATGCTGTTAAACAAGGCTTTTTATCATCGTCGGGTCGTAGCTTG
ATTGATGTGGATGGAGACGATTCGGGATCACTAGATGGGGAACAACCAAAGTTTAGACGA
AATCGAACAACATTTAGTCCTGATCAACTAGAAGAGTTAGAAAAAGAATTTGAAAAATCA
CATTATCCCTGCGTTTCAACACGAGAACGATTAGCCTCTAAGACTTCTTTATCCGAGGCA
AGGGTTCAGGTTTGGTTTTCCAACAGACGAGCCAAATGGAGGCGCCACCAGCGCATGAAT
CTCCTCAAACGCGGTGGCTCTCCTTCGCACCGCCTCCCCCACTCTCCCTCCCGTTCCCGT
TCACGTTCTTTATCCCCCACCCGAATTCCCTACCACGCTCCTCAAATGGGCGGTGAAAAT
AGTGCTTTCAAAGCTTTAGGCCATCAAGATACTAATACGCTGAAAGCCCTCACGCATCAA
AGCCAATTCGAAACGAACGCACTAAAAGCTTTATCCCAACAGACAACATTTGATAGCAAT
CCTTTTAAATCACATCCAGCTCTAGAGAATAGTGCATTTAAAGCGCTCGTACCAAACTCA
GCGGCAGCGGCGTTATTGGCCGCACAATCGATACAATTAGCCCGCGGATATGAATCTCAT
TCGGATTCCGATGAGGAAATAAACGTTCATGATGAGAGTGAAGACGAGGCCGAGAAACAG
ATAAATGCGATGAGATCTAGATCACCGAGCCCGAGCCGACATAGAATGACGACAACCAAT
GACGTGCCGCTGCAATTAACTAAGCATGACCGTTGA

Protein sequence:

MLISAGNGPIPEFSPSSCCASWKMEPAPAASITQPLNISPNVPVSPSSSGGTNTLYAGAS
HPLSSLLSQQRLLELSRFGLRHYDIAHHVLSQQGAVTKLLGTLRPPGLIGGSKPKVATPA
VVSKIEQYKRENPTIFAWEIRERLISEGVCTNATAPSVSSINRILRNRAAERAAAEFARA
AGYGLYAAPPPYGGFPWASGGGVWPPGSLPLPPGVPPSSVGVPHPDAVKQGFLSSSGRSL
IDVDGDDSGSLDGEQPKFRRNRTTFSPDQLEELEKEFEKSHYPCVSTRERLASKTSLSEA
RVQVWFSNRRAKWRRHQRMNLLKRGGSPSHRLPHSPSRSRSRSLSPTRIPYHAPQMGGEN
SAFKALGHQDTNTLKALTHQSQFETNALKALSQQTTFDSNPFKSHPALENSAFKALVPNS
AAAALLAAQSIQLARGYESHSDSDEEINVHDESEDEAEKQINAMRSRSPSPSRHRMTTTN
DVPLQLTKHDR