DPGLEAN21935 in OGS1.0

New model in OGS2.0DPOGS203265 
Genomic Positionscaffold261:+ 11503-15300
See gene structure
CDS Length1683
Paired RNAseq reads  888
Single RNAseq reads  2311
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000445 (2e-10)
Best Drosophila hit  jumeau (7e-61)
Best Human hitforkhead box protein N1 (3e-48)
Best NR hit (blastp)  conserved hypothetical protein [Culex quinquefasciatus] (5e-87)
Best NR hit (blastx)  conserved hypothetical protein [Culex quinquefasciatus] (6e-78)
GeneOntology terms

















  
GO:0003682 chromatin binding
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0006350 transcription
GO:0008105 asymmetric protein localization
GO:0007400 neuroblast fate determination
GO:0005730 nucleolus
GO:0005701 polytene chromosome chromocenter
GO:0048749 compound eye development
GO:0005700 polytene chromosome
GO:0007476 imaginal disc-derived wing morphogenesis
GO:0006325 chromatin organization
GO:0007391 dorsal closure
GO:0043565 sequence-specific DNA binding
GO:0006355 regulation of transcription, DNA-dependent
GO:0048813 dendrite morphogenesis
GO:0048666 neuron development
GO:0007517 muscle organ development
GO:0000785 chromatin
InterPro families

  
IPR001766 Transcription factor, fork head
IPR011991 Winged helix-turn-helix transcription repressor DNA-binding
IPR018122 Transcription factor, fork head, conserved site
Orthology groupMCL19496

Nucleotide sequence:

ATGGATCTGTACATCACTGACTCACTCCAGGACATGCTGGATATGGACATCAAAAACGAA
ATAGCGACAGACTTGAGCAGTATAACAGATTTTTCTGATTCATTAGGATTAAATTTCTCA
GAAATGCCACCGTTATTAGACATGGAAACAGATAATTCTGTGACGTGGCTGAATAACTCT
TCGAGTTTCGTACACAATCTCGATCTGTATGGATCGGAAGCGAACGCTGTCATGGTCAAT
CCGAACTCCGTAATGCCGTCGACGTTCGCTGAAACTCCAGTCAAAAGTATTGTAAAAGAA
GAGGCTTCACATTTGCTGCTCACTTCAGCTGCGAATAACGATCTCACCAATAACACGTCG
CTATCGAGTCCTAAAGAGGAAAAAAGTCATTTAACATTCTCACCGAACGCCATCAAGGTG
GCTAAAGTACAGGAATCGGAAGACACAAAAAACAAGAAGCCGATGGAGGAGGCGACGCAG
ATGGTCATTTATGTGCGTAAACAAGACAAGACTGTAGTCAAAGATTTATTAAAGGATTTA
GACACGAATAAAACTAAGAGTTCAACCTTAACGCCCACCGTCAGAATAAAGTCGAGTCAA
CAGGAAGTATTAAAAATAAATAATAAGAACTGTTCAGTTTTAAACACGAACCAAAAACTA
TCACAGTCTTTAGGCACGAAAACTATTATATCCGGTAATATACACATATTGGACGCACAG
CAATCTAGGACAATTTTAGCTAATGGTAACAAACAAGCTACGATATTAATTGATAATTCA
TCACTAAACAATAGCAGGCAAATAATTAAGACCTCAGTAACTGGTGCATTTACAGTAGAC
ACTAGCCAAGCTAAGTACGTAAATAATTCAAGTAAAACAGTCGCGGGAGAGTTTCCAAAG
CCGGCCTATTCGTATTCATGTTTAATAGCAATGGCGCTAAAAAACTCGAGAACGGGAAGC
TTACCAGTGTCAGAGATTTATAATTTTATGTGTCAACATTTCCCCTATTTCAAAACCGCA
CCAAACGGTTGGAAGAATTCCGTAAGGCATAATCTAAGCTTAAACAAATGTTTCGAAAAG
ATTGAAAAACCATCGACGAATGGAAGTCAACGGAAAGGCTGTCTATGGGCTATGAATCCA
TCGAAAGTCGGCAAAATGGATGAAGAAGTCCAGAAATGGTCCAGGAAGGATCCTCAGGCA
ATCAAGAAAGCTATGATTTATCCAGAGACCTTGGAAGCGTTGGAACGCGGAGAGATGAAG
TACAGCGGGTTCGGCAGCGACAATGACGCGGACGAAGATAATGATAACGATAATGACACA
GAGGACTTGGACTTGGAGATAGACCCTGAGGTCAAAGATGAAGAGCAGGAGGAACATGTG
CATGAGGAATCAGATCAAGAATTGGAAGTAGAAGAAGTGGAAGGAACGGGCATGGTTGGG
GCTTACCGCGTACTGGCTCCAGGGCTATATGGAGACCTGAGTGATGTTGAGGTATTGGAT
CAGTCGTACGAAGAGATCGACATCGATACTAAACCAGTAAAATTAGACCTATCTGTTACC
GAAAATTATACGATCCATTCCGCTAAACGGGCAAAGACGAGCTTCATATACCAGCCGGTG
ACGTCACAGACGCACACGAGTCGAAGAAAGACGCCGCTCGTCAACAGAATAGCGTTAGTT
TAA

Protein sequence:

MDLYITDSLQDMLDMDIKNEIATDLSSITDFSDSLGLNFSEMPPLLDMETDNSVTWLNNS
SSFVHNLDLYGSEANAVMVNPNSVMPSTFAETPVKSIVKEEASHLLLTSAANNDLTNNTS
LSSPKEEKSHLTFSPNAIKVAKVQESEDTKNKKPMEEATQMVIYVRKQDKTVVKDLLKDL
DTNKTKSSTLTPTVRIKSSQQEVLKINNKNCSVLNTNQKLSQSLGTKTIISGNIHILDAQ
QSRTILANGNKQATILIDNSSLNNSRQIIKTSVTGAFTVDTSQAKYVNNSSKTVAGEFPK
PAYSYSCLIAMALKNSRTGSLPVSEIYNFMCQHFPYFKTAPNGWKNSVRHNLSLNKCFEK
IEKPSTNGSQRKGCLWAMNPSKVGKMDEEVQKWSRKDPQAIKKAMIYPETLEALERGEMK
YSGFGSDNDADEDNDNDNDTEDLDLEIDPEVKDEEQEEHVHEESDQELEVEEVEGTGMVG
AYRVLAPGLYGDLSDVEVLDQSYEEIDIDTKPVKLDLSVTENYTIHSAKRAKTSFIYQPV
TSQTHTSRRKTPLVNRIALV