DPGLEAN14103 in OGS1.0

New model in OGS2.0DPOGS205508 
Genomic Positionscaffold699:+ 28668-43384
See gene structure
CDS Length2844
Paired RNAseq reads  1223
Single RNAseq reads  3679
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000140 (6e-35)
Best Drosophila hit  CG7839 (1e-82)
Best Human hitCCAAT/enhancer-binding protein zeta (5e-112)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC002139 [Tribolium castaneum] (5e-179)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC002139 [Tribolium castaneum] (2e-167)
GeneOntology terms



  
GO:0006366 transcription from RNA polymerase II promoter
GO:0003677 DNA binding
GO:0005488 binding
GO:0045449 regulation of transcription
GO:0005634 nucleus
InterPro families

  
IPR016024 Armadillo-type fold
IPR011989 Armadillo-like helical
IPR005612 CCAAT-binding factor
Orthology groupMCL12474

Nucleotide sequence:

ATGAAGCACAAAAGATTCGAAGAGAACATTTTAGTAGACGAAGCGTATACGAATTACGGT
AAAGTTGATAAAAATGCTGATGAAAAAAAAGGTTATGGTATAGCAAAACATTTCGCTGAT
ACTCTAGAATACGCGGACAAAAAGAAATGGTATCAGCAGCTACCAGAAGAGCCTTCGACA
CAAGGATCGATAACGCAAGAAAAAATTGAAGAACTCCGTAAAGAAGCGAGTAGTGCTCTA
CATGGTGACACATTAGCTTATGAAACAAAATCAAGTAAAAGTGGTTCATCCGACCAACAG
TGGGCTCGTACACTATTAACCAAAGGTACAATTGGAGACAGAGTGGCCGCAGCAACAATA
CTAATACAGGACAATCCGCTGTATAATTTGACGGCATTGAGGAATTTAATCAATAATGTG
AAGCCAGCTAAGAAAAAGGATGGAATAGTTATAATAGATGCTGTATCGGAACTGTTAGTA
TCAGAGCTGCTCATCCCTGACACCAAGCTGCGCACCTTCCAGCAGCATCCTCTACATGTC
ATTGATGAAATCACATCGGGCAACAAACAGGCAAGACGGAATGTTCTAAAACTATGGTAC
TATGAGGATCAACTTAAAGAGTTGTACGGAACTTATGTGGAAGCATTAAACAAGTTTGCT
CATGACACAGTGGACGCTAATAAGGAAAAGTCTATCAGTGCCATGTCTTACCTCTTGATG
CATCATCCGGAGAGAGAAAAGATGCTCCTAACAAATATAATAAATAAACTCGGTGATCCG
AGCCAGACGGTGGCATCGAAAGTGATCTACCATCTCTGTCAACTTCTATACAACCATCCG
AATATGAAATCTGTTGTTTTAGCTGAGATAGAGAAAATGTTGTTCAGATCTAACATATCC
CCACGCGCTCAATACTATGGAGTGTGTTTCCTGAACCAGTTCTTCCTTGGTAAGGATGAC
AGTCGGATAGCTGAGAACTTGATCAGAATATACTTCTCATTCTTCAAGGCTTCTATCAAG
AAGGGTGAAATAGATTCTCGTCTCATGTCAGCTATCCTGACGGGTGTGAAGCGAGCCTAT
CCCTTTGCTGACAGGGAGCGGTTGGTTGAGGCCTCCCAGCATGTAGACGCTGTACACCGA
CTGGTCCACCTGGCCAACATCAACGTGGCGATCCATGCACTGGCCTTGCTGTATCACATC
AGTGATGCTAACAAAGGGACATCCGACAGATACTACACAGCCTTGTACCGGAAACTGACA
AATTCCAATATATTCAATACTACCCACTCTGCATTGTTCTTCTCTCTCATATACAAGTCG
TTGAAGCAGGACAAGGATATAGACCGGGTGACGTCATTCATCAAGAGATTATTACAGCTG
TCCTGCTACATGAGCCCTGGCCAGGCTTGCGGAATGCTCTTCCTCATCTCGCAAGTATTG
AAGAGTGATGATAAGAGAGAGGCTGTAAAACTGGTCTTCAGTGAGATTAAAGAGGAAATT
AAAGAAGAAAATGAAACTAAAAATAATGATGAAAATCCAGAAGAATTAATGCATTCAGAA
GTTGAATTAGATGAGAGTAAGGAAGATGCTGAGGAAAATGTCAAACAGAAAAAAATTGAT
CTCTTAATAGGAGATAAGAAAGATTTATTAATGGATGATGAAGAAGAGACATATGTTGAC
CTCAAAATAGACGATGAAGGTAACATAAAGCCTAAGAAGAGGAATACGAACTCTGTGACT
GGGTGGTTTCATGCTAGAGTTGACAAGAAAGATGTACAAGAAAAAAACGTTGAGAAACAG
TTGAAGAAAGCTATTAATATTGGAAAGACGATAACCAGTTATAGTCCACTGTGCCGTGAC
CCTCGTTTCACCGGAGCACACCTGACGGCGATGGCTGAACTGACAATGCTGATGAAACAT
CATCATCCGAGTGTCAAGATGTTTGCTGAAAAATTACTGAATAATCAAATAATCCAATAT
GGCGGCGATCCTTTGAAGGACTTTTCCGGTATCCGTTTCCTGGATAGATTCGTGTTCAAG
AATCCAAAGAAACGTGCCGAGGTCACTGATGGGGAGGTCAAAAAGGTTAAGGGGTCACAT
CCGAAGTTCGCTGTTAGAAAGAACTATACAGCTAAAGGCATCAGAAGTATCGCTGTCAAT
TCATCGGCATATTTGAATGAGGATGTCAAGAAAATTCCTGTCGATGAAAGATTCCTATAT
GATTTCCTTCAAAAGCGCCGAGCGGCTGCTGATAGTGATGAGGAGAGTGACAACGACTCG
GTGACCAGCGAAGATTTTGAGACCTATTTGGATTCAGTCACTGGAACCAAAGCACAGGAA
TCCGATGAGGAGTTAGATTATTTGGGTGAATTGGAGTCGAGTAAACAGAAACGACCGAAG
GAAGTTGATGATGAGAAAGATGAGGTGATGAGCGATGATCAAGATGAAGACGATGATAGC
GATGGCGAACTCAATATATCCGGTGATGAAGACGAGCCAGTACTATCCGGAGACGAGGAC
GAACTAATGTTAGAAGACAGCGAAGAAGAAGACCAGATAGATATACCAGGAAAGAAGTCC
AAAAAGGATGCTATTAAATTAAAAGCTGCGAGTGACATTGACGTGGTGGAATCACGTAGG
CGCCCCCTCGGTTATACACAGTACAGTTATATTGGATTTTTAAATCGGTCGGATATTGTC
GCAGCGGCGCCCTCTGCAGACTGGAGTCACTTGAACTTTGCACAAGTCCACTGCAGGTTT
GAGTATGGTCGCGAATTATATTACGTGTCGTGTTTACTTTCGATACATTTGAAACAATTC
AAACTAAACAACGGTACTCGCTAG

Protein sequence:

MKHKRFEENILVDEAYTNYGKVDKNADEKKGYGIAKHFADTLEYADKKKWYQQLPEEPST
QGSITQEKIEELRKEASSALHGDTLAYETKSSKSGSSDQQWARTLLTKGTIGDRVAAATI
LIQDNPLYNLTALRNLINNVKPAKKKDGIVIIDAVSELLVSELLIPDTKLRTFQQHPLHV
IDEITSGNKQARRNVLKLWYYEDQLKELYGTYVEALNKFAHDTVDANKEKSISAMSYLLM
HHPEREKMLLTNIINKLGDPSQTVASKVIYHLCQLLYNHPNMKSVVLAEIEKMLFRSNIS
PRAQYYGVCFLNQFFLGKDDSRIAENLIRIYFSFFKASIKKGEIDSRLMSAILTGVKRAY
PFADRERLVEASQHVDAVHRLVHLANINVAIHALALLYHISDANKGTSDRYYTALYRKLT
NSNIFNTTHSALFFSLIYKSLKQDKDIDRVTSFIKRLLQLSCYMSPGQACGMLFLISQVL
KSDDKREAVKLVFSEIKEEIKEENETKNNDENPEELMHSEVELDESKEDAEENVKQKKID
LLIGDKKDLLMDDEEETYVDLKIDDEGNIKPKKRNTNSVTGWFHARVDKKDVQEKNVEKQ
LKKAINIGKTITSYSPLCRDPRFTGAHLTAMAELTMLMKHHHPSVKMFAEKLLNNQIIQY
GGDPLKDFSGIRFLDRFVFKNPKKRAEVTDGEVKKVKGSHPKFAVRKNYTAKGIRSIAVN
SSAYLNEDVKKIPVDERFLYDFLQKRRAAADSDEESDNDSVTSEDFETYLDSVTGTKAQE
SDEELDYLGELESSKQKRPKEVDDEKDEVMSDDQDEDDDSDGELNISGDEDEPVLSGDED
ELMLEDSEEEDQIDIPGKKSKKDAIKLKAASDIDVVESRRRPLGYTQYSYIGFLNRSDIV
AAAPSADWSHLNFAQVHCRFEYGRELYYVSCLLSIHLKQFKLNNGTR