DPGLEAN13459 in OGS1.0

New model in OGS2.0DPOGS216202 
Genomic Positionscaffold63:+ 110859-128636
See gene structure
CDS Length4779
Paired RNAseq reads  900
Single RNAseq reads  2309
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004549 (8e-06)
Best Drosophila hit  castor (4e-76)
Best Human hitzinc finger protein castor homolog 1 isoform b (8e-77)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC015058 [Tribolium castaneum] (8e-143)
Best NR hit (blastx)  PREDICTED: similar to castor CG2102-PA [Apis mellifera] (6e-161)
GeneOntology terms












  
GO:0006355 regulation of transcription, DNA-dependent
GO:0005634 nucleus
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0007417 central nervous system development
GO:0045892 negative regulation of transcription, DNA-dependent
GO:0003677 DNA binding
GO:0006357 regulation of transcription from RNA polymerase II promoter
GO:0003702 RNA polymerase II transcription factor activity
GO:0016319 mushroom body development
GO:0009791 post-embryonic development
GO:0007402 ganglion mother cell fate determination
GO:0007419 ventral cord development
GO:0008270 zinc ion binding
GO:0040034 regulation of development, heterochronic
InterPro families
  
IPR015880 Zinc finger, C2H2-like
IPR007087 Zinc finger, C2H2-type
Orthology groupMCL15845

Nucleotide sequence:

ATGACTACTGCGATGGCACATCAGGAATCCATACACGAGAGACTGCAGTCGGGACCCGAG
GAACCGCTGTACCTGCACCATGGAGCATCCTGGAACTCAGAGACGTCGGAACACCAAGAG
GATCACAGCACAGACATGTCGCCCTACTACAACATGCCAAAAAGAAATAAGAGAAAGAAT
TTCAAACCACGATGTGCTCCCAACGCTACGACGTTCTCTGACGACTCCAACGGAGGCAAC
TGCGAGAGCAGCCTCACAAACGGAAATGACAGAACTACACCTGAACAGGCTGATGAGGAA
ACGGATGGCTACAATAAAATTGGTGTTAAGAACTTCGGTCTGATAAAAGGTGGAGCTGTC
GACCTCAGCCGGCGATTGAGCACTGATTCCAATGAAAACATAGAATCCACCTACCAGAAC
ATCACGGACGAGAAGAAAATAGACTCGTTCCAGCGCTCGTACATACACAGCCTGAAGGTG
AACCAGCCGGCCGAGAGAGATCACACCGTGCTTAACTTTAGTAATTTACAAAACAAAACA
TTCTGTGCCAAAAATCCTTTCGCCATATCCCAGTTAATAAAAACCAATTCGCCTCCACGG
AGAAGGGAATCCGATTCCGACGATGAGGCTAAGGGTCAGGATATAAGTGCTAATGACAGG
TTGGAGAACCAGGTTATAGGGGAGTCTAATGAAGGTCAACAAAACGGTGATCCGTCTAAC
GCTACAAATAGAATCTTGAGGGATTACGCTATGAACACCATGAAAGAGCTATTGGGTATC
TATGGCTTAAGAGCTTTGGAGGCTCTTCAAGGTAAACCCTTCACTCAGTTACCTTTGAGT
CTGAAGCGACGTCATGATTCAGGGGATGGAACTGATAGAAGCATCCGAAGATCCTCTTCA
GCGACTGAAGATGAAGGATCCACCAACATAACCCCTCCAAAAACACCAGACAACGACATA
GAAGCCCAAAGACAAAGGGCCCTTCGGATAATAAATCAGCAATCAATGCGTCTGTTTCGT
TCTGGGCGAGAGGAAGACAGTTCTGATGATGGAGACCAGACCCTCGACCTCAGTGTGTCC
AGAGATACAGACGGACAGGAAATGTCTACAACTTCGAGCGCCGGTAACAGCGTGTCGGAG
TACGATCAGAACATGATGATGAGGAAGAACCTCGAGGACCTGGCGAATTTACAAATGCAG
AATTTCTTCCAAAAACAATCCATGATGGATCCCGAAGACTCTCAGGAACCGAAACTCCAA
GCCAGTGGACTGCTGTCAGCCTTGAATCATCTGGATCAGGCTCGCAAGAACTCCATGCAA
ACGACCGTGGATTACTCCAGATACGTCAAAAGGTACAGTTCAACACTAGAGTGTGGTTCA
TCATATTGTAAGGATTTGGGATATCGAGAACATTTTCATTGCATGGATTGCACTGCGAGG
GTCTTTTGCAAAAAGGAAGAAATGATAAGACATTTCAAGTGGCACAAAAAACGAGACGAA
TCTTTAGCGCATGGTTTTATGAGATATTCTCCCCTCGACGACTGCTCCGAGAGGTTCAGT
GACTGTCCTCATAATCGAAGTCAAACACATTACCACTGCATACAGGATGGCTGTGATAAG
GTGTACATATCTACATCGGATGTCCAGATGCATGCGAACTATCACCGCAAGGACTCTGCC
ATACTTCAAGAAGGGTTCCAAAGGTTCCGCGCCACTGAGAGCTGTTCCGCTCCACATTGT
ATGTTCGCCGGTCAACGGACCACGCACTTCCACTGTCGAAGGCCTGGATGCACCTACACA
TTTAAAAACAAAGCCGACATGGAGAAACACAAGACTTACCACATAAAAGACGAGGCGTTA
TCAAAGGACGGTTTCAAAAAGTATATGAAGAGCGAAGCCTGCCCGTATAGAGATTGCAGG
TTTAGTAGGACCTGCAACCACATACACTGCATCAGACCGCACTGCAACTACGTACTGCAC
TCCAGCAGCCAGATCTTCACACACAAACGGAAACACGAGAGGAAGGAACAGGAAATGAGT
TTTGGGTTGCCAACGTCTGTGATCCAAAATGCTTTAATGCAAGGAGATCTGTCATATGCC
ATGCATGATGACGACATATCCGTTGAAGATTACACCCAAGCTTATAACCAACCCTTGGTG
GATGAAGTAGCACGCAAGTATATTGAAACGTTCAACGGCGCGAAGGAAGCTGAAGACAAG
TGCAGCAAATGTAGCGCCTTCAACAAACACTACCACTGTTTGGCTGAAGACTGCAAGATG
GCGCATAGTTGTCACAACACAATGGTAAAGCACGTCATGGAACACGAACAACAGAGCCAA
GTAACGGAAGCTTACTTCGTGACGTATACGAGAAATAATCCTTGTTCCAATTCGGCGTGC
CAGCATATCAAAGACATCACCCATTATCACTGTATTTGGGAGAACTGTGGAGCTGTCATA
CTATCTTCAGAGGAACATCCCTTCAGACGCTTGGAGCACTATCGCCAACACGCCAATCTC
AGTCCCATGTCGCCAAACTTGGCAGCCAGATTTGCGCCCACGTTCACTCCGCAACTATCG
GCTCAATTACACCCAAACATGGCGGCCCAACTTGGCCAGGTGACAAATTTACCCAATCTT
CCCCCTAATTTGGCTCAGTTGGCCCAAATGGCACCAAGTCTAACGTCCCAATTGACGCCG
AACTTACAAGCTCAATTGAATCTTGGGCCTAATCCCACGATTGCACCACAACAAGTGAAC
CCATCGAGTTCGTTGGACGAGATGTTTAGCCGGAAGAGGGGGAGACCTCCAAAGAATCGT
GTTGTAGAGGTCTGGACAGATAATGTGACTCCTCAAGCTATATTTACATCGTTCAAACTG
CCGAAGAGCAACCAACTGCCGCCTCTAGTACAATCACCAATTATTGCTACAGTCGAAGAA
TTCCCAAGAATCAGATTCCAGAATTTCGAAGTGTTTACAACACCTGATTCGTGCATTCAG
TATTATGCGAATTGTGCATTAAGATCAACAAGGTTGCATTTCCATTGTTTCAACTGTAGC
TTCACGGGTGAAACCCACGTGACAATGGAAACTCACATGAGGGAATCCCATTCGATAGCC
CCGCTATTGGAAGGCTTCGACTATTTCTCCTCCTTCGACCAATGCGGTGGAAAGAGCTGT
TTTAAGAACCAATTAAGATCGCACTTCCATTGCGCCCAAGGAAATAATTGTCCGGCGATT
TTAACCCAATATTCAGCATTGGCCACCCATAAGCACGGAAGGGGTACTCCTGTTATTAAG
AGTGAATCAGAACAAGAGAAATCTTATGAAGAAGCCAAAGATTATTCTATTAAAGAACGA
GCTTCAGCCGACAGCGTTGCTTCCATGAAAGATCAAGCTGAATCATTTACACCCACAAAT
TTCTCTATCAAAAGCGAATTCGAAAGAGAACAGGATAATATTTCGGTAGTAAAAGCGACA
GGAACTTTCTACCCGTCATCTCATCTCAGGAACTCGTCTCCATCCCCAAGAAGTATCTAC
GACGCCTCCCCGTCTAAGGATATTAAGTACGAAAAGAAATTCGAGCCAAAGTTAACTGGG
CCAACAATACCGCCGTCTTGTCACGATCAAAACTGCCCGTTGAACAAAAACGCTGGCGCA
ATGAACTTGCACTACCACTGCCCGCACTGCAGCCAAGCCTATGTGGATTTGAAGCTGTTA
TTCAGTCACATGGTCAAAAAACATAGCAACGCCATCGAACCAGGGCCAGTCGGATTGGCC
GCTACAAAGGAAACGCTGGAGAGAGAGTACCCAGAAATTTCAATTCTACCGTCAACAGCG
ACATCTTCAAATGCCCAAACACCACAACAAAGTCATCGACCTCCGAATCCTGCTGAACAG
GTGCAAGCTGTTCAATCCCTACTGCTGCAACAGTACTTGGGTTCAGGTCGCAAATCGCTC
CAGGATCAATTGAAGATGCAGCAGTACTCCTCACTGGCAGGGCTGCCTGGATTAGCTCAA
GTGGCGTTATTCTCTCAGGGTGGTTCCGCATTCCCTATGTATCCGACTATGTTGTATCCG
CCCGAGCTTCTTCTTGAGCAGAGTCTTCTTCAAAATCACGGTCTGCCGCCAGGCCTGGAC
AAAGAGGCGGAAATGATCGCTAAATCCAGGAGAAGTACTGGAGCTAGAGGACCTCATATG
AGGGTTTTAAAGGATGAACCCATACCGGATGGATACTTGCGTTTCCGGTTTAATGAGGAT
TGCGCCTATCAGCAGTGCGGATATAGGGAACATCAAACTCACTTCCACTGCACAAGAAAG
GATTGCGGTTACTCATTCTGCGATAAAACAAGATTCGTCCAACACACGGCTAGACATGAA
CGTTTGGACACTCTAATGGGTGGGGACTTCCAACAGTACCGAGCGAACGTGTACTGCCAG
CGACCAGAGTGCCCTCACGCCTCCACATTCGGCACGGGACAGAACAAAGCTTCCCATTTT
CACTGTCTCAAGTGTGACTTCGTATGTACGGACACCAATAAGGTTGTTGCTCATCGCCGA
CAACATCAGAAGCTCGACTCCATACAGGCCGCTGGCTTCCAGAAATTTCCTCCAAGCAAA
GCGTGTGGTTACGAACCCCAGTGCATTCACAGCAAGAAACAGACCCACTACCACTGTTTG
CAATGCGGTTTCGCTGTTCTTGGGTTATCGCAGATGACGTCACACAAGTACAAGCACCAG
GAGGCGAGCCTCGGACCGTCGACCAGCTCGACCAACTGA

Protein sequence:

MTTAMAHQESIHERLQSGPEEPLYLHHGASWNSETSEHQEDHSTDMSPYYNMPKRNKRKN
FKPRCAPNATTFSDDSNGGNCESSLTNGNDRTTPEQADEETDGYNKIGVKNFGLIKGGAV
DLSRRLSTDSNENIESTYQNITDEKKIDSFQRSYIHSLKVNQPAERDHTVLNFSNLQNKT
FCAKNPFAISQLIKTNSPPRRRESDSDDEAKGQDISANDRLENQVIGESNEGQQNGDPSN
ATNRILRDYAMNTMKELLGIYGLRALEALQGKPFTQLPLSLKRRHDSGDGTDRSIRRSSS
ATEDEGSTNITPPKTPDNDIEAQRQRALRIINQQSMRLFRSGREEDSSDDGDQTLDLSVS
RDTDGQEMSTTSSAGNSVSEYDQNMMMRKNLEDLANLQMQNFFQKQSMMDPEDSQEPKLQ
ASGLLSALNHLDQARKNSMQTTVDYSRYVKRYSSTLECGSSYCKDLGYREHFHCMDCTAR
VFCKKEEMIRHFKWHKKRDESLAHGFMRYSPLDDCSERFSDCPHNRSQTHYHCIQDGCDK
VYISTSDVQMHANYHRKDSAILQEGFQRFRATESCSAPHCMFAGQRTTHFHCRRPGCTYT
FKNKADMEKHKTYHIKDEALSKDGFKKYMKSEACPYRDCRFSRTCNHIHCIRPHCNYVLH
SSSQIFTHKRKHERKEQEMSFGLPTSVIQNALMQGDLSYAMHDDDISVEDYTQAYNQPLV
DEVARKYIETFNGAKEAEDKCSKCSAFNKHYHCLAEDCKMAHSCHNTMVKHVMEHEQQSQ
VTEAYFVTYTRNNPCSNSACQHIKDITHYHCIWENCGAVILSSEEHPFRRLEHYRQHANL
SPMSPNLAARFAPTFTPQLSAQLHPNMAAQLGQVTNLPNLPPNLAQLAQMAPSLTSQLTP
NLQAQLNLGPNPTIAPQQVNPSSSLDEMFSRKRGRPPKNRVVEVWTDNVTPQAIFTSFKL
PKSNQLPPLVQSPIIATVEEFPRIRFQNFEVFTTPDSCIQYYANCALRSTRLHFHCFNCS
FTGETHVTMETHMRESHSIAPLLEGFDYFSSFDQCGGKSCFKNQLRSHFHCAQGNNCPAI
LTQYSALATHKHGRGTPVIKSESEQEKSYEEAKDYSIKERASADSVASMKDQAESFTPTN
FSIKSEFEREQDNISVVKATGTFYPSSHLRNSSPSPRSIYDASPSKDIKYEKKFEPKLTG
PTIPPSCHDQNCPLNKNAGAMNLHYHCPHCSQAYVDLKLLFSHMVKKHSNAIEPGPVGLA
ATKETLEREYPEISILPSTATSSNAQTPQQSHRPPNPAEQVQAVQSLLLQQYLGSGRKSL
QDQLKMQQYSSLAGLPGLAQVALFSQGGSAFPMYPTMLYPPELLLEQSLLQNHGLPPGLD
KEAEMIAKSRRSTGARGPHMRVLKDEPIPDGYLRFRFNEDCAYQQCGYREHQTHFHCTRK
DCGYSFCDKTRFVQHTARHERLDTLMGGDFQQYRANVYCQRPECPHASTFGTGQNKASHF
HCLKCDFVCTDTNKVVAHRRQHQKLDSIQAAGFQKFPPSKACGYEPQCIHSKKQTHYHCL
QCGFAVLGLSQMTSHKYKHQEASLGPSTSSTN