New model in OGS2.0 | DPOGS216202  |
---|---|
Genomic Position | scaffold63:+ 110859-128636 |
See gene structure | |
CDS Length | 4779 |
Paired RNAseq reads   | 900 |
Single RNAseq reads   | 2309 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004549 (8e-06) |
Best Drosophila hit   | castor (4e-76) |
Best Human hit | zinc finger protein castor homolog 1 isoform b (8e-77) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC015058 [Tribolium castaneum] (8e-143) |
Best NR hit (blastx)   | PREDICTED: similar to castor CG2102-PA [Apis mellifera] (6e-161) |
GeneOntology terms    | GO:0006355 regulation of transcription, DNA-dependent GO:0005634 nucleus GO:0003700 sequence-specific DNA binding transcription factor activity GO:0007417 central nervous system development GO:0045892 negative regulation of transcription, DNA-dependent GO:0003677 DNA binding GO:0006357 regulation of transcription from RNA polymerase II promoter GO:0003702 RNA polymerase II transcription factor activity GO:0016319 mushroom body development GO:0009791 post-embryonic development GO:0007402 ganglion mother cell fate determination GO:0007419 ventral cord development GO:0008270 zinc ion binding GO:0040034 regulation of development, heterochronic |
InterPro families    | IPR015880 Zinc finger, C2H2-like IPR007087 Zinc finger, C2H2-type |
Orthology group | MCL15845 |
Nucleotide sequence:
ATGACTACTGCGATGGCACATCAGGAATCCATACACGAGAGACTGCAGTCGGGACCCGAG
GAACCGCTGTACCTGCACCATGGAGCATCCTGGAACTCAGAGACGTCGGAACACCAAGAG
GATCACAGCACAGACATGTCGCCCTACTACAACATGCCAAAAAGAAATAAGAGAAAGAAT
TTCAAACCACGATGTGCTCCCAACGCTACGACGTTCTCTGACGACTCCAACGGAGGCAAC
TGCGAGAGCAGCCTCACAAACGGAAATGACAGAACTACACCTGAACAGGCTGATGAGGAA
ACGGATGGCTACAATAAAATTGGTGTTAAGAACTTCGGTCTGATAAAAGGTGGAGCTGTC
GACCTCAGCCGGCGATTGAGCACTGATTCCAATGAAAACATAGAATCCACCTACCAGAAC
ATCACGGACGAGAAGAAAATAGACTCGTTCCAGCGCTCGTACATACACAGCCTGAAGGTG
AACCAGCCGGCCGAGAGAGATCACACCGTGCTTAACTTTAGTAATTTACAAAACAAAACA
TTCTGTGCCAAAAATCCTTTCGCCATATCCCAGTTAATAAAAACCAATTCGCCTCCACGG
AGAAGGGAATCCGATTCCGACGATGAGGCTAAGGGTCAGGATATAAGTGCTAATGACAGG
TTGGAGAACCAGGTTATAGGGGAGTCTAATGAAGGTCAACAAAACGGTGATCCGTCTAAC
GCTACAAATAGAATCTTGAGGGATTACGCTATGAACACCATGAAAGAGCTATTGGGTATC
TATGGCTTAAGAGCTTTGGAGGCTCTTCAAGGTAAACCCTTCACTCAGTTACCTTTGAGT
CTGAAGCGACGTCATGATTCAGGGGATGGAACTGATAGAAGCATCCGAAGATCCTCTTCA
GCGACTGAAGATGAAGGATCCACCAACATAACCCCTCCAAAAACACCAGACAACGACATA
GAAGCCCAAAGACAAAGGGCCCTTCGGATAATAAATCAGCAATCAATGCGTCTGTTTCGT
TCTGGGCGAGAGGAAGACAGTTCTGATGATGGAGACCAGACCCTCGACCTCAGTGTGTCC
AGAGATACAGACGGACAGGAAATGTCTACAACTTCGAGCGCCGGTAACAGCGTGTCGGAG
TACGATCAGAACATGATGATGAGGAAGAACCTCGAGGACCTGGCGAATTTACAAATGCAG
AATTTCTTCCAAAAACAATCCATGATGGATCCCGAAGACTCTCAGGAACCGAAACTCCAA
GCCAGTGGACTGCTGTCAGCCTTGAATCATCTGGATCAGGCTCGCAAGAACTCCATGCAA
ACGACCGTGGATTACTCCAGATACGTCAAAAGGTACAGTTCAACACTAGAGTGTGGTTCA
TCATATTGTAAGGATTTGGGATATCGAGAACATTTTCATTGCATGGATTGCACTGCGAGG
GTCTTTTGCAAAAAGGAAGAAATGATAAGACATTTCAAGTGGCACAAAAAACGAGACGAA
TCTTTAGCGCATGGTTTTATGAGATATTCTCCCCTCGACGACTGCTCCGAGAGGTTCAGT
GACTGTCCTCATAATCGAAGTCAAACACATTACCACTGCATACAGGATGGCTGTGATAAG
GTGTACATATCTACATCGGATGTCCAGATGCATGCGAACTATCACCGCAAGGACTCTGCC
ATACTTCAAGAAGGGTTCCAAAGGTTCCGCGCCACTGAGAGCTGTTCCGCTCCACATTGT
ATGTTCGCCGGTCAACGGACCACGCACTTCCACTGTCGAAGGCCTGGATGCACCTACACA
TTTAAAAACAAAGCCGACATGGAGAAACACAAGACTTACCACATAAAAGACGAGGCGTTA
TCAAAGGACGGTTTCAAAAAGTATATGAAGAGCGAAGCCTGCCCGTATAGAGATTGCAGG
TTTAGTAGGACCTGCAACCACATACACTGCATCAGACCGCACTGCAACTACGTACTGCAC
TCCAGCAGCCAGATCTTCACACACAAACGGAAACACGAGAGGAAGGAACAGGAAATGAGT
TTTGGGTTGCCAACGTCTGTGATCCAAAATGCTTTAATGCAAGGAGATCTGTCATATGCC
ATGCATGATGACGACATATCCGTTGAAGATTACACCCAAGCTTATAACCAACCCTTGGTG
GATGAAGTAGCACGCAAGTATATTGAAACGTTCAACGGCGCGAAGGAAGCTGAAGACAAG
TGCAGCAAATGTAGCGCCTTCAACAAACACTACCACTGTTTGGCTGAAGACTGCAAGATG
GCGCATAGTTGTCACAACACAATGGTAAAGCACGTCATGGAACACGAACAACAGAGCCAA
GTAACGGAAGCTTACTTCGTGACGTATACGAGAAATAATCCTTGTTCCAATTCGGCGTGC
CAGCATATCAAAGACATCACCCATTATCACTGTATTTGGGAGAACTGTGGAGCTGTCATA
CTATCTTCAGAGGAACATCCCTTCAGACGCTTGGAGCACTATCGCCAACACGCCAATCTC
AGTCCCATGTCGCCAAACTTGGCAGCCAGATTTGCGCCCACGTTCACTCCGCAACTATCG
GCTCAATTACACCCAAACATGGCGGCCCAACTTGGCCAGGTGACAAATTTACCCAATCTT
CCCCCTAATTTGGCTCAGTTGGCCCAAATGGCACCAAGTCTAACGTCCCAATTGACGCCG
AACTTACAAGCTCAATTGAATCTTGGGCCTAATCCCACGATTGCACCACAACAAGTGAAC
CCATCGAGTTCGTTGGACGAGATGTTTAGCCGGAAGAGGGGGAGACCTCCAAAGAATCGT
GTTGTAGAGGTCTGGACAGATAATGTGACTCCTCAAGCTATATTTACATCGTTCAAACTG
CCGAAGAGCAACCAACTGCCGCCTCTAGTACAATCACCAATTATTGCTACAGTCGAAGAA
TTCCCAAGAATCAGATTCCAGAATTTCGAAGTGTTTACAACACCTGATTCGTGCATTCAG
TATTATGCGAATTGTGCATTAAGATCAACAAGGTTGCATTTCCATTGTTTCAACTGTAGC
TTCACGGGTGAAACCCACGTGACAATGGAAACTCACATGAGGGAATCCCATTCGATAGCC
CCGCTATTGGAAGGCTTCGACTATTTCTCCTCCTTCGACCAATGCGGTGGAAAGAGCTGT
TTTAAGAACCAATTAAGATCGCACTTCCATTGCGCCCAAGGAAATAATTGTCCGGCGATT
TTAACCCAATATTCAGCATTGGCCACCCATAAGCACGGAAGGGGTACTCCTGTTATTAAG
AGTGAATCAGAACAAGAGAAATCTTATGAAGAAGCCAAAGATTATTCTATTAAAGAACGA
GCTTCAGCCGACAGCGTTGCTTCCATGAAAGATCAAGCTGAATCATTTACACCCACAAAT
TTCTCTATCAAAAGCGAATTCGAAAGAGAACAGGATAATATTTCGGTAGTAAAAGCGACA
GGAACTTTCTACCCGTCATCTCATCTCAGGAACTCGTCTCCATCCCCAAGAAGTATCTAC
GACGCCTCCCCGTCTAAGGATATTAAGTACGAAAAGAAATTCGAGCCAAAGTTAACTGGG
CCAACAATACCGCCGTCTTGTCACGATCAAAACTGCCCGTTGAACAAAAACGCTGGCGCA
ATGAACTTGCACTACCACTGCCCGCACTGCAGCCAAGCCTATGTGGATTTGAAGCTGTTA
TTCAGTCACATGGTCAAAAAACATAGCAACGCCATCGAACCAGGGCCAGTCGGATTGGCC
GCTACAAAGGAAACGCTGGAGAGAGAGTACCCAGAAATTTCAATTCTACCGTCAACAGCG
ACATCTTCAAATGCCCAAACACCACAACAAAGTCATCGACCTCCGAATCCTGCTGAACAG
GTGCAAGCTGTTCAATCCCTACTGCTGCAACAGTACTTGGGTTCAGGTCGCAAATCGCTC
CAGGATCAATTGAAGATGCAGCAGTACTCCTCACTGGCAGGGCTGCCTGGATTAGCTCAA
GTGGCGTTATTCTCTCAGGGTGGTTCCGCATTCCCTATGTATCCGACTATGTTGTATCCG
CCCGAGCTTCTTCTTGAGCAGAGTCTTCTTCAAAATCACGGTCTGCCGCCAGGCCTGGAC
AAAGAGGCGGAAATGATCGCTAAATCCAGGAGAAGTACTGGAGCTAGAGGACCTCATATG
AGGGTTTTAAAGGATGAACCCATACCGGATGGATACTTGCGTTTCCGGTTTAATGAGGAT
TGCGCCTATCAGCAGTGCGGATATAGGGAACATCAAACTCACTTCCACTGCACAAGAAAG
GATTGCGGTTACTCATTCTGCGATAAAACAAGATTCGTCCAACACACGGCTAGACATGAA
CGTTTGGACACTCTAATGGGTGGGGACTTCCAACAGTACCGAGCGAACGTGTACTGCCAG
CGACCAGAGTGCCCTCACGCCTCCACATTCGGCACGGGACAGAACAAAGCTTCCCATTTT
CACTGTCTCAAGTGTGACTTCGTATGTACGGACACCAATAAGGTTGTTGCTCATCGCCGA
CAACATCAGAAGCTCGACTCCATACAGGCCGCTGGCTTCCAGAAATTTCCTCCAAGCAAA
GCGTGTGGTTACGAACCCCAGTGCATTCACAGCAAGAAACAGACCCACTACCACTGTTTG
CAATGCGGTTTCGCTGTTCTTGGGTTATCGCAGATGACGTCACACAAGTACAAGCACCAG
GAGGCGAGCCTCGGACCGTCGACCAGCTCGACCAACTGA
Protein sequence:
MTTAMAHQESIHERLQSGPEEPLYLHHGASWNSETSEHQEDHSTDMSPYYNMPKRNKRKN
FKPRCAPNATTFSDDSNGGNCESSLTNGNDRTTPEQADEETDGYNKIGVKNFGLIKGGAV
DLSRRLSTDSNENIESTYQNITDEKKIDSFQRSYIHSLKVNQPAERDHTVLNFSNLQNKT
FCAKNPFAISQLIKTNSPPRRRESDSDDEAKGQDISANDRLENQVIGESNEGQQNGDPSN
ATNRILRDYAMNTMKELLGIYGLRALEALQGKPFTQLPLSLKRRHDSGDGTDRSIRRSSS
ATEDEGSTNITPPKTPDNDIEAQRQRALRIINQQSMRLFRSGREEDSSDDGDQTLDLSVS
RDTDGQEMSTTSSAGNSVSEYDQNMMMRKNLEDLANLQMQNFFQKQSMMDPEDSQEPKLQ
ASGLLSALNHLDQARKNSMQTTVDYSRYVKRYSSTLECGSSYCKDLGYREHFHCMDCTAR
VFCKKEEMIRHFKWHKKRDESLAHGFMRYSPLDDCSERFSDCPHNRSQTHYHCIQDGCDK
VYISTSDVQMHANYHRKDSAILQEGFQRFRATESCSAPHCMFAGQRTTHFHCRRPGCTYT
FKNKADMEKHKTYHIKDEALSKDGFKKYMKSEACPYRDCRFSRTCNHIHCIRPHCNYVLH
SSSQIFTHKRKHERKEQEMSFGLPTSVIQNALMQGDLSYAMHDDDISVEDYTQAYNQPLV
DEVARKYIETFNGAKEAEDKCSKCSAFNKHYHCLAEDCKMAHSCHNTMVKHVMEHEQQSQ
VTEAYFVTYTRNNPCSNSACQHIKDITHYHCIWENCGAVILSSEEHPFRRLEHYRQHANL
SPMSPNLAARFAPTFTPQLSAQLHPNMAAQLGQVTNLPNLPPNLAQLAQMAPSLTSQLTP
NLQAQLNLGPNPTIAPQQVNPSSSLDEMFSRKRGRPPKNRVVEVWTDNVTPQAIFTSFKL
PKSNQLPPLVQSPIIATVEEFPRIRFQNFEVFTTPDSCIQYYANCALRSTRLHFHCFNCS
FTGETHVTMETHMRESHSIAPLLEGFDYFSSFDQCGGKSCFKNQLRSHFHCAQGNNCPAI
LTQYSALATHKHGRGTPVIKSESEQEKSYEEAKDYSIKERASADSVASMKDQAESFTPTN
FSIKSEFEREQDNISVVKATGTFYPSSHLRNSSPSPRSIYDASPSKDIKYEKKFEPKLTG
PTIPPSCHDQNCPLNKNAGAMNLHYHCPHCSQAYVDLKLLFSHMVKKHSNAIEPGPVGLA
ATKETLEREYPEISILPSTATSSNAQTPQQSHRPPNPAEQVQAVQSLLLQQYLGSGRKSL
QDQLKMQQYSSLAGLPGLAQVALFSQGGSAFPMYPTMLYPPELLLEQSLLQNHGLPPGLD
KEAEMIAKSRRSTGARGPHMRVLKDEPIPDGYLRFRFNEDCAYQQCGYREHQTHFHCTRK
DCGYSFCDKTRFVQHTARHERLDTLMGGDFQQYRANVYCQRPECPHASTFGTGQNKASHF
HCLKCDFVCTDTNKVVAHRRQHQKLDSIQAAGFQKFPPSKACGYEPQCIHSKKQTHYHCL
QCGFAVLGLSQMTSHKYKHQEASLGPSTSSTN