New model in OGS2.0 | DPOGS200154  |
---|---|
Genomic Position | scaffold2452:+ 6209-11084 |
See gene structure | |
CDS Length | 1122 |
Paired RNAseq reads   | 954 |
Single RNAseq reads   | 2467 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001621 (2e-84) |
Best Drosophila hit   | crooked legs, isoform C (2e-17) |
Best Human hit | zinc finger protein 324B (3e-21) |
Best NR hit (blastp)   | zinc finger transcription factor KRAB-E2S [synthetic construct] (1e-20) |
Best NR hit (blastx)   | zinc finger transcription factor KRAB-E2S [synthetic construct] (2e-24) |
GeneOntology terms    | GO:0005634 nucleus GO:0003677 DNA binding GO:0005622 intracellular GO:0006355 regulation of transcription, DNA-dependent GO:0046872 metal ion binding GO:0008270 zinc ion binding |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding IPR015880 Zinc finger, C2H2-like |
Orthology group | MCL39559 |
Nucleotide sequence:
ATGCCTTCGAACGAGCCCATCACCCTCGACAGTGACGACGAACCAGATTTGGGTTCCCCG
AGTCAGGCTTCGGATTCCCTGAGTATGGATGAAAATTTAAGAGCGTTGCGTCACATACAC
GAGAAATACGCGAATATACATAAGGACGACCTGTACGAAGCCCATCTCCTTGCTGCGAAT
TTCAAACAACCAGATGCGAGGGGTGACGACGACGTGCTCTCAGTAGCTAGCTCCGATGGT
AGCGCGTCAGTAAAAGAATTAGGTACAAGAGTAGAAAAATTGGAGGATAGTTCGAATTCC
GAGTCAGGCAGTAGCAGTTCGGAGTCAGATAGTAGTAGTTCATCGTCGTCTAGCTCGTCG
GACAGCTCACACTCGTCGGACTCGTCGGATTCTGAGGAAGACGAGAAGCCAAACAGCAGC
GTCCATGATGACAGATCCTTCGGCGCCTGGTCCCACACGCCGGAGTCGGATTCGGATCCG
GAGCGCGGGGCCCCGGTCGCTCGGACCGATGACGACATCAGCGAATCGGATAACGAGTCA
GCTGATAAGAGCTTCCCATGTAGGGTCTGCGGGAAGTGGTATTCGACCAGGGTCACGCTC
AAGATCCACGCGCGCGTTCATCAGAACAAGGGCGGCGGCGGCTCCAGGTCCAGGGCGCGT
TCCTCGGACAGATACGAGTGTGACTGTTGCAGCGAGACGTTCAGCAGGAGAGAGAAGCTT
TGGGAGCATAAGGCTGAAGCCCACCGCGGCGCTATGACTGTCCGATGCGAGGTTTGTCGT
CGGTGCTTCGAGGACGACAACGAGCTGGCGGCGCACGCCACCACACACACTAGTGATGAC
AGGATAGGTCGTTGCTCGGACTGCGGCTCGTCGTTCGCCAGATACGACCAGCTGCGCCGC
CACCGCGCCTCCGTCCACGGCTCGGCGCCCGCCCGCCTGCCGCACGCGTGCGTCCAGTGC
GGCAAGAGATTCTCACACGCGCACTCCCTCACCAGACACGCGCACAACCACGCCAAGCAA
CTGTACAGATGCGTGGTTTGTAAGGCATCCTTCGCCCGCGCGGACCAACTCGCCCAGCAC
CTGAACAGCCACCTCGCCACCTACAAACGTATGAAGCAGTGA
Protein sequence:
MPSNEPITLDSDDEPDLGSPSQASDSLSMDENLRALRHIHEKYANIHKDDLYEAHLLAAN
FKQPDARGDDDVLSVASSDGSASVKELGTRVEKLEDSSNSESGSSSSESDSSSSSSSSSS
DSSHSSDSSDSEEDEKPNSSVHDDRSFGAWSHTPESDSDPERGAPVARTDDDISESDNES
ADKSFPCRVCGKWYSTRVTLKIHARVHQNKGGGGSRSRARSSDRYECDCCSETFSRREKL
WEHKAEAHRGAMTVRCEVCRRCFEDDNELAAHATTHTSDDRIGRCSDCGSSFARYDQLRR
HRASVHGSAPARLPHACVQCGKRFSHAHSLTRHAHNHAKQLYRCVVCKASFARADQLAQH
LNSHLATYKRMKQ