New model in OGS2.0 | DPOGS203330  |
---|---|
Genomic Position | scaffold6:- 241319-254080 |
See gene structure | |
CDS Length | 2016 |
Paired RNAseq reads   | 907 |
Single RNAseq reads   | 2744 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002091 (0.0) |
Best Drosophila hit   | CTCF (1e-122) |
Best Human hit | transcriptional repressor CTCF isoform 1 (4e-81) |
Best NR hit (blastp)   | PREDICTED: similar to CTCF-like protein [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to CTCF-like protein [Nasonia vitripennis] (0.0) |
GeneOntology terms    | GO:0005634 nucleus GO:0003700 sequence-specific DNA binding transcription factor activity GO:0008270 zinc ion binding GO:0016481 negative regulation of transcription GO:0043565 sequence-specific DNA binding GO:0043035 chromatin insulator sequence binding GO:0016564 transcription repressor activity GO:0007379 segment specification |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding IPR015880 Zinc finger, C2H2-like |
Orthology group | MCL11870 |
Nucleotide sequence:
ATGACTGGCTTTACAATTGGACACGATTTGTCTTCGGGCGTCGAGGCTGGCTGGTTGTGG
GCTGGCACTTCGGTAGACTCCTGTGCCGTCGTAGCCATGCCGCCTCCAGACAAGAAAACT
TCGAAGGAGGAGACCATTTTACAAACCTATTTGAACTCTTTTGATCAAGACAATGAAACA
ACAACCACCATTGCTGTTACTGGTGAAGTTGAAGAAGAAGCTGATACAGGGGTGACATAT
TTTGTTGATGAAGAAGGCAGATATTACTATCAACCGGCCGGTGACAATCAGAACATAGTG
TCACTGCAGCCCGAAGTCACACAGGACAATGACGGAGAGATTACTGAGGATGCACAGATG
CTTGTTGATGGTGAAAGTTACCAGACAGTGACCCTCGTGCCCTCGGACACGGGGAATGGT
GAAGTCAGCTATGTGTTAGTAATGCAAGAGGAAAATAAGCCTGTTGTCAACTTAAATATA
AAGGTGGATCAGGAGGAGAAAGGTGCTGATGTCTACAACTTTGAGGATGAAGAAGAAGCG
GCTGAGGAGGGTTCTGACGACGGAGATGATGCACCAAAGACTAAAACTACTAAGAGGAAT
AAGTATGTTCGTCCCTACTTCACTTGCAGCTTCTGTTCGTACACGAGTCATAGACGCTAC
TTGCTTCTGCGTCACATGAAATCCCATTCGGAGGAAAGGCCGCACAAGTGCAGCGTGTGT
GAACGAGGCTTTAAAACAATAGCCTCACTCCAGAACCACGTGAATATGCACAACGGTGTT
AAACCTCACGTGTGTAAATATTGCAAGAGTCCGTTCACTACATCTGGTGAACTCGTTAGA
CACGTGCGGTATCGCCACACACATGAAAAGCCTCATAAATGCTCTGAATGCGACTACGCC
TCCGTGGAACTGTCCAAATTGAGGCGTCACGTCCGCTGTCACACCGGAGAGAGACCTTAT
CAGTGCCCTCACTGTACCTACGCTTCACCAGATACTTTCAAATTGAAGAGACACTTGCGT
ACACACACCGGAGAGAAGCCGTACAAGTGTGATCATTGCAACATGTGCTTCACGCAATCT
AACTCTCTGAAAGCTCACAAACTCATACACAATGTGGCCGAGAAGCCTGTGTTTGCTTGC
GAGCTCTGTCCGGCCAGATGCGGTCGGAAAACAGATCTACGCATACACGTCCAAAAACTA
CACACGTCGGATAAACCACTTAAATGCAAGCGCTGTGGTAAATCCTTCCCAGACAGATAT
TCCTGCAAGATTCATAACAAGACACACGAAGGGGAGAAATGTTTCAAATGTGAAATGTGC
CCGTACGCCTCGACGACGCTACGTCATCTGAAGACACACATGCTGAAACACACGGACGAG
AAACCCTTCGCTTGTGAGCAGTGCGACCACTCGTTTAGGCAGAAGCAACTGCTGCGTCGC
CACCAGAATTTGTACCACAATCCCCATTACGAGCCGAAGCCACCCAAGGAGAAAACGCAC
ACGTGTCACGAGTGCAAGCGGACCTTCGCCCACAAGGGTAACCTGATCCGTCATCTTGCC
ATCCACGACCCTGAGTCTGGACACCACGAACGAGCACTGGCTCTGAAAATCGGCAGGCAG
AGGAAGATCAAGACCAACACGGGAGGACCATCTCAGGTTGTGGATTCTGACGACGATATG
ATGAAGCTGGGCCTCAATAAGGAGATCAAACGCGGTGAACTGGTCACAGTAGCTGACGGT
GATGGTCAACAGTATGTGGTGTTAGAGGTGATTCAACTCGAGGACGGGACGGAACAACAA
GTGGCTGTGGTGGCACCAGAGTTCATGGAAGAGGAACAAGAAGAGGAAGAGGAAGAAGAG
GAACAGGAAATTGAAACTCCTAAACAGAAAATATTAAACAGAACCATTAAACTAGAGAAG
GAAGTTGACACATGCTTTGGATTTGATGAAGAAGAGGAGGAGGAGGCAGAGGAAGACATA
ACCTACAGCGACAAAGTAGTGTTGCGTTTAGTGTAA
Protein sequence:
MTGFTIGHDLSSGVEAGWLWAGTSVDSCAVVAMPPPDKKTSKEETILQTYLNSFDQDNET
TTTIAVTGEVEEEADTGVTYFVDEEGRYYYQPAGDNQNIVSLQPEVTQDNDGEITEDAQM
LVDGESYQTVTLVPSDTGNGEVSYVLVMQEENKPVVNLNIKVDQEEKGADVYNFEDEEEA
AEEGSDDGDDAPKTKTTKRNKYVRPYFTCSFCSYTSHRRYLLLRHMKSHSEERPHKCSVC
ERGFKTIASLQNHVNMHNGVKPHVCKYCKSPFTTSGELVRHVRYRHTHEKPHKCSECDYA
SVELSKLRRHVRCHTGERPYQCPHCTYASPDTFKLKRHLRTHTGEKPYKCDHCNMCFTQS
NSLKAHKLIHNVAEKPVFACELCPARCGRKTDLRIHVQKLHTSDKPLKCKRCGKSFPDRY
SCKIHNKTHEGEKCFKCEMCPYASTTLRHLKTHMLKHTDEKPFACEQCDHSFRQKQLLRR
HQNLYHNPHYEPKPPKEKTHTCHECKRTFAHKGNLIRHLAIHDPESGHHERALALKIGRQ
RKIKTNTGGPSQVVDSDDDMMKLGLNKEIKRGELVTVADGDGQQYVVLEVIQLEDGTEQQ
VAVVAPEFMEEEQEEEEEEEEQEIETPKQKILNRTIKLEKEVDTCFGFDEEEEEEAEEDI
TYSDKVVLRLV