New model in OGS2.0 | DPOGS204136  |
---|---|
Genomic Position | scaffold1311:- 11700-15663 |
See gene structure | |
CDS Length | 3030 |
Paired RNAseq reads   | 577 |
Single RNAseq reads   | 1308 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013621 (6e-168) |
Best Drosophila hit   | CG1233, isoform B (1e-44) |
Best Human hit | transcriptional repressor CTCF isoform 1 (1e-24) |
Best NR hit (blastp)   | PREDICTED: similar to CG1233 CG1233-PB [Tribolium castaneum] (3e-59) |
Best NR hit (blastx)   | PREDICTED: similar to CG1233 CG1233-PB [Tribolium castaneum] (5e-64) |
GeneOntology terms    | GO:0008270 zinc ion binding GO:0005622 intracellular |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR015880 Zinc finger, C2H2-like IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding |
Orthology group | MCL21545 |
Nucleotide sequence:
ATGGTCAAAAGTTTGGTACAGATTGCTCTCAAAAATGCTAAACTCAAGAAAGAAGTAGTT
TCAAATAACCCACTAATACAGTCCCAATCCAACATATCCGAAATGAAAATACCTCAAGTA
ATAGAAATCGTACCAGACAGACTCAATGTAAATAATGATCAACTTCCAAAGGGCGTTTCA
TTTATATCTGTGGAGGAGTTAAACAAGATGAGCACACCTAATTTTGAAAAAGTTGACCCC
AAGGAGGTCATCCAGGAAGCCAGTATAAATATTGTGTATACAAATGATGTGTCTACACAA
TATGTTAATGTTTCTAATCACGGAGTATCAAGTTCTACTTGTAATTTAATTCCAGTAACA
AGTGTGACACCTGATTTAATATCTTCAGTCCAGCCTCGTATACCCGTCAGTGATATAAAC
CCATCGCCACACAAAGAAGCTGACTCCTCACATATATCGTCAACAATAACAGAAAGTACT
AGAAAATCAACATACACTAATAACCCGGAGGAAAAAGGCACTTTGTGCTCAATATCAAAC
TGTGTTGTTCGTCTAAAAGACCCTAACAATTTGGCCTATCATAGGAAGTGTCATCAAAAT
GGAAAACTGCAGTGTCCAGAGTGCACCAAGAAGTATTCCTGGGTTCACCAGCTACACATG
CATCTGTGGAAGATACATGCCATAGACCTCGAACTGCCTACCTGTACTATATGCGGTTAC
AAAAACTATAAGCGCCACATACTGATCAATGTACACACAAAATGCCATGGAAAGAATAGG
TCGTATACATGTTCAATATGTCAGAAGAAATTTAAAACCTCGAACCAGTTGTCCAAGCAC
CGCATGATTCATAAGTGTAATGTCATCCATCAGTGCCAGATCTGCCAGCGTGAGTTCAAC
CTGGAACGCCACTTGAGAGAACACATGGCAGCTGTGCATGACAAATTAAGACCTTTCAAA
TGTGGTCATTGCAGTTACATGGCCGCTAGGAAATGCGAGCTCAAGCTACACCTCAGACTA
CACACAGGTGATAAGCCATATGCGTGTGATCAATGCGACTATTGCACGCGCGATCACAAC
TCGCTCAGACGGCACAAGTTGAAGCACTCCAACGAGGGTGTTTACAAATGTAAATACTGT
CCCTATAGCGCCATACAGTCTACCGCCTTCGCCTCCCACATGATATCCAAACATCCCAAC
ATGAACCTCGGCGATGTCCACTGCTGTCCGTTCTGCAGCTTCAAGTCCGTCAGCAGAAAT
AACTACGTTGTGCACCTGACGACGCACAGGGACAAAGAAGGCATTAAACTTCTCATCGAC
ATAGCTAAGAGCGTGAAGAACAAGAAACGTAGTTGGACCATACCCAGCGAGGATACGGAG
AAAAGCAATGTGACTAAAGAAGCCGATGAAAAGAAAGAGGATAAAAATTCATCAGACAAT
ATGCCGATTGAAGTTTACGACTGTGATAGCTTACCGGAAACCGTCCCTCAGGAGTACCCC
AAAAACTACTGCACTACGACGTCCAGTGATACTCAGCAGTCCAATTTTAACAGCGAAGTT
GTCACAGACTTGAGCAAAGAGGACAATAACTCGGATTACCTCATGCCCGAAGGTTACACC
ATACAAGAAAGACATCAGTACAGTTCGCAACATCCCTTACAAATGTCGTACGAAACTCCG
CAAAACAATCTGCATTTTTTAAATCCATCAGTCAGCCACGATCAGGGATTATTAGCTAAC
AGCATAGTCACTTCGAGCCATTCCCCTATAATCAACAGTATAAGCAATAATATCATGAAC
AATTTTCCCATACGATTGCCTCCGGCGCCGATTTCGGTGCGCAACAACATAACCCTAAAA
CCGGTGGATAAAATTTCCCTTCCCGTCTCTAAAGCCACCGGACCTATAATAAAACCGATG
CAGATCCTTCCCGTTCCGTCGTCGAGTAATTCGCCTTTCGATATACCCAACGAAATAGAC
GGAATGCCAAGAAAAAAACCTAAAATATCCGTCAAAAGCAATTTAATATTGAAGGGGCCC
GATCAGGTGAACATGTTTCATTCACAACAGAAAATGGCCTTCAAGCGGCTGGAAGATAAC
GAGAGGTTCGGTTTAAGCGGTCCGGTCACATTCAACAACTTAATAACCACCCAGTTTATG
CAACTGCAACCGGAGCCGACTTTAAGCGAGTCCCCCAACACTATCATGACATACCCTCAA
GACAGCATGATGGTGGAATCAGCGACGACTCCGGTCGACAACGAAATCAACGACAACCCT
CAGATCTTCTCGTTCAATCAACAGATGAACGCAAACACGCTGTCCATGATCCCGCCGCCG
CAGAAGGTGCAGAACAACGACCCGAGCTACATCAAAATAGAAGCAACCATCAAACAGAAC
ACACAGTCGCCGAGCTTGGAGAGGATGTGCAACGCGAACCTACTGAACAACCCGGCCATA
AGCAGGGAATATAAAGCGTCGCCTCCGCTAGAGGATATACACAAAAACATGAGCGAGATA
AAGAACGAGGTGAAATCGGACGCGTTCTACAACATGGCCATCACCAGCACGGCCGCCAAC
CCGCCGATTATAGACCAGTACATGATAGACAACATCATACCAGAACAGTATCCGGCTCAC
CTGGACCTGTCCACCGTAGTACTCTCCGAGGTCTCCAACCAGCAGAACGACGTCATAGAG
ATAGATGACAATTCCGACGACAACAAACTATTGAATCGCTTCGACATGAACTTCTCTCTG
GAGTCGCTTTATCTGATGCAAAACGATTTCCACTTCCTCGAAAACGACCTCCCATCGAAC
ATGGCCGAGGTGCCGGTTAACGAGATCAATAGGATGGTGACGGAAGTGCCGATCATAAAC
CAGAAGGATAATTTGGAGGTCGTCCAGGTGGAAGCCGCCAGTGAGAGCGCCAATTTCATT
CAAGGCAAAAAAGACCCCATGATGAACACATCGGTCAGACCCTCGACGAATAAAATTAAC
GTTAAAAATATAGAGCTCATGAAAAATTAA
Protein sequence:
MVKSLVQIALKNAKLKKEVVSNNPLIQSQSNISEMKIPQVIEIVPDRLNVNNDQLPKGVS
FISVEELNKMSTPNFEKVDPKEVIQEASINIVYTNDVSTQYVNVSNHGVSSSTCNLIPVT
SVTPDLISSVQPRIPVSDINPSPHKEADSSHISSTITESTRKSTYTNNPEEKGTLCSISN
CVVRLKDPNNLAYHRKCHQNGKLQCPECTKKYSWVHQLHMHLWKIHAIDLELPTCTICGY
KNYKRHILINVHTKCHGKNRSYTCSICQKKFKTSNQLSKHRMIHKCNVIHQCQICQREFN
LERHLREHMAAVHDKLRPFKCGHCSYMAARKCELKLHLRLHTGDKPYACDQCDYCTRDHN
SLRRHKLKHSNEGVYKCKYCPYSAIQSTAFASHMISKHPNMNLGDVHCCPFCSFKSVSRN
NYVVHLTTHRDKEGIKLLIDIAKSVKNKKRSWTIPSEDTEKSNVTKEADEKKEDKNSSDN
MPIEVYDCDSLPETVPQEYPKNYCTTTSSDTQQSNFNSEVVTDLSKEDNNSDYLMPEGYT
IQERHQYSSQHPLQMSYETPQNNLHFLNPSVSHDQGLLANSIVTSSHSPIINSISNNIMN
NFPIRLPPAPISVRNNITLKPVDKISLPVSKATGPIIKPMQILPVPSSSNSPFDIPNEID
GMPRKKPKISVKSNLILKGPDQVNMFHSQQKMAFKRLEDNERFGLSGPVTFNNLITTQFM
QLQPEPTLSESPNTIMTYPQDSMMVESATTPVDNEINDNPQIFSFNQQMNANTLSMIPPP
QKVQNNDPSYIKIEATIKQNTQSPSLERMCNANLLNNPAISREYKASPPLEDIHKNMSEI
KNEVKSDAFYNMAITSTAANPPIIDQYMIDNIIPEQYPAHLDLSTVVLSEVSNQQNDVIE
IDDNSDDNKLLNRFDMNFSLESLYLMQNDFHFLENDLPSNMAEVPVNEINRMVTEVPIIN
QKDNLEVVQVEAASESANFIQGKKDPMMNTSVRPSTNKINVKNIELMKN