DPGLEAN08942 in OGS1.0

New model in OGS2.0DPOGS204136 
Genomic Positionscaffold1311:- 11700-15663
See gene structure
CDS Length3030
Paired RNAseq reads  577
Single RNAseq reads  1308
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013621 (6e-168)
Best Drosophila hit  CG1233, isoform B (1e-44)
Best Human hittranscriptional repressor CTCF isoform 1 (1e-24)
Best NR hit (blastp)  PREDICTED: similar to CG1233 CG1233-PB [Tribolium castaneum] (3e-59)
Best NR hit (blastx)  PREDICTED: similar to CG1233 CG1233-PB [Tribolium castaneum] (5e-64)
GeneOntology terms
  
GO:0008270 zinc ion binding
GO:0005622 intracellular
InterPro families

  
IPR007087 Zinc finger, C2H2-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL21545

Nucleotide sequence:

ATGGTCAAAAGTTTGGTACAGATTGCTCTCAAAAATGCTAAACTCAAGAAAGAAGTAGTT
TCAAATAACCCACTAATACAGTCCCAATCCAACATATCCGAAATGAAAATACCTCAAGTA
ATAGAAATCGTACCAGACAGACTCAATGTAAATAATGATCAACTTCCAAAGGGCGTTTCA
TTTATATCTGTGGAGGAGTTAAACAAGATGAGCACACCTAATTTTGAAAAAGTTGACCCC
AAGGAGGTCATCCAGGAAGCCAGTATAAATATTGTGTATACAAATGATGTGTCTACACAA
TATGTTAATGTTTCTAATCACGGAGTATCAAGTTCTACTTGTAATTTAATTCCAGTAACA
AGTGTGACACCTGATTTAATATCTTCAGTCCAGCCTCGTATACCCGTCAGTGATATAAAC
CCATCGCCACACAAAGAAGCTGACTCCTCACATATATCGTCAACAATAACAGAAAGTACT
AGAAAATCAACATACACTAATAACCCGGAGGAAAAAGGCACTTTGTGCTCAATATCAAAC
TGTGTTGTTCGTCTAAAAGACCCTAACAATTTGGCCTATCATAGGAAGTGTCATCAAAAT
GGAAAACTGCAGTGTCCAGAGTGCACCAAGAAGTATTCCTGGGTTCACCAGCTACACATG
CATCTGTGGAAGATACATGCCATAGACCTCGAACTGCCTACCTGTACTATATGCGGTTAC
AAAAACTATAAGCGCCACATACTGATCAATGTACACACAAAATGCCATGGAAAGAATAGG
TCGTATACATGTTCAATATGTCAGAAGAAATTTAAAACCTCGAACCAGTTGTCCAAGCAC
CGCATGATTCATAAGTGTAATGTCATCCATCAGTGCCAGATCTGCCAGCGTGAGTTCAAC
CTGGAACGCCACTTGAGAGAACACATGGCAGCTGTGCATGACAAATTAAGACCTTTCAAA
TGTGGTCATTGCAGTTACATGGCCGCTAGGAAATGCGAGCTCAAGCTACACCTCAGACTA
CACACAGGTGATAAGCCATATGCGTGTGATCAATGCGACTATTGCACGCGCGATCACAAC
TCGCTCAGACGGCACAAGTTGAAGCACTCCAACGAGGGTGTTTACAAATGTAAATACTGT
CCCTATAGCGCCATACAGTCTACCGCCTTCGCCTCCCACATGATATCCAAACATCCCAAC
ATGAACCTCGGCGATGTCCACTGCTGTCCGTTCTGCAGCTTCAAGTCCGTCAGCAGAAAT
AACTACGTTGTGCACCTGACGACGCACAGGGACAAAGAAGGCATTAAACTTCTCATCGAC
ATAGCTAAGAGCGTGAAGAACAAGAAACGTAGTTGGACCATACCCAGCGAGGATACGGAG
AAAAGCAATGTGACTAAAGAAGCCGATGAAAAGAAAGAGGATAAAAATTCATCAGACAAT
ATGCCGATTGAAGTTTACGACTGTGATAGCTTACCGGAAACCGTCCCTCAGGAGTACCCC
AAAAACTACTGCACTACGACGTCCAGTGATACTCAGCAGTCCAATTTTAACAGCGAAGTT
GTCACAGACTTGAGCAAAGAGGACAATAACTCGGATTACCTCATGCCCGAAGGTTACACC
ATACAAGAAAGACATCAGTACAGTTCGCAACATCCCTTACAAATGTCGTACGAAACTCCG
CAAAACAATCTGCATTTTTTAAATCCATCAGTCAGCCACGATCAGGGATTATTAGCTAAC
AGCATAGTCACTTCGAGCCATTCCCCTATAATCAACAGTATAAGCAATAATATCATGAAC
AATTTTCCCATACGATTGCCTCCGGCGCCGATTTCGGTGCGCAACAACATAACCCTAAAA
CCGGTGGATAAAATTTCCCTTCCCGTCTCTAAAGCCACCGGACCTATAATAAAACCGATG
CAGATCCTTCCCGTTCCGTCGTCGAGTAATTCGCCTTTCGATATACCCAACGAAATAGAC
GGAATGCCAAGAAAAAAACCTAAAATATCCGTCAAAAGCAATTTAATATTGAAGGGGCCC
GATCAGGTGAACATGTTTCATTCACAACAGAAAATGGCCTTCAAGCGGCTGGAAGATAAC
GAGAGGTTCGGTTTAAGCGGTCCGGTCACATTCAACAACTTAATAACCACCCAGTTTATG
CAACTGCAACCGGAGCCGACTTTAAGCGAGTCCCCCAACACTATCATGACATACCCTCAA
GACAGCATGATGGTGGAATCAGCGACGACTCCGGTCGACAACGAAATCAACGACAACCCT
CAGATCTTCTCGTTCAATCAACAGATGAACGCAAACACGCTGTCCATGATCCCGCCGCCG
CAGAAGGTGCAGAACAACGACCCGAGCTACATCAAAATAGAAGCAACCATCAAACAGAAC
ACACAGTCGCCGAGCTTGGAGAGGATGTGCAACGCGAACCTACTGAACAACCCGGCCATA
AGCAGGGAATATAAAGCGTCGCCTCCGCTAGAGGATATACACAAAAACATGAGCGAGATA
AAGAACGAGGTGAAATCGGACGCGTTCTACAACATGGCCATCACCAGCACGGCCGCCAAC
CCGCCGATTATAGACCAGTACATGATAGACAACATCATACCAGAACAGTATCCGGCTCAC
CTGGACCTGTCCACCGTAGTACTCTCCGAGGTCTCCAACCAGCAGAACGACGTCATAGAG
ATAGATGACAATTCCGACGACAACAAACTATTGAATCGCTTCGACATGAACTTCTCTCTG
GAGTCGCTTTATCTGATGCAAAACGATTTCCACTTCCTCGAAAACGACCTCCCATCGAAC
ATGGCCGAGGTGCCGGTTAACGAGATCAATAGGATGGTGACGGAAGTGCCGATCATAAAC
CAGAAGGATAATTTGGAGGTCGTCCAGGTGGAAGCCGCCAGTGAGAGCGCCAATTTCATT
CAAGGCAAAAAAGACCCCATGATGAACACATCGGTCAGACCCTCGACGAATAAAATTAAC
GTTAAAAATATAGAGCTCATGAAAAATTAA

Protein sequence:

MVKSLVQIALKNAKLKKEVVSNNPLIQSQSNISEMKIPQVIEIVPDRLNVNNDQLPKGVS
FISVEELNKMSTPNFEKVDPKEVIQEASINIVYTNDVSTQYVNVSNHGVSSSTCNLIPVT
SVTPDLISSVQPRIPVSDINPSPHKEADSSHISSTITESTRKSTYTNNPEEKGTLCSISN
CVVRLKDPNNLAYHRKCHQNGKLQCPECTKKYSWVHQLHMHLWKIHAIDLELPTCTICGY
KNYKRHILINVHTKCHGKNRSYTCSICQKKFKTSNQLSKHRMIHKCNVIHQCQICQREFN
LERHLREHMAAVHDKLRPFKCGHCSYMAARKCELKLHLRLHTGDKPYACDQCDYCTRDHN
SLRRHKLKHSNEGVYKCKYCPYSAIQSTAFASHMISKHPNMNLGDVHCCPFCSFKSVSRN
NYVVHLTTHRDKEGIKLLIDIAKSVKNKKRSWTIPSEDTEKSNVTKEADEKKEDKNSSDN
MPIEVYDCDSLPETVPQEYPKNYCTTTSSDTQQSNFNSEVVTDLSKEDNNSDYLMPEGYT
IQERHQYSSQHPLQMSYETPQNNLHFLNPSVSHDQGLLANSIVTSSHSPIINSISNNIMN
NFPIRLPPAPISVRNNITLKPVDKISLPVSKATGPIIKPMQILPVPSSSNSPFDIPNEID
GMPRKKPKISVKSNLILKGPDQVNMFHSQQKMAFKRLEDNERFGLSGPVTFNNLITTQFM
QLQPEPTLSESPNTIMTYPQDSMMVESATTPVDNEINDNPQIFSFNQQMNANTLSMIPPP
QKVQNNDPSYIKIEATIKQNTQSPSLERMCNANLLNNPAISREYKASPPLEDIHKNMSEI
KNEVKSDAFYNMAITSTAANPPIIDQYMIDNIIPEQYPAHLDLSTVVLSEVSNQQNDVIE
IDDNSDDNKLLNRFDMNFSLESLYLMQNDFHFLENDLPSNMAEVPVNEINRMVTEVPIIN
QKDNLEVVQVEAASESANFIQGKKDPMMNTSVRPSTNKINVKNIELMKN