DPGLEAN11960 in OGS1.0

New model in OGS2.0DPOGS214648 
Genomic Positionscaffold1106:+ 11596-16974
See gene structure
CDS Length1485
Paired RNAseq reads  428
Single RNAseq reads  1102
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005064 (0.0)
Best Drosophila hit  CG17446 (6e-133)
Best Human hitcpG-binding protein isoform 1 (2e-84)
Best NR hit (blastp)  PREDICTED: similar to cpg binding protein [Nasonia vitripennis] (6e-175)
Best NR hit (blastx)  PREDICTED: similar to cpg binding protein [Nasonia vitripennis] (2e-170)
GeneOntology terms

  
GO:0008270 zinc ion binding
GO:0005515 protein binding
GO:0003677 DNA binding
InterPro families





  
IPR019786 Zinc finger, PHD-type, conserved site
IPR001965 Zinc finger, PHD-type
IPR019787 Zinc finger, PHD-finger
IPR002857 Zinc finger, CXXC-type
IPR011011 Zinc finger, FYVE/PHD-type
IPR022056 CpG binding protein, C-terminal
IPR013083 Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL14789

Nucleotide sequence:

ATGAGTGAAAAGAAGTCTAAACAAAGTAAAGCAGATATTGCTAAACAGTTTGATCTACCC
GAGCGTCAATCGAAGATTACGTCGCTTTTAAATCAAGCCGGGCAAGCTTACTGTATATGC
CGATCTTCAGATAGTTCGCGGTTTATGATAGCTTGTGATGCTTGCGAGGAATGGTATCAC
GGAGATTGTATAAACATTTCTGAAAGAGAGGCGAAGTATATTAAAAACTATTTCTGTGAA
CGCTGCCGTGAGGAAGATCCCACTTTAAAAACAAGATTCAGGCCACAGAAACGAGAAAAT
GATGGAGATTCTGGTAGAGATGATAGAAAGAAAAAACGCAAAGAAAAAGATCATTCAGAA
AACAAGTCATCAAAGAGACCAAATAAAGATGGTTGTGGAGATTGTGGTGGCTGTTCACAA
ACACACGACTGTGGCCACTGTGACGCATGCGAGGATATGTACAAGTATGGAGGTAATAAC
AAGTTAAAGTTGACCTGCCGTCAAAGATTGTGCGTTAAAAGCAAAAAGACCTCTAGAATA
TCAAGTAGTAACCGCATAAAGAAGAAGCATGAACGTGAGGTTTATGAACCCGAAGAGTCA
ATATCCCACCTTCAGACATCAGAGCCTCGGCAATGCTATGGACCGCAATGCTGTAAGGCA
GCTCAGTATGGCTCCAAATACTGTTCCCAGCAGTGCGGAATGAGACTTGCTACCGCTAGG
ATTTATCAGGTGTTGCCGCAGCGCATTCAGGAGTGGTCGCTCTCCAGCTGTGTGGCGGAG
CAGCATAACCGTCGTTCATTGGAGGTGGTCAGGTCGGGGCTGGCTAAAGCGCAGGCAGCG
CTGAGGGCGTTAGACAAACAGCACGCTGAGATAGACGAGATGCTGCAGCGAGCTAAGCAT
GCCACCATAGAACACACTGATGAGAAGGAAGCAGACGACGAGACCTCAATGTACTGCATC
ACGTGCGGCCACGAGATACACTCGCGCTCAGCTGTCAAACATATGGAGAAATGCTTCATA
AAGTACGAGGCCCAGGCCTCGTTTGGCAGCAGACATCGCACACGGATAGACGGACAGAGC
ATGTTCTGTGATTACTACAACCAAATCAATGGCACCTATTGTAAGAGGTTGCGGGTAATG
TGCCCCGAGCACTTCAAGGATCCCAAGGTGAGCGACACGGATGTATGCGGCTGTCCGCTG
GTGAAGAACGTCTTCGATCCCACCGGAGAGTTCTGTAGGGCGCCGAAGAAGTCCTGTCTG
AAGCACTACCAGTGGGAGAAGCTGCGGCGGGCGGAGGTTGACATGGAGAGGGTCCGCCAG
TGGCTGAGGCTGGACGAGCTGGTCGAGCAGGAGAGGAATATACGCCTCGCTATGGCCTCC
AGGGCCGGCGTTTTAGGTTTGATGCTTCACTCGACGTACAACCACGAGGTCATGGAGAGG
ATAACGAAGGCGAACGAAAACGGAAAGGTCAAAGAGGGGTCATGA

Protein sequence:

MSEKKSKQSKADIAKQFDLPERQSKITSLLNQAGQAYCICRSSDSSRFMIACDACEEWYH
GDCINISEREAKYIKNYFCERCREEDPTLKTRFRPQKRENDGDSGRDDRKKKRKEKDHSE
NKSSKRPNKDGCGDCGGCSQTHDCGHCDACEDMYKYGGNNKLKLTCRQRLCVKSKKTSRI
SSSNRIKKKHEREVYEPEESISHLQTSEPRQCYGPQCCKAAQYGSKYCSQQCGMRLATAR
IYQVLPQRIQEWSLSSCVAEQHNRRSLEVVRSGLAKAQAALRALDKQHAEIDEMLQRAKH
ATIEHTDEKEADDETSMYCITCGHEIHSRSAVKHMEKCFIKYEAQASFGSRHRTRIDGQS
MFCDYYNQINGTYCKRLRVMCPEHFKDPKVSDTDVCGCPLVKNVFDPTGEFCRAPKKSCL
KHYQWEKLRRAEVDMERVRQWLRLDELVEQERNIRLAMASRAGVLGLMLHSTYNHEVMER
ITKANENGKVKEGS