DPGLEAN20479 in OGS1.0

New model in OGS2.0DPOGS200066 
Genomic Positionscaffold1939:+ 4447-44713
See gene structure
CDS Length6327
Paired RNAseq reads  4216
Single RNAseq reads  10538
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012543 (0.0)
Best Drosophila hit  six-banded, isoform D (7e-21)
Best Human hitmethyl-CpG-binding domain protein 5 (1e-18)
Best NR hit (blastp)  PREDICTED: similar to six-banded CG13598-PB, isoform B [Apis mellifera] (1e-148)
Best NR hit (blastx)  PREDICTED: similar to six-banded CG13598-PB, isoform B [Apis mellifera] (3e-138)
GeneOntology terms
  
GO:0005634 nucleus
GO:0003677 DNA binding
InterPro families
  
IPR000313 PWWP
IPR001739 Methyl-CpG DNA binding
Orthology groupMCL18540

Nucleotide sequence:

ATGTCCGGCAAGCCGGCGAGTGTGGGGCCGCCGCGAGCGGGCGTGGCTGGCGCGCCTCGC
GTGCTGGTGTACGTGGCGGAGGCGCGCCAGCTAGAGGCGGCTGCCGCGGGGCCCGACCCG
CAGCCTCCACCGCTTGACACCCTCTACCTCCAAGATAACGTGCCGCACCTGCCTATGCCG
AGAAACAACCTGTACCGAAACCCCATCGAACAGAACGATCAGAGAACTGGTGCACTTTAC
TATAAATCCATCAATGATAATGAATCTAAAATAAAACCATTCGAATCGGAATCTACACAG
AGGGAAGAGTTCTACACAACTCCACATTTTATGAAAGATAATGTTAGAAATGATGATAAA
GCTAATGATAGGCAAAGCAGTGTGACGGTTAAAAGAGATTGTGAAAACTCTGAGTGGAAT
AATATAGATTCATACCACGTTAGAGTTCCAATGCAAGTTGGAGACGTTGGGGCATATCAG
AGATGTCCAGTAAGTGTAGATAACATGCCCGTAATGCCGATTGGGCATACCCAGTTCAAT
GATAGTGGTAGTGATATGAACTGTGAGAGGGAGGTGTCGGGGAAGAGTTTTGTAAACTAT
CAGTTGTTAGAACAGAGTGTGACTCATCAGTCTATAGTGAATCAAAGCATAAATATACTG
AACCAGCAGATTGGTGTGAATAGATCTGTAGTTAGTGAGGGTGATAGTAGCCCGCCACAG
CTCATACAGACACCAGATGGTGTTGTGCTTGCGGTGCTCCCACCCTCGGCTGTACCCCAG
ACGGAAGGTGATGTTAGAGCTGTGCATAGTGAATATCCACAAACAATTACCGTGCCGCTG
GGCTGGCGGAGAATTGTTAATGGGACATCTGTTCTATATGTTAGTCCAAGCGGTACAGCT
CTGAGTAACTACACTCAGCTGCGGGAGTACCTTCAGACGGCTGGTACCTGTAAATGCGGC
CTGCCTTGCCCGCTGCGTCCGGAAACCGCCTTCAGTTTCGATCCTAAGGTATCGAGTAAG
CCTTACCAAGTGGCTGCTGGCGCGGAGCTGACAAAGCTTTGCAATCATAAGCGGAAACTT
CTGGCCTCCCTGCAGTCGAGGGTCCAAAGTCCAGTAACTCCGCCACCGGCCATCGATCAG
AAGAAAGGCGGCGGCCGGCGGCACCCCGTCGACCCCCGTCGACTATTGATCTGTCGACGC
CCGCCTCCAACCCTCGGCCACCCAGCGACCGCTGAAACTACTAAGAAGAAAATGAAGAAG
CGTTCCGGTTTCATACCTAACATCAGCGTATCCCAAATGATGGTGCAAAGGGATAGACCC
CTCAACGAGTTGAAGGCTGATAATGATCAGAAGCGGACTGCGTCCCCGGGAGCCATGTCC
CAACGTCCCGGCAATATGCAGTCTTGCTTGCCGTATCAACAGCAAAACATGATGAATATG
GTACAGGATGAGGTTAAACAAGAAACCCTTGTACAAAGAAGTTCACATTATCTGACTGTT
AGCAGTGGATGGATAACACAACATCCTGACAGTATGGAATCTGGACACAAGCAGAATCAA
ATTATCGCAGGACCTGGAAATAATTCAGTGCATGTTGGCCTTCCAGTGCTAGGCACTAAT
GGGCAAGTTATTGGAGTGAATACCTCATATAACAAACACGTTACGAATATCAAGAATACT
GTTAACCTCACACCGCAAAATGTAATGGAACATGAGCGTAAAGAAGATAAAGACATAAAT
CCCGGTAAACAATCATCGAAAGAAGACGACTCGCAAATATTCTACGGTTTGAATCAGCCC
CTAACACCGGAGGTGATGCAAAAGATTAATGAACAGCAGCAAATATTTATACAAAGAAAC
CAGAAACAAATCGAAGTAATGAAGGGCTACAAAACCCAAATGTCAGATAATGTTACTCAA
AACAAACCTCATAACCAAAATCAGTATGTCCCTCAGAACCAGTTAGTTATTCAAAATGGA
ACGGTAGTAAAAACAAATTCTGCAAAGACACCACCGTGGCAAGTGAAGAGGGTAGATAAT
ACATCGAATGCCTTAATATGTCCCAGCCCTAAACAGCTAAAAATTGAGAATTCTGAATAT
GAAGATTCTAATCCCAATCCCATGGAGGATTCTTCGCCATCGGGTTCGTTTGCGAACAAT
GACCAGATGATGAACCCGGCTATGTGTTCCCGAGTTCCTCCTCTACCACAACATTACAAC
ATGATGGGACAGCAATGGCCAAATGTTGATAATAAAAAGAAAGCTAGATGTAACAGCAAA
ACTTCTAAGAAGAAAGCCGCTGCCAATGACCCTAAGAATATGAATTTTGGTAACAATGGA
TTGAGACATATGCAAGACGTCAAAGGCAAGCCTTGTGAAGATATAATGAGAAATAACGTT
CCTTCGTTCATGGAAGATCCTAGTGGTTATCTAGCACAACAAACAGTACTTTTGAATAAC
ACTATATCGAGACAGGTAGGTGTGAATGTTCCATATGAAGCTAACCAGTTTGAGAACTCC
GGAAATTCATATAACACTTTGAATCCAACGGACAAGAAAAATCTCCCGAAGCAAAGTGAA
CCCATGAATGTGTTTAGAAATAATGTTGTAGCTAATTCAACACCATCCCCAAACAACGCT
CCTGATAGTAGTCTTATACCAGAGAAAGTCGTTGAAAATAATCTACAGTCATGTTGCAAG
GGCTGCAATGTTTCCTACAAAGGCAACTGTTCCGATACTTCGGATAACCCGATGCGCTCA
AAGTTATTAAAACAACATACTAATGTTTATGGTCTGGAAATGGACAGTGGGGTGGCTACA
ACGTCTAGTCCTAAACATGGATTCGAAGATAATCCAGTTACATCATCAACCTATATAGAT
CGTAGTATGATGTTAAATGATAATTGCGGTCCGATACAAGCCGGTATTGTTAGCACTAGC
AATGTGTCACCGAGTGAGACTCTGCAACAACCGGAGCCTTCACCTACATTGTCTAATAGC
TCCCGTAATACGGACACTCCTCATAGTAGTGGCAGCAACTCAATGCAAAACTGTACATTC
CCTGTTCCTAGTCCAGCCTATTCCAATCCAGGTTCCAGCAGAGATAACTTTAATAGTAAT
CCTTCTCCTCCAGGCAGTAATCAGCCATATCCGTACAATACCGTAGGGCCACCCAATACA
GAATCGGTATCCGGAAATCAGGTTATGCATTTCATATCAAATCACAATATCCACAAAAAT
CTAATGGAAGTGAATAACATATCAATGGTTGGTGTCAAACCAGGAATGAGGAGCCAAGAA
TATAGAATGGATGGCAAGAAACGTATTGAAAATACAATGCCCGGTTACTGTCCGCCCCCA
CATCTAGGCGGGGGTCCGGCACATTCCCTTATACAAACTTGTTACATACAAACAATTGTT
ACGACTATGGCTAGCGGTTTTTCCGTTACAAGGGATACTGTAACTTCAGTTCTAGCGGGA
AAAGCTAATACTGCGACTACCTCTATAAATGCATCACAAGCAAATTTCATAAGAACTCCG
CCGCCTCCAAGTGTCAATTTGGCGACGACCTATGCTATACCGAATAATCAGCCCGATCCC
TTTATTAACTCTGTTAATTTGCCAACAAGTTATCCGCTGCACATCGCCGGTACAACCGCG
CAAAATATGATATCAAAATCGCCTTTGGAGATGGTACAAAATGTTATAAGCAGTTTACCT
TCCAAGCCGGAGGCGAGCAATTGCCAATCGACCATAGCGCAGAGAAGATCGTGTTCCAAC
TCATCCGGTCAGATACTTATTTCATCGACCGGTCAAATAATTGTGTCAAACAATCAAATG
CCGCCACCTCCACCTAAAAATACAACCACGATGTCACCAATTGGTTCTAACGCTATAACA
AATGTGACCACTTCAGTTACTCAAGTCGTACCAGCTGTTGGCAATGTTCAACCGATTGTC
AATCAACCGACTGTCGTGGTAAATGCATTGCAAACTCCTTTCGTTATTCAACCTCCCATG
ATGGCTGTTGAGGGGCAAGTCGTCCAACCGAATACCGTCCTCCCTCAAATAGTAACTGGA
GGCATCGTGTCTGGACCCAACGAAATTTCACGGCAAATTGATATGAAGAATGGACAAAAC
TTCGTCCAAGGTGTGGCCATGCTTTCTCCTGAAAGTTTAAAGAAGAGAAGTAAAAAGAAG
AAAAATCAAACGGCGAATATAACGAACGTTCTGCAAATCACAGCACCGCAACAGAACCCA
AATAATATCATGGTACACTCATCGCCACAACATAATTCTAGTCCACAGTTTTCGCCACGC
GGTTTTCAATTGTCGCCAACAAACAATATATCGCCGACGCCGATGTTGCAAGCACTAACT
ATAGTACCCGGGAAGTCCGGTACTCCAGCACATATTGTAATGAACGGCCAAGGGAATTCA
AATAACTTTGGATCTCAACAGATAATCACAAACACTTCACCATCACAACAAATTAACTTA
TTACAACCAGTCAATCTAATCAATAACGCCAGTAATGTGATGTCAAATTTCCCGGCGTTC
CAGCAATTTATTCTGCCAAATCTCGGCGGTATGGTAATGACAGCTGATGGCACAGCGATC
ATTCAAGACAATTCAACAGGAATGCCTATGCAATTACAATTGCAAACGGTAAACGGACAA
AACGTTTTAACGCCTGTGCAAAATCCTGGATTGTTTACCGCCGGGAATAACAGCGGTGTC
GTTATAAGAGCACAAAACCAGCAAGGAAAAATAATACAATCGCAACACAGCCCTGGTGCT
CAATTTTTGTCGCCGAACAGTCAAGTTATGGTCAACAGTCCCAACTTCAACGGACAGCTC
AGCCCTTTACTTGCGAATCTTAGTCCAACTAACGTCGCGTTCAATAGTTCGCAAGTGCGA
GCGGGGAACGTCCAAACACAGGAGTTCATCCAGACGAATCAAATGGGCCAAACATTAATG
GTTCCGTTGTCCCCGAAACCAGTTCCCATATCCGCTAGTAACAGAAACTCCACTTTCGTC
CAGAACACGACTATAGTTCAACAACAGACAACACTAGTTTCCAACTCTCATAATACTTCG
ATGAACATGAACAATACCTCTCGATTGAGCATCGATCCAAATGTGCTTCTTACTCAGAAG
GTTGGGAAGAGGTTTCCTCAAGAAAATCAGGAAATAAAGGACAAGATCCTACCGTTTGAT
GTGAAGGAAGATAAAGAGGATAGCGAAGGTTACGCCCGTTTAATGCAGGAGTTGGGTGGT
GTAAGACATTCCGTTTCGACTCAAACTTTGGGTCAAAGAGGGGGTAAATCCCCTAACACA
GCCGGGGGTTCCCCACCAGATACTACGACACACAGTCCGTTGGGAGCCTTAGATTCCGCT
CCAAGCCCAAGATTATATTCCGCAGCAACTCACGCTACCTACGCCGATACCACAACCAAA
TCACCCGAACCCGCGGATGTTCATTCTTCGGCGATGGTGCAATGTGTGTCCAGCAGTGAA
CAAGACATGGCTGAGTCAAGGGAACGAGCGTGGCCCGTGAGGAACGAAGAAACAGACTCA
CACAATCTAATGTCAAACATACAGCTTATGAGGTTAAACCGCCACAACTCTGTGGAGTCG
AGCGAGATGTACCACAGCATGGCTGCGAACCAGCAAATGATGCAAATGCCGTACGAAAGA
CCTCACAGCTACAAACGGAAAAACGACTTCGGGGACGGCTCATCGTATAGGAACGACAAA
ATTATGAGGACCAATTATAACATGGTCCACATGCCCAAAGAATACGACATGAACCAGGAC
AAACAGAGGAGATTCTCGCCCGGCAGTTCGGAGAGCATGCAGAAAGTATTCGATAGACAT
TTAGGCTCGGAAGACGATACAGAGGATAATTACCGGAGATTCGAAGTCGGCGATCTGGTA
TGGGGGCCGGTTAAAGGTTATGTCTCGTGGCCGGGCAAGTTGGTGTCGCGTGTGTCGGAC
ACCAGCTGGAGTGTCCGGTGGTTCGGTGGCAGGCCTACCAGCGAAGTCGAACACTCCAGA
CTGTTGACACTCTGCGAGGGTCTGGAGGCGCATCACGCGGCACGGATGAGACACAGGAAG
AGCAGAAAACTAAATACGCTATTAGAGAACGCCATACAGGAGGCAATGGCTGACTTAGAT
AAGAAGACGGAAAATAACGACGCTGATGATTTACAAGAGGATGTATCTGATGTAACAGAC
ACTACTAAAAGGAAAACACCCAAACTTAGGAAACACAAGAAAAACCCATCCAAAGTCGAC
GGAACCCGTTTGAGGAGCTCGCGATGA

Protein sequence:

MSGKPASVGPPRAGVAGAPRVLVYVAEARQLEAAAAGPDPQPPPLDTLYLQDNVPHLPMP
RNNLYRNPIEQNDQRTGALYYKSINDNESKIKPFESESTQREEFYTTPHFMKDNVRNDDK
ANDRQSSVTVKRDCENSEWNNIDSYHVRVPMQVGDVGAYQRCPVSVDNMPVMPIGHTQFN
DSGSDMNCEREVSGKSFVNYQLLEQSVTHQSIVNQSINILNQQIGVNRSVVSEGDSSPPQ
LIQTPDGVVLAVLPPSAVPQTEGDVRAVHSEYPQTITVPLGWRRIVNGTSVLYVSPSGTA
LSNYTQLREYLQTAGTCKCGLPCPLRPETAFSFDPKVSSKPYQVAAGAELTKLCNHKRKL
LASLQSRVQSPVTPPPAIDQKKGGGRRHPVDPRRLLICRRPPPTLGHPATAETTKKKMKK
RSGFIPNISVSQMMVQRDRPLNELKADNDQKRTASPGAMSQRPGNMQSCLPYQQQNMMNM
VQDEVKQETLVQRSSHYLTVSSGWITQHPDSMESGHKQNQIIAGPGNNSVHVGLPVLGTN
GQVIGVNTSYNKHVTNIKNTVNLTPQNVMEHERKEDKDINPGKQSSKEDDSQIFYGLNQP
LTPEVMQKINEQQQIFIQRNQKQIEVMKGYKTQMSDNVTQNKPHNQNQYVPQNQLVIQNG
TVVKTNSAKTPPWQVKRVDNTSNALICPSPKQLKIENSEYEDSNPNPMEDSSPSGSFANN
DQMMNPAMCSRVPPLPQHYNMMGQQWPNVDNKKKARCNSKTSKKKAAANDPKNMNFGNNG
LRHMQDVKGKPCEDIMRNNVPSFMEDPSGYLAQQTVLLNNTISRQVGVNVPYEANQFENS
GNSYNTLNPTDKKNLPKQSEPMNVFRNNVVANSTPSPNNAPDSSLIPEKVVENNLQSCCK
GCNVSYKGNCSDTSDNPMRSKLLKQHTNVYGLEMDSGVATTSSPKHGFEDNPVTSSTYID
RSMMLNDNCGPIQAGIVSTSNVSPSETLQQPEPSPTLSNSSRNTDTPHSSGSNSMQNCTF
PVPSPAYSNPGSSRDNFNSNPSPPGSNQPYPYNTVGPPNTESVSGNQVMHFISNHNIHKN
LMEVNNISMVGVKPGMRSQEYRMDGKKRIENTMPGYCPPPHLGGGPAHSLIQTCYIQTIV
TTMASGFSVTRDTVTSVLAGKANTATTSINASQANFIRTPPPPSVNLATTYAIPNNQPDP
FINSVNLPTSYPLHIAGTTAQNMISKSPLEMVQNVISSLPSKPEASNCQSTIAQRRSCSN
SSGQILISSTGQIIVSNNQMPPPPPKNTTTMSPIGSNAITNVTTSVTQVVPAVGNVQPIV
NQPTVVVNALQTPFVIQPPMMAVEGQVVQPNTVLPQIVTGGIVSGPNEISRQIDMKNGQN
FVQGVAMLSPESLKKRSKKKKNQTANITNVLQITAPQQNPNNIMVHSSPQHNSSPQFSPR
GFQLSPTNNISPTPMLQALTIVPGKSGTPAHIVMNGQGNSNNFGSQQIITNTSPSQQINL
LQPVNLINNASNVMSNFPAFQQFILPNLGGMVMTADGTAIIQDNSTGMPMQLQLQTVNGQ
NVLTPVQNPGLFTAGNNSGVVIRAQNQQGKIIQSQHSPGAQFLSPNSQVMVNSPNFNGQL
SPLLANLSPTNVAFNSSQVRAGNVQTQEFIQTNQMGQTLMVPLSPKPVPISASNRNSTFV
QNTTIVQQQTTLVSNSHNTSMNMNNTSRLSIDPNVLLTQKVGKRFPQENQEIKDKILPFD
VKEDKEDSEGYARLMQELGGVRHSVSTQTLGQRGGKSPNTAGGSPPDTTTHSPLGALDSA
PSPRLYSAATHATYADTTTKSPEPADVHSSAMVQCVSSSEQDMAESRERAWPVRNEETDS
HNLMSNIQLMRLNRHNSVESSEMYHSMAANQQMMQMPYERPHSYKRKNDFGDGSSYRNDK
IMRTNYNMVHMPKEYDMNQDKQRRFSPGSSESMQKVFDRHLGSEDDTEDNYRRFEVGDLV
WGPVKGYVSWPGKLVSRVSDTSWSVRWFGGRPTSEVEHSRLLTLCEGLEAHHAARMRHRK
SRKLNTLLENAIQEAMADLDKKTENNDADDLQEDVSDVTDTTKRKTPKLRKHKKNPSKVD
GTRLRSSR