DPGLEAN11404 in OGS1.0

New model in OGS2.0DPOGS208752 
Genomic Positionscaffold721:+ 11448-19818
See gene structure
CDS Length4362
Paired RNAseq reads  2952
Single RNAseq reads  6648
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003413 (3e-72)
Best Drosophila hit  MBD-R2, isoform A (3e-48)
Best Human hitPHD finger protein 20-like protein 1 isoform 1 (1e-18)
Best NR hit (blastp)  PREDICTED: similar to MBD-R2 CG10042-PA, isoform A [Apis mellifera] (7e-98)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC010453 [Tribolium castaneum] (5e-93)
GeneOntology terms


  
GO:0008270 zinc ion binding
GO:0003677 DNA binding
GO:0005515 protein binding
GO:0005634 nucleus
InterPro families








  
IPR006612 Zinc finger, C2CH-type
IPR002999 Tudor domain
IPR001739 Methyl-CpG DNA binding
IPR001965 Zinc finger, PHD-type
IPR013083 Zinc finger, RING/FYVE/PHD-type
IPR007087 Zinc finger, C2H2-type
IPR019786 Zinc finger, PHD-type, conserved site
IPR016177 DNA-binding, integrase-type
IPR011011 Zinc finger, FYVE/PHD-type
IPR016197 Chromo domain-like
Orthology groupMCL16848

Nucleotide sequence:

ATGGCTGTAAAAAAATGTTGTGTAGAAAACTGTAATTCATCTTCAACAAGACCAGAAGAC
ATTGGTGTTACATACCACAAGTTCCCTAAAGATAAGACATTACGTGATTTATGGTCGTTG
GTAACGCACTACAAACAAACTAACATAGACTCTACGACATACGTGTGCTCGCGTCACTTT
TGCAAAATCGATTTTCAAATTTACGAGGACTCAAAATACATTCTTAGATCAGATTCCATT
CCCTCTATATTTTCATGGATCCAAAGAGATAAAGATACAAAGATACAGCAATTAGAATCT
AATATGGATGAACCTAATATTTCTGGGGCAGCATCCCCTGTAAATGAAAGCTCAGATGGA
GGTGCTGCAAACCTGAACACCTCATCCAGCAGTAAAGAATCCGAAGGTGAAAATGTTGAA
GCTATTATGAAATTCATTGAAGAACAAGAACAGGAAATAAAAAAGCAACAGAATGAGAGC
CAATCTGAAAAGATTGCAAGCGATCAAAATGTGCCGCTAACCCATAATGACAATATTGAC
AATAATGACAATAATCAAGTGTTCAGTGACATAGCGGAGCCTATGGTGATAGCGACGAGT
GTTATGGACATGATTCTCAGTGAATCAGAAGCTAAAATAGACACAAGGAAAAATATTAAA
CCAATACCACAGAAACTGAGTAAAAATGATAAAGGCAACAGTGTTTCACTGTCGGTCGGA
TCTAAGGTTGAGGCCAAAGATTATGGAGAATTTTGGCATTCGGCTCAGATTGTGGAAGTG
GACTATGACGAAATGGAAGTTCTGGTGCATTATGAGAACACACACAACAAACCCGATGAA
TGGATAAGTGTGAGCAGTCCCAGATTGAGGCTTACGAACAATCCTACACAAAGCACCCCA
GCGAGAAACGTCAGGACTGAAATAAAACCTGAAAAGGAAGAAGTTAAGGTGGAGGAGAAA
CCAAAACAACAGTTTGTTGTTGGTGAAAGATGTCTGGCTCGTTGGAGGGACAACAGACGT
TTCATAGCCACAATACTAACAGATCTCGGCAATGGCAATTACGAGATTATGTTCGACGAT
GGTTTCAAATGGAAATGCACTACGTCAAGGATGTGTAAGCTCAAGGAGTCTAAGACGGAA
CCGCTGGCCATCGATACGTCAGCGTCAGCATCATCTTCCAGTCTTTCACCGATACCAATT
CCTGGGACGGGTCCGACGGGAAATATCCCAAACAGCCAATACACGTTCCACACACATCTA
TTCGATCCCACCCGCGATTATTTGGGCTCTAAGAGCGAGAGGCGAGAAATGAAACGCAAA
TTGAACATAAAAGAGATATTTAATATAGGTCAGAAGAAACAAAAGCGAAAAGATAGCGAT
CAAGGGAAACCGAAAATTGGTAAGGTGAAGAAGGCAAGGGTTATTAAGAAGAAGGTTGAC
ATTAAACCGGAAGCTGAGACCGAAGTCAAGCTTGAAGTGTCGCAAATAACAATGGAGATA
AAAAAAGAGATTCCCGATGCAGTCGCTTCGATAATTGGTACTGTCAGTAAAGATGAAAGC
GATAAAACTAAAATTGATACAGAAATTAATATACCGATGGACGCTGCTAAGGACGTGGAA
GAATCTAGTGCTGAGACTAACATTGAAGACGCAAATGTATTAGACGTAAAAAATACAGGA
ACGGCTTTCGATTCAGAGGAGGTCGTTGAAAAAAATGACATCGGAGATAGTTTAGTTAGT
AGCTCAGATCTGTTAGTCCCTAGCGACTTAGGCTTCCAATCAGAGCCTACATCGGATCCT
GTCATGGAAGAAGAAGCGAAATTAGAACAATTCGAAAAACCGGATGAAAGTAAACAAGAG
GTTATAGAGAAAATGAAAGAGGTTATATGCAAATTAGAGGGCGGTTTAGATATACATAAG
ATAGATACAACGAAACGTGAAGTAGACACGAAGCCAGTAAGCGAAGTTGATACAACAGTA
AAAGACGACAGATCGAAAAGGAAGCTGTCGAAAATAAAGAGGAATAAAAGATTGAGAATG
TTACAGGAGAAGAAAGTTAAGAAGCAGGTGGAGAAAGTGAAGAACGAGCTGGTGGAGATG
AGGAAGCAGATGGAGGAGATGAGGAAACAGATGTTGATGAAGACGGAGGAGATGGCTCGC
CCGCACGAGATGCCCGAGAGCTTCCTGCTACCGGGAGAGTGGTGCTGCAAATGGCTCAAC
GGGCAACCCCTGGGCAATGTGTGCGAGTTTGAAGATAAAGTCGACGGCAAAGGGCTTAAG
AAGATGAGCGTTCAGGTCGAGGATAAAAGACTGCCTCCAGGGTGGACGAAACTTATGGTG
CGTCGAAGTTTTGGACAGTCCGCTGGGAAATGGGACGTCGTTTTAGTTGGACCGGAAAAT
CGTAGGTTTCATACTAAGACGGATATACGGAACTATCTCGAGCAGCACGACGACTCCCTC
AAGCAGTACGAACACGCGCTGTTAGATTTCGGTGTACATCTGAAGCTGTCCCGTAGGATG
GGATGGTATACGACGGATGGCGGCGTTGCACCGGCACTGGTGAAAAGAAAGAAATTAGGT
ATAAACAGGAAGGAAGGAAAGAAAAGAAAGAAAGAGAAGTCAGCCAAGCGTGATATATCC
TTGGAAAGTTTTTATAAACGTACATTTTACCCGGAAAGCCCACCGGTCTTCCTGGAAAAT
CCCGTGGAAGACGATGGTTCTGTGTACGTCGGTTCTATGAAGGTGGAAGTGATCGATAAT
CTCTTACGGTGTCCAGCTGAGGGCTGCCTGAAGAACTTCCGAAATACCACACTACTGCAG
ATGCACATAAAACATTACCACAGAGAAATGAGGGAAATGTTGGGAGCCACCCCAAAAGTT
TTGGACTTAGCGCGCGAAAGAACGAAACCCACTGATATCGAAGTCAAAAAGACGGAATTT
GAATCCAAAATTATTAAAGTCAAGCTACCAAAACTGCCGAAGAGATCCGAGGAACCCAAA
AGTCAAACGAACCCAGAAGTCAAAGAGCCCATTGTGCAAAAAGTAGAGCCTCGACCACAA
ACACCACCTAAACTGGATGTGCCCATACCTAGATCACAGGATTCTCCTAAACTAAGACAA
GCACTAATCACCAAACCGGCTAAAAGACCGAAAGTTCTACTCCCAGTTAGAAAACCAGAG
CCAGAAGAAAAAGAAGAAATCCCCGAGGAAGCTGATGTAGAAAAGATAGATTTCGACGAC
AGCTCCAATACTGCAGAGAAACCGTTTGAGGAGTTCCGAAGGAAGTGTGATAAAAAGCGC
AAATGTTTTTCAACTGTGTCAAGGAAGCCTATCAGCGAGGAGGACGAGTGGTTCGGTGTG
AACTCTGACCTTGACACTCGGTCCAGTTTCCCAGGGTCTGGCACACCGGACTCCAAAAAC
ATGGACAAGGCAGTACCACTTCCGGTTTCCTCCGAGTCCAATGAAGAACAGAAGGACGGC
AATATGTATATGTATACAGAGACTGGCGAACGTATAAAGATCGAGCACATGAAACGCGAG
GAGATCATAAACTGTCATTGCGGTTTCCGCGAGGAAGACGGGCTGATGGTGCAGTGTGAA
CTCTGCCTGTGTTGGCAACACGCGCTGTGTCACAACATACAGAAGGAGTCGGAGGTTCCA
GAGAAATACACTTGCAGTATATGTCTCAATCCTCGGCGTGGGAGACGCTCCAAGCGGTTC
TTGCACGATCAGGACAGACTGTACGAGGGGTTGCTGCCGGGGGCGAAGCCCTGCGAGACT
TTGCGACGCTCTCACGAATTATCAGCTAACCTATTGAAAATTCAGGATGCTCTGCATGCA
ATGCGAGTCAAACACTATGTAGCTACTAAGAAAGACCACCCAAAATTATATCTGTGGGCC
AAAGACTGGGAGAGTCCAGAGGTAAATTTCACCCAAGAAAGACTTAATTCAGATTACTCA
GATCTGAATATTATCATAAATAACATCGGCAAGGAGAATTTGCCGCTGAAACCCGATGAA
GTTAATCCACATCTGGATATAAGAATGCCCATAACTGAAGAGCCTGAAGATAGATTCACT
CAGAGAGACAAACAAGAAGTACAAAGAGTGGTCATCCCTCAGCCCGAGGCAGCCATTGAG
AACAGTGCATGCAGGGAACGCTTGCTGCGACATGTGCAGCGCTGTCAGGGCTTCATTGAC
GCCAGACTCGATTCTATAGAAGCTCAAGTAGCCGAACTCGAATCTCAAGATCCATCATTT
GAGGATGATGAGACAGCGGATTTCTTCCCAAGAACAAAACAAACTATCCAAATGCTGATG
AGGGACCTCGATACGATGGAAGAACTGGGAATTATATCTTGA

Protein sequence:

MAVKKCCVENCNSSSTRPEDIGVTYHKFPKDKTLRDLWSLVTHYKQTNIDSTTYVCSRHF
CKIDFQIYEDSKYILRSDSIPSIFSWIQRDKDTKIQQLESNMDEPNISGAASPVNESSDG
GAANLNTSSSSKESEGENVEAIMKFIEEQEQEIKKQQNESQSEKIASDQNVPLTHNDNID
NNDNNQVFSDIAEPMVIATSVMDMILSESEAKIDTRKNIKPIPQKLSKNDKGNSVSLSVG
SKVEAKDYGEFWHSAQIVEVDYDEMEVLVHYENTHNKPDEWISVSSPRLRLTNNPTQSTP
ARNVRTEIKPEKEEVKVEEKPKQQFVVGERCLARWRDNRRFIATILTDLGNGNYEIMFDD
GFKWKCTTSRMCKLKESKTEPLAIDTSASASSSSLSPIPIPGTGPTGNIPNSQYTFHTHL
FDPTRDYLGSKSERREMKRKLNIKEIFNIGQKKQKRKDSDQGKPKIGKVKKARVIKKKVD
IKPEAETEVKLEVSQITMEIKKEIPDAVASIIGTVSKDESDKTKIDTEINIPMDAAKDVE
ESSAETNIEDANVLDVKNTGTAFDSEEVVEKNDIGDSLVSSSDLLVPSDLGFQSEPTSDP
VMEEEAKLEQFEKPDESKQEVIEKMKEVICKLEGGLDIHKIDTTKREVDTKPVSEVDTTV
KDDRSKRKLSKIKRNKRLRMLQEKKVKKQVEKVKNELVEMRKQMEEMRKQMLMKTEEMAR
PHEMPESFLLPGEWCCKWLNGQPLGNVCEFEDKVDGKGLKKMSVQVEDKRLPPGWTKLMV
RRSFGQSAGKWDVVLVGPENRRFHTKTDIRNYLEQHDDSLKQYEHALLDFGVHLKLSRRM
GWYTTDGGVAPALVKRKKLGINRKEGKKRKKEKSAKRDISLESFYKRTFYPESPPVFLEN
PVEDDGSVYVGSMKVEVIDNLLRCPAEGCLKNFRNTTLLQMHIKHYHREMREMLGATPKV
LDLARERTKPTDIEVKKTEFESKIIKVKLPKLPKRSEEPKSQTNPEVKEPIVQKVEPRPQ
TPPKLDVPIPRSQDSPKLRQALITKPAKRPKVLLPVRKPEPEEKEEIPEEADVEKIDFDD
SSNTAEKPFEEFRRKCDKKRKCFSTVSRKPISEEDEWFGVNSDLDTRSSFPGSGTPDSKN
MDKAVPLPVSSESNEEQKDGNMYMYTETGERIKIEHMKREEIINCHCGFREEDGLMVQCE
LCLCWQHALCHNIQKESEVPEKYTCSICLNPRRGRRSKRFLHDQDRLYEGLLPGAKPCET
LRRSHELSANLLKIQDALHAMRVKHYVATKKDHPKLYLWAKDWESPEVNFTQERLNSDYS
DLNIIINNIGKENLPLKPDEVNPHLDIRMPITEEPEDRFTQRDKQEVQRVVIPQPEAAIE
NSACRERLLRHVQRCQGFIDARLDSIEAQVAELESQDPSFEDDETADFFPRTKQTIQMLM
RDLDTMEELGIIS