DPGLEAN08021 in OGS1.0

New model in OGS2.0DPOGS212026 
Genomic Positionscaffold221:- 159038-168729
See gene structure
CDS Length4830
Paired RNAseq reads  4187
Single RNAseq reads  9507
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010180 (1e-09)
Best Drosophila hit  without children, isoform B (0.0)
Best Human hitzinc finger MYM-type protein 4 (3e-57)
Best NR hit (blastp)  WOC protein, putative [Aedes aegypti] (0.0)
Best NR hit (blastx)  WOC protein [Culex quinquefasciatus] (0.0)
GeneOntology terms









  
GO:0006697 ecdysone biosynthetic process
GO:0003677 DNA binding
GO:0008270 zinc ion binding
GO:0000792 heterochromatin
GO:0000791 euchromatin
GO:0005705 polytene chromosome interband
GO:0035012 polytene chromosome, telomeric region
GO:0016233 telomere capping
GO:0043234 protein complex
GO:0005515 protein binding
GO:0006355 regulation of transcription, DNA-dependent
InterPro families
  
IPR021893 Protein of unknown function DUF3504
IPR011017 TRASH
Orthology groupMCL10637

Nucleotide sequence:

ATGGATGAAAAAGAAATACCTGAGAATCTCGGTGAAAACGACAATGAGACCGGTGATGGG
ATAAATACGACTGACATAAAAGAAGTAAGCGAAAATAATTGTACTAAAAGTGAAGAAAAT
AGTGAATGTGGACCAAGTGAACCTAATAGTATACAAAATCATCTAGAATGTGACGTTATC
AAAAGTGATGTTGTAGAAAAGAGTGAAAAATGTGACGAACAAGGTGATGTAGAAGTATCT
GTCAGTGTTGGTGATGATACAAAAGATATAGGAGAGATTCAAAAAGAGGAACTTCAAGAA
ATCTCAGAATTAGACAATGAATCAGTAGAGCCAGAAAAAGAAATACAAGATTTAGAAAAG
CAAGTTTTAGACTCAGATTCAAAACGTGATGTCCCTCTGGAGAAACCTAGCTTGTGTGAT
GCTAATGAAGGAGTTTCTAAATCTGAAATTAGTGAAGAGCTTGTTGTTAACAATGCCACA
GACCCCACTGTATCAAAAGAACTATTGAAAGAAGATAGTGCAGTTACAAAATTAGAGACT
GATGAAACCCGCAATGCTGAATGTATAGAGGACATAAAATTAAGTATTGACAACCCTGAT
ACTGAACATTCTGAATGTATAGAAAACATAAAATCAAATATTGACAACCCTGACACTGAA
AATAAAAGCCCTAATATAGAAAGTATTACAAAAGAACCGAGTCAAGAACATGCAAATGAG
GCTAATGAAGAGGATTATGAAAAAAGGGAGGTCGGCAAACCCGAGTTTGAAACAGCCCCT
AATGTGTCTATGGATGATCACCACGAAGATCATACACAGGCTTTAGACCCATTTGATGCT
CTTTTGAAAGACCGAACAGATGCAGCGGAAACATCAGAGACAGCCACTCAAGCTGATCTG
TCCTCAATTAATATAGATGATGATGATCATCACAACGCCGATGATGCCCATGATATGATT
CCTGATGATGAAGATGAACATCATCCGGACGATGAGAGTGGAATACCAGTTATAACAGAG
CAACCAGGAGCTGATGAAGAGGTATGCTTGTTACCCGACACCGAAAGAGAAATATCTGAG
GCTGATAAAGCGGCTGCTGAAAAGGTTTTAGCTGAGAAGAGAAAAAGAGATGAAGAATTG
GCTATGCCCAAAGAGGAAGCTGAAGCGAGTGAATCTAATGAAGGTGTTGCAGAAGAACAG
GTGCAGGAAGATGTAAACGAAAGTCTGGAAGGGGAGGGGGAGGGGGAGTCTGAAATATCC
ACGGGAGATGCGTCACAGGATAATGTAGGGGAAAAAGAAAACGATCAGGAAGATGCAGTC
GATTACGAACAGGAAGAGCCAGAACCTAATTCAATTCGACAAATAAGCGCCACCGACTCC
GTCTGTGTACAGTGTGACGAAGAGAAGTCATGTCAGTACAGATATTCAGACAAAGACGGA
GCCGTACATTATATATGCGTTCTCAGTTGTGTTAAATTGTTCCAAGCGGCGCACGCGGGG
CAGTACATCGTTGTAAATAAAAAATATATGGTCGAAGAAATTGCACCCAAAATACTGACG
TGTTCGGAATGCGAGGAGAAAAAGACTTGCTACTTTTATTACAACTTCGACGGCGAAGAC
ACGAACTACTGTTCAGTAGAGTGTCTGCACAGCATGATGGCGGATGAAAGAGACAAGTAC
ACGTTCAAAAGACGAAGGATCACCGTCGAGGAAAAGTCGCCCAAGGAAGATCAATGTTGC
GTCTGTGAGAGCACCAAGGAGTGTATATATAGTTTGACGCGCTACGGACAACAACTTTGG
ATTTGTGAACAAACATGCCTCAGGGCGATCAATTGTAAAGAAAACGGCAGGTATTTATTG
AGAAAGAAGAGAGTGCAACGAGTACAAGCACAAAAACCCGTCAAGACTAATCCGCCCCTG
CTGAAACTGAAAGTTATTAGTAATGCGACTGATAAATATTTAGACGAGGCTTACAAAGTG
CAAGGAAAAACGCCGGCGATGGTTCAAGCGGCCAGGGAAGAGAGGGAACGGACGTTTATA
AGGTCTTGCATGAATTGTCACATGATCCTCAATAATGAGGAAAAAATGCTAACGTGGGAA
GCTATGGACTATTGTAACGAGACATGTCTGGGAAGATATCAGAATAAATTCGGTTCCAAA
TGCACCAATTGCAAAGTACACGTGCAGCATACGAGCATAGGGAAGTACTGCGTCAGGTTC
GGATATAATATACGACAGTTCTGTAACTCGGCGTGTTTGGAGGACTTCAAAAAGGGTTTG
AAAATTTGTTGTTACTGTCAGAAAGATATATCGGACGGCAGTCAAGGGTTCCTAGCGCCG
GTGGGCGACAAGGGCCAGTTCAAAGACTTCTGTTCGCAGCTGTGTATGGAGAAGTTCGAC
AAGATGAGCAAGAACCCAGTTCCTAGGCCTGTTTGGGCGAAGTGCGCGGTCTGTTCGCTG
GAGAAAGCTACCACAATCGAGGTGGAAGTCGCTCCCGATGAATCACAAAGACTGTGCTCC
GATCCTTGCTTTGCTGCTTTCAAATTCGTCAACAACATTTTCCCTGATCAGTGCCGATGG
TGTAAAATATATTTTGAGAGGAAAATAAGTCAATTTTTTACGATATACGAGGGTTCATCA
CCTCAGTGTTTCTGTTCCAAGTCCTGTATGAATATCTATATAAGTAATTCTAGGCACATA
GTACCCTGTAACTGGTGCAAAGTTAAAAAGTACAATTTCGATATGATCAAACGCGTCCAA
CCCAACGGCCAGGACATTATGATGTGCTCCGTAAACTGCCTGAATCTATATCAAGTGTCC
ATCAACGCTGTGTCCTCGAGGAGAACGAAATGTGACCTGTGCAAGAATTCTGCTCTGGCG
CAATATCACCTCACCATGTCCGACGCCACAGTCCGAAACTTCTGTACATACCAGTGTGTG
ATGACATTCCAGGGACAATATTCAAAACAACCGGCCCCGTTAATGTCTGGAGATTCTATC
GACCAACAGAAAGCCGTTCCCACGGGCGCCCCCAGGCGCACGTATAACGCGAACACTCAC
AAAAATAACACTATGAAGTGTCAAAACCGTTCCGGTTCAGGCATGCCTGTGATATCCAAC
GTGCAGTCATTAGCTGCGCCGCCGCCTCTAGTGCCTACGAACGCTCGCAACAAGACCAAA
CGAGTAACACCCGAGGAGAACGCGGCGGAGCCCGCCATCCCGCAACCGCCTAGACCGCCC
ACACCCCCACCACCGCCCCCCAAGATATACAACCACGTGATCGTCAAGACTCTCCCGCCA
CAAGAAGTCGCCAACAAAGCCACTATGTCCAAACCCATGATGGTGTCCAAAGGAGTGTCG
TGTAGACCCCATCCATGCACGAAAGAGTGTCAGACAGACCCCAGCCTGGAGCGTCGTGTT
CTGATACCGGTTCCAGTTCCTATATACGTTCCGGTCCCCTGTGTGATGTGGTCGCTTCCG
TTTCCGGTCCCCGTGCCCATACCGATTCCCATACCGACGCCGGTGTTTATACCAACTACG
AGGAATTCGGCTAAGGGCATAATGAAAGAAATTAATAAGATCCATGACAAAATGCCGACG
GATCCCTTCGAGGCTGAACTACTGATGATGGCGGAGATGGTGGCGGGAGACAAGAAGAAG
GATCACAGCGACTCGGACACCGAGGACGAGAACGAGGAAGGTTTCAGTCCGGTGGCCGGT
ATGGACGGTAACAATGCGTTCGGCGAGGACGTGCTGCAGATGGCATTGAAAATGGCCACC
GAGTACGAAGACCAGCCCGTGGACCTGGAGTCAGCTATGACCGCCAACACCATCACACCC
AGCTCACATCCCGGAATGCCAGGTCTAGAAGGCGAGGGCATGCACCAGCACCATATGATG
GTACTGGAACAGCAGCGTGCTGTGGCAGCCCTGCGCGCATCAAGCGTGGGCGGCGTGGGC
GTGGGCGGTGTCGGTGTAGGCGTGGGCGTTGCTCGGAAACGAGCGCCCGCGGTTGCGCCT
CGTGGGCGACCCTCCAAGCGACGGCGGGAGCCAGCGCCCGCGCCGCCGCCCGACCCGCCT
CGCGAACCACAGGAGAAACCGGATGCTAATATGTGTCTTAAGTACACTTTCGGCGTCAAC
GCGTGGAAGCAGTGGGTGATGACGAAGAACGCAGAAATAGAGAAGAGTTCGATAAGACGA
AAACCTTTTAAATCTGAAATATTACAGCTGACGGCCGACGAGTTGAACTATTCCCTTTGT
TTGTTTGTTAAAGAGGTGCGGAAACCTAACGGCAGTGAATACGCACCGGACACTATTTAT
TATTTGGTTTTAGGAATTCAACAGTATCTGTTTGAAAACGGTAGGATAGACAATATATTC
ACGGATCCATATTACGAAAAGTTCACCGACTGTTTGGATGAAGTTGCTAGAAAATTTTCA
GTTTTATATAACGATTCCCAGTACATCGTGACCCGTGTGGAGGAGGAGCACCTCTGGGAG
AGTAAACAACTCGGCGCACACTCTCCACACGTGCTGTTGTCAACTCTAATGTTCTTTAAC
ACCAAACATTTTAATCTAGTAACGGTAGAGGAACACATGCAATTATCATTCTCACATATA
ATGAAGCACTGGAAGCGAAATCCCAACCAGCCGGGACAAGCCAAAATACCCGGCTCTAGG
AACGTTCTGCTCAGATTCTACCCTCCACAGTCAGCTCTAGAGGCGAATTCAAGAAAAAAG
AAAGTTTATGAACAACAAGAGAATGAGGAGAACCCGCTGAGATGTCCCGTTAAATTATAT
GAATTTTATATATCGAAATGGTATGTTTGA

Protein sequence:

MDEKEIPENLGENDNETGDGINTTDIKEVSENNCTKSEENSECGPSEPNSIQNHLECDVI
KSDVVEKSEKCDEQGDVEVSVSVGDDTKDIGEIQKEELQEISELDNESVEPEKEIQDLEK
QVLDSDSKRDVPLEKPSLCDANEGVSKSEISEELVVNNATDPTVSKELLKEDSAVTKLET
DETRNAECIEDIKLSIDNPDTEHSECIENIKSNIDNPDTENKSPNIESITKEPSQEHANE
ANEEDYEKREVGKPEFETAPNVSMDDHHEDHTQALDPFDALLKDRTDAAETSETATQADL
SSINIDDDDHHNADDAHDMIPDDEDEHHPDDESGIPVITEQPGADEEVCLLPDTEREISE
ADKAAAEKVLAEKRKRDEELAMPKEEAEASESNEGVAEEQVQEDVNESLEGEGEGESEIS
TGDASQDNVGEKENDQEDAVDYEQEEPEPNSIRQISATDSVCVQCDEEKSCQYRYSDKDG
AVHYICVLSCVKLFQAAHAGQYIVVNKKYMVEEIAPKILTCSECEEKKTCYFYYNFDGED
TNYCSVECLHSMMADERDKYTFKRRRITVEEKSPKEDQCCVCESTKECIYSLTRYGQQLW
ICEQTCLRAINCKENGRYLLRKKRVQRVQAQKPVKTNPPLLKLKVISNATDKYLDEAYKV
QGKTPAMVQAAREERERTFIRSCMNCHMILNNEEKMLTWEAMDYCNETCLGRYQNKFGSK
CTNCKVHVQHTSIGKYCVRFGYNIRQFCNSACLEDFKKGLKICCYCQKDISDGSQGFLAP
VGDKGQFKDFCSQLCMEKFDKMSKNPVPRPVWAKCAVCSLEKATTIEVEVAPDESQRLCS
DPCFAAFKFVNNIFPDQCRWCKIYFERKISQFFTIYEGSSPQCFCSKSCMNIYISNSRHI
VPCNWCKVKKYNFDMIKRVQPNGQDIMMCSVNCLNLYQVSINAVSSRRTKCDLCKNSALA
QYHLTMSDATVRNFCTYQCVMTFQGQYSKQPAPLMSGDSIDQQKAVPTGAPRRTYNANTH
KNNTMKCQNRSGSGMPVISNVQSLAAPPPLVPTNARNKTKRVTPEENAAEPAIPQPPRPP
TPPPPPPKIYNHVIVKTLPPQEVANKATMSKPMMVSKGVSCRPHPCTKECQTDPSLERRV
LIPVPVPIYVPVPCVMWSLPFPVPVPIPIPIPTPVFIPTTRNSAKGIMKEINKIHDKMPT
DPFEAELLMMAEMVAGDKKKDHSDSDTEDENEEGFSPVAGMDGNNAFGEDVLQMALKMAT
EYEDQPVDLESAMTANTITPSSHPGMPGLEGEGMHQHHMMVLEQQRAVAALRASSVGGVG
VGGVGVGVGVARKRAPAVAPRGRPSKRRREPAPAPPPDPPREPQEKPDANMCLKYTFGVN
AWKQWVMTKNAEIEKSSIRRKPFKSEILQLTADELNYSLCLFVKEVRKPNGSEYAPDTIY
YLVLGIQQYLFENGRIDNIFTDPYYEKFTDCLDEVARKFSVLYNDSQYIVTRVEEEHLWE
SKQLGAHSPHVLLSTLMFFNTKHFNLVTVEEHMQLSFSHIMKHWKRNPNQPGQAKIPGSR
NVLLRFYPPQSALEANSRKKKVYEQQENEENPLRCPVKLYEFYISKWYV