DPGLEAN21146 in OGS1.0

New model in OGS2.0DPOGS200421 
Genomic Positionscaffold578:+ 4727-17452
See gene structure
CDS Length2550
Paired RNAseq reads  44
Single RNAseq reads  135
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008996 (0.0)
Best Drosophila hit  hamlet (4e-56)
Best Human hitPREDICTED: PR domain zinc finger protein 16-like (1e-57)
Best NR hit (blastp)  conserved hypothetical protein [Pediculus humanus corporis] (8e-128)
Best NR hit (blastx)  PREDICTED: similar to hamlet CG31753-PA [Apis mellifera] (2e-149)
GeneOntology terms




















  
GO:0016563 transcription activator activity
GO:0030154 cell differentiation
GO:0006350 transcription
GO:0003677 DNA binding
GO:0045449 regulation of transcription
GO:0046872 metal ion binding
GO:0008270 zinc ion binding
GO:0003676 nucleic acid binding
GO:0005622 intracellular
GO:0043457 regulation of cellular respiration
GO:0005515 protein binding
GO:0003713 transcription coactivator activity
GO:0046332 SMAD binding
GO:0017053 transcriptional repressor complex
GO:0050873 brown fat cell differentiation
GO:0005634 nucleus
GO:0022008 neurogenesis
GO:0050872 white fat cell differentiation
GO:0016564 transcription repressor activity
GO:0043565 sequence-specific DNA binding
GO:0045941 positive regulation of transcription
GO:0016481 negative regulation of transcription
InterPro families


  
IPR007087 Zinc finger, C2H2-type
IPR003656 Zinc finger, BED-type predicted
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL16515

Nucleotide sequence:

ATGGGGTGGTCAATGATAAATGCCCAAGCTGTTATCCCGCAGACACTGACAGATACTAAT
AGTGCCATCTATCGTGAAGCCGCGCTGTCCGCTGTCAGAACTCGCCAGGTGCCGCAAGAT
GACGCCACCGCTGTTTTCGGTAAAAACGGCGTTCGTTCGTGGCTGGACGCGGCTGCGGAC
AAGTCTAACTGGTTCAAACTCGTCCGCTGCGCAACCTCTCCGCACGAAGTCAATCTGCAA
CACGAAAAGTTTGCAGGACAAGTCTGGTATAAAGTGACTCGTGACGTGTCAGCAGGACAA
GAGCTGTTGGTCGGAGCTTGGACGTCACTGCCGTTACAAGATGTTCTCACAACTGGTAGA
GAGAGTGCCAGCAGTCACTCTCACCAGCAACAGGACGAAGAAGACAGAGAGGATACAAAA
CCACGATGTTCCTTCTGTGACGAACCATTCCCTAATATTGATGCACTTGACCGTCACTTG
ATTCAAGCACATGCCCAGCCAGCTTCGGCATATCATTGCGAGCTGTGCAACAGAGCGTAC
AGTTCCCGGGCACTTCTCCTAAGACATCGGGCGTTAACACATACCGATATCAGGAAATAT
CCCTGCGAGAATTGTCCTAAGGTATTTACCGATCCTTCCAACCTCCAGCGCCACATCCGC
GCGCAGCACGTGGGTGCCCGCAGCCACGCCTGCCCTGAATGCGGCAAGACCTTCGCTACC
AGCTCCGGCCTTAAGCAGCACACACATATTCACTCCAGTGTCAAGCCCTTCCAGTGCAAA
GTCTGCTTCAAGGCATACACTCAATTCTCTAATTTGTGCAGACACAAACGAATGCACGTT
GCGTGTAGAGCATTGGTAGAGTGTGGGAAATGTGGACAATCATTTACGTCGTACGCATCT
CTTACTAAACATAAAAGATTTTGCGATACTGCTTCTGCAACGAACGTAAATCTGAGAGGA
CAAATTGGTCAAGGATTACCGCAGATACCACCTATTCCAAACGTCATGAATAATCCGAAT
AATACAAATCCATTCCCCATGTACAGAGGTCCAGCTCTGCCGTTACCATACAACACTTTC
GCACATTACCCAGCCTTTATCTCCGCTGCTGCCGCCGCAGCTTGCCCTCCAGACTTTTTA
AGTCCCCTCCTCTTCAATGTCCAAGGAGCGAGGTTAGCTATGGAGCATGATTTGGCACTT
AACGCCAGTTTAATGGCCAAGCAACAACAGGAAGAACGCATGTCAGTAAAAAAGGAAACA
GAGAGCATAGATAGTTCAACATCTGTAGATATAATAAATAAAGCGAAAGAAATTACCAGA
GATGAGAAAGATATGGATGTAGACAGAGTAACACCGAAACCTCAGGAACATTATGTAAAA
CAATCACCGCCGTCGGCTGAAGAGGCTACTTCAAAACAACGTCCTTCTCCAGTGATGCCA
CTGTCGACTACTGTTGGACCCTTTGATTTTGCAAGAAATGAAACAAAACACAACTCTATG
TACGATTTTTCTTTGAAAAATAACAACGAAACTTTAGAAAATAAGTCAATGTCTCCTCAA
CCTAAAGATTTGACGAGGAACAACATGTCCAGTGATGTGGAAAAACAATCAAGATATTCT
AATTTAGAAGAAGAAATAAAAGAGCAGAATGACCAACCGCTCGACTTATCCGTCACTCGA
AAACAACGTGACAAGGAGTCAGACCTAGAAAATGATGATCATTCCTTTCGAAATTCATCG
ATTAAATCTTATTCACCTGCTGAAAGTCCTGTTGATAGAGAGAATAAGACTCCGGAAAAT
GAGACAACTGATGTTGACGTGGAAGCAGTTGAACCCAAAAGAGAGGATTCCCCAGTATCA
ATGATGTCTCCTCCCTTAGCATTCCCAATGGCTGTACATGCTCAGCACAATAACAGTCTC
ATGAACGCAATGTACCCACCACGTTTTACACGTTTCCATTCGACTTCTGACTCCATACTA
AGCGCACAGCACTCACCATACGTTCCCAGCCCGTTTAATTTTTTATCGCCACTTCTCGGC
ACTGATGGCCCCGATAGGCAATCAAGTGCCTATGCGAAATTTCGAGAACTTAGCGCTGGT
TCCGGCAAACTGCGAGATCGCTACGCTTGCAAATTTTGCGGAAAAGTATTTCCGCGAAGT
GCCAACTTAACGCGTCATTTACGTACGCACACCGGCGAGCAACCATACAAGTGCAAATAT
TGTGAGCGTTCCTTTTCCATATCCTCTAATTTACAGCGACACGTAAGAAACATTCATAAC
AAAGAGAGACCGTTTAGATGTCAGTTATGCGATAGATGTTTCGGTCAGCAGACTAACCTA
GATCGACACCTTAAGAAACATGAGGCGGAAGGTGGTGATTCACCAAGTTCCGGGGATACT
GAACACGACGCGTGTTTTGATGATATTCGTTCTTTCATGGGGAAGGTGACCTGTTCTCCT
GGAGCAGGATCCCCAGCAGCGACTTCTCCTCACCCATCTCACGCCCCACATCCTTCTCAT
CGACCTTCAGCGCTTTCCATTTCCACCTAG

Protein sequence:

MGWSMINAQAVIPQTLTDTNSAIYREAALSAVRTRQVPQDDATAVFGKNGVRSWLDAAAD
KSNWFKLVRCATSPHEVNLQHEKFAGQVWYKVTRDVSAGQELLVGAWTSLPLQDVLTTGR
ESASSHSHQQQDEEDREDTKPRCSFCDEPFPNIDALDRHLIQAHAQPASAYHCELCNRAY
SSRALLLRHRALTHTDIRKYPCENCPKVFTDPSNLQRHIRAQHVGARSHACPECGKTFAT
SSGLKQHTHIHSSVKPFQCKVCFKAYTQFSNLCRHKRMHVACRALVECGKCGQSFTSYAS
LTKHKRFCDTASATNVNLRGQIGQGLPQIPPIPNVMNNPNNTNPFPMYRGPALPLPYNTF
AHYPAFISAAAAAACPPDFLSPLLFNVQGARLAMEHDLALNASLMAKQQQEERMSVKKET
ESIDSSTSVDIINKAKEITRDEKDMDVDRVTPKPQEHYVKQSPPSAEEATSKQRPSPVMP
LSTTVGPFDFARNETKHNSMYDFSLKNNNETLENKSMSPQPKDLTRNNMSSDVEKQSRYS
NLEEEIKEQNDQPLDLSVTRKQRDKESDLENDDHSFRNSSIKSYSPAESPVDRENKTPEN
ETTDVDVEAVEPKREDSPVSMMSPPLAFPMAVHAQHNNSLMNAMYPPRFTRFHSTSDSIL
SAQHSPYVPSPFNFLSPLLGTDGPDRQSSAYAKFRELSAGSGKLRDRYACKFCGKVFPRS
ANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNKERPFRCQLCDRCFGQQTNL
DRHLKKHEAEGGDSPSSGDTEHDACFDDIRSFMGKVTCSPGAGSPAATSPHPSHAPHPSH
RPSALSIST