New model in OGS2.0 | DPOGS200421  |
---|---|
Genomic Position | scaffold578:+ 4727-17452 |
See gene structure | |
CDS Length | 2550 |
Paired RNAseq reads   | 44 |
Single RNAseq reads   | 135 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008996 (0.0) |
Best Drosophila hit   | hamlet (4e-56) |
Best Human hit | PREDICTED: PR domain zinc finger protein 16-like (1e-57) |
Best NR hit (blastp)   | conserved hypothetical protein [Pediculus humanus corporis] (8e-128) |
Best NR hit (blastx)   | PREDICTED: similar to hamlet CG31753-PA [Apis mellifera] (2e-149) |
GeneOntology terms    | GO:0016563 transcription activator activity GO:0030154 cell differentiation GO:0006350 transcription GO:0003677 DNA binding GO:0045449 regulation of transcription GO:0046872 metal ion binding GO:0008270 zinc ion binding GO:0003676 nucleic acid binding GO:0005622 intracellular GO:0043457 regulation of cellular respiration GO:0005515 protein binding GO:0003713 transcription coactivator activity GO:0046332 SMAD binding GO:0017053 transcriptional repressor complex GO:0050873 brown fat cell differentiation GO:0005634 nucleus GO:0022008 neurogenesis GO:0050872 white fat cell differentiation GO:0016564 transcription repressor activity GO:0043565 sequence-specific DNA binding GO:0045941 positive regulation of transcription GO:0016481 negative regulation of transcription |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR003656 Zinc finger, BED-type predicted IPR015880 Zinc finger, C2H2-like IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding |
Orthology group | MCL16515 |
Nucleotide sequence:
ATGGGGTGGTCAATGATAAATGCCCAAGCTGTTATCCCGCAGACACTGACAGATACTAAT
AGTGCCATCTATCGTGAAGCCGCGCTGTCCGCTGTCAGAACTCGCCAGGTGCCGCAAGAT
GACGCCACCGCTGTTTTCGGTAAAAACGGCGTTCGTTCGTGGCTGGACGCGGCTGCGGAC
AAGTCTAACTGGTTCAAACTCGTCCGCTGCGCAACCTCTCCGCACGAAGTCAATCTGCAA
CACGAAAAGTTTGCAGGACAAGTCTGGTATAAAGTGACTCGTGACGTGTCAGCAGGACAA
GAGCTGTTGGTCGGAGCTTGGACGTCACTGCCGTTACAAGATGTTCTCACAACTGGTAGA
GAGAGTGCCAGCAGTCACTCTCACCAGCAACAGGACGAAGAAGACAGAGAGGATACAAAA
CCACGATGTTCCTTCTGTGACGAACCATTCCCTAATATTGATGCACTTGACCGTCACTTG
ATTCAAGCACATGCCCAGCCAGCTTCGGCATATCATTGCGAGCTGTGCAACAGAGCGTAC
AGTTCCCGGGCACTTCTCCTAAGACATCGGGCGTTAACACATACCGATATCAGGAAATAT
CCCTGCGAGAATTGTCCTAAGGTATTTACCGATCCTTCCAACCTCCAGCGCCACATCCGC
GCGCAGCACGTGGGTGCCCGCAGCCACGCCTGCCCTGAATGCGGCAAGACCTTCGCTACC
AGCTCCGGCCTTAAGCAGCACACACATATTCACTCCAGTGTCAAGCCCTTCCAGTGCAAA
GTCTGCTTCAAGGCATACACTCAATTCTCTAATTTGTGCAGACACAAACGAATGCACGTT
GCGTGTAGAGCATTGGTAGAGTGTGGGAAATGTGGACAATCATTTACGTCGTACGCATCT
CTTACTAAACATAAAAGATTTTGCGATACTGCTTCTGCAACGAACGTAAATCTGAGAGGA
CAAATTGGTCAAGGATTACCGCAGATACCACCTATTCCAAACGTCATGAATAATCCGAAT
AATACAAATCCATTCCCCATGTACAGAGGTCCAGCTCTGCCGTTACCATACAACACTTTC
GCACATTACCCAGCCTTTATCTCCGCTGCTGCCGCCGCAGCTTGCCCTCCAGACTTTTTA
AGTCCCCTCCTCTTCAATGTCCAAGGAGCGAGGTTAGCTATGGAGCATGATTTGGCACTT
AACGCCAGTTTAATGGCCAAGCAACAACAGGAAGAACGCATGTCAGTAAAAAAGGAAACA
GAGAGCATAGATAGTTCAACATCTGTAGATATAATAAATAAAGCGAAAGAAATTACCAGA
GATGAGAAAGATATGGATGTAGACAGAGTAACACCGAAACCTCAGGAACATTATGTAAAA
CAATCACCGCCGTCGGCTGAAGAGGCTACTTCAAAACAACGTCCTTCTCCAGTGATGCCA
CTGTCGACTACTGTTGGACCCTTTGATTTTGCAAGAAATGAAACAAAACACAACTCTATG
TACGATTTTTCTTTGAAAAATAACAACGAAACTTTAGAAAATAAGTCAATGTCTCCTCAA
CCTAAAGATTTGACGAGGAACAACATGTCCAGTGATGTGGAAAAACAATCAAGATATTCT
AATTTAGAAGAAGAAATAAAAGAGCAGAATGACCAACCGCTCGACTTATCCGTCACTCGA
AAACAACGTGACAAGGAGTCAGACCTAGAAAATGATGATCATTCCTTTCGAAATTCATCG
ATTAAATCTTATTCACCTGCTGAAAGTCCTGTTGATAGAGAGAATAAGACTCCGGAAAAT
GAGACAACTGATGTTGACGTGGAAGCAGTTGAACCCAAAAGAGAGGATTCCCCAGTATCA
ATGATGTCTCCTCCCTTAGCATTCCCAATGGCTGTACATGCTCAGCACAATAACAGTCTC
ATGAACGCAATGTACCCACCACGTTTTACACGTTTCCATTCGACTTCTGACTCCATACTA
AGCGCACAGCACTCACCATACGTTCCCAGCCCGTTTAATTTTTTATCGCCACTTCTCGGC
ACTGATGGCCCCGATAGGCAATCAAGTGCCTATGCGAAATTTCGAGAACTTAGCGCTGGT
TCCGGCAAACTGCGAGATCGCTACGCTTGCAAATTTTGCGGAAAAGTATTTCCGCGAAGT
GCCAACTTAACGCGTCATTTACGTACGCACACCGGCGAGCAACCATACAAGTGCAAATAT
TGTGAGCGTTCCTTTTCCATATCCTCTAATTTACAGCGACACGTAAGAAACATTCATAAC
AAAGAGAGACCGTTTAGATGTCAGTTATGCGATAGATGTTTCGGTCAGCAGACTAACCTA
GATCGACACCTTAAGAAACATGAGGCGGAAGGTGGTGATTCACCAAGTTCCGGGGATACT
GAACACGACGCGTGTTTTGATGATATTCGTTCTTTCATGGGGAAGGTGACCTGTTCTCCT
GGAGCAGGATCCCCAGCAGCGACTTCTCCTCACCCATCTCACGCCCCACATCCTTCTCAT
CGACCTTCAGCGCTTTCCATTTCCACCTAG
Protein sequence:
MGWSMINAQAVIPQTLTDTNSAIYREAALSAVRTRQVPQDDATAVFGKNGVRSWLDAAAD
KSNWFKLVRCATSPHEVNLQHEKFAGQVWYKVTRDVSAGQELLVGAWTSLPLQDVLTTGR
ESASSHSHQQQDEEDREDTKPRCSFCDEPFPNIDALDRHLIQAHAQPASAYHCELCNRAY
SSRALLLRHRALTHTDIRKYPCENCPKVFTDPSNLQRHIRAQHVGARSHACPECGKTFAT
SSGLKQHTHIHSSVKPFQCKVCFKAYTQFSNLCRHKRMHVACRALVECGKCGQSFTSYAS
LTKHKRFCDTASATNVNLRGQIGQGLPQIPPIPNVMNNPNNTNPFPMYRGPALPLPYNTF
AHYPAFISAAAAAACPPDFLSPLLFNVQGARLAMEHDLALNASLMAKQQQEERMSVKKET
ESIDSSTSVDIINKAKEITRDEKDMDVDRVTPKPQEHYVKQSPPSAEEATSKQRPSPVMP
LSTTVGPFDFARNETKHNSMYDFSLKNNNETLENKSMSPQPKDLTRNNMSSDVEKQSRYS
NLEEEIKEQNDQPLDLSVTRKQRDKESDLENDDHSFRNSSIKSYSPAESPVDRENKTPEN
ETTDVDVEAVEPKREDSPVSMMSPPLAFPMAVHAQHNNSLMNAMYPPRFTRFHSTSDSIL
SAQHSPYVPSPFNFLSPLLGTDGPDRQSSAYAKFRELSAGSGKLRDRYACKFCGKVFPRS
ANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNKERPFRCQLCDRCFGQQTNL
DRHLKKHEAEGGDSPSSGDTEHDACFDDIRSFMGKVTCSPGAGSPAATSPHPSHAPHPSH
RPSALSIST