DPGLEAN14035 in OGS1.0

New model in OGS2.0DPOGS212569 
Genomic Positionscaffold9:+ 88989-97320
See gene structure
CDS Length2205
Paired RNAseq reads  181
Single RNAseq reads  494
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002076 (0.0)
Best Drosophila hit  CG7759, isoform A (1e-60)
Best Human hitSET and MYND domain-containing protein 4 (4e-11)
Best NR hit (blastp)  set and mynd domain-containing protein, putative [Pediculus humanus corporis] (3e-111)
Best NR hit (blastx)  AGAP002999-PA [Anopheles gambiae str. PEST] (1e-66)
GeneOntology terms



  
GO:0008270 zinc ion binding
GO:0005634 nucleus
GO:0005737 cytoplasm
GO:0016481 negative regulation of transcription
GO:0016564 transcription repressor activity
InterPro families
  
IPR019734 Tetratricopeptide repeat
IPR011990 Tetratricopeptide-like helical
Orthology groupMCL17088

Nucleotide sequence:

ATGGATCAAAACGGCGTCGACTTTTCATACGCTACCATGTGCAGTGATGTGACGCTGTGT
GCGAACAATAAGGGTTTTTTTAAGAACTTCGCAGAAGAAATGGTTTCATTGGCAGAAATT
GATGGATGGTTGGATGACTTTGAACTTATTGAGGACAAAGAAAAAGTCCTAGCAGTAAGG
AATAATCAAAAAATTATGGAACCTATCAATGAGTTATTATCTAGAATTCAACCCTTATTC
CGAGGTAAGGACGCAAGAGTATCGCATGAAAAGAAAACTGCAGCTCTGATAGCCTTAAAA
AATGGAGATCTCGTTAAAAGCCTTTCTCTAGCCAACCAAGCCGTGCTTCGAGCTCCAATG
ACTGGTACCGATGAAATAATAGATAGTGGTATAACACTCGCTTTAAGCCTCTGGGTTCGT
TCGGAAGTTTTGTTAAGTCTAAACCGTCCGAAACCAGCTTTAGAAGACCTTAAGCTAGCT
TTGAAAGAACGGCTTCCAGCTCGTATGAGAGCAGATTATTATTGGAGGATGGGCCACTGT
TACAAGGGTACTGGTGAAACAACACGAGCTAAGGTTTCATATGAATTAGCTAGCCGGTTA
TTGGGGGATAAAAAAGAAGCAAAAATTCAACTAGCTAATGATATCGAATCATTAAAACAC
TCTACACAATCCGAAAGTCCTTCCAAACTGAAAGAACCTCAACTCACAAGCGGTGCAACG
TTAAATTTACCAGCTCTTTCAAAATTATTAAAAATTACTGAAGATAACGAAAAGGGCCGT
TACGCAGTTGCTAATGCTCCAGTAAAAACCGGTGACATAGTTTTAGTTGAAAGTCCCTAC
GCTGCTTGTTTACTCGCTGATTGCCATGGCTCTCACTGCCTTCATTGCTTTGTAAGATTA
GAAGATTTTGAGGACTCGGCTCCAATATGGTGTCCCAATTGCTCAGGAGTAGCATTTTGT
TCGATACAATGTCGAGATGCTGCAATTTCCACATATCATTTATACGAGTGCCCGTTTTTT
AACCTATTTATTGGTTCCGGAATGTCGGTACTTAGCCACATTGCTCTCCGTATGGTAACC
CAAGCCGGACTGGACACAAGTCTTTCAATACATTCGAAGTTTTTAAGCAATGAAGTTAAG
ACTATACAGAGTCCGGTATTAAACGATGTTGAAGGAGAAAAAAAAAAGTTTAAGATAAAA
AGTAGAAAAGAGAGATTGAACAGAACAAGAAAAGGTATGAACATTATCGAAAATAAAACT
TCCGATACACAAGAAATTGAACCACAAATTAAAAATGAGACGAGTTACAATGAAAAGATA
GAAATGGCAGCTGAGCAAATTTATTCACTGCTGGCTCATTCACGACAAAGGAAGGGAGCA
GATTACCTAAAGCGTATAATTATGGGCATGTTTCTAACGGAATGTTTGAAGAAAACCGAT
TTTTTTAAAAATTGTGAAAAAGAAAATATAACAAGAGCTGAAATATCAATTTGCGAATTG
ATAGTTCGTAACTTGCAATTATTACAATTTAATGCCCACGAGATATATGAAACAGTGCGT
GGAGAACATCAATTTAGAGGATCTAAACCAGTCTACATAGGCGTAGGAATTTATCCTACA
GGAGCCTTATTTAATCACGAATGTTATCCCGCAGTGGCACGATATTTCTATGGTAAAAAA
ATGTCATACCGCGCGATACGACCTCTTGAACCAGGAGAGATTGCCGCTGAGAACTATGGA
CCGCATTTTTTGATGCGCACGCTTAAGGAACGCCAAAGGATGCTGACGTGTCGATACTGG
TTCAGATGTCAATGTATAGCCTGCGTTGAGGATTGGCCGACTCTCAAAGAAACTGAATCT
AAATCACCAATATACTTGAGGTGTCTCAATAAGAAGTGCCACGGAAAAATTAAAGTTATC
AAAAATCCAACAAACTTGAAGTGCCCGAAATGTTCTATGGCCTTTAATAAGACTTCTTTG
AAAGAATGTTTAAACGAGGTTGACATAGTTCTCTCGCAGTACGAGGCAGGTGCGAAGCTA
ATGGAACAGCAGCGGCCCCAAGATGCTATCGAAATATTCTCAAAAGCCATTGATTGCTTT
TATGACTTTGCAATGCCTCCACATCGAGAAACACATATAGCACAAGAATCGCTAAGGTCG
TGTTATGCTACATTTGGAAACACCCATATTTTAAAAGAAAAATGA

Protein sequence:

MDQNGVDFSYATMCSDVTLCANNKGFFKNFAEEMVSLAEIDGWLDDFELIEDKEKVLAVR
NNQKIMEPINELLSRIQPLFRGKDARVSHEKKTAALIALKNGDLVKSLSLANQAVLRAPM
TGTDEIIDSGITLALSLWVRSEVLLSLNRPKPALEDLKLALKERLPARMRADYYWRMGHC
YKGTGETTRAKVSYELASRLLGDKKEAKIQLANDIESLKHSTQSESPSKLKEPQLTSGAT
LNLPALSKLLKITEDNEKGRYAVANAPVKTGDIVLVESPYAACLLADCHGSHCLHCFVRL
EDFEDSAPIWCPNCSGVAFCSIQCRDAAISTYHLYECPFFNLFIGSGMSVLSHIALRMVT
QAGLDTSLSIHSKFLSNEVKTIQSPVLNDVEGEKKKFKIKSRKERLNRTRKGMNIIENKT
SDTQEIEPQIKNETSYNEKIEMAAEQIYSLLAHSRQRKGADYLKRIIMGMFLTECLKKTD
FFKNCEKENITRAEISICELIVRNLQLLQFNAHEIYETVRGEHQFRGSKPVYIGVGIYPT
GALFNHECYPAVARYFYGKKMSYRAIRPLEPGEIAAENYGPHFLMRTLKERQRMLTCRYW
FRCQCIACVEDWPTLKETESKSPIYLRCLNKKCHGKIKVIKNPTNLKCPKCSMAFNKTSL
KECLNEVDIVLSQYEAGAKLMEQQRPQDAIEIFSKAIDCFYDFAMPPHRETHIAQESLRS
CYATFGNTHILKEK