DPGLEAN11275 in OGS1.0

New model in OGS2.0DPOGS213148 
Genomic Positionscaffold566:+ 9441-14336
See gene structure
CDS Length1986
Paired RNAseq reads  177
Single RNAseq reads  481
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007907 (0.0)
Best Drosophila hit  CG7759, isoform A (1e-127)
Best Human hitSET and MYND domain-containing protein 4 (1e-23)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL003516 [Aedes aegypti] (1e-155)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL003516 [Aedes aegypti] (4e-150)
GeneOntology terms



  
GO:0008270 zinc ion binding
GO:0005634 nucleus
GO:0005737 cytoplasm
GO:0016481 negative regulation of transcription
GO:0016564 transcription repressor activity
InterPro families


  
IPR019734 Tetratricopeptide repeat
IPR001214 SET domain
IPR013026 Tetratricopeptide repeat-containing
IPR011990 Tetratricopeptide-like helical
Orthology groupMCL16398

Nucleotide sequence:

ATGAGTAAAGACAGCGAAGGGTTATTTAAGAATTTTCACGAAAGTATAAACAACTCGCTC
GACGACGTCGTGAGAAATAATTTCGCAAATTTAGAATCGAATGAGAAAAGAATATCGTAT
TTATGCAGCCTTCCGTGCGTGAAAAATTATGATATATCAAAAGAGATTAAAAAATTCGAA
AGTGGTGGTGAGTTTCCGGTAAAAAAAGACCTGGAAAAAGCTCTTCAGTTGAAAGATGAA
GGAAACAAGGCTGTACAGAAAGGGGATTGGGGTCGATCTTTACAATACTACAATGAAAGT
ATTATTCTCATGCCGGAAATAAAAAGCGAAGAGTTGTCGATCGTTTTAGCTAATCGCTCG
GCCGCCTTAAACCATTTGGCACAGTACGAAGACACACTGAGAGATATTCAACGTTGCCTG
GCGCTGGGTTACCCCCGACACTTAAGATATAAAGTTTATGAGAGGAGAGCTCGTTGTCTA
CTAGCTCTTAAAAGAAACCAAGAAGCTGTAACGGCATTTCAAAACACAATCACCGCATTA
GATGAAGCAAAAAATCTTGATAAGGAAAAACGGCTGAAACTGCGAACGGATGCGAAGCTT
ATGTTGGAAGTTCTGAACAAGGGCTTGGTTTTAGCTGGTAATCCAAAAGATCCAGAGCCA
TTTAAAAACACACCGCCAAAACCTAAACTTTCTGGAAAACATAATAAGCTATTCCCAGCA
GCTTCAGAAACTGTTGAAATCGCTTTTGATGAAACTAGAGGCCGATTTGCAAAAGCTGAT
AGAGATATACAAGCTGGTGAAATCTTGCTGATTGAGGAACCGCATGGTGGTGTCTTACTA
TCAGAATTTTCTAAAAGCCACTGTCAAAATTGTTTTAATAAATGTCTCATTCCCTTGCCA
TGTCCGAAATGCCCAAACGTTATTTTTTGCAGCGAAAAATGTTTAGACATTGCATTAAAA
TCTTATCACGGATACGAATGCCATATTCTTCCACTTCTATGGAAGTCTGGCTGTTCTGTA
ACTTGCCACATCGCTTTAAGGATGATAACACAAAACAGTAAAGATTATTTTATGAAAATT
ATGCAGGATTTGAAAGAAAAACCAACTGGCCCGTACAAAACTGAGGACTACCGTAACATT
TATCACCTCGTTTCACACGAAAACAAGAGGACTAAACAAGACATCTTGCATAGGACTGAA
ATGGCAATATTTCTTCTTAAGCTTTTAGAGATTAGCGGTTACTTTAATGATGATGCAGCG
TCATTTGGATGTTTGATACTGAAAAATCTTCAAGTTCTTCAATTTAATGCTCATGAAGTT
TTCGAAATACAATGTCTTAAACCAAAGGATGGTACACGTTTCCTTAAACATGAAGGTAAA
TCGGTGTTTATTGGCGGTGCCGTATATCCGACACTAGCTTTGTTTAATCATTCTTGCGAA
CCTGGTATAGTAAGATACTTCTGTGGTTCGCGAATAGTTGTCTGTGCCGTGAAAAATATA
AGAAAAGGTGAAGAGGTTGCAGAAAACTATGGACCTATATTTACAACAGTGCCCAAGGAT
AAACGACAGTCACAGCTAAAAGAGCAGTACTGGTTCGATTGCAAATGCTTGCCTTGCGAA
CAAAATTGGCCAAAATATGAAGACATGACAGAAAATTACTTGCGATTCAAATGCGATTCA
GACCAGCCATGTTCTAATGTAATACCAGTACCATATGACTGCATGGAATTTATGGTACAG
TGTGGTCTATGCCAACAGTATACAAATATTTTAAAAGGGTTAAAGTCATTACAGGATACA
GAGACGCTTTATAAACTTGGACGAGCGGCGATGGGAGAAGGAAAGTATGGAGAAGCAATT
AAAAAGTTTATCGAAACATTAAAACTCTATGATACAACATTAGCACCACCTTACAAATCT
TATTATGATTGTGTTCAAGATTTAAGAAGTTGTATGCTATCATTGGGAAATTACAGCTTT
GTTTAA

Protein sequence:

MSKDSEGLFKNFHESINNSLDDVVRNNFANLESNEKRISYLCSLPCVKNYDISKEIKKFE
SGGEFPVKKDLEKALQLKDEGNKAVQKGDWGRSLQYYNESIILMPEIKSEELSIVLANRS
AALNHLAQYEDTLRDIQRCLALGYPRHLRYKVYERRARCLLALKRNQEAVTAFQNTITAL
DEAKNLDKEKRLKLRTDAKLMLEVLNKGLVLAGNPKDPEPFKNTPPKPKLSGKHNKLFPA
ASETVEIAFDETRGRFAKADRDIQAGEILLIEEPHGGVLLSEFSKSHCQNCFNKCLIPLP
CPKCPNVIFCSEKCLDIALKSYHGYECHILPLLWKSGCSVTCHIALRMITQNSKDYFMKI
MQDLKEKPTGPYKTEDYRNIYHLVSHENKRTKQDILHRTEMAIFLLKLLEISGYFNDDAA
SFGCLILKNLQVLQFNAHEVFEIQCLKPKDGTRFLKHEGKSVFIGGAVYPTLALFNHSCE
PGIVRYFCGSRIVVCAVKNIRKGEEVAENYGPIFTTVPKDKRQSQLKEQYWFDCKCLPCE
QNWPKYEDMTENYLRFKCDSDQPCSNVIPVPYDCMEFMVQCGLCQQYTNILKGLKSLQDT
ETLYKLGRAAMGEGKYGEAIKKFIETLKLYDTTLAPPYKSYYDCVQDLRSCMLSLGNYSF
V