DPGLEAN08794 in OGS1.0

New model in OGS2.0DPOGS203374 
Genomic Positionscaffold6:+ 231471-237773
See gene structure
CDS Length2331
Paired RNAseq reads  265
Single RNAseq reads  619
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002109 (7e-101)
Best Drosophila hit  CG1868, isoform B (3e-24)
Best Human hitSET and MYND domain-containing protein 4 (1e-31)
Best NR hit (blastp)  PREDICTED: hypothetical protein [Nasonia vitripennis] (2e-103)
Best NR hit (blastx)  PREDICTED: hypothetical protein [Nasonia vitripennis] (6e-87)
GeneOntology terms

  
GO:0046872 metal ion binding
GO:0005488 binding
GO:0008270 zinc ion binding
InterPro families
  
IPR011990 Tetratricopeptide-like helical
IPR001214 SET domain
Orthology groupMCL12197

Nucleotide sequence:

ATGAACAGTGTCAATTTGCCGGAGGACTGTGCCAAACAATGGGAAATTTTGTTGTTGTTA
TTGTCGTCAGAGGAAAAGAAAATCGATCGTACTCACGAAACCGAGATCGATACTATGAGT
TATTTCTACAACAACAAGGGTGTTAGAAGGATTTTACTCTACTGGCTTCAGCAGATGAAG
GACACATACGCACGTAAAACATGTGATACAGATACAGCGCTGAAATGCGACGGAGTGTCG
CTGTGCTGGCGACAGAAGGGGAACGAGAAATTCAGGGCTAATTTGGTGGAAGAGAGCTAT
AAGTGTTACACTAATAGCGTATTGTACGCTAAACCTAATAGTCTTATGTACACACTGGCT
CTAGCGAACCGGTCGGCTGCTTTATTGAGGTTAAAGAGATTTCAGGAATGTTTGTCGGAT
GTGTCTCTGGCTATAGACAGCGGTTATCCCGCGGAACAGAGACACAAACTTTTACTGCGT
AGAGCAGATTGTTACATTGAATTGCAAAGTAAGGAAGCAAGGACCGCTCTCAACTCGGCT
ATACAGTATGCAGGGACACTCAACATGACAGTGGCGAACAAATTGGAATTCGAGCGTCAC
ATAAAGATTTTAGAGAAGAAACTAGAGTGTGTCAAACGGGATGTTGTTAAAGAGGAAGAG
GTGCTGCTACCTGATTGTTACCTGGGACCGAATCCTGACTTTGTATCGGCCTCCAAATCT
ATTGAACTAAGATATAAGGAGTCACGCGGTCGTCACGTGGTCGCGGTAGAGCCGGCAGGT
CGTGGGGACGTGCTCTTCTCAGAGGAGCCCTACGCATGGGTCGCTCTGCCCTCAGACGAC
GCCATTTGTGAGATGTGCTGCGACACAGACATCAATCCAGTGCCCGTGTACTGTGGGTGT
GAGTGTGCGTCTCGTGCCATATCCTTCCACCGCTGGGAGTGTGTGGGTGCACAGTGCTCA
CTCTTCCCCACTATAGGCATCGCGCATTTAGCTTTAAGGGTGCTACTAATAAGTACGAAC
AACGGATTCCCCCCGTCGCCGGTGTCGTTGCCGCAGGCGTGTACCGCCGGCGAGTTGTTC
AGGAGCTACGGGCTGGTAGACAACATCCAGATATACAAGACGGGCACGGATCCCTTCTAC
AGGATGTTCAATCTGGTGACCAACTTCAACAAGATGGACAACACAGATTACATACAATAC
GCCTTAACGGCCACGATGTTGACGTTGTATTTGGAGAATTTCACGAGTTTTTTTGATTAT
CTACCAAGCAAAATGCCGTGTAGTATGTCTGAGAGTCAATTGAAGTTGTTCGCTGCGGCC
GTCATACTGAGGAGTATGGGCCAGTTGGTTTGTAACGGACACGCAACCTTAAGCCTGGCA
GTCGTGGAAGAGGACGATGGCAGAAACGGCAAAACGATAACGGAAAAGGAAGTCCGTAGA
GCCACCGCCATCTATCCCTCGGCCGCCATGATGAATCACTCCTGCGATCCCAACATAATA
AACACTTTCTACAAGAGTCGTCTTATAGTCCGATGCCAGCGCGAGTTACCGGCAGGAGGC
GAAGTGTTCAACTGTTATGGTCCACACCGAGCCCGCGCGCCCGCCGCAGCACGACGGAAA
GCGCTCAAGGCTCAGTATATGTTCACGTGCCACTGTGCTGACTGCAACGACACGGAGAGG
AAAGACTTCGTGTCGTTGTTCAGCGCGTACCTGTGTCAGTCGTGTAAGGGTCCGGTGTGG
GCGCACTGTGTTCGTCCTCTGTGCACACAGTGTCGGTCAGCACTCCACCTGGAACGTGCA
CACACACTACTGGATCGAGCTGACGACCTCGCAACACAAGCGGAACAGGTCGTCAGTTTG
GAGGAGCGTTGCGAAAAGATGGCGGCCTCGTACCGGCTGAAACAACAGGTGTGGCATCGA
CACCACGCCTCGCTCAGAATGGCGGCGGATAGACTGGCAAGACTGTATGCTGACACAGGT
GATTTCGGTAAAAGTATGGAGCTCATCAAACAGAACATCCAGAGCCTCGAGTATCGCTTT
GGTTCTTTCAGTGTAGAGGTCGCCCATGAACTCCGTAAACTGTCGGATGTTATGTTAGAA
AGGATTTTGAATTCACCGCAGCATCTGGAATACAGAGAATGGTGTCTAGAAGCTCATAAG
GTGGTCAAAAAGGCTATACAATTGATGGAATTGAACTACGGCTCGTGGGAACCTCTTGTG
TCTAGACTGAAGCAACACGAGTGCTACCTCGCGGCGACGCTCGCCGAGAGCAGGACTCCC
GAGGCTGTGGACTGTGTTCATCACAACTTACATTACAATCTTAAAATATAA

Protein sequence:

MNSVNLPEDCAKQWEILLLLLSSEEKKIDRTHETEIDTMSYFYNNKGVRRILLYWLQQMK
DTYARKTCDTDTALKCDGVSLCWRQKGNEKFRANLVEESYKCYTNSVLYAKPNSLMYTLA
LANRSAALLRLKRFQECLSDVSLAIDSGYPAEQRHKLLLRRADCYIELQSKEARTALNSA
IQYAGTLNMTVANKLEFERHIKILEKKLECVKRDVVKEEEVLLPDCYLGPNPDFVSASKS
IELRYKESRGRHVVAVEPAGRGDVLFSEEPYAWVALPSDDAICEMCCDTDINPVPVYCGC
ECASRAISFHRWECVGAQCSLFPTIGIAHLALRVLLISTNNGFPPSPVSLPQACTAGELF
RSYGLVDNIQIYKTGTDPFYRMFNLVTNFNKMDNTDYIQYALTATMLTLYLENFTSFFDY
LPSKMPCSMSESQLKLFAAAVILRSMGQLVCNGHATLSLAVVEEDDGRNGKTITEKEVRR
ATAIYPSAAMMNHSCDPNIINTFYKSRLIVRCQRELPAGGEVFNCYGPHRARAPAAARRK
ALKAQYMFTCHCADCNDTERKDFVSLFSAYLCQSCKGPVWAHCVRPLCTQCRSALHLERA
HTLLDRADDLATQAEQVVSLEERCEKMAASYRLKQQVWHRHHASLRMAADRLARLYADTG
DFGKSMELIKQNIQSLEYRFGSFSVEVAHELRKLSDVMLERILNSPQHLEYREWCLEAHK
VVKKAIQLMELNYGSWEPLVSRLKQHECYLAATLAESRTPEAVDCVHHNLHYNLKI