DPGLEAN04058 in OGS1.0

New model in OGS2.0DPOGS206684 
Genomic Positionscaffold3672:+ 1719-16111
See gene structure
CDS Length1470
Paired RNAseq reads  296
Single RNAseq reads  1008
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008518 (1e-65)
Best Drosophila hit  CG32732 (2e-55)
Best Human hitSET domain-containing protein 3 isoform a (2e-74)
Best NR hit (blastp)  PREDICTED: similar to SET domain containing 3 [Apis mellifera] (3e-91)
Best NR hit (blastx)  SET domain-containing protein, putative [Pediculus humanus corporis] (1e-79)
GeneOntology terms
  
GO:0008150 biological_process
GO:0003674 molecular_function
InterPro families
  
IPR015353 Rubisco LS methyltransferase, substrate-binding domain
IPR001214 SET domain
Orthology groupMCL11047

Nucleotide sequence:

ATGGGACGAAAACTTCAGTCAAAGCTTACATGCAAGAAGAAAAATGTTAAAGAAGGGAAT
AGATTTTTGCAGCAGAGACGTAAAGAATTAGCAGTTTTAGTAGATACATTACTTAAATTA
ACCAGCACGTTTCAAAGTACGGGTAAAAGCTTTGAGCACCATTTACAAATCGAAAAAATT
ATCAAAGAAATTATAAATATTGAGTCCATTTCAAATAAAAGTACAAACAGACAGAGGAAA
TTATATATAGAAAACTATGTCAGTTGGTTACATGAACATGGAGCTGAATTTGAAGGAGTG
GAAATAAGTGAATTTGATGGTTACGGGTTCGGTTTAAAGGCGACCAAAGATTTTTCAGAA
GGATCACTTATATTAACTGTACCTGGCAAAGTTATGATGAGTGAGAAAGATCCAAAAGCA
TCCGACTTATCAGAATTTATCAACATAGATCCACTTTTACAGAATATGCCAAATGTTACC
CTAGCGTTGTTTTTGCTTTTAGAAAAGAATAATCCCAACTCTTTCTGGAAGCCATACATT
GATGTACTGCCTGAGAAGTATTCCACAGTACTATACTTTAACTCAGAAGAACTAGCCGAG
CTGAGGCCTTCACCTGTTTTTGAGTCGTCATTAAAATTGTACAGAAGTATTGTAAGACAA
TACGCCTACTTCTACAACAAAATTCACACAATAGACTTGCCAGTTCTCAAAAATCTACAA
GATATATTCACATTTGATAACTACAGATGGGCGGTGTCCACTGTGATGACCCGCCAGAAC
AACATAGTTCAGGGGACTGCCTTCACGTTGACGAACGCTTTCATACCGCTCTGGGACATG
TGCAATCATAAACACGGCAAGATAACGACCGATTTCAATTTGGAGCTGAACCGCGGCGAG
TGTTACGCGTTACAAGACTACAGACGAGACGAACAGATATTCATATTTTACGGAGCGAGA
CCGAACTCGGATCTCTTCCTGCATAATGGTTTTGTGTATCCGGATAATGATTACGATAGT
TTGTCTATCGCGTTGGGTATAAGTCCCAACGACGCTTTGAGGAACGGAAAAGTCAATCTA
TTGAATAAGCTCGGCCTGTCTGGTGTCACAAACTTCTCGCTATACAAAGGCGCGAGTCCC
ATCAGCGTGGAACTGCTCGCCTTTATAAGGATTTTCAATATGAACCAAGAGGAATTAGAG
AAGTGGTCGGCCGAGAGCATCCCTAGTGATTTGCTGTCTTTTGAGACAGGAACCGAATAC
AATATGGCGTCGATTGATAAAAGAGGATTTACATACCTTCTGACCAGGTGCGGCCTCATC
AGGGGTACTTACAAAGACAGTGGGGGTGATGTCCAGTCTGAGCACAGGAAAAACATAAAA
CTATTGAAGCAATGCGAAGTACAAATATTAGAAAATGCCATAAAGTACTTAAGGGACGTC
ATAGACAAGATTTCCGGAGACGAGAAATAA

Protein sequence:

MGRKLQSKLTCKKKNVKEGNRFLQQRRKELAVLVDTLLKLTSTFQSTGKSFEHHLQIEKI
IKEIINIESISNKSTNRQRKLYIENYVSWLHEHGAEFEGVEISEFDGYGFGLKATKDFSE
GSLILTVPGKVMMSEKDPKASDLSEFINIDPLLQNMPNVTLALFLLLEKNNPNSFWKPYI
DVLPEKYSTVLYFNSEELAELRPSPVFESSLKLYRSIVRQYAYFYNKIHTIDLPVLKNLQ
DIFTFDNYRWAVSTVMTRQNNIVQGTAFTLTNAFIPLWDMCNHKHGKITTDFNLELNRGE
CYALQDYRRDEQIFIFYGARPNSDLFLHNGFVYPDNDYDSLSIALGISPNDALRNGKVNL
LNKLGLSGVTNFSLYKGASPISVELLAFIRIFNMNQEELEKWSAESIPSDLLSFETGTEY
NMASIDKRGFTYLLTRCGLIRGTYKDSGGDVQSEHRKNIKLLKQCEVQILENAIKYLRDV
IDKISGDEK