DPGLEAN10227 in OGS1.0

New model in OGS2.0DPOGS207920 
Genomic Positionscaffold828:+ 44180-50422
See gene structure
CDS Length1839
Paired RNAseq reads  3485
Single RNAseq reads  8777
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000069 (6e-166)
Best Drosophila hit  Rtf1 (1e-87)
Best Human hitRNA polymerase-associated protein RTF1 homolog (5e-74)
Best NR hit (blastp)  PREDICTED: similar to Rtf1 CG10955-PA [Tribolium castaneum] (5e-112)
Best NR hit (blastx)  PREDICTED: similar to Rtf1 CG10955-PA [Tribolium castaneum] (3e-97)
GeneOntology terms






  
GO:0006352 transcription initiation
GO:0003677 DNA binding
GO:0005634 nucleus
GO:0042800 histone methyltransferase activity (H3-K4 specific)
GO:0007219 Notch signaling pathway
GO:0051568 histone H3-K4 methylation
GO:0051571 positive regulation of histone H3-K4 methylation
GO:0045747 positive regulation of Notch signaling pathway
InterPro families
  
IPR004343 Plus-3
IPR018144 Plus-3 domain, subgroup
Orthology groupMCL13518

Nucleotide sequence:

ATGAAGACGAGGTGGGAGATAGAACGCAAGCTGCGACTAGCGAGGCGGTCAGCGGCCGAG
AGAGACGTGTCTCCTACTGAGCTGCAGAGGAGGAGGGAGGCGAGGAGGCGGCGGAGGGAG
AGGCGGGGTAGGAGGGGGGAGAGAGAAGCCGTTGTAGAAGAAAAGAGGAAAGAGGAACGC
GAAGAGGAGAAAGAGAGAGAGAAGCCTCCGCCCAGCCCCGGGGAGGTGACAGACGATCAA
AAAGATACAGAGAGAGATCAGGACCGGTCCGCCTCCCCGCTGTTCGGTGCCAAGACTGAG
AGGAAGAGGAACGTGGACGACAGGAGAGTGAACGCTATGGCGGCGCTCAGGGCCCAGAGA
GACGCGCGACAGAGGAACGTGGAGACCAAACAGAAAAAGAGGGCGCTGGAGAGGAAGGAG
GAGGACGACGAAGCGGATCCGGAAATAATAGGAGGCACCAGCAAACAGAGCGTCAAGCTG
AAGGCGTCCGACATATACTCTGACGACTCGGGCTCGGACTCCGAGGACAAGTCACAGGGA
AAAAGAAGCTCCTCGAGTTCCTCCACATCAGACGCCGAGGAAGAAGAGAAGAAGAGAGAG
AGAGAGGAAGTTGAAGTGAAGTACGCGGACACCAGGGAACAGATAAATAAGCTGAGGCTT
AGTAGGTTCAAGTTAGAGCGTCTCGTACATTTACCTTTCTTCTCGCGCGTCGTGTCCGGG
TGTTTCGTTCGTATCGGCATCGGCAATAACAACGGAAACCCGGTGTACAGGGTCGCCGAA
ATTATAGATGTATACGAGACGGCAAAGGTGTATAACTTAGGAAACACGAGGACTAACAAG
GGCTTCAAGCTGAGACACGGCACGCAGGACAGGGTGTTTAGGCTGGAGTTCGTGAGCAAT
CAGGAGTTCACAGAAAATGAATTCCAGAAATGGCATCGAGCCATCAAGGAAGCCAACAAG
AAGCCTCCCACCATGGACTTCGTTAGGAACAAGATACTGGAGGTTAAGGACGCGCTCATG
TACGAGTTCAAGGAGTTCACAGAAAATGAATTCCAGAAGTGGCATCGAGCCATCAAGGAG
GCCAACAAGAAGCCTCCCACCATGGACTTCGTTAGGAATAAGATACTGGAGGTTAAGGAC
GCGCTCATGTACGAGTTTAAGGAAGAGGATATAGAGAAGATTGTAGCGGAGAAGGAGAGG
TTCAGGTCGCACCCGACCAACTACGCCATGAAGAAAACCCAGCTCATGAAGGAGAGAGAT
GTAGCACAGCTGAGAGGTGACGAGGAATTGGTTCTAGAATTAAACTCCAAGCTTCAGGAG
CTGGAAGAGAGAGCCAGCGCCCTGGACAAGACGAGGACCAGCTCCATACAGAGCATCAGC
TACATCAACAACAGGAACCGGAAACTCAACGTGGAGACGGCCGAGAAGGCCATCATGGAG
GAGGTGAAAGCTATGAAGGGGAAGAAGATGGACGATCCCTTCACCAGGAGACACACCAAG
CCCGTCATGAACTTCAAGTCGCACGGCGGGAGCAGATCGCAGGAACTACTGAAGAACGAG
CAGCAAGCGGCGGAGCAGCAGAAACAGAAGGACGAAGAAGAGAGGATAGAGAAGGAGAAA
GAGGAAGAGATACTGAACCGGCCGGTCGCGCCCCGCCCGCTCCCGCCGGACGGCAGTTTG
TATTCTTTACACGACTTCGACATCAACATAGAAATAGATCTCCCCGCGCCCAAGCCGGTG
ACGTCACACTCCAAACAGATAACCATAAAGGTGAAGGACGCCGGCCCAAAAAGGTCATTG
AACCTGGACGATTACAAGAAGAGACACGGCCTCATATAG

Protein sequence:

MKTRWEIERKLRLARRSAAERDVSPTELQRRREARRRRRERRGRRGEREAVVEEKRKEER
EEEKEREKPPPSPGEVTDDQKDTERDQDRSASPLFGAKTERKRNVDDRRVNAMAALRAQR
DARQRNVETKQKKRALERKEEDDEADPEIIGGTSKQSVKLKASDIYSDDSGSDSEDKSQG
KRSSSSSSTSDAEEEEKKREREEVEVKYADTREQINKLRLSRFKLERLVHLPFFSRVVSG
CFVRIGIGNNNGNPVYRVAEIIDVYETAKVYNLGNTRTNKGFKLRHGTQDRVFRLEFVSN
QEFTENEFQKWHRAIKEANKKPPTMDFVRNKILEVKDALMYEFKEFTENEFQKWHRAIKE
ANKKPPTMDFVRNKILEVKDALMYEFKEEDIEKIVAEKERFRSHPTNYAMKKTQLMKERD
VAQLRGDEELVLELNSKLQELEERASALDKTRTSSIQSISYINNRNRKLNVETAEKAIME
EVKAMKGKKMDDPFTRRHTKPVMNFKSHGGSRSQELLKNEQQAAEQQKQKDEEERIEKEK
EEEILNRPVAPRPLPPDGSLYSLHDFDINIEIDLPAPKPVTSHSKQITIKVKDAGPKRSL
NLDDYKKRHGLI