New model in OGS2.0 | DPOGS207920  |
---|---|
Genomic Position | scaffold828:+ 44180-50422 |
See gene structure | |
CDS Length | 1839 |
Paired RNAseq reads   | 3485 |
Single RNAseq reads   | 8777 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000069 (6e-166) |
Best Drosophila hit   | Rtf1 (1e-87) |
Best Human hit | RNA polymerase-associated protein RTF1 homolog (5e-74) |
Best NR hit (blastp)   | PREDICTED: similar to Rtf1 CG10955-PA [Tribolium castaneum] (5e-112) |
Best NR hit (blastx)   | PREDICTED: similar to Rtf1 CG10955-PA [Tribolium castaneum] (3e-97) |
GeneOntology terms    | GO:0006352 transcription initiation GO:0003677 DNA binding GO:0005634 nucleus GO:0042800 histone methyltransferase activity (H3-K4 specific) GO:0007219 Notch signaling pathway GO:0051568 histone H3-K4 methylation GO:0051571 positive regulation of histone H3-K4 methylation GO:0045747 positive regulation of Notch signaling pathway |
InterPro families    | IPR004343 Plus-3 IPR018144 Plus-3 domain, subgroup |
Orthology group | MCL13518 |
Nucleotide sequence:
ATGAAGACGAGGTGGGAGATAGAACGCAAGCTGCGACTAGCGAGGCGGTCAGCGGCCGAG
AGAGACGTGTCTCCTACTGAGCTGCAGAGGAGGAGGGAGGCGAGGAGGCGGCGGAGGGAG
AGGCGGGGTAGGAGGGGGGAGAGAGAAGCCGTTGTAGAAGAAAAGAGGAAAGAGGAACGC
GAAGAGGAGAAAGAGAGAGAGAAGCCTCCGCCCAGCCCCGGGGAGGTGACAGACGATCAA
AAAGATACAGAGAGAGATCAGGACCGGTCCGCCTCCCCGCTGTTCGGTGCCAAGACTGAG
AGGAAGAGGAACGTGGACGACAGGAGAGTGAACGCTATGGCGGCGCTCAGGGCCCAGAGA
GACGCGCGACAGAGGAACGTGGAGACCAAACAGAAAAAGAGGGCGCTGGAGAGGAAGGAG
GAGGACGACGAAGCGGATCCGGAAATAATAGGAGGCACCAGCAAACAGAGCGTCAAGCTG
AAGGCGTCCGACATATACTCTGACGACTCGGGCTCGGACTCCGAGGACAAGTCACAGGGA
AAAAGAAGCTCCTCGAGTTCCTCCACATCAGACGCCGAGGAAGAAGAGAAGAAGAGAGAG
AGAGAGGAAGTTGAAGTGAAGTACGCGGACACCAGGGAACAGATAAATAAGCTGAGGCTT
AGTAGGTTCAAGTTAGAGCGTCTCGTACATTTACCTTTCTTCTCGCGCGTCGTGTCCGGG
TGTTTCGTTCGTATCGGCATCGGCAATAACAACGGAAACCCGGTGTACAGGGTCGCCGAA
ATTATAGATGTATACGAGACGGCAAAGGTGTATAACTTAGGAAACACGAGGACTAACAAG
GGCTTCAAGCTGAGACACGGCACGCAGGACAGGGTGTTTAGGCTGGAGTTCGTGAGCAAT
CAGGAGTTCACAGAAAATGAATTCCAGAAATGGCATCGAGCCATCAAGGAAGCCAACAAG
AAGCCTCCCACCATGGACTTCGTTAGGAACAAGATACTGGAGGTTAAGGACGCGCTCATG
TACGAGTTCAAGGAGTTCACAGAAAATGAATTCCAGAAGTGGCATCGAGCCATCAAGGAG
GCCAACAAGAAGCCTCCCACCATGGACTTCGTTAGGAATAAGATACTGGAGGTTAAGGAC
GCGCTCATGTACGAGTTTAAGGAAGAGGATATAGAGAAGATTGTAGCGGAGAAGGAGAGG
TTCAGGTCGCACCCGACCAACTACGCCATGAAGAAAACCCAGCTCATGAAGGAGAGAGAT
GTAGCACAGCTGAGAGGTGACGAGGAATTGGTTCTAGAATTAAACTCCAAGCTTCAGGAG
CTGGAAGAGAGAGCCAGCGCCCTGGACAAGACGAGGACCAGCTCCATACAGAGCATCAGC
TACATCAACAACAGGAACCGGAAACTCAACGTGGAGACGGCCGAGAAGGCCATCATGGAG
GAGGTGAAAGCTATGAAGGGGAAGAAGATGGACGATCCCTTCACCAGGAGACACACCAAG
CCCGTCATGAACTTCAAGTCGCACGGCGGGAGCAGATCGCAGGAACTACTGAAGAACGAG
CAGCAAGCGGCGGAGCAGCAGAAACAGAAGGACGAAGAAGAGAGGATAGAGAAGGAGAAA
GAGGAAGAGATACTGAACCGGCCGGTCGCGCCCCGCCCGCTCCCGCCGGACGGCAGTTTG
TATTCTTTACACGACTTCGACATCAACATAGAAATAGATCTCCCCGCGCCCAAGCCGGTG
ACGTCACACTCCAAACAGATAACCATAAAGGTGAAGGACGCCGGCCCAAAAAGGTCATTG
AACCTGGACGATTACAAGAAGAGACACGGCCTCATATAG
Protein sequence:
MKTRWEIERKLRLARRSAAERDVSPTELQRRREARRRRRERRGRRGEREAVVEEKRKEER
EEEKEREKPPPSPGEVTDDQKDTERDQDRSASPLFGAKTERKRNVDDRRVNAMAALRAQR
DARQRNVETKQKKRALERKEEDDEADPEIIGGTSKQSVKLKASDIYSDDSGSDSEDKSQG
KRSSSSSSTSDAEEEEKKREREEVEVKYADTREQINKLRLSRFKLERLVHLPFFSRVVSG
CFVRIGIGNNNGNPVYRVAEIIDVYETAKVYNLGNTRTNKGFKLRHGTQDRVFRLEFVSN
QEFTENEFQKWHRAIKEANKKPPTMDFVRNKILEVKDALMYEFKEFTENEFQKWHRAIKE
ANKKPPTMDFVRNKILEVKDALMYEFKEEDIEKIVAEKERFRSHPTNYAMKKTQLMKERD
VAQLRGDEELVLELNSKLQELEERASALDKTRTSSIQSISYINNRNRKLNVETAEKAIME
EVKAMKGKKMDDPFTRRHTKPVMNFKSHGGSRSQELLKNEQQAAEQQKQKDEEERIEKEK
EEEILNRPVAPRPLPPDGSLYSLHDFDINIEIDLPAPKPVTSHSKQITIKVKDAGPKRSL
NLDDYKKRHGLI