New model in OGS2.0 | DPOGS201750  |
---|---|
Genomic Position | scaffold115:- 28555-47096 |
See gene structure | |
CDS Length | 3876 |
Paired RNAseq reads   | 3722 |
Single RNAseq reads   | 8504 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002656 (1e-52) |
Best Drosophila hit   | grappa, isoform B (1e-130) |
Best Human hit | histone-lysine N-methyltransferase, H3 lysine-79 specific (5e-124) |
Best NR hit (blastp)   | PREDICTED: similar to histone h3 methyltransferase [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to histone h3 methyltransferase [Tribolium castaneum] (2e-163) |
GeneOntology terms    | GO:0003677 DNA binding GO:0008168 methyltransferase activity GO:0016740 transferase activity GO:0018024 histone-lysine N-methyltransferase activity |
InterPro families   | IPR013110 Histone methylation DOT1 |
Orthology group | MCL14856 |
Nucleotide sequence:
ATGGCTTTGGATTTACGCTTGCACTCTCCTGCAGGCGCGGAACCTGTTGTTTATACATGG
CCCCTCACATCGGGTCATGGCTCGGACAAACACGATGGAGCATTAGAAATAGTAGAGACA
ATAAGGTCGGATATAAAAGAACTAGTTTGGCAAGTTGACCCGTTCGCGTTCGGTCCCTGC
GCTCGTGCGGTCGGGGTGGCGAGGGCGGCCGCGGCCTCCTCTCGACCGGACGCGTTCACT
CTTCCTTTATGGGTCTGCGACGACCTGCCGGAGATGAAAGGTGCATTGGAGAAAAACATC
TTAAGCGATTACGACACCCACTCATACGAAAGTATGAGGGCTTTATGCGATCGCTTCAAC
CGCGCCATAGACTCTGTTGTAGCTCTAGAAAAAGGAACTTCACTGCCAGCTCAGCGTTTA
AACAAATATCCCTCACGTGGTCTTCTAAAACATATTTTACAACAAACCTACAATCAAGCC
GTCTCAGATCCTGACAAACTTAACCAATATGAACCGTTCTCCCCTGAGGTATACGGAGAA
ACGTCGTACGAACTAGTCTGTCAGATGATAGATCAGATAAATATATCAGCTGAGGATGTG
TTCGTTGACTTGGGATCGGGCGTGGGGCAGGTGGTGCTGCAGATGGCCGCCGCTACACCG
TGCCGCATCTGTTTCGGTGTCGAGAAAGCTGAAGTGCCTAGCAAATATGCTGAGAGTATG
GATTTACATTTTAGAATGTGGATGAGATGGTATGGGAAAAAGTATGGAGAATATAAATTA
ATAAAAGGCGATTTCCTGATGGATGAGCATAGAGAGAAAATTAATTCCGCCACCATTGTG
TTCGTCAATAATTTTGCATTCGGTCCCCATGTGGACCATCAATTGAAAGAGAGGTTTGCC
GACCTCAAGGATGGAGCTAAGATTGTCTCGTCTAAGAGCTTCTGCCCTCTCAACTTTAGG
ATAACGGATAGAAATCTGAGCGACATTGGGACAATCATGCACGTTAGCGAAATGTCGCCG
CTCAAAGGCTCCGTGTCTTGGACCGGCAAACCAGTATCATACTATTTACATATAATAGAT
AGGACCAAATTGGAGAGATACTTTCAAAGACTGAAAAATCCGAAACTCAAGGTCAGCAAA
AGGAACTGCCAGGACGAGGCCGAGGGAAAGAGGAACGTGAGCGCGCCTCCAACAAGACAA
CCCACACCGGACCTGTTGAACGGCAACAGCAACCACAGCACCGGCTCGCTACCAGAGAGA
CGAAGGAAGGTCGCCCGTCCGAGACCCATCAGGGGCGAAGCCGGACGGACTCGAGCGCGT
GTGGCGGCGGCGGCCAAGAGGCGCGTCGTGAGACGCTCCAGCGACGAGAGTGAGGAAGAG
TCCAGCGGCACCGACGAGCCGCCGGGACCCGAGCCCGCGCCGAGAGAGTGGGGGGCGCCC
TGGGCCTCCTCGCCGCACTCCAATCGGAACCGGCGCACAACTAACAAACGCAGCGCTAGC
GCCGGCGGTGCTCGTCGTCGTGTCGCTCGCGCCAAGCGTCGCCGCGCCGCGCCCGCTGCC
ATCGCCGGCTTGGATCTGCTCCACTCCACCACCCTGGCGTCCACGCTGCACGCGGGCGCG
GTGACGGCGCCGCCCCCGGGCTGCGTGGAGCAGCGTCTGTCGGCGCTGGGCGTGCAGCTG
CAGCCGGCAGACCGTCTGCACTCGGAGCTGGACATACCCCGTGCTCCTCACGCGCCCTAC
TCCCTGCAACTGCTGTTGGACATGTTCCGCGATCAGTACCTCGCCTTCATCCGCCGCATG
AACACACCGGACTACGCACTCGAGATTCGCGCGCAAATCGACAAAGAGAAGGAAAGAAAT
CAAAAACTGAAGTCTCGCGCATCTCAGTTGGATAAACAGATCAACGTATTAATTAGCGAC
AGCGTGGCTTTGTTGAAGGCACGGATGAGCGAGTTGGGGATACACGCGAATAATACGGTC
GACCTGCTGGCCAAGGCGAAAGAGATTGTGGGGCGACACAAAGAGTTACAGAGCAAAGCA
AGCAAATTACAGGCTCAGGTGAATAATATAGAGGCTGAACAAGCGATGCTTGTGAAACAA
CGAACTTTTGAAATTACAGAGAAGTATAGGCAGCTGGGACATATACCCCCAGATGTCGAG
ATCACGCAGAGTATAGCTCACGATTGTATATTGAAGGAGATCTCGGCGACGTTGGCGCAC
AGGAAACGTCTCCATGCACAAGTTGGACACTTGCAGAACGAGATCATTCAGATGGAACGA
GCCAGCGAACAACAGAAAGTGGCGGCTCCTACTGTACCCGTGGCGACCGTCAAACCTCAC
ACTGTCAATTCAAAACCAAGAAAGTCGAGGGAACATCGGTCACGTTCGCAGGAATGGCCG
GATGTGCCGGATGTGGGGAAGATCGAAGAACAAAATCCAGAGATCCTGGCCCAAAAGATA
TTAGAAACTGGCAGACAGATTGAGGCTGGGAAGATTGCTAAACCGAATGTTATTGTAAAT
GGGTATATAAGAGATCCTGAAAGACATGTTGATCATAGGCAGGGTCGGGCGGTCGCTCGA
GCCGCGCCGCCGTTGGCGCGACACTCGCCCGTCAAGTCTCAGAAACCGCTCAACGTCGTG
GCCAAGGTACAGGAGTCACCGAAGGTCATCAACTTCGAAGACCGACTCAAGAGCATCATC
ACATCCGTCCTCAACGAGGACCAGGAACAGAGGAAGGCGTCGCGGCTAGAGCCTCGTCAA
GCAGTAGCGAATCAAGCGTACGCTAACGGGTACGTCCGACCCGTAGCGGTGACGTCCAGC
ACGCTCGGCGCGTACGCAGCGAGGCCGGCCGTCTCCGCCGCGCCGCCAGCCGTGTCGCCA
TACGCGCGTGGCCTGCGGGATGCGAGGGACGTGCGGGACGCGAGGGACGCTCGTGACGTA
CGGGACGTTCGTGACACACGGGACGCGAGGGACGTACGGGACATGCGGCGGGAACGCTTC
GGCTTCGATCGTCGGGAGCCGCGGCCGCACCCGCATCACGAAGCACGGCCGCCGGACATG
CGTCATCATCACTCCGCGCAGCCTGATTACACACAGGTGTCTCCGGCGAAATTGGCACTG
CGTCGGCATCTATCACAAGAGAGGCTGGCGGCGCCGGGGGCTCGCACCATCGGCGACCTC
GTCAACGGAGAGATCGAGCGCACGCTAGAGATATCAAATCAGAGCATCATCAACGCCGCC
GTCCACATGAGCGCGCGCTACCACGAACCTGCGGCCGCGCACCAGCCGCTTGAGGGTCTA
GCCGCCTGTTTACAAGCACGCGTTCTAGCGTCGGAGTACTGGCGCGGGCGGAACGGCCCC
GCGCCCGCCGGCGAGCGGCAGGAGGAGGCCGGCCGGCGGCGCTCGCCCGCGCCCGACCCT
CACTCCAACACCTCCACGCCACTCGTCGACGAGCCGCCGGAGCCCCGGCCGGCCGCGCCG
GATGCCGAGGCCGGCGAGGAGGTCGAGGAGAGCAAGTGGCAAGATCGGATAGCGTTTCGC
TTCGATCAGATCATATCGTTCGCTTCCACCGCCATGGACGACAAGCGGCGGCGGTCCGAC
GAGGCGTGTAACACCTCACCCGACTCCGGCATAGGTCACGGCGAAGCGGCCCGCGGCGCC
GCAGACGGCGTCACGACCGCGCCGGCGGGAGAGACGGGCGGGGGAGACGCCACGGGCGCG
CCGCCCGCGGCCGAGCCGGCGCCGGTCCGCCGCTCGCCGTCCCCGCCGGCGCCGCACCAC
TTCAAAAAGCGTTTCTTCCGCCGCGAACGATGGGGCGCGTGGTCGTCGGGCACGCCGCCG
CCGGCCGCCGTGGAGTGGGAGCGTCCGCCGATGTGA
Protein sequence:
MALDLRLHSPAGAEPVVYTWPLTSGHGSDKHDGALEIVETIRSDIKELVWQVDPFAFGPC
ARAVGVARAAAASSRPDAFTLPLWVCDDLPEMKGALEKNILSDYDTHSYESMRALCDRFN
RAIDSVVALEKGTSLPAQRLNKYPSRGLLKHILQQTYNQAVSDPDKLNQYEPFSPEVYGE
TSYELVCQMIDQINISAEDVFVDLGSGVGQVVLQMAAATPCRICFGVEKAEVPSKYAESM
DLHFRMWMRWYGKKYGEYKLIKGDFLMDEHREKINSATIVFVNNFAFGPHVDHQLKERFA
DLKDGAKIVSSKSFCPLNFRITDRNLSDIGTIMHVSEMSPLKGSVSWTGKPVSYYLHIID
RTKLERYFQRLKNPKLKVSKRNCQDEAEGKRNVSAPPTRQPTPDLLNGNSNHSTGSLPER
RRKVARPRPIRGEAGRTRARVAAAAKRRVVRRSSDESEEESSGTDEPPGPEPAPREWGAP
WASSPHSNRNRRTTNKRSASAGGARRRVARAKRRRAAPAAIAGLDLLHSTTLASTLHAGA
VTAPPPGCVEQRLSALGVQLQPADRLHSELDIPRAPHAPYSLQLLLDMFRDQYLAFIRRM
NTPDYALEIRAQIDKEKERNQKLKSRASQLDKQINVLISDSVALLKARMSELGIHANNTV
DLLAKAKEIVGRHKELQSKASKLQAQVNNIEAEQAMLVKQRTFEITEKYRQLGHIPPDVE
ITQSIAHDCILKEISATLAHRKRLHAQVGHLQNEIIQMERASEQQKVAAPTVPVATVKPH
TVNSKPRKSREHRSRSQEWPDVPDVGKIEEQNPEILAQKILETGRQIEAGKIAKPNVIVN
GYIRDPERHVDHRQGRAVARAAPPLARHSPVKSQKPLNVVAKVQESPKVINFEDRLKSII
TSVLNEDQEQRKASRLEPRQAVANQAYANGYVRPVAVTSSTLGAYAARPAVSAAPPAVSP
YARGLRDARDVRDARDARDVRDVRDTRDARDVRDMRRERFGFDRREPRPHPHHEARPPDM
RHHHSAQPDYTQVSPAKLALRRHLSQERLAAPGARTIGDLVNGEIERTLEISNQSIINAA
VHMSARYHEPAAAHQPLEGLAACLQARVLASEYWRGRNGPAPAGERQEEAGRRRSPAPDP
HSNTSTPLVDEPPEPRPAAPDAEAGEEVEESKWQDRIAFRFDQIISFASTAMDDKRRRSD
EACNTSPDSGIGHGEAARGAADGVTTAPAGETGGGDATGAPPAAEPAPVRRSPSPPAPHH
FKKRFFRRERWGAWSSGTPPPAAVEWERPPM