New model in OGS2.0 | DPOGS208774  |
---|---|
Genomic Position | scaffold6418:+ 512-4106 |
See gene structure | |
CDS Length | 1632 |
Paired RNAseq reads   | 1104 |
Single RNAseq reads   | 2664 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007949 (4e-33) |
Best Drosophila hit   | G9a (2e-68) |
Best Human hit | histone-lysine N-methyltransferase, H3 lysine-9 specific 5 isoform 1 (2e-78) |
Best NR hit (blastp)   | histone-lysine N-methyltransferase, H3 lysine-9 specific, putative [Pediculus humanus corporis] (3e-98) |
Best NR hit (blastx)   | PREDICTED: similar to euchromatic histone methyltransferase 1 isoform 2 [Apis mellifera] (1e-86) |
GeneOntology terms    | GO:0005634 nucleus GO:0016740 transferase activity GO:0018024 histone-lysine N-methyltransferase activity GO:0008168 methyltransferase activity GO:0008270 zinc ion binding GO:0016568 chromatin modification |
InterPro families    | IPR020683 Ankyrin repeat-containing domain IPR002110 Ankyrin repeat IPR001214 SET domain IPR007728 Pre-SET domain IPR003606 Pre-SET zinc-binding sub-group |
Orthology group | MCL10502 |
Nucleotide sequence:
ATGATTAAATTAATTATCCGATGTACCACAGCACTACTAGACAAAGAACATTCGGAGGAA
ACGGAATTCGAAGGCAACGAAGCGGAAGTGTCGGTTAAGAAAGAAGATGAAGTTGTTGCA
AATGACGAAGACAAGGTAAAGACGGAGACCGAAGACAAAGAAGACGCCGAGCACGAGCTC
AAGGACGGCCAAGAAGTCCCCGAGGACCCCAGCAGACCGGCCGACGACGAGCTCCTTAGT
GTTATCAAATACTTAATAGCAGCTGGCTGTGACGTCAACAAACAGGGTCCGGAGGGCATG
TCGGGCCTGCACATGTCGTGTCAGTACGGCGGCGCGGCCGTGTGCCTTATGTTACTGGAG
GCAGGCGCTGCGGTCGACGCCAGGGACCACGGCGGGTGGACGCCGCTCGTGCGCGCCGCC
GAGAACAAACACGCCGCCGTCGTCAGGTTACTGCTGGCAGCCGGGGCGGACGCCGCGTCT
TGTGACAACGAAGGCAACCAGCCCATACACTGGTGTACACTGGCGGGCGACTCGCGCTGC
CTCGCCATGATACTGAGGGCCGCGCCGCACGCCACCAACGCTCCTAACGCTCACACTGAC
ACACCGCTTCACATCGCCGCTCGCGAGGGTCACTACTCGAGTGTGGTCGTGCTGCTCGCC
CACGGAGCCAGGACGGATATAGAGAACTCGTCCGGAGAACTTCCGGTGGAGGTGTGCAGC
GGTCCGTGCCACGAGGCCATCTCTATGAACATGCAAATGACACTCGCCGTCAAAGACACT
ATGACACGGGTGAAGGTCATTACGAGTGACCTGTCCAACGGCCGCGAGCCGTACCCCGTG
AGTGTGGTCAACGAGGTGGACGACGCCTCGCCCGCCGCCTTCACGTACGTGTCACAGCAT
GTGCTCACTGAACACCTCACCATAGACAACACCATAGAGACCATGCAGGGCTGCGAGTGT
GCGGGTGGGTCGTGCGACGGCGAGTGCGGCTGCTGCGTGCTGTCCGTGCGGCGTTGGTAC
CGCGCCGGCCGCCTGCCGCCCGCCTTCCCCCACCACGACCCGCCCGTCATGTTCGAGTGT
AACTACACGTGCGGCTGTAACATGAAACGGTGCACAAACCGCGTGGTGGGTCGGATGGAG
AGCGCGGGGTCGCTGAACACCCCGGTGCAGGTGTTCAGGACCAGGACGCGCGGCTGGGGA
CTGAGGGTGCTGACCAGGGTGAGCCGGGGGGAGCTGCTGGCCCTGTACCGGGGGGAACTC
GTCACCAGCGAGCGAGCCGACGCGCGGACCGACGATCAGTACATGTTCGCCTTGGACCTG
AAGCCCGACCTACTGGAGCAATGCAGTGACAAGACGCTGCTGTGTGTGGACGCGTGTCGC
TTCGGTAGCGCGGCTCGGTTCATGAACCACAGCTGCCGGCCGTCCGCGGCGCCCGTGAGG
GTGTTCACCTCGGGCCGCGATCTGCGCCTGCCGCACGTCGCCTTCTTCGCTCTCAGAGAC
CTCGCGCCCGGCGACGAGCTCACTTTCGACTACGGAGACAAATTTTGGTCAGTGAAGTCG
AAATGGATGAAATGCGAGTGCGAGTCGCCCGACTGCAGATACCCGACCAAGATGGAGGAG
GCTGATACATAG
Protein sequence:
MIKLIIRCTTALLDKEHSEETEFEGNEAEVSVKKEDEVVANDEDKVKTETEDKEDAEHEL
KDGQEVPEDPSRPADDELLSVIKYLIAAGCDVNKQGPEGMSGLHMSCQYGGAAVCLMLLE
AGAAVDARDHGGWTPLVRAAENKHAAVVRLLLAAGADAASCDNEGNQPIHWCTLAGDSRC
LAMILRAAPHATNAPNAHTDTPLHIAAREGHYSSVVVLLAHGARTDIENSSGELPVEVCS
GPCHEAISMNMQMTLAVKDTMTRVKVITSDLSNGREPYPVSVVNEVDDASPAAFTYVSQH
VLTEHLTIDNTIETMQGCECAGGSCDGECGCCVLSVRRWYRAGRLPPAFPHHDPPVMFEC
NYTCGCNMKRCTNRVVGRMESAGSLNTPVQVFRTRTRGWGLRVLTRVSRGELLALYRGEL
VTSERADARTDDQYMFALDLKPDLLEQCSDKTLLCVDACRFGSAARFMNHSCRPSAAPVR
VFTSGRDLRLPHVAFFALRDLAPGDELTFDYGDKFWSVKSKWMKCECESPDCRYPTKMEE
ADT