New model in OGS2.0 | DPOGS207038  |
---|---|
Genomic Position | scaffold1:+ 1333875-1344810 |
See gene structure | |
CDS Length | 4476 |
Paired RNAseq reads   | 1726 |
Single RNAseq reads   | 4243 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012978 (9e-35) |
Best Drosophila hit   | CG40351, isoform H (6e-109) |
Best Human hit | histone-lysine N-methyltransferase SETD1A (4e-88) |
Best NR hit (blastp)   | GM18767 [Drosophila sechellia] (1e-141) |
Best NR hit (blastx)   | PREDICTED: similar to CG40351 CG40351-PC [Tribolium castaneum] (1e-157) |
GeneOntology terms    | GO:0006338 chromatin remodeling GO:0005634 nucleus GO:0018024 histone-lysine N-methyltransferase activity GO:0000166 nucleotide binding GO:0003676 nucleic acid binding |
InterPro families    | IPR000504 RNA recognition motif domain IPR001214 SET domain IPR003616 Post-SET domain IPR015722 Histone-lysine N-methyltransferase IPR012677 Nucleotide-binding, alpha-beta plait |
Orthology group | MCL14407 |
Nucleotide sequence:
ATGAATGGAGGAATGGAGCACAAAACACCAGGCCACAATGCAGTTCTTCACAAGGGACCT
AAAAATTATAAACTTCTAATAGACCCATTCCTAGTAAAGGGAGCGACGAAAGTGTACAGA
TATGATGGTACTGTTCCTGGTACGTCTTACCCGTCGATACAATGCAGAGACCCGAGACCT
CAACCGTCCAGAATATGGAATAAATTAGAACCAGCAGATTTACCGATACCTAGGTTTAGA
ATTGATAAGAATTATGTTGGTGTTCCGCCACAGTTAGAAATAACTATTGTAAATTTGAAT
GATAACATCGATAAAGCTTTCTTGTCCGACATGATGAATAAAGTTGGACCTTATGAAGAA
TTGACAATATTTTATCATCCGATGACTAATAGGCATTTAGGTTTTGCTAGAATTGTATTT
CAGGATGTCAAATATTCCAAAATATGTATCGAAAAATATAATGGAAAATCTGTCATGGGG
CAGGTACTTGAAGTTTTCCATGATTCATTTGGTAAGAAGTGTCAGGAGATGTTTGAAGAT
AAAACGGTGGAGAAAAAGCCCCAGCCGGCTCCGATCAAGCCTCCCGAGGATGCCCGAGTG
GCCAAGCTAGATCCAGCTCTCAGCAAGAGATTAGAGGATAGCAAACTAGTTGATAAGGAC
CCATACCTTCGCAAGGAGCTGGAACACAATGACAGCAACACAAGATGGTCAGATGATGAA
AGGGACAGAGAGTACAAACATCGCCTCCGCAGTAGAAGTGAGAGAGACAGAGATATTGAC
AGAGATAGAGATGGCAACAGAGAGAGAGAAAGACACAGAGACCGCTATGCCCGTACCAGT
GAACAGAGTGAGTCCTATTCCAGCGCCCACGTTGAGATACCGTACGCTCCAACGCCAGTA
CCATACGACCCATACTACCAGACCCCTGGCTATGGATACGGCTACGGCACGAGCGCCGGC
GCTGTTTGGTGGGGGGACTGGAGACAGCCGCACACCTCTCACCATCACTCACATTCGGAG
CAGAGCAGCAGCAGCACGTGGACGGCCGCGGGCGAGCCGACCCCCTCCCCGCGGCACACA
CCCCTCGCGGCCCCTCAAACCCCGTACACGCCCGCGCCCCCCACTCCGCTCCCGGAAAAG
GAGGTGAAGTGTAAGCCCGAGGAGCCTCTTCCTCCGAGTACGTCCGTCGTCAGCGACCCT
GAGCCCAAGCCGCCTCCGCCCAGCGACGAACCCAAGAACGTCGACCTTGACACAAGGATA
GCTTTGTTGCTGAAGGGAGCCAGCGGCGGAGGGGGTCTGGCGCCTCCCTTCCTGTCTCTG
GGAATGACCTCTGAAGAGGAAGACGAAGACAGGAAGCCTAGGAACATACCTGACCTGGAC
ACACACAACCCGCCGTCAGACGATGAAGGTTCTGTAAGTGAGGATAGAGAGAGTATAATA
TCATTGAACCAGAATAGAGAAGTTAACCCGGAACCGTTGTCTAATACTCCTTCACCATAC
CTATCAAGAGAGTTCTATCTTGAATGTCTTAAAGCGACGGTCGAGAGGAAAGCAAAAGAA
GAGGAACGCAAAAAGTTCCCGCCAATAGACAAAATAGGTTCCGATATATCGTCGTCTGAA
GACGAACTGCTCACTGGGGAGGAACCACGACGTTCACCTGTTAACCCGCCCGATAGAGAT
CAAGATAATTTGGACGATGATCAGATGTCTTTATCGTCTCTGTCGTCAACGGAGGCCAAA
ATCGAGGAGCAAGTCCCCGCTGAGGCGTATTACTACCCGCCCGCACACCCGCATTACTAT
CAGGCGATGTGGCCGACAACTGCCTATCCTCCGGGTGCTGTGGGCGCTATGGGGTCGGTT
GCCGCTATGGGGCCCGTGGGAGCGGTGGGCCCAGTGGGCGCGGTGGGCGCTGCATACCCA
GCCGCCGGCGACATGTCATTGTATGCTGGTGGCTTCGCTCCTCCCGTGATACACAGCTAC
CCTCCACCACGCACCGTCACTCACGAGGAGCTCGACAACCCTTACTACCCAACAATCAAT
AGTGTGATAGAGCGCGTCACGACTGAGCTTAAACAAATACTTAAGAAGGATTTCAATAAG
AAAATGATAGAGAGCACTGCCTTTAAGAACTTCGAGGTTTGGTGGGACGAGCAGAGTCGG
AAGACGAGACAGACTGTGAAACAAACTAAAGAAGATGTCGGACAACCATTACAAGATGTA
TCAAATAAGAAGGAGGAATCGGTGGATTCAATAAAATCTATAATGGAGTCTAGAGATCTG
GGTCTAGATCTAGGCGGGTACAGTGTTGGTATTGGTCTTGGTCTCAGGGCGACCATACCA
AAGATGCCCAGTTTCAGGAAGAAGAGAAAAATACCTTCGCCTGTTGTTATGGACGAGGAC
TCCAGTAAGAGACTGAGTGATCAGGAGGAAATCGTCCAGAACTCTGACGAAGAGAAGGAA
GTACCGACCAGTCCTCGGAATAGGACAACAGGTTCATACCTCTCGACTGGCAGAAGAAGA
CAGTCGAGCAGCTCATCGAGGTCTTCGTCGTCGTCGTCTTCGCGATCCTCTTGGTCGGGT
TCGGAGCGCTCTGTGAGGAAGGTCGCCCCAAGAATATACTCCGACACAGACGACTCGGAC
CTCGAAGACGCTGAAGTGCAGCAAATCAAGTTGGTGTCCAACAAGGAGAGACTCAGGCGA
GTGTACTCATCGTCATCGGACAGCGAGGAAGAACAGAGAAGAAGAGAAAAAACTCCGATA
CCGGAAGTGGAAGCATCAGACGACCGCCTCGGTTCACCTATTCTGTCGCCGGAAGAGGAA
CCCAGAGATACAATACTCGATCGTGTATACTCGGACTCTGAGGAAGAGAGGGAATACCAG
GAGCGTCGCCGTCGTAACACTGAGTATATGGAACAGATCGAACGAGAGTTCCTCGAGGAA
CAGCGTCGAGGACAACAGACCTCTGACACGGATGCACAGCAACAAAACGATAGTATAACG
GAACCGAAACAAGAAAAGAGTAGAAGTAGTCCAAGCAAGAACTATCTGAAATCACCTGAG
AAGAATAAAATGGCAGCTGAGGGTGATGTTGAAGAAGGTGAAATAAGTTCAGAGGAAGAA
CCTTTAGAAGTTAGAAGAAAGAAGGAGAAGAAACAGAAAAAGAAGACTGACAAGAGACGA
AGAGTCACATCCGTCAGCGACCACAGCTTTACCGAGTCTGCGGTTAGCGTGAATGGCGTT
AAGGAGGCTAGCGGTGCTGTGTCGGAGACGTCTTCGCCTCAGTCGCAGGCGTCCCAGGCT
TCTCAGGCGTCCCAAGTGGCGTTGGACCACTCGTACTGTCGCCCGCCGCCCACTGAACGA
CCTACCACTACACATCTACAACACGATCACGGTTACACTTGGATGGCTGAACCGGAACCG
GAAGCAGAATCGCCACCAGTCGCCATGGAAGAGAAGAGACGGGAGAAGACGGAGAGACCG
TACAAAAGAAAACATCAGAATAAAAAGTTATCTGAAATTCAGAATAAATTATACGACGGT
CGCGATGATTATAACAATAAGTACTCGTCCGTGACATTCAAGCAGCGCGATATAATGGCG
GAGGTCCAAGTGATGTACGAGTTCCTCACCCGCGGTATAGACAGAGAGGATATAGAGTAC
CTCAGGCGGGCGTACGAGGCTCTGTTGGCGGAGGATGCTCAGGGGTACTGGCTCAATGAC
ACCCACTGGGTCGAACATCCGCCCACTGACCTCACGTACTCACCGCCCAAGAAGAAATCC
AAGCGATACAATAACATCTACGAGGACTTGCAAGGCCACTCGAGCGGTTCAGCACGTACG
GAGGGCTACTACAAGATGGACGCCAAGTTGAAGGCGAAGTACAAGTATCACCACGGAAGA
ACCGCTGCGTTACCCCCGCCTGATGATAAGAAAGCCAGCAAGATGCAGCTGCTGTCGAGA
GAGGCGCGCTCCAACCAGAGGAGACTGCTCACGGCATTCGGAACCGACACTGACTCAGAT
CTCCTCAAGTTCAATCAGCTCAAGTTCAGGAAGAAACAGCTCAAGTTCGCCAAATCTGGT
ATACACGACTGGGGTCTCTTTGCTCAGGAGGCGATAGCGGCGGACGAGATGGTTATCGAA
TACGTCGGTCAAATGGTCCGTCCCATAGTAGCGGATGTCCGCGAGGCTCACTACGAGGCC
ACTGGCATCGGTTCTTCATATCTGTTCCGTATAGACTTGGACACTATTATTGATGCAACC
AAGTGCGGTAACCTGGCGCGTTTCATCAACCACAGCTGCAATCCAAACTGTTACGCAAAG
ATAATAACTATAGAATCACAGAAGAAAATCGTCATATACTCGAAACAGCCCATAGGAGTC
GACGAGGAGATAACCTACGACTACAAGTTCCCTCTCGAAGACGAGAAGATACCTTGCCTG
TGTGGAGCGCCGCAATGCCGTGGCTACCTTAACTAG
Protein sequence:
MNGGMEHKTPGHNAVLHKGPKNYKLLIDPFLVKGATKVYRYDGTVPGTSYPSIQCRDPRP
QPSRIWNKLEPADLPIPRFRIDKNYVGVPPQLEITIVNLNDNIDKAFLSDMMNKVGPYEE
LTIFYHPMTNRHLGFARIVFQDVKYSKICIEKYNGKSVMGQVLEVFHDSFGKKCQEMFED
KTVEKKPQPAPIKPPEDARVAKLDPALSKRLEDSKLVDKDPYLRKELEHNDSNTRWSDDE
RDREYKHRLRSRSERDRDIDRDRDGNRERERHRDRYARTSEQSESYSSAHVEIPYAPTPV
PYDPYYQTPGYGYGYGTSAGAVWWGDWRQPHTSHHHSHSEQSSSSTWTAAGEPTPSPRHT
PLAAPQTPYTPAPPTPLPEKEVKCKPEEPLPPSTSVVSDPEPKPPPPSDEPKNVDLDTRI
ALLLKGASGGGGLAPPFLSLGMTSEEEDEDRKPRNIPDLDTHNPPSDDEGSVSEDRESII
SLNQNREVNPEPLSNTPSPYLSREFYLECLKATVERKAKEEERKKFPPIDKIGSDISSSE
DELLTGEEPRRSPVNPPDRDQDNLDDDQMSLSSLSSTEAKIEEQVPAEAYYYPPAHPHYY
QAMWPTTAYPPGAVGAMGSVAAMGPVGAVGPVGAVGAAYPAAGDMSLYAGGFAPPVIHSY
PPPRTVTHEELDNPYYPTINSVIERVTTELKQILKKDFNKKMIESTAFKNFEVWWDEQSR
KTRQTVKQTKEDVGQPLQDVSNKKEESVDSIKSIMESRDLGLDLGGYSVGIGLGLRATIP
KMPSFRKKRKIPSPVVMDEDSSKRLSDQEEIVQNSDEEKEVPTSPRNRTTGSYLSTGRRR
QSSSSSRSSSSSSSRSSWSGSERSVRKVAPRIYSDTDDSDLEDAEVQQIKLVSNKERLRR
VYSSSSDSEEEQRRREKTPIPEVEASDDRLGSPILSPEEEPRDTILDRVYSDSEEEREYQ
ERRRRNTEYMEQIEREFLEEQRRGQQTSDTDAQQQNDSITEPKQEKSRSSPSKNYLKSPE
KNKMAAEGDVEEGEISSEEEPLEVRRKKEKKQKKKTDKRRRVTSVSDHSFTESAVSVNGV
KEASGAVSETSSPQSQASQASQASQVALDHSYCRPPPTERPTTTHLQHDHGYTWMAEPEP
EAESPPVAMEEKRREKTERPYKRKHQNKKLSEIQNKLYDGRDDYNNKYSSVTFKQRDIMA
EVQVMYEFLTRGIDREDIEYLRRAYEALLAEDAQGYWLNDTHWVEHPPTDLTYSPPKKKS
KRYNNIYEDLQGHSSGSARTEGYYKMDAKLKAKYKYHHGRTAALPPPDDKKASKMQLLSR
EARSNQRRLLTAFGTDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAQEAIAADEMVIE
YVGQMVRPIVADVREAHYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAK
IITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDEKIPCLCGAPQCRGYLN