New model in OGS2.0 | DPOGS200462  |
---|---|
Genomic Position | scaffold1810:+ 6798-23019 |
See gene structure | |
CDS Length | 5100 |
Paired RNAseq reads   | 2820 |
Single RNAseq reads   | 6848 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011407 (0.0) |
Best Drosophila hit   | eggless (2e-127) |
Best Human hit | histone-lysine N-methyltransferase SETDB1 isoform 2 (1e-54) |
Best NR hit (blastp)   | PREDICTED: similar to histone-lysine n-methyltransferase [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to CG30426-PA [Apis mellifera] (6e-156) |
GeneOntology terms    | GO:0005634 nucleus GO:0018024 histone-lysine N-methyltransferase activity GO:0048477 oogenesis GO:0051038 negative regulation of transcription, meiotic |
InterPro families    | IPR001739 Methyl-CpG DNA binding IPR003606 Pre-SET zinc-binding sub-group IPR007728 Pre-SET domain IPR001214 SET domain IPR016177 DNA-binding, integrase-type |
Orthology group | MCL12327 |
Nucleotide sequence:
ATGGCATCAAAACAAAACATGGAAGATGAAGAAAATCTCGTTGGAAAAAAAGAGCTTGAT
GATACGACGAATGTTAATGATCAAAAAGACGGAGAGGTGTACGAAAGCGTTGATGATGAT
ATGGAATTAAAGTGGGAAGATGATGATATTGACGACGTATCGATCACAAATGAAGACGCG
CTTCTCGAAGATGTAGCTATGGACAATGATGATAAATTATTACCTTCCGACTCTGTAATA
CCGGTAGCTAGCCAAGAAAGTATCACGTATGAGATCAATCCTAATGAATTTATAAATAAA
GCGAATTTAGATGAAACGTTTGAACCGGCGAAAGCGCATGGCTTAAATAAACCCGACATG
ATGCTGGAGATACTAACTCATAACCTTAGCGATTTAAGTGACGATGAAGATCTTACAAAT
CTTAAGATGTCGCCTGACGTCGATTTGGAACGGTGCAGTCCTAACAAAGATTCACAGATT
GAAAATAAAGCTGACTTCATGGAAACTGATTTGAATGATGATTTTGATAGAATTTCATCA
CTAGTTCATGAAAATGTTGATGATGACTTAGATAAAGGCGCAGATATTTCAATGTGTGAA
GACAGCAAGCCCGCAGAGACATCACTATCTAGGAAACTTAGTGTGAACGATGACTCCATT
GATGAAGATATTCTACTTGCAGATGATGATAAAGATGAACAGGATGAAGGTGGAATGGAA
GAACTATTGGATGATAAGATTGATTTGGATGCTGTTGATATATTAGAGATTAATTCTGAA
GAGAAGTTGGAATTAGAAAGTGAGAAAAATAAGCTATTACAAAATATTCCCGATACAGAT
GGTCTAAAACTTAATAATGAGGAAGACGTGTCAGGAATTAAAGCAGAGTGTGGAGCTGAA
TGTAAAGAAGTAGACGAGATAATAAACGTCCCGAGTCCAAAACCAGAGATATCAGAAGCA
CATGTACCTAAAGTTATAAATGCAGATCAATCCAATGATTCACCAACTTCAGAAAAGGAA
GTTCAGGTCTCAAAAACTGGTATAAAAAGGAAAAAATTATCTTTGAGACTGAGATTGGAT
AAAACACAGAGCACAGGATCTGATGTGATTATGGATGAACACACTTCATTGAAATCTGAT
GGATCAGAAGTTGCATTAAGGGACCCGGGAAATGATCAAATTAATACAAATAATGCTCAC
TTGACCTCCGAAGATGTAGCAAATCAACCAGACATTGCAAACAAAGATTTAATTACCGCA
GCAGACTTGGAATCCGAACATACTAAGAGAAAGAAAGATTCCGAGGACACTAGTTCACAA
CTACCAGATAACGATGTGTCATTAAAACCTAAAAAAGACCTGAAATCTAAGGAGAAGAAA
TTAACCCCTGATATAGAACCGCAGCCATCGACGAGTGGTTCGAAAAACATTAAAATTAAT
ATTGAATCAGCGTCAAAGAGTGACAGTCGACTAACTTCAAACATCTCTAAACTAAGTTCA
CCGGCAGAAGACATTCCTGGAACAACTGATAATCTTGACCTCCTGGCTGAATCGTCGCGC
GTGACACATGACGATGAAGCAGAAGATGAATATATGGATGACGAGGAGGGAGAGGATTTT
GAGCAGTTTGACGAAAGCAGCAATCAGATGGCAGCGGAGCAGTCCGAGGATTCAGAGCAG
CATCACTCGGATAACGCTCACGAGACAACACACAGTAATGAAAAGGAATTCAGCTTTACC
ATCACTGATGTCGTCACTGAAAATGTAGTTAAGGTGGACATTGAGAATCAAGACAGTATT
AAGTCAGAGAATTTGGAATCAGTGCCGATAGGAAATGTGTGTCAGAACATGGACGTGTCC
AAGGTTGGAGGAATAGAAACCGATTTTGGAAATGAGAATAAAAAGTTGACATCGGAAGAA
GAAAAGAATCTGGATTATCAAGACGAGACAAAAGATTGCACCGATGTGAAAAAATCAGAG
GCCCTGAGCTATGTTGAGTTAGAGGAGAGCTCGGAAGAAGATGCTCTAGACGCAAATAAA
ACTGATGAAGTAGGCAATATGAATACAACACAAGATATCAACGAAGATGAGCCAGCCAAG
ACAGAGGACACGGGACTGGATGACAGTAACACACTAGTTGAGGATACTACGAACCAAGAT
ACAAAAGCATTGGATAAAGATGACAAGACGAAATCCCAAGGTTTGGAAGTATTCAATCTA
GACTCGGACGAGGAGGATGTTGGTGAAAAGAATAAAACGGACATTTCCCATCAAGAAACC
CCTGAGAATCCGAAGCCCCAATCCCAGTGGGTGAAGTGCATCAACAAGTCCTGTGCCAAC
ACATCGTCAGACTATTACAAGGCTGACGGCATCACAGTCAACTTCTATGACCCGGAGAGA
AAGAAAAGAGGCTATGTTTGCCAAACCTGTCTCAATTTGGTGGAAGAGAGGAATCAGTTG
TTGATCAGCGGCATCAAGTCCCTGGTGCCGCTGCTGAAGCTGGAGCCCGGCCGGCCGGAA
GAGGATCTGGTCGAGATATCAGACTCGGAGTCCGAAGACGAGGCGGAGCCGGAGGACGAC
GATGACGTCATAGGAGTGGAGGGGGCTAGGGTGATAGAAGAGAAGTTGACTGATGTCCTG
AACGAGACGTGGGTGAAGTACAACTTGGATGACCGGCTGCAGGAGGCACAGGACCAGCTC
AAACAACAGCTGGAACAGCTGCAAAAGGACAGTTTGGAAATCAACCAGCTCCTAGACGAG
TGCCAGCTATCCACAGACAAGCTGCGATCAGAGCTCTACTCTAGCTTCGAGCGCGACATT
AAAGAACTCCCATCGCTTCTAATATTCGACGTGCCTAATTGCTCTTACACCTGCGTCGAT
CCATCCGGAGAGGGAAGCAGACTACTGAAGCGCAGGAAGTCATCTGTATCCGAGTCCCCG
GCAAAGAAATCTGCATTGTCAACAGGCGATCAAGACACAAACACCAAAGACATGACAGAC
GAGAAAACGGAAGAGGATAATCCTGATGTGTCTGTGGTACATCTCTCCGTGGAATCCGCG
CCGCCCGACCTTCCTCCCGCGGGGGAGGTAACCTACCCCCCCTTAAGAGTGGGGATGACG
ATCTACGCGTCCAAAAATGCCCTGGGTTCCTGGATGAAAGCCAAAATTGTAGAGATCACT
CCGAAATCATCACTTCCGAACTGTTTTACGCTGTGTCGCGTCAAGTACGAATACAAACAG
TCTAAGCCAACCAAAATATTACCAGCGAGGTGTATCGCCTACATAGACCCACCAGACGTT
AGAATGACTATAGGTACCCGTGTGATAGCTCTGTTCAAAGACATAACCATGAAGGAGTCC
TTCTACCCGGGGATTGTTGCTGAAATACCGAACCCAGTCAACAATTACCGCTACCTGATA
TTCTTCGACGATGGCTACTCTCAATACGCGCCGCACTCTAAGGTCCGTCTGGTGTGCGAG
TGCGCGTCTCACGTGTGGGAGGAAGTACAGCCCAAGTCGCGGGAATTCGTCCGAAAATAT
CTCCTGGCTTACCCTGAGAGACCCATGGTGAGGTTGCACCCTGGACAGAGCTTGAAGACG
GAATGGAAGGACAACTGGTGGTCATCCGTGGTGGTGTCGGTGGACGCGTCGCTGGTGGAA
GTCCAGTTCCTCCAGCTGGACAGACGAGAGTGGATCTACCGAGGATCCACGAGACTCGCC
CCCCTGTACCTGGAACTGCAGGCCGCGGAGAGACACAGGCCCAGGGCCCTGCCACGGGCA
CAGACCACGAGGACGAACATGCCCTACGTGGAGTACACCAGATCTGAAGAACAGACGAGC
AAACAGGCCGAGACTTCGCCACAGCAACAACAGAGTGAGTACTACACGCCGAAGAAACAG
GTGAAGCCGTACAAGATGGTGCCACACACTTGCTCGCCGGCGTGCAAAAGAACGGATGTT
CTGGCACTTAAGGATTTGAGAACTTATAATCCGTTAGCCAAGCCGCTACTGAGCGGCTGG
GAGAGGCAGATAGTTCTTTTCAAGGGCAACAAGGTTGTGTTGTACGTGTCTCCGTGTGGT
CGCCGCATCCGCTCTCCGCGGGAGCTACATCGCTATCTGCGGACCGTTGGGTCAGACCTG
CCAGTCGACCTCTTCGACTTCACACCATCCACGCACTGTCTGGCCGAGTTTGTGCTCAAC
AAATGCTACGTTGGCAAAAAGGATTTGTCCCATGGCAAAGAGAACGTCCCAGTGCCTTGT
GTCAATTACTACGACGAATCACTGCCAGAGTTCTGTTCCTACAACACTGAGCGGACTCCG
ACCGCTGGGGTTCCACTCAACCTGGACCCGGAGTTCCTGTGTGGCTGTGACTGTGAGGAC
GACTGCGAGGACAAGAGCAAGTGCGCCTGCTGGCAGCTGACTCTGGAGGGCGCCAGGACG
ATAGGTCTGGAGGGGGAGAACGTCGGTTACGTTTACAAAAGACTGCCAGAACCACTGCCT
AGCGGTATATACGAGTGTAATTCGAGGTGTAAATGTAGAGACACGTGCCTTAACCGCGTC
GCTCAACATCCGCTGCAGCTGAAGTTACAAGTGTTCAAGACCCTCAACCGCGGGTGGGGG
ATTCGCGCCCTCAACGACATACCGAAAGGGGCCTTCCTTTGCGTCTACGCTGGAAATTTG
CTCACCGACGCTACAGCAAACCTTGACGGTCTGAACGAGGGTGACGAGTACCTGGCGGAG
TTGGACTACATCGAGGTCGTGGAACAGATGAAGGAGGGTTACGAAGAGGACATACCAGAG
AACATCAAGAAGATGGATGAGGCGGAAATAGCGAAACAGCAGTTGATGCCGGACGACGAG
ATGGAATCCTCGTCATCAGAGGAAGGGAGCAGCACCAAGAACGGCGAGGAAGACGATGAC
TTCAGTCCCGGATACATCGGCCTGGGTGTCACCGAGTTCAACAAACGATTACGGAAAAGG
GATTCGAAGGTAGCTAAAGAAAAGTCTATGGCCAAAGACAAGGATAAAACCGAAGCGAGG
AAGGAGAACGAAGAGGATTGCATCACCATCAGTGATGATGAGGAAGGTGGGGACTCGTGA
Protein sequence:
MASKQNMEDEENLVGKKELDDTTNVNDQKDGEVYESVDDDMELKWEDDDIDDVSITNEDA
LLEDVAMDNDDKLLPSDSVIPVASQESITYEINPNEFINKANLDETFEPAKAHGLNKPDM
MLEILTHNLSDLSDDEDLTNLKMSPDVDLERCSPNKDSQIENKADFMETDLNDDFDRISS
LVHENVDDDLDKGADISMCEDSKPAETSLSRKLSVNDDSIDEDILLADDDKDEQDEGGME
ELLDDKIDLDAVDILEINSEEKLELESEKNKLLQNIPDTDGLKLNNEEDVSGIKAECGAE
CKEVDEIINVPSPKPEISEAHVPKVINADQSNDSPTSEKEVQVSKTGIKRKKLSLRLRLD
KTQSTGSDVIMDEHTSLKSDGSEVALRDPGNDQINTNNAHLTSEDVANQPDIANKDLITA
ADLESEHTKRKKDSEDTSSQLPDNDVSLKPKKDLKSKEKKLTPDIEPQPSTSGSKNIKIN
IESASKSDSRLTSNISKLSSPAEDIPGTTDNLDLLAESSRVTHDDEAEDEYMDDEEGEDF
EQFDESSNQMAAEQSEDSEQHHSDNAHETTHSNEKEFSFTITDVVTENVVKVDIENQDSI
KSENLESVPIGNVCQNMDVSKVGGIETDFGNENKKLTSEEEKNLDYQDETKDCTDVKKSE
ALSYVELEESSEEDALDANKTDEVGNMNTTQDINEDEPAKTEDTGLDDSNTLVEDTTNQD
TKALDKDDKTKSQGLEVFNLDSDEEDVGEKNKTDISHQETPENPKPQSQWVKCINKSCAN
TSSDYYKADGITVNFYDPERKKRGYVCQTCLNLVEERNQLLISGIKSLVPLLKLEPGRPE
EDLVEISDSESEDEAEPEDDDDVIGVEGARVIEEKLTDVLNETWVKYNLDDRLQEAQDQL
KQQLEQLQKDSLEINQLLDECQLSTDKLRSELYSSFERDIKELPSLLIFDVPNCSYTCVD
PSGEGSRLLKRRKSSVSESPAKKSALSTGDQDTNTKDMTDEKTEEDNPDVSVVHLSVESA
PPDLPPAGEVTYPPLRVGMTIYASKNALGSWMKAKIVEITPKSSLPNCFTLCRVKYEYKQ
SKPTKILPARCIAYIDPPDVRMTIGTRVIALFKDITMKESFYPGIVAEIPNPVNNYRYLI
FFDDGYSQYAPHSKVRLVCECASHVWEEVQPKSREFVRKYLLAYPERPMVRLHPGQSLKT
EWKDNWWSSVVVSVDASLVEVQFLQLDRREWIYRGSTRLAPLYLELQAAERHRPRALPRA
QTTRTNMPYVEYTRSEEQTSKQAETSPQQQQSEYYTPKKQVKPYKMVPHTCSPACKRTDV
LALKDLRTYNPLAKPLLSGWERQIVLFKGNKVVLYVSPCGRRIRSPRELHRYLRTVGSDL
PVDLFDFTPSTHCLAEFVLNKCYVGKKDLSHGKENVPVPCVNYYDESLPEFCSYNTERTP
TAGVPLNLDPEFLCGCDCEDDCEDKSKCACWQLTLEGARTIGLEGENVGYVYKRLPEPLP
SGIYECNSRCKCRDTCLNRVAQHPLQLKLQVFKTLNRGWGIRALNDIPKGAFLCVYAGNL
LTDATANLDGLNEGDEYLAELDYIEVVEQMKEGYEEDIPENIKKMDEAEIAKQQLMPDDE
MESSSSEEGSSTKNGEEDDDFSPGYIGLGVTEFNKRLRKRDSKVAKEKSMAKDKDKTEAR
KENEEDCITISDDEEGGDS