New model in OGS2.0 | DPOGS200457  |
---|---|
Genomic Position | scaffold1378:+ 13817-41183 |
See gene structure | |
CDS Length | 7131 |
Paired RNAseq reads   | 1312 |
Single RNAseq reads   | 4039 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005445 (0.0) |
Best Drosophila hit   | eggless (2e-116) |
Best Human hit | histone-lysine N-methyltransferase SETDB1 isoform 1 (2e-56) |
Best NR hit (blastp)   | PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum] (0.0) |
GeneOntology terms    | GO:0005634 nucleus GO:0018024 histone-lysine N-methyltransferase activity GO:0048477 oogenesis GO:0051038 negative regulation of transcription, meiotic |
InterPro families    | IPR001739 Methyl-CpG DNA binding IPR001214 SET domain IPR007728 Pre-SET domain IPR003616 Post-SET domain IPR016177 DNA-binding, integrase-type IPR003606 Pre-SET zinc-binding sub-group |
Orthology group | MCL12327 |
Nucleotide sequence:
ATGGACAGCAATATAAATAGCGGGTTAAATAAGCCGGTAAGTGAGGAAAAAACTAAGGAT
AGTGAGAAAATAGCTGATTTTGAATTAATTTTACTACGTAGAAGACAGAATGCCGCGAGA
GTTCGTGCCTGTAGGGAAAGGAAAAAAGCTCTTGGCCTCCAGTCATCTTTACCAGTTGAT
ATTAAAACTGAGCCGCCAGATGACATCAGTGGGCTAACAGATTCAATGCCAAGTACAAGC
TGGAGTCCAGCGTCAAATCTTGGATTTAATGCAATTTCAAGTGAGAAGCCCGCAGTTGAT
GATATTGATGCTGAAAAGGATCTGTATCAAAAGCAGCTAAATGCAGAAAGATGTCGTAGG
TATCGTCAAAAATGTAAACTGAAATCTGGCGTCCGTACTAAGCGAACAACAGGGGAACTG
GATACTAGTTCCAGTCGAGACTTAGTTGCTGGGGGAGATGGATTTGAAAGCTCTACCAGT
CAGACACAAGGTTCTTCCAACGACGAATCTACATCATCAGGTGCTAAAAGTACAACCGCC
GCTACAAACAACGCTAATTCCAGCTCTGCTGCGGGGTTGTACTGTCGTCGGTACAGGGAG
AAACTGAATGCACGAAGAAAACGAGTAAAAGAAGATCCCTCATCATTCTATACATTGTAT
GTCAAACATAACGGAGCTCATAACTTATTCGAAAACTTATTCGATAATAATCCATTTGGT
TTTTCGTGTACTGTTTGTGATAGACTATGGTTTGAAAATGAATTGAGAAGTCCTCCTTCG
TCTTGCGGTGAAATTTTAAGACAGATATGCCCAAATGTCCCCTGTCAAGATATTGTAGTT
TGTGCCGCCTGTAAGGTCTCATTAGTTGCTGGAAAAATTCCGAATTTGGCGGTGTACAAT
GGTTTTAAGTATCCGCCTAAGCCGAATCTCCCTCAAATGGATATGGTTTCAGAGCGATTA
ATATCACCAAGATTACCCTTTATGCAAATTAGACGTTTGCGATACGTTGAGGGACAACAT
AGTGTCACTGGTCAAGTTATCAATGTGCCCATTCATGTTGACAACTTGGTCCAAACTCTG
CCTAGAAATATGGTCGACGATTTCTGTATAAATGTTCACGTAAAAAACAAATCACTGCAC
AAATCTAGTTATTTGCAAGGATTATTAAAAAAGCGGGTTATTAGAGACTGGCTTGATTAT
CTTATAGACACCCCGCTTTATAAGCATTATAATATAAAAATCAATCCGTATTTCTTAGAG
GATTTGAACAACGAGAGTGAAATGCCAGACATTGATTTAAAGGATATCGCTGAACCAATT
GTAATTGGTGATAGCCTAGTTGCTGAACAGCATACACTTTTATGGTCCACGGAAAGAGAT
CTACAAATAGCACCTGGAGAGAATAAAAGGTCTTTGAGTTTACTGTTTGATGCTTATGCG
GAAGAATTATCTTTCCCAACTATTTATTACGGTCAATTTCGGAAATTCAAAGACGGTGTT
AATTGTAAAGCACACTCAATTGCAACAAGTGAAATTCGGCGGACCGATCGACGAGGGGCC
ATTCCTCGCCACTTGCTCTTCTTAGTCATGAAGGTTATTTTATTTAGACTGAGTGAAAAC
ATCGGTATTGCTTCTAAATATATCGTGGAAGACACAAAAGTAACGAAAGAGCAAATATTA
TCTTCCGACTATCTTAACGACTGTCGGGAACCAAATTTGTCTTTTCTCAAATTCATACCA
AATTCAGTACAATATTGGCAAAATCGAAAAAAAGATCTCTTTGCAATGATAAGACAACTT
GGGACCCCAACAGTTTTTTTGTCCCTGAGCGCTAATGAAATATCATGGAAATGGCTTTTA
AAAACTTTGCATAAACTGAAACACGGCACGGAGATCTCTGATTTAGAAATTGATCAGATG
CATTATAAAGTCAAGGCAGAATTAATAAATGAAGATGCTGTGACTTGCGCCATTTATTTT
AATAAACTCGTCAATGTTATAATGACAATTCTTCAAAATAAAACCGTTAGCCCATTCGGT
AAACATTATGTACGACATTATTTTAGGAGGATTGAATTTCAACACAGAGGAAATACTCAT
GCACATATCTTGCTGTGGTTAAATCAAGCGCCTAACGATGCTTTTGGAGGTGACATGACT
TCTGCTATTAAACTTATCGATAACTTAATTTCAGTATCAAAGACAGAATGTTCAGGGCAC
ATTGAATTGGTCACACACCATCACACATATTCGTGTTATAAAAATAATCAAAACCAACTA
AAGTGCAGATTCAATGCTCCATATATGCCTAGTAGAACTACCGTTTTGCTTGAACCTATG
GCTAAATCGTCTGATGAGGAAAAACGAATTTATAACGAGTATAAAAAGAGATATCACATC
ATACACCAAAAACTGGAATGTCATGACTATTATAATATTGATGATTTTTATCGTAAAAAT
GGCATAAAGTCAGATGTAGAGTATTATAAAATTCTGTCCGCTGGAATTCTACGACCGATG
GTTTTCGTTAAACGTCACCCTAATGAAAAATGGCACAACTTTTTCAATCCCTTCATATTT
CACCATTTACAATCGAGCATGGATATCCAATACATCACAGACGAGTATTCTTGTGCTGCT
TATATTGCGGAGTGTGTAAACAAATCCGATCGTAGCGTCAGTAATCTTCAACGGGAGCTG
TTGGATCTTTTGGAGAAAAATCCAAATCTAGATTTGGTTGACATGACCAAACATATGAGT
GTTAATATTTTGAATGCAATGGAAATGTCCAGCCAAGAAGCAGCATGGTTCCTTCTTAGG
GAACCATTGTGTAAGTCGACTCTTAAAGTAGAATTCATACCCACAATGTGGCCTCAGGAA
CGTCATCGTTTTAGAAAAACTGAAAAAGAATTAGACCGTCGTCCCGATGAAGACACAAGT
GTTTGGAAAGAAAACTATTTTGAGAATTACGAAAATCGACCAGCAGAGTTGGAAGATGTC
TCACTGATTCAGTTTGTTGCTTGGTATAAGACCAGAACTAGAAAAAAAATGTCAGGACCT
CAAATTGCGAACCAAAACTGGGATTCGGAAGACGAAGAAGACGTTGAAGAAGATATAACA
CCAGAAGAAAATATGAATCAGCAAAGTGAAAAAGTATTTTACCGTCGTAAAACTCCAAGA
GTCGTAAGCTATTTGCGTTACGATATGACTGATCATGAGCTAGACTACAAACGAGAAATG
GTTACCTTGTTTATTCCATTTCGTAACGAAGAAAGGGACATTTTAGCTGATATGAGTTTC
AATCAAATCTATGAAGAAAATGAAGAACTCATTTTAGTTCGCCGAGAAGACTTCGAAGGA
AATTTGGACATTGATAAAATTTTTGCGGCATACAGAATATTATGTCACCCTGATCAGAAT
GGCAGAGACCTAGAAATTTTTCCGCTTATGCATCACGATGCGGATCCATTCAGGGAATTT
TCCAACAATCCTATTCCAGAGCAGGAAGTTGTTATATTAAATGCTGAAGATGAGTCAGAA
GAAGTCATTGATAGTAGAAAATTAGATGTAGCTGAGAGCTATGTTGATTTACGAGACAGT
TCCGAGGAAGACACTCAAGACACGAAGAAAACTGATCATGAAGTAGACAATCTTAATACA
ACGGAAGATATTGATGAAGATGAGCCAGCGAAGAAAGAAGACACGGTATTGCCCATGGTG
AGGTGCGTCAACAAGATGTGCGCTCGCACTTCCTTCGACTTCTACACAGCTGAGCGCAGC
ACAGTGGACTTCTATGATCCAGAGAGAAAGAAAAGAGGTTATGTTTGCAGAACCTGTCTC
AGTCTGGTCGAAGAAAGGAATCAGCTGTTGATCAGCGCCTTTAAATCCCAGACGCCCCTC
CTACAATTGGAGACTCGTCAGCAGGAAGAGGATCTGGTAGAGATATCAGAATCGCAGTCC
GAAGATAATCTAATACCGGAAGTTGATGATGACGTCATAGGCGAGGAGGGGGCTAGGTTT
ATAGAGGAGAAGTTGACTGATGTCCTGAACGAGACCTGGGTCAAGTACAACATGGATGAC
CGGCTCCAGGAGGCACAGGATCAGCTAAAACAACAGCTGGAACAACTGCAAAAGCACAGT
TTGGAGATCGACCAGTTACTGGACGAGTGCCAGATGTCCACAGATAAGCTCCGCACAGAG
ATTTACTCTACGTTCAAGCCAGACATCAGGAAACTTCCGTCGCTACTTATATACGACGTG
CCAGATTGCTCTTACACCTTCGTCGACCTCGCTGAACAGGGAAGTAGACTCTTAAATCTT
CGGAAATCATCTTTATCTGAGTCTCCAACAAAAAAATCCACAACAGATCAGGATTCTGAT
GAATCAGTGGTACATATATCTGTGGAATCCGCTCCCTCCCACCTGCCTCCCGTGGGGGAG
TTGTCTTATCCTCAGTTGGAGGTGGGAATGATAGTCTACGCATCTAAGAATGCTTTGGGG
ACCTGGATGAAGGGCAAGATATTGGAGATAACACCAAAGTCAGAAGATATTGAACGGATC
TACTTGTGGGATCTTCAGACCAGAGTTGCTATTCTAATTTACTCTATCAACATTGGTCAT
CCAGCCAATCAAGCTACTCTCATTGATCATTCAGTCAATCATCCAATCCATTTAAGCAGA
GAACTGCCACACTGTACACGTGTGATAGCCTTGTTCAAGGACATCATGAGGCGCGAGTTT
TTTTACCCGGGTATCGTCGCAGAAATGCCCAACCCAAGGAATAGTTACCGCTACCTGATA
TTCTTCGACGATGGCTACTCTCAATACGCGCCGCACTCTAAGGTCCGTCTGGTGTGCGAG
TGCGCGTCTCACGTGTGGGAGGAAGTACAGCCCAAGTCGCGAGAATTCGTCCGAAAATAT
CTCCTGGCTTACCCTGAGAGACCCATGGTGAGGTTGCACCCTGGACAGAGCTTGAAGACG
GAATGGAAGGACAACTGGTGGTCATCCGTGGTGGTGTCGGTGGACGCGTCGCTGGTGGAA
ATCCAGTTCCTCCAGCTGGAGAGACGAGAGTGGATCTATCGAGGATCCACGAGACTCGCC
CCCCTGTACCTGGAACTGCAGGCCGCGGAGAGACACAGGCCCAGAGCCCTGCCACGGACA
CAGACCACGAGGACGAACATGCCCTACGTGGAGTACACCAGATCTGAAGAACAGACGAGC
AAACAAGCCAAGACTTCGCCACAGCAACAACAGAGCGAGGGATTTCCTCGTCAGCGAGCC
GTTGCCAAGAAGACTACCACGAAGACTCGCCAACCACCCCGTACAGCCGTACAGAGCCTC
GACCACTTTACTAGTAAACTAGTGGGATTTCCTCGTCAGCGCGCCGTTGCCAAGAAGACT
ACCACGAAGACTCGCCAATCATCCCGTACAGCCGTACAGAGCCTCGACCACTTTACTAGT
AAACTAGTGTACTACAGTCCAAAGAAACATGTGAAGCCATACAAGATGGTGCCCCATACT
TGCTCGACTGCGTGCAAGAGGACGGATGTTTTGGAACTCAAAGATTTAAAATCTTACAAT
CCATTAGCCAAGCCACTGTTGAGTGGCTGGGAGAGACAGATAGCCAATTTCAAGGGCAAC
AAGGTTGTATTGTACTTGTCTCCGTGCGGTCGCCGCGTCCGCTCTCCGCGGGAGCTACAT
CGCTATCTGCGAACCGTTGGTAGTCTGGACGGTCAGCTGGAGAAGCTCTTCACACCATCC
ACGCACTGTCTGGCCGAGTTTGTGCTCAACAAATACTGCGTCAGCAAGAAGGACTTATCA
AATGGCAAAGAGAACGTCCCAGTGGCTTGCGTCAATTACTACGACGGATCACTGCCAGAG
TTCTGTTTCTACAACACTGAGCGGACTCCGACCGCTGGGGTTCCACTCAACCTGGACCCG
GAGTTCCTGTGTGGCTGTGACTGCGAGGACGACTGCGAGGACAAGAGCAAGTGCGCCTGC
TGGCAGCTGACTCTGGAGGGCGCTAGGACGATAGGTCTGGAGGGGGAGAACGTCGGTTAC
GTTTATAGAAGACTAATGGAACCGCTCCCGACTGGTATTTACGAGTGCAACTCTAGGTGC
AAGTGTAAAGACACTTGTCTTAACCGCGTCGCTCAATATCCACTTCAGCTAAATTTGCAG
GTGTTCAAGACCCAGAACCGCGGTTGGGGCATTCGCACCCTGAATGACATACCCAAGGGG
AGCTTCCTCTGTACTTACGCAGGGAAACTACTAACAGAGGCCACAGCTACCCTCGACGGT
CTGAACGAGGGTGACGAGTACCTGGCGGAGTTGGACTACATCGAGGTCGTGGAACAGATG
AAGGAGGGTTACGAAGAGGACATACCAGAGAACATCAAGAAGATGGATGAGGCTCAAATA
GCGGAACAACTCTCGATGGCGGGCGAAGAAACACAGTCATCGTCTTCAGGGGAAAGCAGC
CCCAAAAGCGCTGAAAATGACGACCTTAGCCTCGAAGACATTGGTCCGGGGGTCACAGAG
TCCAGCAAAGAACTAAGGGGGAAAGACTCAAAGACAGACGAAGAAATAGAGAGTGCGGTG
CTGAAAGTTACCGAGAGATTAGTGCCCACAGAAGAAGATGAAACAGTTTTCACAGAGGAA
CAGAAATCTGTTGTCATAGAAGTGGAAAGTTCAGTGCCCACGGAGGACGAACTCTCTGAA
ATGCAGGAGGAAATCGATGAAGATTATGATTCTTCGAGTGATGACGGAGAAGATCGAGAA
CCTTCGAATTTCTCAGCCAGTGCTGGGATGGGAGCAAAGAAGTTTAAATCAAAGTATAGG
TCTGTCCGTAGTCTGTTTGGTGAAGATGAAGCCTGCTACATCTTGGACGCCAAGGTTCAA
GGGAATATAGGCAGATATCTCAATCACTCGTGCGTGCCGAACGTGTTCGTCCAGAACGTG
TTCGTGGACACGCACGACCCTCGCTTCCCGTGGGTGGCTTTCTTCGCTCTCACAGCCGTG
CGGGCCGGGGGCGAGCTCACCTGGAACTACAACTACGACGTAGGTTCCGTGCCCGGGAAG
GTCCTCTACTGTTACTGCGGGGCTCCGACGTGTCGCGGCAGGTTACTGTGA
Protein sequence:
MDSNINSGLNKPVSEEKTKDSEKIADFELILLRRRQNAARVRACRERKKALGLQSSLPVD
IKTEPPDDISGLTDSMPSTSWSPASNLGFNAISSEKPAVDDIDAEKDLYQKQLNAERCRR
YRQKCKLKSGVRTKRTTGELDTSSSRDLVAGGDGFESSTSQTQGSSNDESTSSGAKSTTA
ATNNANSSSAAGLYCRRYREKLNARRKRVKEDPSSFYTLYVKHNGAHNLFENLFDNNPFG
FSCTVCDRLWFENELRSPPSSCGEILRQICPNVPCQDIVVCAACKVSLVAGKIPNLAVYN
GFKYPPKPNLPQMDMVSERLISPRLPFMQIRRLRYVEGQHSVTGQVINVPIHVDNLVQTL
PRNMVDDFCINVHVKNKSLHKSSYLQGLLKKRVIRDWLDYLIDTPLYKHYNIKINPYFLE
DLNNESEMPDIDLKDIAEPIVIGDSLVAEQHTLLWSTERDLQIAPGENKRSLSLLFDAYA
EELSFPTIYYGQFRKFKDGVNCKAHSIATSEIRRTDRRGAIPRHLLFLVMKVILFRLSEN
IGIASKYIVEDTKVTKEQILSSDYLNDCREPNLSFLKFIPNSVQYWQNRKKDLFAMIRQL
GTPTVFLSLSANEISWKWLLKTLHKLKHGTEISDLEIDQMHYKVKAELINEDAVTCAIYF
NKLVNVIMTILQNKTVSPFGKHYVRHYFRRIEFQHRGNTHAHILLWLNQAPNDAFGGDMT
SAIKLIDNLISVSKTECSGHIELVTHHHTYSCYKNNQNQLKCRFNAPYMPSRTTVLLEPM
AKSSDEEKRIYNEYKKRYHIIHQKLECHDYYNIDDFYRKNGIKSDVEYYKILSAGILRPM
VFVKRHPNEKWHNFFNPFIFHHLQSSMDIQYITDEYSCAAYIAECVNKSDRSVSNLQREL
LDLLEKNPNLDLVDMTKHMSVNILNAMEMSSQEAAWFLLREPLCKSTLKVEFIPTMWPQE
RHRFRKTEKELDRRPDEDTSVWKENYFENYENRPAELEDVSLIQFVAWYKTRTRKKMSGP
QIANQNWDSEDEEDVEEDITPEENMNQQSEKVFYRRKTPRVVSYLRYDMTDHELDYKREM
VTLFIPFRNEERDILADMSFNQIYEENEELILVRREDFEGNLDIDKIFAAYRILCHPDQN
GRDLEIFPLMHHDADPFREFSNNPIPEQEVVILNAEDESEEVIDSRKLDVAESYVDLRDS
SEEDTQDTKKTDHEVDNLNTTEDIDEDEPAKKEDTVLPMVRCVNKMCARTSFDFYTAERS
TVDFYDPERKKRGYVCRTCLSLVEERNQLLISAFKSQTPLLQLETRQQEEDLVEISESQS
EDNLIPEVDDDVIGEEGARFIEEKLTDVLNETWVKYNMDDRLQEAQDQLKQQLEQLQKHS
LEIDQLLDECQMSTDKLRTEIYSTFKPDIRKLPSLLIYDVPDCSYTFVDLAEQGSRLLNL
RKSSLSESPTKKSTTDQDSDESVVHISVESAPSHLPPVGELSYPQLEVGMIVYASKNALG
TWMKGKILEITPKSEDIERIYLWDLQTRVAILIYSINIGHPANQATLIDHSVNHPIHLSR
ELPHCTRVIALFKDIMRREFFYPGIVAEMPNPRNSYRYLIFFDDGYSQYAPHSKVRLVCE
CASHVWEEVQPKSREFVRKYLLAYPERPMVRLHPGQSLKTEWKDNWWSSVVVSVDASLVE
IQFLQLERREWIYRGSTRLAPLYLELQAAERHRPRALPRTQTTRTNMPYVEYTRSEEQTS
KQAKTSPQQQQSEGFPRQRAVAKKTTTKTRQPPRTAVQSLDHFTSKLVGFPRQRAVAKKT
TTKTRQSSRTAVQSLDHFTSKLVYYSPKKHVKPYKMVPHTCSTACKRTDVLELKDLKSYN
PLAKPLLSGWERQIANFKGNKVVLYLSPCGRRVRSPRELHRYLRTVGSLDGQLEKLFTPS
THCLAEFVLNKYCVSKKDLSNGKENVPVACVNYYDGSLPEFCFYNTERTPTAGVPLNLDP
EFLCGCDCEDDCEDKSKCACWQLTLEGARTIGLEGENVGYVYRRLMEPLPTGIYECNSRC
KCKDTCLNRVAQYPLQLNLQVFKTQNRGWGIRTLNDIPKGSFLCTYAGKLLTEATATLDG
LNEGDEYLAELDYIEVVEQMKEGYEEDIPENIKKMDEAQIAEQLSMAGEETQSSSSGESS
PKSAENDDLSLEDIGPGVTESSKELRGKDSKTDEEIESAVLKVTERLVPTEEDETVFTEE
QKSVVIEVESSVPTEDELSEMQEEIDEDYDSSSDDGEDREPSNFSASAGMGAKKFKSKYR
SVRSLFGEDEACYILDAKVQGNIGRYLNHSCVPNVFVQNVFVDTHDPRFPWVAFFALTAV
RAGGELTWNYNYDVGSVPGKVLYCYCGAPTCRGRLL