DPGLEAN20394 in OGS1.0

New model in OGS2.0DPOGS203203 
Genomic Positionscaffold4990:+ 1443-4724
See gene structure
CDS Length1698
Paired RNAseq reads  344
Single RNAseq reads  1031
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011089 (4e-25)
Best Drosophila hit  Sex comb on midleg (3e-115)
Best Human hitpolycomb protein SCMH1 isoform d (6e-79)
Best NR hit (blastp)  PREDICTED: similar to lethal(3)malignant brain tumor [Tribolium castaneum] (2e-166)
Best NR hit (blastx)  PREDICTED: similar to lethal(3)malignant brain tumor [Tribolium castaneum] (1e-148)
GeneOntology terms









  
GO:0005634 nucleus
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0045449 regulation of transcription
GO:0016458 gene silencing
GO:0005515 protein binding
GO:0008270 zinc ion binding
GO:0030713 ovarian follicle cell stalk formation
GO:0030708 germarium-derived female germ-line cyst encapsulation
GO:0035102 PRC1 complex
GO:0007409 axonogenesis
GO:0022008 neurogenesis
InterPro families




  
IPR004092 Mbt repeat
IPR021129 Sterile alpha motif, type 1
IPR010507 Zinc finger, MYM-type
IPR001660 Sterile alpha motif domain
IPR010993 Sterile alpha motif homology
IPR013761 Sterile alpha motif-type
Orthology groupMCL12241

Nucleotide sequence:

ATGTCAAACAATACTGTCGGCTCGTCAGGGCATGCTCCCGGCAAAATACGTGGACCTGGA
AGACCCCCTAAACGGACATGCACTTGGTGTGCTGAGAGTAAAACGCCTTTGAAGTATGTT
TTGCCTACGGAAAATGGGAAAAAAGAATTTTGTTCAGAAACGTGTTTGTCAGAATTCCGA
CAAGCATATAGCAAAGGCGCTTGTCTTCACTGTGACAATGTTATACGCGGCAATGCTCCA
TCCAGTAGCAAAAATTTTTGTTCGACGTATTGTTTAAATAAATATCAAAAGAAAAATGAG
AAGAGAACAACTTCGCCGCAATCAGGGAATGGAGCGAACGGTACCGAACCTCATCAGAAC
AATAATTCGACAGGATCGTTCTATGACATATACCAGTCGTTTGATTGGAATGAATATATG
AAGGAAACTAATAGTGTTGCTGCACCCCAAGAGTGTTTTAAGCAAGCTCCAAATCCTCCA
GTGAATGACTTTAAAGTTAATATGAAACTCGAAGCTTTGGACCCTCGTAATTTGACATCA
ACTTGCATTGCTACAGTTGTTGGTGTATTGGGTCCAAGATTGAGGCTTAGACTTGATGGA
AGTGATAATAAAAATGATTTTTGGAGGCTTGTTGATGCTGGTGATATTCATCCTATAGGT
TATTGTGAGAAAAATGATGGTATGTTGCAACCACCTCTTGGTTTTCGTATGAATGCCAGC
AGTTGGCCTATGTTCTTGCTAAAAACATTAAATGGGGCGGAGATGGCTCCATCAAAGGTC
TTTCAACCTGAACCACCTACTCCTAAATCAAATTTGTTTGTTGTTGGTCAAAAATTAGAA
GCTGTTGATAAAAAAAATCCACAACTTATATGTTGTGCAACTGTTGGTGCCGTGAAAAAT
GATCAGATACATGTTACTTTTGATGGTTGGAGGGGGGCTTTTGATTACTGGTGTAAATAT
GACTCTCGAGACATATTTCCTGTTGGCTGGTGTGCAAGAGCAGGTCACTTATTACAGCCA
CCTGGTCAAAAAAGTGCTACAGCGCCTTCTAGATTTAAATTGCGTCCCAGTGGTATTCCT
AATCCAGCTTTACCAGAAGGGGGATCAACTGGTACAGGCAATGCAAATGGAGCTAACACT
GTAACTCCATTAGCAAATGTTGTTTTACGTATCCGTAATAGTTGTTCTGGAGGGAACGCC
GCCTTACCATCCTCCATCACCGGTGTCGGTGCTTCAGGTGTAGCTGAAAACCTAGTTAAA
GAGTTACTTGTCACATATACAGATCCTCAAAAACTCACAAGGGCAATACTTACTGCATCA
AATAGCTATTCAAATAATACTAATGTACAGGAGTCTCCAACTCCATGCACTAATGGCGAT
TCCGGTTGTAAGCTGGCTCGTGTGTCTCCTGAACGTCCTGAGCCGGCGTCATCGACTACG
TGTGCCCCCCCTCCCCCTGCCCCCGCCGCCGCCCCCGCTGACTGGTCCGTGGAGGATGTC
ATCGGATTTATCGCTGCAGCTGACCAAGCTCTTGCCGCCCATGCTGATCTATTCAGGAAG
CATGAAATAGATGGCAAGGCGCTCTTGTTATTGAATTCTGACATGATGATGAAATACATG
GGTCTGAAACTTGGTCCCGCCTTAAAAATATGCAATTTGGTATCTAAAATAAAAAATCGT
CGACATTATAGCACCTAG

Protein sequence:

MSNNTVGSSGHAPGKIRGPGRPPKRTCTWCAESKTPLKYVLPTENGKKEFCSETCLSEFR
QAYSKGACLHCDNVIRGNAPSSSKNFCSTYCLNKYQKKNEKRTTSPQSGNGANGTEPHQN
NNSTGSFYDIYQSFDWNEYMKETNSVAAPQECFKQAPNPPVNDFKVNMKLEALDPRNLTS
TCIATVVGVLGPRLRLRLDGSDNKNDFWRLVDAGDIHPIGYCEKNDGMLQPPLGFRMNAS
SWPMFLLKTLNGAEMAPSKVFQPEPPTPKSNLFVVGQKLEAVDKKNPQLICCATVGAVKN
DQIHVTFDGWRGAFDYWCKYDSRDIFPVGWCARAGHLLQPPGQKSATAPSRFKLRPSGIP
NPALPEGGSTGTGNANGANTVTPLANVVLRIRNSCSGGNAALPSSITGVGASGVAENLVK
ELLVTYTDPQKLTRAILTASNSYSNNTNVQESPTPCTNGDSGCKLARVSPERPEPASSTT
CAPPPPAPAAAPADWSVEDVIGFIAAADQALAAHADLFRKHEIDGKALLLLNSDMMMKYM
GLKLGPALKICNLVSKIKNRRHYST