DPGLEAN09901 in OGS1.0

New model in OGS2.0DPOGS207664 
Genomic Positionscaffold975:- 5912-7102
See gene structure
CDS Length1191
Paired RNAseq reads  416
Single RNAseq reads  1055
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010525 (2e-162)
Best Drosophila hit  CG1620, isoform A (1e-38)
Best Human hitmesoderm induction early response protein 3 (3e-55)
Best NR hit (blastp)  PREDICTED: similar to mesoderm induction early response 1 [Tribolium castaneum] (7e-98)
Best NR hit (blastx)  PREDICTED: similar to mesoderm induction early response 1 [Tribolium castaneum] (3e-100)
GeneOntology terms





  
GO:0045449 regulation of transcription
GO:0005634 nucleus
GO:0006350 transcription
GO:0003677 DNA binding
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families


  
IPR000949 ELM2 domain
IPR017884 SANT, eukarya
IPR009057 Homeodomain-like
IPR001005 SANT domain, DNA binding
Orthology groupMCL15451

Nucleotide sequence:

ATGTCAGACTGTGCGCTGGTAACCAGTGTAAGCGAACACGATGCTAGTATGGATGTGGGA
AACGACAAATCCCTCTTCGAGCCGACTATTGATATGATGGTAAATGATTTCGACGACGAG
AGAACATTAGACGAAGAAGAAGCTTTGGCGGCAGGCGAGCAACAAGATCCGAAAGCGGAA
CTTAATAGCTTGCAACGTGAAGGTGATATGCCTTTAGAGGAATTACTTGCATTGTATGGT
TATAACAGAGGTATGGATAAAGCAAGCCCTGAACAACCACCAGAGGTGGTACCGGAAGAA
AATGAGAAAGCTGAGTCTGCTCTACAGCAGTTATACACTGAGACCACAAGCCCTGAAGCC
ACACGGTGTCTCCGCTCTGGCTCAAGGCCTCCTTCTGAAGAAGAAGATGATTATGACTAT
AGTCCCGATGAGGATGACTGGAAAAAAACTATCATGGTAGGTAGTGATTATCAAGCTGGT
ATACCAGAAGGTCTCTGCAGTTATGATGATGCTTTGCCATATGAGAATGAAGATAAATTG
TTGTGGAACCCAAGTGTCCTTGATGAAAAGGTGATAGAAGATTATATGAGAAAAATATGT
GCTATGAATTCCGGCACAGGTATTGATGCTGTGCCTAGAGGAAAGCAGCTGAGAGATGAT
GAAGAAGCATTGTTCCTATTGCAACAATGTGGTCATAATGTTGAGGAAGCTCTCAGGAGG
AGAAGAATATCGGCACAAACCCCTGCCCACGCCAGTGTATGGTCCGAAGAGGAATGCAGA
AACTTTGAAAACGGTATCAAAGTTCACGGCAAGGACTTTCACTTAATACGCCAACAGAAA
GTCAGGACGAGATCTGTTGGGGAGCTAGTACAATTTTATTATATCTGGAAAAAAACTGAA
CGACATGATATATTTGCTAACAAGACGAGACTAGAAAAGAAAAAATACACACTACATCCT
GGGCATACCGATTATATGGACAGATTTTTGGAGGAACAGGAAGCTACAGGGGCTGCTAAT
GTCGTCCGACCTGTCTCTCCGTCCCCTATGATGGTGTATGTACCTTCACCGGCCACCCAG
CCGGATCCCTTGGCTTTGGGAGAGAAAGAGGTTTTCTCTCAATTAAATCCCCATACTACA
CCACCAAGAACCCTCTCCATCGAGGATCAAGAACCAGACGTTGTTTCCTAA

Protein sequence:

MSDCALVTSVSEHDASMDVGNDKSLFEPTIDMMVNDFDDERTLDEEEALAAGEQQDPKAE
LNSLQREGDMPLEELLALYGYNRGMDKASPEQPPEVVPEENEKAESALQQLYTETTSPEA
TRCLRSGSRPPSEEEDDYDYSPDEDDWKKTIMVGSDYQAGIPEGLCSYDDALPYENEDKL
LWNPSVLDEKVIEDYMRKICAMNSGTGIDAVPRGKQLRDDEEALFLLQQCGHNVEEALRR
RRISAQTPAHASVWSEEECRNFENGIKVHGKDFHLIRQQKVRTRSVGELVQFYYIWKKTE
RHDIFANKTRLEKKKYTLHPGHTDYMDRFLEEQEATGAANVVRPVSPSPMMVYVPSPATQ
PDPLALGEKEVFSQLNPHTTPPRTLSIEDQEPDVVS