DPGLEAN17970 in OGS1.0

New model in OGS2.0DPOGS214187 
Genomic Positionscaffold546:+ 11322-12671
See gene structure
CDS Length1128
Paired RNAseq reads  108
Single RNAseq reads  298
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006230 (4e-143)
Best Drosophila hit  escargot (2e-66)
Best Human hitzinc finger protein SNAI2 (2e-57)
Best NR hit (blastp)  zinc finger protein SLUG, putative [Pediculus humanus corporis] (9e-76)
Best NR hit (blastx)  PREDICTED: similar to escargot CG3758-PA [Apis mellifera] (2e-75)
GeneOntology terms
















  
GO:0007489 maintenance of imaginal histoblast diploidy
GO:0005634 nucleus
GO:0006357 regulation of transcription from RNA polymerase II promoter
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0007417 central nervous system development
GO:0003702 RNA polymerase II transcription factor activity
GO:0003677 DNA binding
GO:0035156 fusion cell fate specification
GO:0007424 open tracheal system development
GO:0007422 peripheral nervous system development
GO:0030718 germ-line stem cell maintenance
GO:0055059 asymmetric neuroblast division
GO:0035147 branch fusion, open tracheal system
GO:0008270 zinc ion binding
GO:0003676 nucleic acid binding
GO:0042048 olfactory behavior
GO:0048076 regulation of compound eye pigmentation
GO:0001708 cell fate specification
InterPro families

  
IPR015880 Zinc finger, C2H2-like
IPR007087 Zinc finger, C2H2-type
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL13631

Nucleotide sequence:

ATGCTCGTTGAAGATTCTCAAACGGCTCGTTCGTTGATGTCGAAGAAGTATGCGCACTGT
CCGCTGAAGAAGCGACCGGTGCTCGTTCGGGAGGAGCGACCAGCGACGCCGCCCACTACA
CCGCCTCACCCCGCCCACATGGCCACCAGACTCTATTATGATTATCACTGTGATACAGAA
AATGATGAACCAGAAAACTTAAGCACAAAACCGGAAGATCTCTCCAAGACTGGCAATTAC
CCAAGTAAAGCTGCATCACCGGTGTCCACCGCAGACATTAAATCTGAACCACGAGAGTGG
CCCCAACAGCAACTGGAGTACCTAGCCGCGTGCCGAGCTCGTCTGGAGCCAATGCCCACA
GAACTGGCTCGCCCTACACCTCAATATGCATATCTCCCTACACTGTATCCAGCTTATCCT
ATGGAAGAACTCTATCCGACTGCCCCAGCCCTGTCCCCTCCCGTCCAACCTCAATACTAC
GCCAGATACTCTCCGGCATCACCGCCCTCCTCGTGCTCGCCGCCTCCATGCCCTGAGGAC
CTCCGTTCCCCCGGCTCAGTCTCCTCTGACTCTGGAGTCTCCGTGTCTGGTCCACGCCGC
CCTCGTTACCAGTGCCCAGACTGTGCTAAGTCCTACTCCACCTACTCTGGACTGTCAAAA
CATCAGCAGTACCACTGCGCTGCCGCCGAGGGAAGTCTCGCTAGGAAATCGTTTAGCTGC
AAGTACTGCTCTAAGGTGTACACTTCTCTGGGTGCTCTCAAGATGCACATAAGGACTCAC
ACCCTGCCGTGTAAGTGTCACCTGTGCGGTAAAGCTTTCTCCCGTCCGTGGCTTCTCCAG
GGACACATCCGTACTCACACCGGCGAGAAGCCGTTCTCATGCCACCACTGCCGGCGAGCG
TTCGCTGACCGATCAAACCTCCGAGCTCATCTACAGACCCACTCCGATGTTAAGAAATAC
TCGTGCACGGGATGCGGGAAGACTTTCTCTCGGATGTCACTGCTGAGTAAACATTTGGAG
GGAGGCTGCGGGGCTCCCAGCACGTCTCCATACGAATACCGACCGGAGGCTCACCAAACA
TCACATCCGCACCAGCTCCCGCCTTCTGCGCCTGTTCATGCCTATTAG

Protein sequence:

MLVEDSQTARSLMSKKYAHCPLKKRPVLVREERPATPPTTPPHPAHMATRLYYDYHCDTE
NDEPENLSTKPEDLSKTGNYPSKAASPVSTADIKSEPREWPQQQLEYLAACRARLEPMPT
ELARPTPQYAYLPTLYPAYPMEELYPTAPALSPPVQPQYYARYSPASPPSSCSPPPCPED
LRSPGSVSSDSGVSVSGPRRPRYQCPDCAKSYSTYSGLSKHQQYHCAAAEGSLARKSFSC
KYCSKVYTSLGALKMHIRTHTLPCKCHLCGKAFSRPWLLQGHIRTHTGEKPFSCHHCRRA
FADRSNLRAHLQTHSDVKKYSCTGCGKTFSRMSLLSKHLEGGCGAPSTSPYEYRPEAHQT
SHPHQLPPSAPVHAY