DPGLEAN18675 in OGS1.0

New model in OGS2.0DPOGS203153 
Genomic Positionscaffold117:- 237559-239889
See gene structure
CDS Length1107
Paired RNAseq reads  469
Single RNAseq reads  1503
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011498 (6e-145)
Best Drosophila hit  stripe, isoform A (2e-72)
Best Human hitearly growth response protein 1 (2e-52)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC004846 [Tribolium castaneum] (1e-76)
Best NR hit (blastx)  conserved hypothetical protein [Culex quinquefasciatus] (2e-78)
GeneOntology terms













  
GO:0007517 muscle organ development
GO:0007417 central nervous system development
GO:0003702 RNA polymerase II transcription factor activity
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0016204 determination of muscle attachment site
GO:0042440 pigment metabolic process
GO:0007427 epithelial cell migration, open tracheal system
GO:0003676 nucleic acid binding
GO:0005622 intracellular
GO:0008270 zinc ion binding
GO:0006911 phagocytosis, engulfment
GO:0007398 ectoderm development
GO:0007390 germ-band shortening
GO:0007525 somatic muscle development
InterPro families

  
IPR007087 Zinc finger, C2H2-type
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL39205

Nucleotide sequence:

ATGGGCGCGGATGGAGACCCACCGACCGGGCCTCACCTCCTCTCGCTGGCGGACGTCGGG
GCGCTGGGCTTCGACTGCGCTCTGAAGCCGGTGACCGCGCCTATGACAGGCGGCGCTCCG
GCCGATCTCAACACACCCGTGTCCACATCGGAACTTCCCGCTTTCTTCCCGAGCCTGCTC
GAGCCTCCTCCGATATCAGGTACTTTACCAGGCGATGAGTTACTGGGGTGCTCCCCTCGT
CGTCACAAGCACGAAGCGTCTTTGTCACCGGGAGCGAGGGCTGAGGACGCTAGCAATGCC
TCTAGTGCTAGCGCCTCTCTATACGGACCGCCGATGGGCGGCAAAAGAGCTCCCTCACCA
CCACTACAATGGTTGCTACCATCTGGACCCGGTCCCGGCAGCGTCGATAAATACTTCCAA
CAAGAATACGAGGAACGCGTCGAGCTTCTACCGCCCGAATGTCAGCCTTCTTACTGTACA
GCACCGCAGCAATGCCAGCCGCAACACTGCGACTACAGACCCCAACCTCCACCACAACCC
CAACACTCGTGGGAGACGCAGGAGTACGCGAGCGTGCCGCAGCCAACACCGGGTCCCTCC
GGAGTCCCCAAAAGAGAACCCTATCCAAACACAACAGGCGACAGACCCGTGCAACTAGCA
GAATACAACCCGTCCACGAGCAAAGGCCATGAGATATTATCTCAAGTGTATCAACAGAGC
GCTCAACCACTGCGTCTAGTCGCCGTCAAACCTCGCAAGTATCCCAACCGTCCGAGTAAA
ACACCCGTACATGAAAGGCCCTATGCCTGTCCAGTGGACGAGTGTGATCGCAGGTTTTCG
AGATCAGACGAGCTGACAAGGCACATACGCATACACACAGGACAAAAACCGTTCCAGTGT
CGTATCTGTATGCGCTCGTTCAGTCGATCGGATCATTTGACGACACATGTCAGAACTCAC
ACAGGGGAGAAGCCGTTTGCGTGCGACGTGTGCGGTCGTAAGTTCGCGAGGTCTGATGAG
AAGAAGCGTCACGCGAAGGTTCACCTTAAGCAGCGTCTCAAACGCGAGCGGGGCAGTGGA
CCGGCTCACCCACACGCGCCGCTCTAG

Protein sequence:

MGADGDPPTGPHLLSLADVGALGFDCALKPVTAPMTGGAPADLNTPVSTSELPAFFPSLL
EPPPISGTLPGDELLGCSPRRHKHEASLSPGARAEDASNASSASASLYGPPMGGKRAPSP
PLQWLLPSGPGPGSVDKYFQQEYEERVELLPPECQPSYCTAPQQCQPQHCDYRPQPPPQP
QHSWETQEYASVPQPTPGPSGVPKREPYPNTTGDRPVQLAEYNPSTSKGHEILSQVYQQS
AQPLRLVAVKPRKYPNRPSKTPVHERPYACPVDECDRRFSRSDELTRHIRIHTGQKPFQC
RICMRSFSRSDHLTTHVRTHTGEKPFACDVCGRKFARSDEKKRHAKVHLKQRLKRERGSG
PAHPHAPL