DPGLEAN02427 in OGS1.0

New model in OGS2.0DPOGS211454 
Genomic Positionscaffold2688:+ 5927-11983
See gene structure
CDS Length1068
Paired RNAseq reads  216
Single RNAseq reads  628
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002159 (2e-53)
Best Drosophila hit  pannier, isoform A (2e-57)
Best Human hittranscription factor GATA-4 (3e-50)
Best NR hit (blastp)  PREDICTED: similar to AGAP002235-PA [Tribolium castaneum] (5e-66)
Best NR hit (blastx)  GE24361 [Drosophila yakuba] (2e-62)
GeneOntology terms























  
GO:0007391 dorsal closure
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0007350 blastoderm segmentation
GO:0007507 heart development
GO:0045449 regulation of transcription
GO:0005634 nucleus
GO:0030154 cell differentiation
GO:0007179 transforming growth factor beta receptor signaling pathway
GO:0007389 pattern specification process
GO:0045893 positive regulation of transcription, DNA-dependent
GO:0042440 pigment metabolic process
GO:0045892 negative regulation of transcription, DNA-dependent
GO:0008407 bristle morphogenesis
GO:0007498 mesoderm development
GO:0007513 pericardial cell differentiation
GO:0010002 cardioblast differentiation
GO:0007510 cardioblast cell fate determination
GO:0007398 ectoderm development
GO:0006355 regulation of transcription, DNA-dependent
GO:0043565 sequence-specific DNA binding
GO:0008270 zinc ion binding
GO:0048542 lymph gland development
GO:0035050 embryonic heart tube development
GO:0035051 cardiac cell differentiation
GO:0060047 heart contraction
InterPro families
  
IPR000679 Zinc finger, GATA-type
IPR013088 Zinc finger, NHR/GATA-type
Orthology groupMCL16279

Nucleotide sequence:

ATGGATCTGCAGTTAACTCAGTCTTGGTTACAAAACATAGCCCTCATCGCAGGAGGGAGC
GGTGTGGGCAGCGTGGGCACCGTGGGCGGCGTCGGTAGTGTGGGCGGTGTGGGTGGCGTG
GGCGGTGTGGGCGGTCTGTACCCCCAGAACATGGTGATGGGATCTTGGTGCGGGCCCTAC
GATGCCCTCCAAAGACCTCCAGCTTACGATGGAGTGCTGGAGGCGTACGAGGAGGGTCGC
GAGTGTGTGAACTGCGGCGCCAACAACACGCCGCTGTGGCGCCGCGACTCCACCGGCCAC
TACCTATGCAACGCGTGCGGTCTCTACCACAAGATCAACGGAGTGAACCGGCCGCTCGTG
AAGCCGAGCAAGCGGCTGTCAGCGGCTCGTCGACACGGTCAAAGCTGCACCAACTGTGGC
TCCAGGAACACGACCCTCTGGAGGAGGAACAACGAGGGCGAGCCCGTCTGTAACGCGTGT
GGACTCTACTACAAGCTCCATGGAATCAATAGACCTCTGGCTATGAGGAAAGATGGAATA
CAAACCAGGAAACGTAAGCCGAAGAAATCGGCGAACGGCGTGAAGCCTTCACCGGAGACG
ACTAAGAAGGACGAGCAGACGTCACCGGGAGTGGACGAGAGCAAGCCCAGTATACCAGAG
GTTCCGTTGCCTCTGACAGCTACCCTATCGGGGCACTCCTCGAGCAAGTCCCGCGAGCCT
CTCGGTGCTACGCCCTCGCCGCATGCACACAGTCACTCGCACTCTCACGCACACTCCCAC
ACACACTCCCACACACACTCGCAGAACTCTCAGTATCCCCTGGCGCTGCCCTCTGCCCCC
GCCTTCCTCTCCAACCCTTCGCTGTTCAACATCAAGAGCGAACCGAACGCAGCCTCCGGC
TACGAGGGTTACGGCTCGCACGCCGCCTCTAACGGCCCGTATCACTCTCAACAGCACTAC
CTGCACGCCTTACAGTACGGCCTGGGCGGTGCTGAGGAGGAGGAGGGCAGCGGGTTCCTT
CATCAGCGGAACGTGACCGCACACGCCAAGCTCATGGCCTCCACGTAG

Protein sequence:

MDLQLTQSWLQNIALIAGGSGVGSVGTVGGVGSVGGVGGVGGVGGLYPQNMVMGSWCGPY
DALQRPPAYDGVLEAYEEGRECVNCGANNTPLWRRDSTGHYLCNACGLYHKINGVNRPLV
KPSKRLSAARRHGQSCTNCGSRNTTLWRRNNEGEPVCNACGLYYKLHGINRPLAMRKDGI
QTRKRKPKKSANGVKPSPETTKKDEQTSPGVDESKPSIPEVPLPLTATLSGHSSSKSREP
LGATPSPHAHSHSHSHAHSHTHSHTHSQNSQYPLALPSAPAFLSNPSLFNIKSEPNAASG
YEGYGSHAASNGPYHSQQHYLHALQYGLGGAEEEEGSGFLHQRNVTAHAKLMAST