DPGLEAN20699 in OGS1.0

New model in OGS2.0DPOGS211448 
Genomic Positionscaffold608:+ 42579-44275
See gene structure
CDS Length804
Paired RNAseq reads  1089
Single RNAseq reads  3277
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002156 (6e-50)
Best Drosophila hit  serpent, isoform B (7e-38)
Best Human hittranscription factor GATA-4 (5e-35)
Best NR hit (blastp)  BmGATA beta isoform 2 - silkworm (fragment) (1e-51)
Best NR hit (blastx)  BmGATA beta isoform 2 - silkworm (fragment) (2e-44)
GeneOntology terms























  
GO:0007391 dorsal closure
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0007350 blastoderm segmentation
GO:0007507 heart development
GO:0045449 regulation of transcription
GO:0005634 nucleus
GO:0030154 cell differentiation
GO:0007179 transforming growth factor beta receptor signaling pathway
GO:0007389 pattern specification process
GO:0045893 positive regulation of transcription, DNA-dependent
GO:0042440 pigment metabolic process
GO:0045892 negative regulation of transcription, DNA-dependent
GO:0008407 bristle morphogenesis
GO:0007498 mesoderm development
GO:0007513 pericardial cell differentiation
GO:0010002 cardioblast differentiation
GO:0007510 cardioblast cell fate determination
GO:0007398 ectoderm development
GO:0006355 regulation of transcription, DNA-dependent
GO:0043565 sequence-specific DNA binding
GO:0008270 zinc ion binding
GO:0048542 lymph gland development
GO:0035050 embryonic heart tube development
GO:0035051 cardiac cell differentiation
GO:0060047 heart contraction
InterPro families
  
IPR000679 Zinc finger, GATA-type
IPR013088 Zinc finger, NHR/GATA-type
Orthology groupMCL18541

Nucleotide sequence:

ATGGCGGAGGTATTCACGGAGGGCAGAGAGTGCGTGAACTGCGGCGCCATCGACACGCCG
CTGTGGCGACGAGACGGCACCGGACACTACCTGTGCAACGCTTGCGGTCTCTACACCAAG
ATGAACGGCATGAACCGGCCGCTGAAGCCCCCGCGGCGGCTGGTACGTCAGCGGCACGCG
GCGCAGGCGCCGGCGCCCGCTCCCGACGTACGCAGCCTCGCCCTCACGACCTCCGCTCGA
CCGACCCTCCCCCTCCACCACCCCGCGACCCTCGCCCTCCCCGCGCCCGCGAGGAATCCG
CGCCCGAGCATGGGTACGAAGCGGCAGGGAGTGTGTTCTAACTGTGAGACCACCATCACT
ACTTTATGGCGTCGGAATCCGCTCGGGGAGAACGTGTGCAACGCTTGCGGGTTGTACTTC
AAACTGCACGGCATCAACCGCCCGAAGAACATGAAGAAGGATTCGATCCAGACGAGAAAG
CGAAAATCTAAGAACAATACGAAAACGGAGCGCAATATAAGTAAAACCACCGTTCGCTCG
ACTCTCAACGTAGGGACGACGGAGTTAGAGAACATATTGGATATAGGCGCCTCGTCGAGC
GGTGCTCGCGGCCGTTCGCTGGGGTACTACGTGCAGCCGCACACTGTGAAACTGGAGGAG
CCCGCGCCAGCCCACGCGCACCAGCAGCAGCAACAACACCAGCAACACCAACAGCAGCAG
CAGGCCTACTACGACGACGAGTACCGCCGCGTCGAGCCCCAGGAGCGCCTGGAGCGGCCG
ACTGTGGTGTCGCTCGGCAGCTGA

Protein sequence:

MAEVFTEGRECVNCGAIDTPLWRRDGTGHYLCNACGLYTKMNGMNRPLKPPRRLVRQRHA
AQAPAPAPDVRSLALTTSARPTLPLHHPATLALPAPARNPRPSMGTKRQGVCSNCETTIT
TLWRRNPLGENVCNACGLYFKLHGINRPKNMKKDSIQTRKRKSKNNTKTERNISKTTVRS
TLNVGTTELENILDIGASSSGARGRSLGYYVQPHTVKLEEPAPAHAHQQQQQHQQHQQQQ
QAYYDDEYRRVEPQERLERPTVVSLGS