New model in OGS2.0 | DPOGS211454  |
---|---|
Genomic Position | scaffold2688:+ 5927-11983 |
See gene structure | |
CDS Length | 1068 |
Paired RNAseq reads   | 216 |
Single RNAseq reads   | 628 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002159 (2e-53) |
Best Drosophila hit   | pannier, isoform A (2e-57) |
Best Human hit | transcription factor GATA-4 (3e-50) |
Best NR hit (blastp)   | PREDICTED: similar to AGAP002235-PA [Tribolium castaneum] (5e-66) |
Best NR hit (blastx)   | GE24361 [Drosophila yakuba] (2e-62) |
GeneOntology terms    | GO:0007391 dorsal closure GO:0003700 sequence-specific DNA binding transcription factor activity GO:0007350 blastoderm segmentation GO:0007507 heart development GO:0045449 regulation of transcription GO:0005634 nucleus GO:0030154 cell differentiation GO:0007179 transforming growth factor beta receptor signaling pathway GO:0007389 pattern specification process GO:0045893 positive regulation of transcription, DNA-dependent GO:0042440 pigment metabolic process GO:0045892 negative regulation of transcription, DNA-dependent GO:0008407 bristle morphogenesis GO:0007498 mesoderm development GO:0007513 pericardial cell differentiation GO:0010002 cardioblast differentiation GO:0007510 cardioblast cell fate determination GO:0007398 ectoderm development GO:0006355 regulation of transcription, DNA-dependent GO:0043565 sequence-specific DNA binding GO:0008270 zinc ion binding GO:0048542 lymph gland development GO:0035050 embryonic heart tube development GO:0035051 cardiac cell differentiation GO:0060047 heart contraction |
InterPro families    | IPR000679 Zinc finger, GATA-type IPR013088 Zinc finger, NHR/GATA-type |
Orthology group | MCL16279 |
Nucleotide sequence:
ATGGATCTGCAGTTAACTCAGTCTTGGTTACAAAACATAGCCCTCATCGCAGGAGGGAGC
GGTGTGGGCAGCGTGGGCACCGTGGGCGGCGTCGGTAGTGTGGGCGGTGTGGGTGGCGTG
GGCGGTGTGGGCGGTCTGTACCCCCAGAACATGGTGATGGGATCTTGGTGCGGGCCCTAC
GATGCCCTCCAAAGACCTCCAGCTTACGATGGAGTGCTGGAGGCGTACGAGGAGGGTCGC
GAGTGTGTGAACTGCGGCGCCAACAACACGCCGCTGTGGCGCCGCGACTCCACCGGCCAC
TACCTATGCAACGCGTGCGGTCTCTACCACAAGATCAACGGAGTGAACCGGCCGCTCGTG
AAGCCGAGCAAGCGGCTGTCAGCGGCTCGTCGACACGGTCAAAGCTGCACCAACTGTGGC
TCCAGGAACACGACCCTCTGGAGGAGGAACAACGAGGGCGAGCCCGTCTGTAACGCGTGT
GGACTCTACTACAAGCTCCATGGAATCAATAGACCTCTGGCTATGAGGAAAGATGGAATA
CAAACCAGGAAACGTAAGCCGAAGAAATCGGCGAACGGCGTGAAGCCTTCACCGGAGACG
ACTAAGAAGGACGAGCAGACGTCACCGGGAGTGGACGAGAGCAAGCCCAGTATACCAGAG
GTTCCGTTGCCTCTGACAGCTACCCTATCGGGGCACTCCTCGAGCAAGTCCCGCGAGCCT
CTCGGTGCTACGCCCTCGCCGCATGCACACAGTCACTCGCACTCTCACGCACACTCCCAC
ACACACTCCCACACACACTCGCAGAACTCTCAGTATCCCCTGGCGCTGCCCTCTGCCCCC
GCCTTCCTCTCCAACCCTTCGCTGTTCAACATCAAGAGCGAACCGAACGCAGCCTCCGGC
TACGAGGGTTACGGCTCGCACGCCGCCTCTAACGGCCCGTATCACTCTCAACAGCACTAC
CTGCACGCCTTACAGTACGGCCTGGGCGGTGCTGAGGAGGAGGAGGGCAGCGGGTTCCTT
CATCAGCGGAACGTGACCGCACACGCCAAGCTCATGGCCTCCACGTAG
Protein sequence:
MDLQLTQSWLQNIALIAGGSGVGSVGTVGGVGSVGGVGGVGGVGGLYPQNMVMGSWCGPY
DALQRPPAYDGVLEAYEEGRECVNCGANNTPLWRRDSTGHYLCNACGLYHKINGVNRPLV
KPSKRLSAARRHGQSCTNCGSRNTTLWRRNNEGEPVCNACGLYYKLHGINRPLAMRKDGI
QTRKRKPKKSANGVKPSPETTKKDEQTSPGVDESKPSIPEVPLPLTATLSGHSSSKSREP
LGATPSPHAHSHSHSHAHSHTHSHTHSQNSQYPLALPSAPAFLSNPSLFNIKSEPNAASG
YEGYGSHAASNGPYHSQQHYLHALQYGLGGAEEEEGSGFLHQRNVTAHAKLMAST