New model in OGS2.0 | DPOGS201552  |
---|---|
Genomic Position | scaffold1145:- 8256-30018 |
See gene structure | |
CDS Length | 2424 |
Paired RNAseq reads   | 457 |
Single RNAseq reads   | 1450 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006027 (0.0) |
Best Drosophila hit   | u-shaped (3e-46) |
Best Human hit | ND |
Best NR hit (blastp)   | u-shaped [Tribolium castaneum] (1e-163) |
Best NR hit (blastx)   | u-shaped [Tribolium castaneum] (9e-136) |
GeneOntology terms    | GO:0007362 terminal region determination GO:0008293 torso signaling pathway GO:0007390 germ-band shortening GO:0008258 head involution GO:0046665 amnioserosa maintenance GO:0008134 transcription factor binding GO:0032583 regulation of gene-specific transcription GO:0005634 nucleus GO:0030528 transcription regulator activity GO:0004879 ligand-dependent nuclear receptor activity GO:0003700 sequence-specific DNA binding transcription factor activity GO:0007507 heart development GO:0048749 compound eye development GO:0007393 dorsal closure, leading edge cell fate determination GO:0030097 hemopoiesis GO:0007391 dorsal closure GO:0042440 pigment metabolic process GO:0045449 regulation of transcription GO:0005515 protein binding GO:0042690 negative regulation of crystal cell differentiation GO:0009996 negative regulation of cell fate specification GO:0007398 ectoderm development GO:0008270 zinc ion binding GO:0048542 lymph gland development GO:0035167 larval lymph gland hemopoiesis |
InterPro families    | IPR015880 Zinc finger, C2H2-like IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding IPR007087 Zinc finger, C2H2-type |
Orthology group | MCL16093 |
Nucleotide sequence:
ATGCAGGGCGGGCGGAAGGCCACCTGCGGGTTTCTGGCCCGTCTTAATGCATCCGCACAT
ATTTATTCCCGAGCTATATGTTTCCTGATCGCCATCGATATGATTTTAAAAATGGCCGCT
TCCGCCATTTTGGCCGGTGAGGACGAGGAATGGGGAAACGAGAGCGAGGCGATGCCTGGA
GAACCTCGCACACCAAGCTCGGGCGGTGAGCCACCGGCTTCTAGTGGAGGAGCTTCACCG
GCGTCCGAAGGCTCAGCAGCCACCTCACCGCCGCGCCTTCGACTGAACACTCGACTGGCG
ACCGATCCCGCTCTCGCACCTCAGGCCACAACTCTGAAACACGAACCGCCATCCCCTTCG
CCGCCTGCGCCGTCACCAGCACAACAAGCGAGGGATTTCCTTAATCTCACAGCAGCGAAT
TTCCCAGCTCTTTTTCCCGGAGCCCCAGCTGTAGCGGCACCGCCCGCTTACACATGCATT
CCTTGTGGGATACGGTATTCTTCATTAAGTACGTTACAAGCGCATCAAGAGCATTACTGC
TCCAAACGGAGATCGAAACCTGATGGAACAGATGTGCCGACAGAAACAGTAGCAGATGAC
TCGAGTGGTGATTCGAAAACACCACGACCTCCGGGGAAACAATACGCCTGTACTTACTGT
TCATACAGCGCAGATAAGAAAGTTAGTTTAAATCGTCATATGCGTATGCATTCTTCTTCA
CCTATAAGTAGTAGTACTCCAGTACCACCTCCGACATCCAACGGGGAAGCGACTGATGGA
CAACCAGCCCAGGACCGTTATTGTGTTGATTGTGATATTCATTTTAGCTCCATTAAGACT
TACCGTGCTCATAAGGCTCATTACTGTAACACGAGACAGATTGTCAAGCAAGTTTTACCC
ACAGCCCGAGCGGGTTCTACCACATCGGGATCGGCTCCCACTTCCCCCGGTGCAACCCCA
CCAGCTCAGAATCAATATGCGTTAGCATTACCGACTAATCCAATACTAATTGTCCCATAT
TCACTTTTGAGAAGCGCTAGCACTCTTCCTGGTGCAACATTGCCAGATCCCGATACGCCA
TGCTTTTGGTTGCCAAACGGAACGTTTCAACCTATAAGTCGTGCGTTACCAAATGTAAAT
ACGGAGGTAAAGGAACCCGAGGTTTTAAAATCGGCCAATAGACCGCGAGAACCGTCAAGA
GATGGTGCTACACCTTTAGATCTGAGTGTTCGTCGTACACCAGAGTCGGTAACTACGGAT
GAGCACGAAAAAGAGAACAGAATGCGTTCTACGACACCAGAACAAATAGTATGTGCTCCA
TCCTTACCTGCCTCTCCTGCAACTCCATCACCTTCGAGGCGGTCATCATCGCCTAGTGGA
GAGAGTTCACCGAAACGACGGAGAAAGAATTCAAGAGATCCAACTCCAAAGCCGCCCAGC
GTACCTTCACCGTCAGAAGATAAAATTTCAGCCGTCATACCTCCCCCAGCATTCCCACCT
TCCTTAGCTTTGCGTTTGACTACAGATCCAATAACGACTGTTTCTCCACAAGTTCTAGTA
AAGCAAGGAGTTTCAAAGTGCCAGGAGTGCAATATAGTTTTTTGCAAATTCGATAATTAC
CGTATACACAAACGGCACTACTGTTCCGCTGGCGGCGGAGACGAGCGGGCCAGCCCGGCA
CCCCCTGAACCAGGCCCTCCGACTCAGTACCGACAGCTCATCTGTATGGCGTGCGGCATA
CATTTCAGTTCCTACGATAACTTAACGACTCATCAATCCTACTACTGCACGAAAAGAGAG
ACGCGTTCTCCGCGAGCTGTTCTAGAAACATCAAGACCTTCTTCGGGGTCAGACGGTGGT
TGGAAGTGTCCTTGCTGCGATGTTGTTTCTCCAACAGCCGCCGCTGCTCAGAGACACATG
GAGGCGCATGCGGGAGTGAAAGCTTTCCGTTGCACCATTTGCAAATACAGAGGAAACACT
CTACGTGGTATGAGGACACACATCCGAGTTCACTTCCGTGAAAAGCCCTCTGATTTGCAG
GAAGAGAGCTACATATCCTGCGTGCTCGAAGAGGAAAGTCGCGAGAGTACCTCGCCGGCC
CCCGCTGCGGGCGAGCGCGTCCATCGCTGCAGCTCGTGCGCGTACACATCGACTTACCGC
GGGAATGTTGTCCGACACGCACGACTCGTGCACGCCGACGAACCAGAACCCGACAGAACA
TCGCCGCCGCCCGACATCAAAAAAGAACCTGACGTAGACGAAGACACGCCCAACTTCTGC
AAATCCTGCAACATATCCTTTAAATACGTTAACACTTATAAAGCACACAAACAGTTCTAT
TGCACCGCGGCCAATCAGGACGCAGCGGCCAACAACAACGTCCCCGCTCGTGTGCACGAT
GTGTCCGTTGTTAAATCACAATAA
Protein sequence:
MQGGRKATCGFLARLNASAHIYSRAICFLIAIDMILKMAASAILAGEDEEWGNESEAMPG
EPRTPSSGGEPPASSGGASPASEGSAATSPPRLRLNTRLATDPALAPQATTLKHEPPSPS
PPAPSPAQQARDFLNLTAANFPALFPGAPAVAAPPAYTCIPCGIRYSSLSTLQAHQEHYC
SKRRSKPDGTDVPTETVADDSSGDSKTPRPPGKQYACTYCSYSADKKVSLNRHMRMHSSS
PISSSTPVPPPTSNGEATDGQPAQDRYCVDCDIHFSSIKTYRAHKAHYCNTRQIVKQVLP
TARAGSTTSGSAPTSPGATPPAQNQYALALPTNPILIVPYSLLRSASTLPGATLPDPDTP
CFWLPNGTFQPISRALPNVNTEVKEPEVLKSANRPREPSRDGATPLDLSVRRTPESVTTD
EHEKENRMRSTTPEQIVCAPSLPASPATPSPSRRSSSPSGESSPKRRRKNSRDPTPKPPS
VPSPSEDKISAVIPPPAFPPSLALRLTTDPITTVSPQVLVKQGVSKCQECNIVFCKFDNY
RIHKRHYCSAGGGDERASPAPPEPGPPTQYRQLICMACGIHFSSYDNLTTHQSYYCTKRE
TRSPRAVLETSRPSSGSDGGWKCPCCDVVSPTAAAAQRHMEAHAGVKAFRCTICKYRGNT
LRGMRTHIRVHFREKPSDLQEESYISCVLEEESRESTSPAPAAGERVHRCSSCAYTSTYR
GNVVRHARLVHADEPEPDRTSPPPDIKKEPDVDEDTPNFCKSCNISFKYVNTYKAHKQFY
CTAANQDAAANNNVPARVHDVSVVKSQ