New model in OGS2.0 | DPOGS215846  |
---|---|
Genomic Position | scaffold56:+ 173918-189377 |
See gene structure | |
CDS Length | 4374 |
Paired RNAseq reads   | 1212 |
Single RNAseq reads   | 3192 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002946 (5e-08) |
Best Drosophila hit   | spalt major (2e-57) |
Best Human hit | sal-like protein 3 (3e-46) |
Best NR hit (blastp)   | PREDICTED: spalt-like protein [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | spalt [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0035277 spiracle morphogenesis, open tracheal system GO:0005634 nucleus GO:0003704 specific RNA polymerase II transcription factor activity GO:0045449 regulation of transcription GO:0008586 imaginal disc-derived wing vein morphogenesis GO:0046845 branched duct epithelial cell fate determination, open tracheal system GO:0016481 negative regulation of transcription GO:0007424 open tracheal system development GO:0007427 epithelial cell migration, open tracheal system GO:0007423 sensory organ development GO:0007438 oenocyte development GO:0001751 compound eye photoreceptor cell differentiation GO:0045465 R8 cell differentiation GO:0048098 antennal joint development GO:0030528 transcription regulator activity GO:0007379 segment specification GO:0008270 zinc ion binding GO:0003676 nucleic acid binding GO:0005622 intracellular GO:0045466 R7 cell differentiation GO:0035310 notum cell fate specification |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding IPR015880 Zinc finger, C2H2-like |
Orthology group | MCL15850 |
Nucleotide sequence:
ATGAGCCGGGCGCGGCTCCGCCTCCTTGTCATAACCTTATACCTGGTGCGCGCGCGCGAC
TCCCGCCGTCCTCCACCACCCGCCACCACCGCCCGGAGGCTCGCGGCTCGCGCTCAGTCC
CGCGTCAGATGGCGAGAGGTGAGCCCGTCTGTATACCGCGTCCGATCACACAGGCTCCGC
GCGAGTGAGTACTCTACCACCGCCGGCGATGCGAGGGCTGTGAGCCGCCAGCGCTACCAC
CGACCACCTACCACCGACCGACCACCGGCCAGCATGCCGCGCGTCAAGCCAGCCTGCGTC
CGACGCGTCTCCATCGGTACATATCCATCAGCTCGATCACCAAACCGACACACGACCGTA
CATAGGATCTGTGATAGCTCGGGATCCTGTTCAGAAGAGGATATAAATAATGTGTTACCC
GATGACGGAAGAGATCGGCCTGAGCCACATATGTGCCCTCGCTGTCATGAACAGTTCGAG
AACCTACACGACTTCTTGTATCACAAGAGGGTTTGTGATGAAAAAGCAATACAAATGGGT
GAAGAAAGAATGCACTCTGATCCCGAGGATATGGTAGTGTCGGGGGATGAAGAAATGGAT
GGACCCAATAAGCGGTTAGATCAAGTAAGAAGGCATCGGCAGGATGCTGAAAATAATAAC
AGCCTAGAGGATGGTGAAGCTGAGGTACCTGAAGCCGATATGCCCCCCGTAGGTCTTCCC
TTCCCCGTAGCAGGCCATGTTACTCTTGAAGCTCTTCAGAATACAAAAGTAGCTGTTGCT
CAATTCGCTGCGACGGCAATGGCTAATAATGCAAATAATGAAGCCGCTTTACATGAGCTT
GCCGTATTACAAAGTACGCTATTTACTCTACAGCATCAACAAGTATTTCAACTTCAGTTA
ATACGCCAACTACAAAATCAATTGTCATTAACACGACGGAAAGAGGATCAACATTCAAGC
CCGCCTCCGAGTGAAATTGAACAAAACGCTCCGTCGACGCCAGCTCGCTCGCCTTCACCA
CCGCGTCCGCCACGGGAGCCATCTCCTGCTGAACCCACTCATCCTACTAGCCAAAATTTG
CCGTCTACCCACACAAATCTTACGCCCAAAACGGAACCTATCTCTGTTCCCAAGCTTCCG
ACTTCATCGCCACCAATGATGTCCCACCCACCTTACAGCTCCATATCTTCCTCTCTAGCA
TCTTCAATCATTACAAACAACGATCCTCCGCCATCTCTTAATGAACCAAACACTCTCGAA
ATGCTTCAAAAACGTGCACAAGAAGTACTAGATAACGCATCACAAGGCCTTCTCGCTAAT
AATCTTGCCGATGAACTAGCTTTTAGAAAATCTGGAAAAATGTCACCCTATGACGGAAAA
TCTGGTGGACGCAACGAACCATTCTTTAAACATCGCTGTCGATATTGTGGAAAAGTTTTT
GGAAGTGACTCAGCGCTTCAGATTCATATAAGATCTCATACAGGCGAAAGACCATTTAAA
TGTAACGTTTGTGGATCTCGCTTTACAACGAAAGGAAACCTTAAAGTACATTTCCAAAGA
CATACTTCTAAATTTCCACATGTTAAGATGAATCCCAACCCTGTACCCGAACATTTAGAC
AAGTATCATCCACCACTACTAGCACAACTATCACCGGGGCCGATACCGGGAATGCCACCC
CATCCACTTCAATTCAATCCAGGTGCACCAGCACCTTTTCCACCAAGCTTGCCATTATAT
AGGCCACCACACCATGATTTACTTCCACCTCGACCTCTTGGAGATAAACCTCTTTCACAT
CATCCACTTTTCGCGATGCGAGAAGAACAAGACGTACCAGCAGATCTCAGCAAACCATCT
GCACCAAGTCCTACTCAGCCTGCATCTGAGGTTTTTAAATCCGAACCACAAGATGAAGAA
AGCCAACGTGACTCTAGCTTTGACGAGTCTGATCGCATATCACCTAAAAGAGAACCTGAA
GAAAACGATGCTGGACAAGATCAAGAACACGATCGATATCCATCAACATCGCCCTATGAT
GACTGCAGCATGGATTCAAAATACAGCAACGAAGACCAAATCGGCAGAGAAAGTCCACAA
GTGAAGGCTGATCCAGACCAACCGGAAAATCTTTCAAGTAAGAATTCTACGATATCTGGA
CCAATTTCAATAGCAACGGGACTTCGTACTTATCCTTCTTATCCATTGTTTCCACAATCC
CCACCTAGCAGCGTGTCTTCTGAGAGTCTTACTCCGTTTTCAAATAATCCTGTCCTTGGA
GACACTGATGTAACACGAGACCCCATATTTTATAATTCGCTCTTACCGCGTCCAGGTAGC
AACGATAATTCTTGGGAAAGTTTAATAGAAATTACAAAAACATCTGAAACCTCCAAATTA
CAGCAACTAGTTGATAATATTGATAACAAAGTATCTGAGCCTAATGAATGCATAGTTTGT
CACCGTGTTCTGTCTTGTAAAAGTGCTTTACAAATGCACTACCGCACACATACTGGTGAG
CGCCCATTTCGATGTAAATTATGCGGTCGTGCTTTTACCACAAAAGGTAATTTGAAGACA
CATATGGGTGTACACCGCATAAAACCTCCCTCACAATTATTGCATCAATGTCCCGTGTGC
CATAAAAAGTTTACAGATCCTTCTATGTTACATCAACATATCAGAATTCATACGGGTGAA
CGACTCAATAACCCTTTCAATGAAGTTAACGACAATAACGCGAATAGTTGTCAGTCTTAT
AACAACGAATCGGACATTACAGACTGTTCTTATCGTCCCATTCCGGCTCCAATTTTCCCT
ACACCTTCCACTCCCGGCGACCGAAGGGCGGACTCCCGCGGGACCGACGATGAGAGCGGC
AGAGATTCTCGGGAGTTCGATGAGGACTCAGATATAAAAAATCGTCGAGCTTCGCCGCTC
TCCGTCTGCGCCTCGGCGTCCGAATGCGAAGTAAAAACCATCACCACAACGGCTTCCCTC
CCATCGGCGACAGGTTCGGAGAGCGGGCGCAGTGCACGGGCCTCCCCACCGTCGCCGTCT
CCGTCGGTGCTGTCGACGCCCCCGCGGCTGCCGCAGCACTCGCCGCTGCCTTCGCCGCCG
ACCCCGCTCGCCGCGCTCGGTGCTCTCGGTGGACCTTTCAGCCCGCTCGGACTCGCTTTC
CCTCCCGCAGTGCGCGGAAATACAACATGTACTATCTGCTACAAGACTTTCGCCTGCAAT
TCGGCGTTGGAAATCCATTACCGCAGTCACACTAAAGAGCGACCATTCAAATGCACCGTC
TGCGACAGAGGCTTCTCTACTAAGAGCAGTGGCGGCGGTTGCCGGTGTAGGCGCCCACGC
GCACCCCGCCCGCCGCACGCCACTGCTTTGGACCTCTGGAACGCCTTCGTCTACCCGGGA
AACATGAAACAGCACATGTTGACGCACAAGATACGTGACATGCCGCCTGGTTTCGAAAAG
AGCGTAAGTGGACCAACTGGTCCGCCAAGTGAAGAAGGCCGGGAACCAAGTCCAGACAGA
CGATCGTCTCCTGAGAAGATCGATTTGAAACGATCACCCCCCGCCTTACCCCCCGTCACT
ATAGCGCACCCACCCCCCATTGATATGCCACCACTGCCTAAAAGACCTACAGTACCTATA
GTACCAAATCATCCGCCACCATCTCAGTCATCGAAGCACCTTTGCGGAGTTTGTCGCAAA
AACTTCTCATCGTCATCGGCTTTACAAATCCACATGCGCACGCACACCGGAGACAAACCC
TTCCGATGCGCTGTTTGCCAAAAAGCTTTTACCACCAAAGGAAACCTTAAGGTTCACATG
GGCACACACATGTGGAGCGGAGGTGCGTCTCGTCGCGGCCGTCGTATGTCTCTGGAACTG
CCCCCACGGCCGCTACATGAGCCTCACGACCTTCTCCGACGACCTGATCTCTTCTACCCC
TACTTGCCTGCGCCATTTCTCAATGGCATGCAACAAAAACTGAACGAGATATCAGTAATA
CAACAACAAAATGCCGGCCAGAATGGCGTAGCTGGTAAATTCCCGGGACTACTTGGTTTC
GGAGCTTTTGGAGCTGGAAGACCGGGTGCCGCCTCCCCTCTCGAGAGGCCACCGTCACTC
GACGGTGATGAACGTCAGGCGGCGATGAGGGAACTCGCTGAGAGAGGCAGGGAGCTAGCT
GAACGGAGCCGACAGATGCGTGAGGAAAATGAACGGGAGCACTACCGGTCCGCTGGCGGT
CATCCAGCGCATGCGCCACCGACAGCACAGGCTTCGCCTCCAGCACCACATAATCTACCG
CATCCCCTGGCATCAATACCGCCGCCCGCGCGCACCGAAGGTCTCACAGTTTAA
Protein sequence:
MSRARLRLLVITLYLVRARDSRRPPPPATTARRLAARAQSRVRWREVSPSVYRVRSHRLR
ASEYSTTAGDARAVSRQRYHRPPTTDRPPASMPRVKPACVRRVSIGTYPSARSPNRHTTV
HRICDSSGSCSEEDINNVLPDDGRDRPEPHMCPRCHEQFENLHDFLYHKRVCDEKAIQMG
EERMHSDPEDMVVSGDEEMDGPNKRLDQVRRHRQDAENNNSLEDGEAEVPEADMPPVGLP
FPVAGHVTLEALQNTKVAVAQFAATAMANNANNEAALHELAVLQSTLFTLQHQQVFQLQL
IRQLQNQLSLTRRKEDQHSSPPPSEIEQNAPSTPARSPSPPRPPREPSPAEPTHPTSQNL
PSTHTNLTPKTEPISVPKLPTSSPPMMSHPPYSSISSSLASSIITNNDPPPSLNEPNTLE
MLQKRAQEVLDNASQGLLANNLADELAFRKSGKMSPYDGKSGGRNEPFFKHRCRYCGKVF
GSDSALQIHIRSHTGERPFKCNVCGSRFTTKGNLKVHFQRHTSKFPHVKMNPNPVPEHLD
KYHPPLLAQLSPGPIPGMPPHPLQFNPGAPAPFPPSLPLYRPPHHDLLPPRPLGDKPLSH
HPLFAMREEQDVPADLSKPSAPSPTQPASEVFKSEPQDEESQRDSSFDESDRISPKREPE
ENDAGQDQEHDRYPSTSPYDDCSMDSKYSNEDQIGRESPQVKADPDQPENLSSKNSTISG
PISIATGLRTYPSYPLFPQSPPSSVSSESLTPFSNNPVLGDTDVTRDPIFYNSLLPRPGS
NDNSWESLIEITKTSETSKLQQLVDNIDNKVSEPNECIVCHRVLSCKSALQMHYRTHTGE
RPFRCKLCGRAFTTKGNLKTHMGVHRIKPPSQLLHQCPVCHKKFTDPSMLHQHIRIHTGE
RLNNPFNEVNDNNANSCQSYNNESDITDCSYRPIPAPIFPTPSTPGDRRADSRGTDDESG
RDSREFDEDSDIKNRRASPLSVCASASECEVKTITTTASLPSATGSESGRSARASPPSPS
PSVLSTPPRLPQHSPLPSPPTPLAALGALGGPFSPLGLAFPPAVRGNTTCTICYKTFACN
SALEIHYRSHTKERPFKCTVCDRGFSTKSSGGGCRCRRPRAPRPPHATALDLWNAFVYPG
NMKQHMLTHKIRDMPPGFEKSVSGPTGPPSEEGREPSPDRRSSPEKIDLKRSPPALPPVT
IAHPPPIDMPPLPKRPTVPIVPNHPPPSQSSKHLCGVCRKNFSSSSALQIHMRTHTGDKP
FRCAVCQKAFTTKGNLKVHMGTHMWSGGASRRGRRMSLELPPRPLHEPHDLLRRPDLFYP
YLPAPFLNGMQQKLNEISVIQQQNAGQNGVAGKFPGLLGFGAFGAGRPGAASPLERPPSL
DGDERQAAMRELAERGRELAERSRQMREENEREHYRSAGGHPAHAPPTAQASPPAPHNLP
HPLASIPPPARTEGLTV