DPGLEAN09888 in OGS1.0

New model in OGS2.0DPOGS209613 
Genomic Positionscaffold44:+ 112381-187808
See gene structure
CDS Length3270
Paired RNAseq reads  802
Single RNAseq reads  1926
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006678 (0.0)
Best Drosophila hit  tiptop (6e-155)
Best Human hitteashirt homolog 2 isoform 1 (3e-07)
Best NR hit (blastp)  teashirt-like protein [Tribolium castaneum] (0.0)
Best NR hit (blastx)  teashirt-like protein [Tribolium castaneum] (0.0)
GeneOntology terms







  
GO:0005634 nucleus
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0045449 regulation of transcription
GO:0008270 zinc ion binding
GO:0003677 DNA binding
GO:0006355 regulation of transcription, DNA-dependent
GO:0048730 epidermis morphogenesis
GO:0007380 specification of segmental identity, head
GO:0048749 compound eye development
InterPro families
  
IPR007087 Zinc finger, C2H2-type
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL15923

Nucleotide sequence:

ATGAGGGAGGCGGCCGATATCGTCGTCTCACATAAATCTGTCACCGGGGCTGCGCCTGCG
CCGGCCGAGGAATCAACAAGTCCAGAAAGCGGAGTGAAGGAATTAGGAGGACGCGAACGG
GAGGCGCGGGGAGAGGCAGGGGAGTCGCGCTCTCCATCGCCAGCATCCCGTGCCTCCCCC
ACACCCGAAGATCGGGATATAGAGCACAGCATACCAGCTACCCTCATACAGGATCCCAAT
GCTGAAAGGGAGAGTCCAAGATGTTTATCGCGGGAGTCGTCCGGCGCGCCGCGATGTCCC
TCTAACGACTCGGTGTATTCGGGTCGGAGCGCGCCCAGCCTGCCCTTGCCAGCCGCCCTA
TCAGCAGCGTTACCGGCAGCCCTGCCCGCAGCGCTGATGCCACCCCACTCCGCTGCCGTC
GCAGCCTATCTCGGAGCAGCAGCTGCGGCAGCCCAGCAACGATTACTCATGTCCTACCAG
GAAGACATTACGGACGCTGAAAGAGCGGATGCCGTATTAGACTTCAGCACTAAACGAAGT
GAATCCCCGGTCGACGATGAGGAGGATGACGCCGTTAATCTCACAAAGAATGAAAATGGT
CCATTAGACTTATCTGTAGGTACTAGAAAAAGGGGGCCAGAGGATTCTCCATCTCCCGTC
CCTAGTAGAAAAAGTTCTCGTACTTCCGACTTCAAAGCTTTATCGACACCTTGGTCTACA
CCGGTCGCGCCACATCTTCCTTATTTTGCTGCCGCCGTTGCTGCTGCAAGCTTATCACCA
AAAGGTGGAGTTCCAGCTGATTGGAATGGTAAACTTAAACATGGAGCGCCTACACCAAGC
GATGCTACTAAAGCACTGGAAAAAATGAGCGAATTGAGTAGATTAGGTGGAGAAGAACTT
TTTAGATCTGTTCAAAGTGCAGCTTTGGGTGCAGGTCTTACACCAAATGCAGCTGCACGA
CATTCAGCTTGGCAATCTCATTGGCTGAATAAAGGAGCAGACCAGACAAAAGATGTCCTA
AAATGTGTATGGTGCAAAAAGAGCTTCAATTCACTTGCTGATCTAACTGTTCACATGAAG
GAAGCTAAGCATTGTGGAGTTAACGTTCCTGTACCCCCTTCAACTGGAGCTCCGATTCCG
CCTTCACTACAACCACCATCAAGTTCGCCTTCCACGCCATCCCATAATTCGTCGTCCTCG
AGTGGGTCGTCAAAACCAAATCATAATGATTTAAATATGCTTATAAAAGAAAACATGCCG
ATTCCTAGAAAATTAGTACGAGGTCAAGATGTTTGGCTAGGAAAGGGTGCAGAGCAAACT
AGGCAAATTCTAAAATGCATGTGGTGTGCAGAAAGCTTTCGTTCCTTAGCTGAAATGACG
AGTCATATGCAACGCACTCAGCATTATACTAATATTATATCACAGGAACAAATAATTTCC
TGGAAATCCTCAGATGAAGCTAAGGGATCTAACTCTAGCACCCCGGGTACAAATAACGCT
GTTCCTCCAACAACAGGAACAAGTAGCCATGTTAGCGCGGTATTAACTTGTAAGGTTTGC
GACCAAGCGTTTAGTTCCTTAAAAGAGTTAAGCAATCATATGGTAAAGAATTCTCATTAT
AAAGAACATATTATGCGATCTATTACGGAGAGTGGTGGTAGAAGACGCCAGACACGCGAA
AAACGAAAGAAATCGTTACCAGTAAGAAAATTACTTGAACTTGAACGAGCCCAACATGAG
TTCAAAAATGGCGAAGGTAACGGTGTTCCCATGGGAAAACCGATCAGGGATTTCGGTGCT
GGTAGCCGTATTACTTGCGAAAAATGTGGAGACAAAATAGAGACTGCTGTATTTGTAGAG
CATATTCGTCAATGCATTGGATCACCAATGTCAAACACCCAAAGGAATTTTCTAAAAAGT
GCTCTTCTTTCTAATAATATTATTCCACCTGATGTACCTGGCCATATCACCCCCACTAGT
CGCGATGGTCGAAAAAGCATTAACGAGGAAATTCCATCTCCTGGTTCAGCTCATCACCGT
TCCCCTTCTTCGGTTAATGATTCTTCTCCCAGTTCCAAAGATCATAATGCCAGCAACGAC
AAAAGTTCATCTCCATCGGTGCTTAATGCTATAGAACAATTAATAGAAAAAAGCTTTGAT
ACACGCTCCCGACATTCAGTACCAGGTATACCAGGTGGAGCTTCACATGCTCCAATCGGG
TCAAGTATCCTAAAAAGGTTAGGAATAGATGAAAGCGTAGATTATACCAAACCGTTAGTA
GATCCTCAGACGATGAATATGCTTAGAAGTTACCACCATCAACAGGGATACGGTCGCCGT
GAACGCAGCGGTAGTGAGTCTAGTTCTATGTCAGAAAGGGGTGGTAGTAGGGTTGAATCT
CTAACCCCAGACAGGAAGCTGGATTCCTACCACATGACGCCTCGTACTACTCCTGATACT
CGTGGCTCTCAAACTCCGGCATCTGAGGAACGGCTCACTGAGGTTAGGATAAAAAAAGAA
GTCACAGATGAAGAAGAACGCGAAAACGGTGTAGACTTGAGTAGCCAACCAGTTAGAGTA
AAAACTGAAGTTGAGGATGAGGAAGAGCAACAGAGACCAAGCAGTGCAGTTGACGAGGAC
GTAAAGCCAACTGTTCCAAAACGTGAAAGTGAGGGCCCAAGTCCAGCTGCTAGTCCTCGC
AGTCCGGCCAGTGACCGATCAGCGCCAACGCCCGGTACTGACAGGAAACCGGCTTCCAGC
CTAGGAGCTCTCTCTTCTATGTTTGATAATCTAACCGGCGGAGGTTCCTCAAACGAGCCA
AGTTCTTCTCGTCGCGGAGGCAGTCACCCTTTAGCAGCTTTACAAAAACTTTGCGATAAA
ACGGAAACGAATTCATCTCGTGCTCCTGCCCCAGCCCCATCTCCCGCTGGTCCACCTAGC
ATCCTTACTTTTAGCTGGGCCTGCAACGATGCAGTAGTGACTGACTCTATAATGAAATGC
GCCTTATGTGATACACCGTTTATATCAAAGGGCGCTTATCGGCATCATTTATCGAAGATG
CATTTCGTTAAAGACGGCGCCCTGCCGGAGCCTGTGCCAGTGAAGGCTCCACCGGCGGCA
CCATCCCCAGGACCTCACAAGAGCAGCGGATCAAACGCGGCCTCACCTCAAGATCCGAGA
AGTCCGTCTCAATCTTTCGATGAGAGTCCTCACTCTAAATTCCTCAAGTATACGGAACTG
GCTAAACAATTATCCAGCAAGTACGTCTAA

Protein sequence:

MREAADIVVSHKSVTGAAPAPAEESTSPESGVKELGGREREARGEAGESRSPSPASRASP
TPEDRDIEHSIPATLIQDPNAERESPRCLSRESSGAPRCPSNDSVYSGRSAPSLPLPAAL
SAALPAALPAALMPPHSAAVAAYLGAAAAAAQQRLLMSYQEDITDAERADAVLDFSTKRS
ESPVDDEEDDAVNLTKNENGPLDLSVGTRKRGPEDSPSPVPSRKSSRTSDFKALSTPWST
PVAPHLPYFAAAVAAASLSPKGGVPADWNGKLKHGAPTPSDATKALEKMSELSRLGGEEL
FRSVQSAALGAGLTPNAAARHSAWQSHWLNKGADQTKDVLKCVWCKKSFNSLADLTVHMK
EAKHCGVNVPVPPSTGAPIPPSLQPPSSSPSTPSHNSSSSSGSSKPNHNDLNMLIKENMP
IPRKLVRGQDVWLGKGAEQTRQILKCMWCAESFRSLAEMTSHMQRTQHYTNIISQEQIIS
WKSSDEAKGSNSSTPGTNNAVPPTTGTSSHVSAVLTCKVCDQAFSSLKELSNHMVKNSHY
KEHIMRSITESGGRRRQTREKRKKSLPVRKLLELERAQHEFKNGEGNGVPMGKPIRDFGA
GSRITCEKCGDKIETAVFVEHIRQCIGSPMSNTQRNFLKSALLSNNIIPPDVPGHITPTS
RDGRKSINEEIPSPGSAHHRSPSSVNDSSPSSKDHNASNDKSSSPSVLNAIEQLIEKSFD
TRSRHSVPGIPGGASHAPIGSSILKRLGIDESVDYTKPLVDPQTMNMLRSYHHQQGYGRR
ERSGSESSSMSERGGSRVESLTPDRKLDSYHMTPRTTPDTRGSQTPASEERLTEVRIKKE
VTDEEERENGVDLSSQPVRVKTEVEDEEEQQRPSSAVDEDVKPTVPKRESEGPSPAASPR
SPASDRSAPTPGTDRKPASSLGALSSMFDNLTGGGSSNEPSSSRRGGSHPLAALQKLCDK
TETNSSRAPAPAPSPAGPPSILTFSWACNDAVVTDSIMKCALCDTPFISKGAYRHHLSKM
HFVKDGALPEPVPVKAPPAAPSPGPHKSSGSNAASPQDPRSPSQSFDESPHSKFLKYTEL
AKQLSSKYV