New model in OGS2.0 | DPOGS209613  |
---|---|
Genomic Position | scaffold44:+ 112381-187808 |
See gene structure | |
CDS Length | 3270 |
Paired RNAseq reads   | 802 |
Single RNAseq reads   | 1926 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006678 (0.0) |
Best Drosophila hit   | tiptop (6e-155) |
Best Human hit | teashirt homolog 2 isoform 1 (3e-07) |
Best NR hit (blastp)   | teashirt-like protein [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | teashirt-like protein [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005634 nucleus GO:0003700 sequence-specific DNA binding transcription factor activity GO:0045449 regulation of transcription GO:0008270 zinc ion binding GO:0003677 DNA binding GO:0006355 regulation of transcription, DNA-dependent GO:0048730 epidermis morphogenesis GO:0007380 specification of segmental identity, head GO:0048749 compound eye development |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR015880 Zinc finger, C2H2-like |
Orthology group | MCL15923 |
Nucleotide sequence:
ATGAGGGAGGCGGCCGATATCGTCGTCTCACATAAATCTGTCACCGGGGCTGCGCCTGCG
CCGGCCGAGGAATCAACAAGTCCAGAAAGCGGAGTGAAGGAATTAGGAGGACGCGAACGG
GAGGCGCGGGGAGAGGCAGGGGAGTCGCGCTCTCCATCGCCAGCATCCCGTGCCTCCCCC
ACACCCGAAGATCGGGATATAGAGCACAGCATACCAGCTACCCTCATACAGGATCCCAAT
GCTGAAAGGGAGAGTCCAAGATGTTTATCGCGGGAGTCGTCCGGCGCGCCGCGATGTCCC
TCTAACGACTCGGTGTATTCGGGTCGGAGCGCGCCCAGCCTGCCCTTGCCAGCCGCCCTA
TCAGCAGCGTTACCGGCAGCCCTGCCCGCAGCGCTGATGCCACCCCACTCCGCTGCCGTC
GCAGCCTATCTCGGAGCAGCAGCTGCGGCAGCCCAGCAACGATTACTCATGTCCTACCAG
GAAGACATTACGGACGCTGAAAGAGCGGATGCCGTATTAGACTTCAGCACTAAACGAAGT
GAATCCCCGGTCGACGATGAGGAGGATGACGCCGTTAATCTCACAAAGAATGAAAATGGT
CCATTAGACTTATCTGTAGGTACTAGAAAAAGGGGGCCAGAGGATTCTCCATCTCCCGTC
CCTAGTAGAAAAAGTTCTCGTACTTCCGACTTCAAAGCTTTATCGACACCTTGGTCTACA
CCGGTCGCGCCACATCTTCCTTATTTTGCTGCCGCCGTTGCTGCTGCAAGCTTATCACCA
AAAGGTGGAGTTCCAGCTGATTGGAATGGTAAACTTAAACATGGAGCGCCTACACCAAGC
GATGCTACTAAAGCACTGGAAAAAATGAGCGAATTGAGTAGATTAGGTGGAGAAGAACTT
TTTAGATCTGTTCAAAGTGCAGCTTTGGGTGCAGGTCTTACACCAAATGCAGCTGCACGA
CATTCAGCTTGGCAATCTCATTGGCTGAATAAAGGAGCAGACCAGACAAAAGATGTCCTA
AAATGTGTATGGTGCAAAAAGAGCTTCAATTCACTTGCTGATCTAACTGTTCACATGAAG
GAAGCTAAGCATTGTGGAGTTAACGTTCCTGTACCCCCTTCAACTGGAGCTCCGATTCCG
CCTTCACTACAACCACCATCAAGTTCGCCTTCCACGCCATCCCATAATTCGTCGTCCTCG
AGTGGGTCGTCAAAACCAAATCATAATGATTTAAATATGCTTATAAAAGAAAACATGCCG
ATTCCTAGAAAATTAGTACGAGGTCAAGATGTTTGGCTAGGAAAGGGTGCAGAGCAAACT
AGGCAAATTCTAAAATGCATGTGGTGTGCAGAAAGCTTTCGTTCCTTAGCTGAAATGACG
AGTCATATGCAACGCACTCAGCATTATACTAATATTATATCACAGGAACAAATAATTTCC
TGGAAATCCTCAGATGAAGCTAAGGGATCTAACTCTAGCACCCCGGGTACAAATAACGCT
GTTCCTCCAACAACAGGAACAAGTAGCCATGTTAGCGCGGTATTAACTTGTAAGGTTTGC
GACCAAGCGTTTAGTTCCTTAAAAGAGTTAAGCAATCATATGGTAAAGAATTCTCATTAT
AAAGAACATATTATGCGATCTATTACGGAGAGTGGTGGTAGAAGACGCCAGACACGCGAA
AAACGAAAGAAATCGTTACCAGTAAGAAAATTACTTGAACTTGAACGAGCCCAACATGAG
TTCAAAAATGGCGAAGGTAACGGTGTTCCCATGGGAAAACCGATCAGGGATTTCGGTGCT
GGTAGCCGTATTACTTGCGAAAAATGTGGAGACAAAATAGAGACTGCTGTATTTGTAGAG
CATATTCGTCAATGCATTGGATCACCAATGTCAAACACCCAAAGGAATTTTCTAAAAAGT
GCTCTTCTTTCTAATAATATTATTCCACCTGATGTACCTGGCCATATCACCCCCACTAGT
CGCGATGGTCGAAAAAGCATTAACGAGGAAATTCCATCTCCTGGTTCAGCTCATCACCGT
TCCCCTTCTTCGGTTAATGATTCTTCTCCCAGTTCCAAAGATCATAATGCCAGCAACGAC
AAAAGTTCATCTCCATCGGTGCTTAATGCTATAGAACAATTAATAGAAAAAAGCTTTGAT
ACACGCTCCCGACATTCAGTACCAGGTATACCAGGTGGAGCTTCACATGCTCCAATCGGG
TCAAGTATCCTAAAAAGGTTAGGAATAGATGAAAGCGTAGATTATACCAAACCGTTAGTA
GATCCTCAGACGATGAATATGCTTAGAAGTTACCACCATCAACAGGGATACGGTCGCCGT
GAACGCAGCGGTAGTGAGTCTAGTTCTATGTCAGAAAGGGGTGGTAGTAGGGTTGAATCT
CTAACCCCAGACAGGAAGCTGGATTCCTACCACATGACGCCTCGTACTACTCCTGATACT
CGTGGCTCTCAAACTCCGGCATCTGAGGAACGGCTCACTGAGGTTAGGATAAAAAAAGAA
GTCACAGATGAAGAAGAACGCGAAAACGGTGTAGACTTGAGTAGCCAACCAGTTAGAGTA
AAAACTGAAGTTGAGGATGAGGAAGAGCAACAGAGACCAAGCAGTGCAGTTGACGAGGAC
GTAAAGCCAACTGTTCCAAAACGTGAAAGTGAGGGCCCAAGTCCAGCTGCTAGTCCTCGC
AGTCCGGCCAGTGACCGATCAGCGCCAACGCCCGGTACTGACAGGAAACCGGCTTCCAGC
CTAGGAGCTCTCTCTTCTATGTTTGATAATCTAACCGGCGGAGGTTCCTCAAACGAGCCA
AGTTCTTCTCGTCGCGGAGGCAGTCACCCTTTAGCAGCTTTACAAAAACTTTGCGATAAA
ACGGAAACGAATTCATCTCGTGCTCCTGCCCCAGCCCCATCTCCCGCTGGTCCACCTAGC
ATCCTTACTTTTAGCTGGGCCTGCAACGATGCAGTAGTGACTGACTCTATAATGAAATGC
GCCTTATGTGATACACCGTTTATATCAAAGGGCGCTTATCGGCATCATTTATCGAAGATG
CATTTCGTTAAAGACGGCGCCCTGCCGGAGCCTGTGCCAGTGAAGGCTCCACCGGCGGCA
CCATCCCCAGGACCTCACAAGAGCAGCGGATCAAACGCGGCCTCACCTCAAGATCCGAGA
AGTCCGTCTCAATCTTTCGATGAGAGTCCTCACTCTAAATTCCTCAAGTATACGGAACTG
GCTAAACAATTATCCAGCAAGTACGTCTAA
Protein sequence:
MREAADIVVSHKSVTGAAPAPAEESTSPESGVKELGGREREARGEAGESRSPSPASRASP
TPEDRDIEHSIPATLIQDPNAERESPRCLSRESSGAPRCPSNDSVYSGRSAPSLPLPAAL
SAALPAALPAALMPPHSAAVAAYLGAAAAAAQQRLLMSYQEDITDAERADAVLDFSTKRS
ESPVDDEEDDAVNLTKNENGPLDLSVGTRKRGPEDSPSPVPSRKSSRTSDFKALSTPWST
PVAPHLPYFAAAVAAASLSPKGGVPADWNGKLKHGAPTPSDATKALEKMSELSRLGGEEL
FRSVQSAALGAGLTPNAAARHSAWQSHWLNKGADQTKDVLKCVWCKKSFNSLADLTVHMK
EAKHCGVNVPVPPSTGAPIPPSLQPPSSSPSTPSHNSSSSSGSSKPNHNDLNMLIKENMP
IPRKLVRGQDVWLGKGAEQTRQILKCMWCAESFRSLAEMTSHMQRTQHYTNIISQEQIIS
WKSSDEAKGSNSSTPGTNNAVPPTTGTSSHVSAVLTCKVCDQAFSSLKELSNHMVKNSHY
KEHIMRSITESGGRRRQTREKRKKSLPVRKLLELERAQHEFKNGEGNGVPMGKPIRDFGA
GSRITCEKCGDKIETAVFVEHIRQCIGSPMSNTQRNFLKSALLSNNIIPPDVPGHITPTS
RDGRKSINEEIPSPGSAHHRSPSSVNDSSPSSKDHNASNDKSSSPSVLNAIEQLIEKSFD
TRSRHSVPGIPGGASHAPIGSSILKRLGIDESVDYTKPLVDPQTMNMLRSYHHQQGYGRR
ERSGSESSSMSERGGSRVESLTPDRKLDSYHMTPRTTPDTRGSQTPASEERLTEVRIKKE
VTDEEERENGVDLSSQPVRVKTEVEDEEEQQRPSSAVDEDVKPTVPKRESEGPSPAASPR
SPASDRSAPTPGTDRKPASSLGALSSMFDNLTGGGSSNEPSSSRRGGSHPLAALQKLCDK
TETNSSRAPAPAPSPAGPPSILTFSWACNDAVVTDSIMKCALCDTPFISKGAYRHHLSKM
HFVKDGALPEPVPVKAPPAAPSPGPHKSSGSNAASPQDPRSPSQSFDESPHSKFLKYTEL
AKQLSSKYV