DPGLEAN18938 in OGS1.0

New model in OGS2.0DPOGS214612 
Genomic Positionscaffold1724:- 7112-15247
See gene structure
CDS Length1458
Paired RNAseq reads  5191
Single RNAseq reads  11793
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001765 (1e-111)
Best Drosophila hit  cap-n-collar, isoform C (2e-60)
Best Human hitnuclear factor erythroid 2-related factor 1 (4e-35)
Best NR hit (blastp)  cap-n-collar [Tribolium castaneum] (8e-92)
Best NR hit (blastx)  conserved hypothetical protein [Pediculus humanus corporis] (3e-78)
GeneOntology terms















  
GO:0005634 nucleus
GO:0003702 RNA polymerase II transcription factor activity
GO:0003677 DNA binding
GO:0016563 transcription activator activity
GO:0007350 blastoderm segmentation
GO:0007310 oocyte dorsal/ventral axis specification
GO:0008359 regulation of bicoid mRNA localization
GO:0042070 maintenance of oocyte nucleus location involved in oocyte dorsal/ventral axis specification
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0008103 oocyte microtubule cytoskeleton polarization
GO:0007317 regulation of pole plasm oskar mRNA localization
GO:0046982 protein heterodimerization activity
GO:0045450 bicoid mRNA localization
GO:0045451 pole plasm oskar mRNA localization
GO:0046983 protein dimerization activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0043565 sequence-specific DNA binding
InterPro families
  
IPR004827 Basic-leucine zipper (bZIP) transcription factor
IPR008917 Eukaryotic transcription factor, Skn-1-like, DNA-binding
Orthology groupMCL15023

Nucleotide sequence:

ATGTCCAGCAGCGCCATCTATCTTGTTCCACTGGATATCATCGCCCACGGAGGGGCGGGT
TACGCTCCAAACTACCACGCCCCCATACCACCTATACCGGAAAAACATCACGAGGCTTAC
GGTGCCCCAGCGCCGCTGGACGGAGCGTATAAGGTAGAAGCAGCCCATCACCCGCAGCAG
CACGATGGACTGTACTATCAGACACCTACGGAACCACAGCAAGACGGCTTCCTCCAGTCC
ATCCTGAACGACGAGGATCTCCAGCTGATGGATATGGCGATGAATGAGGGCATGTACACG
ATGCGGATGTTGGACGGCGCGCCCACGGTGCACCAGACACACGCGCACATGCCCGTGGCG
GCCGAACGTGATTCGGCATCAGACAGCGCTGTGTCGTCCATGGGCTCGGAGCGCGTACCC
TCGCTCTCTGACGGCGAGTGGTGCGACGGGAGTGACTCCGCCCAGGAGTTCCACAGTTCA
AAATTCCGACCGTACGAGGCTGCTTACGGCAGAGAGAGATCCCACGCGCCACAGAAGAAA
CATCACATGTTCGGGAAGCGATCCTTCCAGGAACAGCCGTCCCAAGAAACCAGACCGGTT
GTGAAATACGAATGCGAACAAACATACCATGAGATGCATATGCATGCAGATTACACCCCT
CGCCAGCACATACCGCCCCAGCTAGGTGTGCAGCCGACGTTGGACATCAATTCACCACAC
TCAAGCCACGCATTGCAACATACAACGCTGCCGAGCCCGAACCCGCCGCGATTCGGGTTC
AGTTCGGGAGATAGAGTGAGACACAACCACACATACAGCGCAGCCCTGCCGCCCACAGAG
GAGAGACTACCCACGAGAGATAAGAGAGGATTACATATCAGCACATATATATCTGTTTTT
TGTCAACGTGTGATCCGCCGTCTAACCGACGGCAGTACTTCCGACAGCGGCAGCGGACAT
CTCAGTAGAGACGAGAAGAGAGCGAAGGCTTTAGGTATACCGCTGGAGGTCCAGGACATC
ATCAACCTGCCCATGGACGAGTTCAACGAGCGGCTCTCCAAACACGACTTGAGCGAGGCG
CAGCTGTCGCTGATCCGAGACATACGGAGGCGGGGCAAGAACAAGGTGGCAGCACAGAAC
TGCCGGAAACGGAAGCTGGACCAGATCACCTCCCTGGCAGACGAGGTCCGCACGGTCCGC
GACAGGAAGGCCCGCACGCAGAGAGACAGACACAACTTGCTGGCCGACAGGCAGAAGCTC
AAGGAGAGGTTCGCCGCGCTCTACAGACACGTGTTCCAGCACCTCCGCGACCCTGAAGGA
CGACCCTTGTCCTCCAGCCAATACTCCCTACAACAAGCGGCTGACGGCAGCGTAGTTCTC
GTGCCCAGGATGGGAGGAGACCACTCCATGAACCGGACGGAGGAGGACCTCGAGCGGAAG
AACAACTACGAGCACTAG

Protein sequence:

MSSSAIYLVPLDIIAHGGAGYAPNYHAPIPPIPEKHHEAYGAPAPLDGAYKVEAAHHPQQ
HDGLYYQTPTEPQQDGFLQSILNDEDLQLMDMAMNEGMYTMRMLDGAPTVHQTHAHMPVA
AERDSASDSAVSSMGSERVPSLSDGEWCDGSDSAQEFHSSKFRPYEAAYGRERSHAPQKK
HHMFGKRSFQEQPSQETRPVVKYECEQTYHEMHMHADYTPRQHIPPQLGVQPTLDINSPH
SSHALQHTTLPSPNPPRFGFSSGDRVRHNHTYSAALPPTEERLPTRDKRGLHISTYISVF
CQRVIRRLTDGSTSDSGSGHLSRDEKRAKALGIPLEVQDIINLPMDEFNERLSKHDLSEA
QLSLIRDIRRRGKNKVAAQNCRKRKLDQITSLADEVRTVRDRKARTQRDRHNLLADRQKL
KERFAALYRHVFQHLRDPEGRPLSSSQYSLQQAADGSVVLVPRMGGDHSMNRTEEDLERK
NNYEH