DPGLEAN22002 in OGS1.0

New model in OGS2.0DPOGS205808 
Genomic Positionscaffold195:+ 31-3500
See gene structure
CDS Length924
Paired RNAseq reads  0
Single RNAseq reads  2
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010370 (4e-55)
Best Drosophila hit  dissatisfaction (1e-48)
Best Human hitphotoreceptor-specific nuclear receptor isoform b (3e-20)
Best NR hit (blastp)  Orphan nuclear receptor NR6A1, putative [Pediculus humanus corporis] (4e-80)
Best NR hit (blastx)  PREDICTED: similar to Dissatisfaction (Dsf) [Tribolium castaneum] (5e-66)
GeneOntology terms














  
GO:0004879 ligand-dependent nuclear receptor activity
GO:0005634 nucleus
GO:0018993 somatic sex determination
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0007530 sex determination
GO:0007619 courtship behavior
GO:0007617 mating behavior
GO:0018991 oviposition
GO:0003707 steroid hormone receptor activity
GO:0007620 copulation
GO:0045924 regulation of female receptivity
GO:0008049 male courtship behavior
GO:0048047 mating behavior, sex discrimination
GO:0043565 sequence-specific DNA binding
GO:0008270 zinc ion binding
GO:0006355 regulation of transcription, DNA-dependent
InterPro families


  
IPR000536 Nuclear hormone receptor, ligand-binding, core
IPR008946 Nuclear hormone receptor, ligand-binding
IPR001723 Steroid hormone receptor
IPR003068 Transcription factor COUP
Orthology groupMCL15908

Nucleotide sequence:

ATGAGCAAATATTGGTATCTCCAGTTTAGTATAAATGGCATCACTGACTCGGCGCCTCTA
AGTACTGCATCATCAGCTGGATCCAGCGGTGCTGGTGGTGCTGGTGCAGTGAGTCCTCTG
CCTCTCATCACGGAACCGTTCATAGCACCTCCTCCTCCTGGGCTTTTGCATATGCTCATG
TCCAGTGATAAATGTCAAGAATTAATATGGAGTGCGAAACAATTGCAACTACAAGGCGAC
CCTTCTCTGTTACGACCTCCGCCGAATGCTTTCGGAGCACCCCTAGCGCCTACTTGGGAG
TTGTTACAGGAAACAAGCGCGCGTCTTCTATTCATGGCAGTGCGGTGGGTGAGATGTTTG
GCTCCATTCCAAGCCTTGGCGGCATCAGATCAGGCGGTGTTGCTGCGTGCTGCTTGGAAG
GATCTGTTCGTGCTGCATCTCGCACAGTGGTCCGCACCATGGGACCTCGCGCCCCTACTG
GCGGCCCCAGCTGCCAGAGCTAGACTGCCCTCTGACCCCTTGGTCGATCTAGAAATTAAC
ACTCTACAGGAAATTCTTTGTAGATTCCGACAAATTGCTCCCGACGGCAGTGAGTGCGGC
TGTATGAAAGCTATTGTTCTTTTTTCACCGGACACGCCCGGTCTAAGCGAAACACAGCCG
GTGGAGATGCTCCAAGATCAGGCTCAGTGTATTCTGGCCGACTACGTAAGGACGAGATAC
ACTCGTCAGCCTACCAGATTCGGCCGACTCCTTCTTCTACTGCCATCTCTACGCGCTGTC
AGAGCTCGTTCTATAGAGTCACTTCTGTTTCGGGAGACGGTTGGCGACGTGTCCGTGGCC
ACTCTGCTTCATGATATGTACCGCATGCAGCCAGCGCCCACGCCTGTACCAGCCTTCCAA
CCACCAAACTGTTCTTCGCCTTAA

Protein sequence:

MSKYWYLQFSINGITDSAPLSTASSAGSSGAGGAGAVSPLPLITEPFIAPPPPGLLHMLM
SSDKCQELIWSAKQLQLQGDPSLLRPPPNAFGAPLAPTWELLQETSARLLFMAVRWVRCL
APFQALAASDQAVLLRAAWKDLFVLHLAQWSAPWDLAPLLAAPAARARLPSDPLVDLEIN
TLQEILCRFRQIAPDGSECGCMKAIVLFSPDTPGLSETQPVEMLQDQAQCILADYVRTRY
TRQPTRFGRLLLLLPSLRAVRARSIESLLFRETVGDVSVATLLHDMYRMQPAPTPVPAFQ
PPNCSSP