DPGLEAN20818 in OGS1.0

New model in OGS2.0DPOGS201083 
Genomic Positionscaffold358:+ 40957-58692
See gene structure
CDS Length1194
Paired RNAseq reads  725
Single RNAseq reads  2087
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001391 (4e-116)
Best Drosophila hit  seven up, isoform A (3e-143)
Best Human hitCOUP transcription factor 1 (3e-125)
Best NR hit (blastp)  seven-up alpha [Bombyx mori] (1e-152)
Best NR hit (blastx)  seven-up alpha [Bombyx mori] (3e-141)
GeneOntology terms






















  
GO:0007503 fat body development
GO:0004879 ligand-dependent nuclear receptor activity
GO:0005634 nucleus
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0045449 regulation of transcription
GO:0004872 receptor activity
GO:0048749 compound eye development
GO:0001752 compound eye photoreceptor fate commitment
GO:0007465 R7 cell fate commitment
GO:0007464 R3/R4 cell fate commitment
GO:0007462 R1/R6 cell fate commitment
GO:0007510 cardioblast cell fate determination
GO:0005737 cytoplasm
GO:0007507 heart development
GO:0001700 embryonic development via the syncytial blastoderm
GO:0007417 central nervous system development
GO:0007419 ventral cord development
GO:0042331 phototaxis
GO:0007270 nerve-nerve synaptic transmission
GO:0005515 protein binding
GO:0003707 steroid hormone receptor activity
GO:0008270 zinc ion binding
GO:0043565 sequence-specific DNA binding
GO:0006355 regulation of transcription, DNA-dependent
InterPro families





  
IPR001628 Zinc finger, nuclear hormone receptor-type
IPR013629 Zinc finger, C4-type, C-terminal domain
IPR000536 Nuclear hormone receptor, ligand-binding, core
IPR008946 Nuclear hormone receptor, ligand-binding
IPR013088 Zinc finger, NHR/GATA-type
IPR003068 Transcription factor COUP
IPR001723 Steroid hormone receptor
Orthology groupMCL12432

Nucleotide sequence:

ACGCCCGCCAGCCAGGCCGCCTCCACGCAGAGCGGCTCCTCCGCCAACGACAAGGGACAG
AACGTGGAGTGCGTGGTCTGCGGAGACAAGTCCTCGGGGAAACACTACGGCCAGTTCACG
TGCGAGGGATGCAAGTCGTTCTTCAAACGTTCCGTGAGACGGAACTTGACCTACTCCTGC
CGCGGTAACAGGAACTGCCCTATCGACCAGCACCACAGAAACCAGTGCCAGTACTGCCGG
CTAAGGAAATGTCTCAAGATGGGGATGAGGAGAGAAGCTGTTCAGAGAGGTCGAGTGCCT
CCGTCTCAGTCCGCCGGCCTGGCGCTGCCGGGGCAATTCGCCTTGACCAACGGTGATCCC
GCCGCGGGCTTGAACAGTCACCCTTACCTCTCCTCGTACATCTCCCTGCTCCTTCGAGCG
GAGCCCTACCCCACGCAGCCGGCCTCGAGGTACGGCCAATGCGTGCAGCCCACCAACGTC
ATGGGTATAGACAATATATGCGAACTAGCCGCCAGGTTGCTCTTCTCCGCCGTCGAGTGG
GCGAGGAACATCCCCTTCTTCCCCGAACTGCAGGTCACGGACCAGGTCGCGCTCCTGCGA
CTGGTTTGGTCCGAGCTGTTCGTCCTCAACGCCTCCCAATGCTCGATGCCCCTCCACGTG
GCGCCGCTGTTGGCCGCCGCGGGTCTACACGCGTCACCCATGGCCGCCGACCGCGTGGTG
GCCTTCATGGACCACATACGGATCTTCCAGGAGCAGGTGGAGAAGCTGAAAGCGCTCCAC
GTGGACTCCGCGGAGTACTCCTGTCTGAAGGCCATCGTCCTCTTCACGACAGGTAAAATT
TTGGACAGCTTATTCGGGGAGGCGAGGTTGCTGCTGTACAGAGTCGCCGGCGCGTTCGCT
GCTATCACGAACCACGGGGAGCTCCTGGCGCTGGTCCGCACGCACTTGGACGCGTACGCC
GAGGCGACCAGGGCTCCCCAGCCGCCCGCGCCGCCGCCTCCGTCCGCAGCCTCCTCGGGC
TACTACTCCACGATGGAGACATCGCTCGGCGTCAACTCCTCCCTGTCCTACGGCAGCTTC
CTGTCTCCGTCGCGTGTGCCGCCTCAGTATACGAGCAGTCCGCGTTTGGACGCGGGTACG
TCATCGTTTAAGATATACGAGGGCAGCGGGAGCAGGGTTGACGCCAAGCGATGA

Protein sequence:

TPASQAASTQSGSSANDKGQNVECVVCGDKSSGKHYGQFTCEGCKSFFKRSVRRNLTYSC
RGNRNCPIDQHHRNQCQYCRLRKCLKMGMRREAVQRGRVPPSQSAGLALPGQFALTNGDP
AAGLNSHPYLSSYISLLLRAEPYPTQPASRYGQCVQPTNVMGIDNICELAARLLFSAVEW
ARNIPFFPELQVTDQVALLRLVWSELFVLNASQCSMPLHVAPLLAAAGLHASPMAADRVV
AFMDHIRIFQEQVEKLKALHVDSAEYSCLKAIVLFTTGKILDSLFGEARLLLYRVAGAFA
AITNHGELLALVRTHLDAYAEATRAPQPPAPPPPSAASSGYYSTMETSLGVNSSLSYGSF
LSPSRVPPQYTSSPRLDAGTSSFKIYEGSGSRVDAKR