DPGLEAN21528 in OGS1.0

New model in OGS2.0DPOGS201398 
Genomic Positionscaffold13:+ 554668-557964
See gene structure
CDS Length1173
Paired RNAseq reads  10
Single RNAseq reads  25
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002000 (5e-136)
Best Drosophila hit  tailless (7e-78)
Best Human hitnuclear receptor subfamily 2 group E member 1 (4e-83)
Best NR hit (blastp)  Orphan nuclear receptor NR2E1, putative [Pediculus humanus corporis] (3e-108)
Best NR hit (blastx)  GI22125 [Drosophila mojavensis] (4e-82)
GeneOntology terms







  
GO:0006355 regulation of transcription, DNA-dependent
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0043565 sequence-specific DNA binding
GO:0008270 zinc ion binding
GO:0003707 steroid hormone receptor activity
GO:0005634 nucleus
GO:0046872 metal ion binding
GO:0005496 steroid binding
GO:0007275 multicellular organismal development
InterPro families



  
IPR001723 Steroid hormone receptor
IPR001628 Zinc finger, nuclear hormone receptor-type
IPR013088 Zinc finger, NHR/GATA-type
IPR008946 Nuclear hormone receptor, ligand-binding
IPR000536 Nuclear hormone receptor, ligand-binding, core
Orthology groupMCL13864

Nucleotide sequence:

ATGGAAATGCAGTCTATGCCAGCATCATCAAGTCGTATTCTATATGATGTCCCTTGTGCC
GTATGTCGAGATCATTCTTCGGGAAAACATTACGGAGTGTTTGCATGCGACGGATGTGCT
GGTTTTTTCAAGAGATCAGTGCGACGAGACCGTAGATATGCTTGCAAAGCTAGAAACTCC
GGAGCTTGCTTGGTCGACAAAGCACATAGAAATCAATGTCGTGCTTGTAGATTAGCCAAA
TGTCTCGATGTCGGTATGAATAAAGATGCTGTTCAGCATGAGAGAGGACCAAGAAATTCT
ACGATTCGACGGCAAATGGCTTTATTTTTAAAAGATCCAGCACTACCCGCTTCCGAAATG
TCTTTAATGCCTCCAGTTTTGGATTTGGCTATACCAAAGCACTCTCTGCTACCACCCCCA
CCACCTTTATCTTTATTTCACAATCCCTATCAATCTTATAGCAGATTTAATTTATTAGCG
TCTCCATTACCGTCTTGTCCACTGAAAGCTCCATCGCCGCCGCCACCCATGACATCAAAT
TTAATAAGCCCCACTGAACCAGAAGCAATTTGTGAAGCAGCCGCCCGATTATTATTTATG
AACGTTAAGTGGGCCAAAAATGTTCCAGCCTTCTCGTCTTTGTCTTTGCAAGATCGATTG
ATATTATTAGAAGAATCATGGCGAGATCTGTTTGTAATTGGATCGGCACAATTTCTATAT
CCCCTTGACTTAAAAGTTCTCGTAAACACAAAACATACAAAAGTAGATTCTAAACATATC
GCAGATTTCGAAAAGGCTCTTATAGAGCTAACAAAGATGCATCCAGATAATAACGAGTAT
GCATGTCTTCGAGCAATTGTTTTATTTAAAACAAATTTCAATGCTGTCCATACGAACAGT
TTGCCCCAATCCCATATTGAAATTAAGAAACTAAAAGACCTACCTGCAGTAGCTAGTTTG
CAAGATCATTCTCAAGCTGTTTTAAACGAGTATATTACCAGATTATATCCAGGAGATACA
ACACGATCAAATCAATTGCTCCAAAGTTTGTCTGCAGTCCGAAATGTTTCAAGTACTACA
ATAGTGGAACTATTTTTCCGAGCTACCATAGGAGATATACCGATTGAAAGAATAATAAGT
GATATGTACAGAAGCGGCAAAGATACTGTTTAA

Protein sequence:

MEMQSMPASSSRILYDVPCAVCRDHSSGKHYGVFACDGCAGFFKRSVRRDRRYACKARNS
GACLVDKAHRNQCRACRLAKCLDVGMNKDAVQHERGPRNSTIRRQMALFLKDPALPASEM
SLMPPVLDLAIPKHSLLPPPPPLSLFHNPYQSYSRFNLLASPLPSCPLKAPSPPPPMTSN
LISPTEPEAICEAAARLLFMNVKWAKNVPAFSSLSLQDRLILLEESWRDLFVIGSAQFLY
PLDLKVLVNTKHTKVDSKHIADFEKALIELTKMHPDNNEYACLRAIVLFKTNFNAVHTNS
LPQSHIEIKKLKDLPAVASLQDHSQAVLNEYITRLYPGDTTRSNQLLQSLSAVRNVSSTT
IVELFFRATIGDIPIERIISDMYRSGKDTV