DPGLEAN05745 in OGS1.0

New model in OGS2.0DPOGS214137 
Genomic Positionscaffold2257:+ 44220-57181
See gene structure
CDS Length1272
Paired RNAseq reads  1199
Single RNAseq reads  5287
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006183 (1e-91)
Best Drosophila hit  ultraspiracle (4e-71)
Best Human hitretinoic acid receptor RXR-alpha (2e-61)
Best NR hit (blastp)  RecName: Full=Protein ultraspiracle homolog; AltName: Full=Nuclear receptor subfamily 2 group B member 4 (0.0)
Best NR hit (blastx)  RecName: Full=Protein ultraspiracle homolog; AltName: Full=Nuclear receptor subfamily 2 group B member 4 (2e-174)
GeneOntology terms  GO:0005515 protein binding
InterPro families




  
IPR001628 Zinc finger, nuclear hormone receptor-type
IPR000003 Retinoid X receptor
IPR001723 Steroid hormone receptor
IPR000536 Nuclear hormone receptor, ligand-binding, core
IPR008946 Nuclear hormone receptor, ligand-binding
IPR013088 Zinc finger, NHR/GATA-type
Orthology groupMCL10939

Nucleotide sequence:

ATGTCGAGCGTGGCGAAGAAAGATAAGCCGACAATGTCAGTGACGGCGCTTATCAACTGG
GCCCGACCGGCGCCGCCGGGGCCTCAGCAGCAGTTGGCGCAGGCGGTGCCAGTCTCCTCG
ACGGCTCTCCTGCAGTCCCTAGGAACATCCTCGAACATTCCCAACGTCGACTGCTCTATC
GACATGCAATGGCTGAACATAGAATCGGGGTTCATGTCCCCTATGTCTCCACCAGAGATG
AAGCCGGACACAGCGATGCTGGACGGCATGAGGGAGGACGCCACCTCACCCTCGGCCATG
AGGAACTATCCCCCGAATCACCCGCTCAGCGGATCCAAGCACCTCTGTTCCATCTGCGGA
GACAGAGCATCGGGCAAACATTACGGCGTTTATAGCTGCGAAGGCTGTAAAGGATTCTTC
AAGAGGACCGTCCGTAAAGATTTGACGTACGCGTGTCGCGAGGAGAGGAATTGTATAATA
GACAAGCGTCAAAGGAATAGGTGCCAGTACTGCCGCTATCAGAAATGTCTGGCGTGCGGG
ATGAAGAGGGAGGCGGTGCAGGAGGAGAGGCAGAGGGCTGCAAGGGGTGCTGAGGACGTA
CATCCAAGCAGCTCAGTACAGGAGCTGTCAATCGAGCGTCTCCTTGAGATGGAATCTCTG
GTGGCGGACCCTAACGAGGAGTTCCAATTCCTCCGCGTGGGTCCTGACAGTAACGTGCCA
CCGAGATACAGGGCTCCCGTCTCCAGCCTCTGTCAGATTGCATTCCATGGTATCACCGTG
CGGGGGCCGGGTCCATCGCGTTGCGGGGAGAGGAGCTTCAACAGCGCCTGGGATTTGCGA
CCCAGGTGTAATAAACAGATCGCTGCATTAGTAGTATGGGCTCGTGACATACCGCACTTC
AGTCAGCTGGAGTTGGAAGACCAGGTCATACTGATCAAGGCCTCCTGGAACGAGCTCATG
CTGTTCGCCATCGCCTGGAGGAGTATGGAGTACTTGGAAGATGAGAGAGAGAATCTAGAC
GGCACTCGGACAGCGCCACCGCCACAACTGATGTGTCTCATGCCAGGGATGACCCTCCAT
CGTAACTCAGCGCTTCAGGCCGGCGTTGGTCAGATCTTCGACCGCGTGCTCTCTGAACTC
TCGCTGAAGATGAGGGCGCTGAGGATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATC
GTGCTGCTCAACCCCGACATAAAAGGCCTTAAAAACAGACAGGACGTGGACGTTCTACGA
GAGAAGGTATGA

Protein sequence:

MSSVAKKDKPTMSVTALINWARPAPPGPQQQLAQAVPVSSTALLQSLGTSSNIPNVDCSI
DMQWLNIESGFMSPMSPPEMKPDTAMLDGMREDATSPSAMRNYPPNHPLSGSKHLCSICG
DRASGKHYGVYSCEGCKGFFKRTVRKDLTYACREERNCIIDKRQRNRCQYCRYQKCLACG
MKREAVQEERQRAARGAEDVHPSSSVQELSIERLLEMESLVADPNEEFQFLRVGPDSNVP
PRYRAPVSSLCQIAFHGITVRGPGPSRCGERSFNSAWDLRPRCNKQIAALVVWARDIPHF
SQLELEDQVILIKASWNELMLFAIAWRSMEYLEDERENLDGTRTAPPPQLMCLMPGMTLH
RNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPDIKGLKNRQDVDVLR
EKV