New model in OGS2.0 | DPOGS214137  |
---|---|
Genomic Position | scaffold2257:+ 44220-57181 |
See gene structure | |
CDS Length | 1272 |
Paired RNAseq reads   | 1199 |
Single RNAseq reads   | 5287 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006183 (1e-91) |
Best Drosophila hit   | ultraspiracle (4e-71) |
Best Human hit | retinoic acid receptor RXR-alpha (2e-61) |
Best NR hit (blastp)   | RecName: Full=Protein ultraspiracle homolog; AltName: Full=Nuclear receptor subfamily 2 group B member 4 (0.0) |
Best NR hit (blastx)   | RecName: Full=Protein ultraspiracle homolog; AltName: Full=Nuclear receptor subfamily 2 group B member 4 (2e-174) |
GeneOntology terms   | GO:0005515 protein binding |
InterPro families    | IPR001628 Zinc finger, nuclear hormone receptor-type IPR000003 Retinoid X receptor IPR001723 Steroid hormone receptor IPR000536 Nuclear hormone receptor, ligand-binding, core IPR008946 Nuclear hormone receptor, ligand-binding IPR013088 Zinc finger, NHR/GATA-type |
Orthology group | MCL10939 |
Nucleotide sequence:
ATGTCGAGCGTGGCGAAGAAAGATAAGCCGACAATGTCAGTGACGGCGCTTATCAACTGG
GCCCGACCGGCGCCGCCGGGGCCTCAGCAGCAGTTGGCGCAGGCGGTGCCAGTCTCCTCG
ACGGCTCTCCTGCAGTCCCTAGGAACATCCTCGAACATTCCCAACGTCGACTGCTCTATC
GACATGCAATGGCTGAACATAGAATCGGGGTTCATGTCCCCTATGTCTCCACCAGAGATG
AAGCCGGACACAGCGATGCTGGACGGCATGAGGGAGGACGCCACCTCACCCTCGGCCATG
AGGAACTATCCCCCGAATCACCCGCTCAGCGGATCCAAGCACCTCTGTTCCATCTGCGGA
GACAGAGCATCGGGCAAACATTACGGCGTTTATAGCTGCGAAGGCTGTAAAGGATTCTTC
AAGAGGACCGTCCGTAAAGATTTGACGTACGCGTGTCGCGAGGAGAGGAATTGTATAATA
GACAAGCGTCAAAGGAATAGGTGCCAGTACTGCCGCTATCAGAAATGTCTGGCGTGCGGG
ATGAAGAGGGAGGCGGTGCAGGAGGAGAGGCAGAGGGCTGCAAGGGGTGCTGAGGACGTA
CATCCAAGCAGCTCAGTACAGGAGCTGTCAATCGAGCGTCTCCTTGAGATGGAATCTCTG
GTGGCGGACCCTAACGAGGAGTTCCAATTCCTCCGCGTGGGTCCTGACAGTAACGTGCCA
CCGAGATACAGGGCTCCCGTCTCCAGCCTCTGTCAGATTGCATTCCATGGTATCACCGTG
CGGGGGCCGGGTCCATCGCGTTGCGGGGAGAGGAGCTTCAACAGCGCCTGGGATTTGCGA
CCCAGGTGTAATAAACAGATCGCTGCATTAGTAGTATGGGCTCGTGACATACCGCACTTC
AGTCAGCTGGAGTTGGAAGACCAGGTCATACTGATCAAGGCCTCCTGGAACGAGCTCATG
CTGTTCGCCATCGCCTGGAGGAGTATGGAGTACTTGGAAGATGAGAGAGAGAATCTAGAC
GGCACTCGGACAGCGCCACCGCCACAACTGATGTGTCTCATGCCAGGGATGACCCTCCAT
CGTAACTCAGCGCTTCAGGCCGGCGTTGGTCAGATCTTCGACCGCGTGCTCTCTGAACTC
TCGCTGAAGATGAGGGCGCTGAGGATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATC
GTGCTGCTCAACCCCGACATAAAAGGCCTTAAAAACAGACAGGACGTGGACGTTCTACGA
GAGAAGGTATGA
Protein sequence:
MSSVAKKDKPTMSVTALINWARPAPPGPQQQLAQAVPVSSTALLQSLGTSSNIPNVDCSI
DMQWLNIESGFMSPMSPPEMKPDTAMLDGMREDATSPSAMRNYPPNHPLSGSKHLCSICG
DRASGKHYGVYSCEGCKGFFKRTVRKDLTYACREERNCIIDKRQRNRCQYCRYQKCLACG
MKREAVQEERQRAARGAEDVHPSSSVQELSIERLLEMESLVADPNEEFQFLRVGPDSNVP
PRYRAPVSSLCQIAFHGITVRGPGPSRCGERSFNSAWDLRPRCNKQIAALVVWARDIPHF
SQLELEDQVILIKASWNELMLFAIAWRSMEYLEDERENLDGTRTAPPPQLMCLMPGMTLH
RNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPDIKGLKNRQDVDVLR
EKV