Monarch geneset OGS2.0

DPOGS209534
TranscriptDPOGS209534-TA921 bp
ProteinDPOGS209534-PA306 aa
Genomic positionDPSCF300157 - 2794-16434
RNAseq coverage0x (Rank: top 98%)
Annotation
HeliconiusHMEL0115815e-3140.61% 
BombyxBGIBMGA013855-TA2e-10577.10% 
DrosophilaHr51-PC2e-6456.28% 
EBI UniRef50UniRef50_D6WR053e-8860.95%Hormone receptor 51 n=7 Tax=Coelomata RepID=D6WR05_TRICA
NCBI RefSeqXP_002423135.13e-8653.87%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2700110381e-8760.95%hormone receptor 51 [Tribolium castaneum]
NCBI nr blastxgi|2700110384e-9361.27%hormone receptor 51 [Tribolium castaneum]
Group
Gene OntologyGO:00037071.5e-55steroid hormone receptor activity
GO:00056341.5e-55nucleus
GO:00063551.5e-55regulation of transcription, DNA-dependent
GO:00434011.5e-55steroid hormone mediated signaling pathway
GO:00037001.5e-55sequence-specific DNA binding transcription factor activity
GO:00036778e-21DNA binding
GO:00048794.9e-09ligand-dependent nuclear receptor activity
KEGG pathway 
InterPro domain[59-306] IPR0089461.5e-55Nuclear hormone receptor, ligand-binding
[126-278] IPR0005362.6e-29Nuclear hormone receptor, ligand-binding, core
[127-148] IPR0017238e-21Steroid hormone receptor
[120-136] IPR0030684.9e-09Transcription factor COUP
Orthology groupMCL12353 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209534-TA
ATGTTGTTATTGAAAGGGTTAACTGTCCAGAACGAGCGTCAGCCACGGAACACCGCCACCATCAGGCCGGAGACGTTAAGGGATATGGACCAAGAAAGAGCTCTGAGAGAAGCCGCCGTCGCTGTCGGAGTATTTGGTCCGCCGGTGTCTCTGGCGATGGCCTTATCCCCGGCCAGATATCCTCTTCTGTCTCCACACTACGCCCCCCTGCCGCCACCGGATTCGACTCACGAACCAGAAAACCATCAAACGGACAGTGACGACGACAGTATTGACGTCACCAATGAAGAAGATTCTTCATCGTATTCTCCACCAGCGAACAGCTATCCACAGAACTGTATATCTTATGGGCCTCTGGGTATAGAGAGTGCTGCGGAAACAGCCGCCAGACTTCTGTTCATGGCGGTCAAGTGGGCGAGGAACCTGCCGTCATTCGCTGGACTGGCGTTCAGAGATCAGGTGACTCTATTAGAGGAGGGTTGGTCAGAGCTGTTCCTATTGAACGCTGTCCAGTGGTGCGCTCCACTAGACGCCGCAGCCAGCGCCCTCTTTGGAACAGAACACGACACCGGTGCTGGCGAGTGTCGGCGCCGTCTCCGCGCGGTGGTCTCTCGCTACCGTTCAGTCTTAGTTGATCCCGCGGAGTTCGCGTGTATGAAGGCCATCGTGCTCTTCAAACCTGAAACCCGAGGTCTAAAAGACCCCCTCCAGATCGAAAACCTTCAAGATCAAGCTCAAGTGATGCTAATGACTCACACAAGGACAGCTCACGGCACGGCCCCCGCTCGGTTCGGGAGGCTCCTACTTCTTCTACCACTCCTCCGCCTCGTCACGCCACAGCAATTGGAGAAGGAGTTCTTCGCGAAAACCATTGGAGAAACACCCATGGAAAAAGTACTCGCTGATATGTATAAGAATTAA

Protein sequence:

>DPOGS209534-PA
MLLLKGLTVQNERQPRNTATIRPETLRDMDQERALREAAVAVGVFGPPVSLAMALSPARYPLLSPHYAPLPPPDSTHEPENHQTDSDDDSIDVTNEEDSSSYSPPANSYPQNCISYGPLGIESAAETAARLLFMAVKWARNLPSFAGLAFRDQVTLLEEGWSELFLLNAVQWCAPLDAAASALFGTEHDTGAGECRRRLRAVVSRYRSVLVDPAEFACMKAIVLFKPETRGLKDPLQIENLQDQAQVMLMTHTRTAHGTAPARFGRLLLLLPLLRLVTPQQLEKEFFAKTIGETPMEKVLADMYKN-