Monarch geneset OGS2.0

DPOGS211084
TranscriptDPOGS211084-TA1839 bp
ProteinDPOGS211084-PA612 aa
Genomic positionDPSCF300007 - 1212322-1222665
RNAseq coverage43x (Rank: top 72%)
Annotation
HeliconiusHMEL0124920.093.95% 
BombyxBGIBMGA002964-TA0.089.34% 
DrosophilaHr38-PB2e-14764.21% 
EBI UniRef50UniRef50_D6WNH90.065.90%Hormone receptor in 38-like protein n=3 Tax=Coelomata RepID=D6WNH9_TRICA
NCBI RefSeqXP_001814072.10.064.90%PREDICTED: similar to AGAP008334-PA [Tribolium castaneum]
NCBI nr blastpgi|1892378010.064.90%PREDICTED: similar to AGAP008334-PA [Tribolium castaneum]
NCBI nr blastxgi|1892378010.065.84%PREDICTED: similar to AGAP008334-PA [Tribolium castaneum]
Group
Gene OntologyGO:00037075.8e-65steroid hormone receptor activity
GO:00056345.8e-65nucleus
GO:00063555.8e-65regulation of transcription, DNA-dependent
GO:00434015.8e-65steroid hormone mediated signaling pathway
GO:00037005.8e-65sequence-specific DNA binding transcription factor activity
GO:00036777.4e-63DNA binding
GO:00048797.4e-63ligand-dependent nuclear receptor activity
GO:00082702.8e-37zinc ion binding
GO:00435652.8e-37sequence-specific DNA binding
KEGG pathway 
InterPro domain[348-612] IPR0089465.8e-65Nuclear hormone receptor, ligand-binding
[349-364] IPR0030707.4e-63Orphan nuclear receptor
[274-345] IPR0016282.8e-37Zinc finger, nuclear hormone receptor-type
[274-340] IPR0130885e-31Zinc finger, NHR/GATA-type
[421-580] IPR0005361.3e-20Nuclear hormone receptor, ligand-binding, core
[338-348] IPR0017236.7e-15Steroid hormone receptor
Orthology groupMCL11558 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211084-TA
ATGCGAGGTGCGTTGCTGACGCCCTCCAGCCAACATTGTGGCCTTAGACAATTTCTTACCACACGGCCGAGCGCCACTGAGCCGCGCTCGCCAAGCGCTTTCACGTCAAATTCCTCCAGCATGCTACTGCTGCAAACACACAGCAACTACGGTTCGTCCTTCACTGATTTACCAAGCCTGTTGCCACAGTATCAAGACGATTCAGCTGAAATCTTAGAAGAGAACTTGGACCCATTTCCTGACGTCGAATTCCATGCGCCTATTCCTTTTGAAATTAAAGCACAACGTACTACTCCAGTAAGTGAGACCCCGTCGCCTACCTTGGGCCCAGCGCTGCCTAGTTTCGAAGAAACATATTCGGTTCGCTATCCAAAACAAGAAATGGCGGAATTTGGTCTTAAAATGGACGAAGACTGTTACAATGTCAGTGCTTATTCGCACCCTGGACATGCATCGACACAATTATTATATCAATATCATCAGCCGACTTTACCCTATGTTCCTTCACCATATTATGCGCCAGCTCAGCCCTGTTCGCCGACATTCGACACAGGTGGAGTTACTACTGCCCAGGATTCTTATTCTTTACCGCCTTTCCCAAGTTCAGTTGACTTACACATATCTACAGAACAAGCTAACAGGCAAAGAAGATCATCATTGCCCGTTCAACGTTCTGAATCTAACAGTTCCAACGATAGTCCCAAACTGCATGGAAGTCGAATCCATTGCATGCAAGCTTCAGCGCCGAGTTCTGCGTCTAGTTCACCTGGAGGTGTACCACAAGACAATAATGCATCTCGAGCTGCGCCACCATCACCCAGCCAACTATGTGCTGTATGTGGAGATACTGCAGCGTGCCAACATTACGGCGTTCGAACCTGTGAAGGATGTAAAGGATTTTTCAAAAGAACTGTTCAGAAAGGATCAAAGTACGTGTGCTTAGCAGAAAAGTCGTGTCCGGTAGATAAAAGAAGAAGAAACAGATGTCAGTTTTGTCGTTTTCAAAAATGTCTTGCTGTTGGTATGGTGAAAGAAGTAGTTAGAACAGATTCTTTAAAGGGCAGACGGGGACGATTGCCTTCAAAACCAAAATGCCCTCAAGAATCTCCACCTAGTCCACCAATATCACTTATAACAGCACTAGTAAGAGCTCACGTAGACACATCTCCTGACTTTGCTAATCTTGATTACTCCCAGTATAGAGAACCAAATCCAATGGAACCTCCTATTTCGGATATAGAAGTAATCCAGCAATTCTATACTCTACTATCCACATCGATCGATATGATAAAAGTTTTTGCTGAAAAGGTGCCAGGCTACGGCGATTTGTGCCCAGAAGACAGAGAGCAATTATTTGCATCAGCGCGACTTGAATTATTTGTGCTCCGTTTAGCCTATCGCACTCGCCCTGATGATACTAAACTCACCTTCTGCAATGGCTTGGTTCTCGACAAACGACAATGTCAACGATCTTTTGGGGACTGGTTGCACGCTGTACTCGACTTCAGTAATACTCTGCACTCTATGGACATTGATATATCCACTTTCGCCTGTCTTTGTGCGTTGACATTAATTACAGAGAGACATGGCTTAAAAGAGCCGCATCGTGTTGAACAATTGCAAATGAAGATAATCGGATGTCTTCGGTCTCACATGCCAGGCGGGGGCGCGGCCAGTGCCGCCGGCGCGCCTCACTTCAGCCGCGTCCTTGGGGCTCTACCCGAACTGCGCTCGCTTTCCGTTCAGGGTCTTCAAAGAATCTTCTACCTGAAGCTTGAAGACTTAGTGCCAGCGCCGCCGCTGATTGAAAACATGTTTCGCGCCAGTTTACCTTTCTAG

Protein sequence:

>DPOGS211084-PA
MRGALLTPSSQHCGLRQFLTTRPSATEPRSPSAFTSNSSSMLLLQTHSNYGSSFTDLPSLLPQYQDDSAEILEENLDPFPDVEFHAPIPFEIKAQRTTPVSETPSPTLGPALPSFEETYSVRYPKQEMAEFGLKMDEDCYNVSAYSHPGHASTQLLYQYHQPTLPYVPSPYYAPAQPCSPTFDTGGVTTAQDSYSLPPFPSSVDLHISTEQANRQRRSSLPVQRSESNSSNDSPKLHGSRIHCMQASAPSSASSSPGGVPQDNNASRAAPPSPSQLCAVCGDTAACQHYGVRTCEGCKGFFKRTVQKGSKYVCLAEKSCPVDKRRRNRCQFCRFQKCLAVGMVKEVVRTDSLKGRRGRLPSKPKCPQESPPSPPISLITALVRAHVDTSPDFANLDYSQYREPNPMEPPISDIEVIQQFYTLLSTSIDMIKVFAEKVPGYGDLCPEDREQLFASARLELFVLRLAYRTRPDDTKLTFCNGLVLDKRQCQRSFGDWLHAVLDFSNTLHSMDIDISTFACLCALTLITERHGLKEPHRVEQLQMKIIGCLRSHMPGGGAASAAGAPHFSRVLGALPELRSLSVQGLQRIFYLKLEDLVPAPPLIENMFRASLPF-