Monarch geneset OGS2.0

DPOGS200543
TranscriptDPOGS200543-TA1404 bp
ProteinDPOGS200543-PA467 aa
Genomic positionDPSCF300119 - 113368-119190
RNAseq coverage473x (Rank: top 26%)
Annotation
HeliconiusHMEL0168652e-13559.41% 
BombyxBGIBMGA010783-TA8e-11680.40% 
DrosophilaHr96-PA1e-8850.79% 
EBI UniRef50UniRef50_D6X2K22e-11249.68%Hormone receptor in 96-like protein n=1 Tax=Tribolium castaneum RepID=D6X2K2_TRICA
NCBI RefSeqXP_968487.15e-11349.68%PREDICTED: similar to nuclear receptor nhr-48 [Tribolium castaneum]
NCBI nr blastpgi|910937329e-11249.68%PREDICTED: similar to nuclear receptor nhr-48 [Tribolium castaneum]
NCBI nr blastxgi|910937323e-10949.58%PREDICTED: similar to nuclear receptor nhr-48 [Tribolium castaneum]
Group
Gene OntologyGO:00037074.6e-40steroid hormone receptor activity
GO:00056344.6e-40nucleus
GO:00063554.6e-40regulation of transcription, DNA-dependent
GO:00434014.6e-40steroid hormone mediated signaling pathway
GO:00037004.6e-40sequence-specific DNA binding transcription factor activity
GO:00036772.3e-09DNA binding
KEGG pathway 
InterPro domain[273-462] IPR0089464.6e-40Nuclear hormone receptor, ligand-binding
[275-439] IPR0005364.3e-17Nuclear hormone receptor, ligand-binding, core
[276-297] IPR0017232.3e-09Steroid hormone receptor
Orthology groupMCL15164 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200543-TA
ATGGTGAAGGAGTTTATTATGTCCGACGAGGACAAGGCGGAAAAGAGACGAAAAATAGAAGAAAACCGCGCCAAAAAGAGACAACTACAGGATTCGGATGACAGTGTGTCTAGTTCTAAGAATTTTAGACGTGATGTTGAGAGTCCTTACACCACACCCGTACAAGAGAGTACAATACAGTATGATGTTTTAAACAGTACAACATGCAGTCCTCACAGCTCAGCGGAATCCCCCTTGAGCACCGATGTAGATTCTATGCCAACACCAGCCTATGGCCGATATGTGCCGGTGCAGACAGAGCTATTCACAGTAAAGGGTTACCCACCAGAAGAGAAGACAAATCCAAACCAGAGATATTATGAGCCCCGGCAAGAACATTACATGTGTGACAGTATGGATGGCATATACGAACAAACCAAACAAAATAGTATAAGGTCGATCCTGACCAACGGCGAAGGTCTCCCTCATCACCAGGACACGGAGCACGTGTGCGAGGAGATGCCGTCCACCAGCAACCCTGAGGTTAACAAGGCCAGGGACATACTGCAAGACGTCGAGAGGATAGAGCCCAACTCTATGGAGTCAATACTGTGCGAGGCGATTAAGCTGGAGTTCGGGGCTTACTCTTCCGTCAACAGTTGTAGTGGATCATCCAGAGAATTGAATGAGGTGGAGAGAGCCAAGCTGAACGAGCTGATCGTCGCCAACAAAGCGCTCCACGCTCCCATAGACGACGACGTGTCACAACTGATCGGAGACGCGGCCACCGCCGGCCTCAAGGTCGGCGAAGGAAAACATGACCCTCGCCTCATAACGTTGGTCAACCTGACAGCCGTCGCCATACGGAGGCTCATCAAGATCGCCAAGAAGATCAACGCGTTCAAGAACATGTGCGAGGAGGACCAGGTGGCGCTCCTGAAGGGAGGCTGCATAGAGATGATGGTGTTGCGGAGCACCATGACCTACGACGGACAGAGGAACCAGTGGAAGCTGCCTCACAGTCACAAGCAGTACGGCAGCATCCAAACGGACGTGCTGAAGCTGGCCAAGGGGAACATCTACCGCAGCCACGAGGCCTTCATCAGCTCCTTCGAGCACAGGTGGCGCACCGACGAGAACATCATCCTCATCATGTCCGCCATACTGCTGTTCACGCCCGACCGGCCGCGCGTCGTGCACCGCGACGTCATCAAGTTGGAACAGAACTCGTACTACTACCTGCTCCGGCGCTACCTGGAGAGCTCGTTCGCGGGCTGCGAGGCGAAGGCCACGTTCCTCAAGCTGATCGCCAAGATCCTGGAGCTGAGGAAGCTGGCCGAGGAGGTGACGGGCGTCTACCTCGACGTGCACCCCTTGGAACCGCTGCTCGTGGAGATCTTTGACCTCAAACACCACGCGGCATGA

Protein sequence:

>DPOGS200543-PA
MVKEFIMSDEDKAEKRRKIEENRAKKRQLQDSDDSVSSSKNFRRDVESPYTTPVQESTIQYDVLNSTTCSPHSSAESPLSTDVDSMPTPAYGRYVPVQTELFTVKGYPPEEKTNPNQRYYEPRQEHYMCDSMDGIYEQTKQNSIRSILTNGEGLPHHQDTEHVCEEMPSTSNPEVNKARDILQDVERIEPNSMESILCEAIKLEFGAYSSVNSCSGSSRELNEVERAKLNELIVANKALHAPIDDDVSQLIGDAATAGLKVGEGKHDPRLITLVNLTAVAIRRLIKIAKKINAFKNMCEEDQVALLKGGCIEMMVLRSTMTYDGQRNQWKLPHSHKQYGSIQTDVLKLAKGNIYRSHEAFISSFEHRWRTDENIILIMSAILLFTPDRPRVVHRDVIKLEQNSYYYLLRRYLESSFAGCEAKATFLKLIAKILELRKLAEEVTGVYLDVHPLEPLLVEIFDLKHHAA-