Monarch geneset OGS2.0

DPOGS203727
TranscriptDPOGS203727-TA2058 bp
ProteinDPOGS203727-PA685 aa
Genomic positionDPSCF300010 - 657620-665363
RNAseq coverage180x (Rank: top 49%)
Annotation
HeliconiusHMEL0042180.072.62% 
BombyxBGIBMGA013363-TA4e-13559.40% 
DrosophilaHr78-PF3e-4589.53% 
EBI UniRef50UniRef50_Q8T7Y23e-14662.07%Nuclear orphan receptor n=1 Tax=Bombyx mori RepID=Q8T7Y2_BOMMO
NCBI RefSeqNP_001037020.16e-14762.07%nuclear orphan receptor [Bombyx mori]
NCBI nr blastpgi|1129832121e-14562.07%nuclear orphan receptor [Bombyx mori]
NCBI nr blastxgi|1129832122e-15663.16%nuclear orphan receptor [Bombyx mori]
Group
Gene OntologyGO:00037074.9e-64steroid hormone receptor activity
GO:00056344.9e-64nucleus
GO:00063554.9e-64regulation of transcription, DNA-dependent
GO:00434014.9e-64steroid hormone mediated signaling pathway
GO:00037004.9e-64sequence-specific DNA binding transcription factor activity
GO:00082709.2e-37zinc ion binding
GO:00435659.2e-37sequence-specific DNA binding
GO:00036773.5e-12DNA binding
KEGG pathway 
InterPro domain[467-682] IPR0089464.9e-64Nuclear hormone receptor, ligand-binding
[19-90] IPR0016289.2e-37Zinc finger, nuclear hormone receptor-type
[20-86] IPR0130884.7e-30Zinc finger, NHR/GATA-type
[469-666] IPR0005368.6e-13Nuclear hormone receptor, ligand-binding, core
[291-301] IPR0017233.5e-12Steroid hormone receptor
Orthology groupMCL11371 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203727-TA
ATGGAGGCCCAGGATCAAATGGAGATCAAGTACAGCGGCAATGAAGTCGGGGGATTGGAGTTGTGTATTGTGTGTGGGGACAGAGCCAGCGGTAGACACTATGGAGCTATAAGTTGTGAAGGTTGTAAGGGATTCTTCAAGCGTTCCATCCGCAAGAAGTTAGGATATCAATGCCGAGGCAGCATGAACTGTGAGGTGACGAAGCACCATCGTAACCGATGTCAATACTGCCGCTTGCAGAAGTGTCTCGCCTGCGGAATGAGAAGTGACTCGGTTCAACATGAGCGGAAGCCTATCGTTGATAAGTCCAAAGAGGAACGAACTGCGTACTCCAAACTGCTGGGACTGGCCGGCAGCGCTCAGTCACAGATAAATCCTAAGGACGAGCCTTCAGATACGTTCAGCGCTGTGTCCCCAGCTGCCCCAGCTCTTAACTTCGCGCTGGCTGCAGCTGTAGCTTTCAATAAAGGAAATCCAGGTCATGTGGATGGTTCATGTGTTCCAGTGTCTCCGTACCTGTCGGGAGGAGACGCGGAGGGAGCGAGGCGACATCAGCTGATGCTGCAGACACACCTGGCCAAGAACCTGTTCAAGATGGGGCAGTTCGTTCATGGTAAACTGGTTATGGAGGCCCAGGATCAAATGGAGATCAAGTACAGCGGCAATGAAGTCGGGGGATTGGAGTTGTGTATTGTGTGTGGGGACAGAGCCAGCGGTAGACACTATGGAGCTATAAGTTGTGAAGGTTGTAAGGGATTCTTCAAGCGTTCCATCCGCAAGAAGTTAGGATATCAATGCCGAGGCAGCATGAACTGTGAGGTGACGAAGCACCATCGTAACCGATGTCAATACTGCCGCTTGCAGAAGTGTCTCGCCTGCGGAATGAGAAGTGACTCGGTTCAACATGAGCGGAAGCCTATCGTTGATAAGTCCAAAGAGGAACGAACTGCGTACTCCAAACTGCTGGGACTGGCCGGCAGCGCTCAGTCACAGATAAATCCTAAGGACGAGCCTTCAGATACGTTCAGCGCTGTGTCCCCAGCTGCCCCAGCTCTTAACTTCGCGCTGGCTGCAGCTGTAGCTTTCAATAAAGGAAATCCAGGTCATGTGGATGGTTCATGTGTTCCAGTGTCTCCGTACCTGTCGGGAGGAGACGCGGAGGGAGCGAGGCGACATCAGCTGATGCTGCAGACACACCTGGCCAAGAACCTGTTCAAGATGGGGCAGTTCGGTGCAATTAACGAGTATCTCCAGACGGCGTACGGCGCGACCCCGGCCGAGGTGAACCTCAACGCGGTGCACGCCGACGACGCACAGAACGAAGAGGAGGGCGTGATCCTGAACACAGCGCAGAGCCTGAGCCTGCCGCTGTCCCTGCCGGCGGGCGCGGGCGGACCGCTCCGTCTGCACGCTGCCTGCGAGGCCGCCGCCCGTCTGCTGGCAGCCTGCGTGCGGTGGACGCTCAGCGTGCCCGCCGCCGCCGCCATGCCGTTCGAGGCGCGAGTGTCCCTGCTGCGGAAGTGCTGGTCGGAGCTGTTCGTGCTGGGTCTGTGTCGCTGGTCCGAGGCTCTGTCTTTGGGATCCCTGCTGCCGGCGCTGGCGGCGCACCTGCGAGCCGAGCTCAGGGAAAGGAGCGAGCAGGGACACGGGGTTTCACATGACGACGGAGGCGCAGAGATAAGTATCTCTGACTACTCGGACGAGCGCATCTCGGAGGTGTGTTCGATGCTGTGCCGCCTCCACCAGTTCGTGTGTCACATGGAGCAGTTCCGCGTGTCTGACCGGGAGTACGCTCACCTGAGAGCTCTCTGTCTCTTCTCGCCCGACGGCGCCCCCGACTTCCTGAGTCATAAGCTGCATCAGCTCCAGTCGTCCGTGGTCCGCTCGCTCCGTGCCGCTTGCTCCTCGGACGACGAGCGCGCCGCCTGCCTCCTGCTGCAACTGCCGGTGCTGAGGACCTTCAGCGGCAGCTTCATCGAGGACGTGTTCTTCGTGGGCTTCGTCGGTGACGTCAGCATCGATGACGTCATACCCTGCCTGCTCAACGCCGAGCGGTAG

Protein sequence:

>DPOGS203727-PA
MEAQDQMEIKYSGNEVGGLELCIVCGDRASGRHYGAISCEGCKGFFKRSIRKKLGYQCRGSMNCEVTKHHRNRCQYCRLQKCLACGMRSDSVQHERKPIVDKSKEERTAYSKLLGLAGSAQSQINPKDEPSDTFSAVSPAAPALNFALAAAVAFNKGNPGHVDGSCVPVSPYLSGGDAEGARRHQLMLQTHLAKNLFKMGQFVHGKLVMEAQDQMEIKYSGNEVGGLELCIVCGDRASGRHYGAISCEGCKGFFKRSIRKKLGYQCRGSMNCEVTKHHRNRCQYCRLQKCLACGMRSDSVQHERKPIVDKSKEERTAYSKLLGLAGSAQSQINPKDEPSDTFSAVSPAAPALNFALAAAVAFNKGNPGHVDGSCVPVSPYLSGGDAEGARRHQLMLQTHLAKNLFKMGQFGAINEYLQTAYGATPAEVNLNAVHADDAQNEEEGVILNTAQSLSLPLSLPAGAGGPLRLHAACEAAARLLAACVRWTLSVPAAAAMPFEARVSLLRKCWSELFVLGLCRWSEALSLGSLLPALAAHLRAELRERSEQGHGVSHDDGGAEISISDYSDERISEVCSMLCRLHQFVCHMEQFRVSDREYAHLRALCLFSPDGAPDFLSHKLHQLQSSVVRSLRAACSSDDERAACLLLQLPVLRTFSGSFIEDVFFVGFVGDVSIDDVIPCLLNAER-