Monarch geneset OGS2.0

DPOGS205589
TranscriptDPOGS205589-TA2100 bp
ProteinDPOGS205589-PA699 aa
Genomic positionDPSCF300237 + 159975-172325
RNAseq coverage151x (Rank: top 53%)
Annotation
HeliconiusHMEL0112390.083.26% 
BombyxBGIBMGA009688-TA0.079.32% 
DrosophilaHr46-PE6e-10669.53% 
EBI UniRef50UniRef50_O026430.080.48%Hormone receptor 3C n=4 Tax=Endopterygota RepID=O02643_CHOFU
NCBI RefSeqXP_002423364.17e-15750.96%hormone receptor hr3, putative [Pediculus humanus corporis]
NCBI nr blastpgi|20784990.080.48%hormone receptor 3C [Choristoneura fumiferana]
NCBI nr blastxgi|20784990.080.06%hormone receptor 3C [Choristoneura fumiferana]
Group
Gene OntologyGO:00037072.1e-68steroid hormone receptor activity
GO:00056342.1e-68nucleus
GO:00063552.1e-68regulation of transcription, DNA-dependent
GO:00434012.1e-68steroid hormone mediated signaling pathway
GO:00037002.1e-68sequence-specific DNA binding transcription factor activity
GO:00082701.6e-36zinc ion binding
GO:00435651.6e-36sequence-specific DNA binding
GO:00036772.3e-24DNA binding
GO:00048873.7e-08thyroid hormone receptor activity
KEGG pathway 
InterPro domain[450-692] IPR0089462.1e-68Nuclear hormone receptor, ligand-binding
[101-172] IPR0016281.6e-36Zinc finger, nuclear hormone receptor-type
[104-154] IPR0130884.1e-25Zinc finger, NHR/GATA-type
[165-175] IPR0017232.3e-24Steroid hormone receptor
[499-662] IPR0005364.7e-21Nuclear hormone receptor, ligand-binding, core
[442-459] IPR0017283.7e-08Thyroid hormone receptor
Orthology groupMCL10903 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205589-TA
ATGATGAACAACAATCAGTTCCACGAGCTGTTCGGGTCTCAGTGGCCTCCGGACCAACACGGGGGTCACTCGTCGGCGACGACGATGCTGCACCAGGGCCAGCAGCAGCCCCAGATGCAACTGAAGAGGGAGCCCCACGACGTGCCCGGGATGCACTCCATGGGCATGGACATCACGAGCGGCTCCGTCGCCGACAGTACATCCCCGCCCCCTGGAAACAGTGAATCTATGTTCGGATCCTCCATCTCGGGAATGTTCATGGATAAAAAGGCCGCTAACTCTATCAGAGCCCAAATCGAGATCATACCCTGTAAAGTGTGTGGAGATAAGTCCTCAGGGGTCCACTATGGGGTGATCACTTGCGAGGGCTGCAAGGGATTCTTCAGGAGATCACAGAGCACAGTGGTCAACTATCAGTGCCCGAGGAACAAAGCCTGCGTAGTGGACCGCGTCAACAGGAACAGGTGCCAGTACTGCAGGCTCCAGAAGTGTCTTAAGCTGGGAATGAGCCGCGATGCGGTGAAGTTCGGTCGCATGTCCAAGAAGCAGCGGGAGAAGGTGGAGGACGAGGTGAGGTTCCACCAGGCGCAGATGAGGGCGCAGTCTGACGCCGCCCCGGACTCGGTGTACGACGCGCAGCAACAGACGCCCAGCTCCAGCGACCAGTTCCACGGACACTATAACAGCTACCAGAACTACGGTTCCCCGCTGTCGTCATACGGATACAACGCGCCGCTCAACTCCAACCTGAACATTCAGGCCCAACCTCCCCAATACGACGTGTCGGCCAACTACGTGGACTCCACCACGTACGAGCCCAAGCAACCCGGCTTCCTCGACACGGACTTCATAGACCATGACGAACAACAAAAAACTATCCGAGCGTCCACATCCACGACAACAGCGACGACAGCGACGACGACAATGAGACAGTCCATGAGCGACGTCAACAGACCCAGGGTGCAGGAGTTCGACAGATACGATGAGAGGATCCAGAGCCCGCCGGCGAGCGTCATAGCTATCAAGCAGGAAATCAAACCGGAGACCTCCATGGGCGTAGACAATCTAGTGGCCAGCTACGTCGACTCCACCACGTTCCTTCATAGTCCGTCGAACCTGAACAGTCCCATGGACATACAGAACTCGGTGCTAGTGAGCGGCCAGAGCTCGGTGTCGTTGACGAGCGAGGAGCTCAGCCCCGACGATCTCACGAACAGCAACGCCAGGCTGATGGATCCTCTCAACATGAACATGTCGGGCATGGGTATGGTGAACCCCAACGCCGTGTCCACGCGGCGGCAGCAGGGCAGCGCGGACGACCTGCCGTCGGAAGGTGAAATAAGTAAGGTACTGGTTAAAAGTCTTGCGGAGGCTCACGCGAACACGAACCCCAAGTTGGAGTACATACACGAGATGTTCAGGAAGCCGCCGGATGTCTCTAAGTTGCTATTCTATAACTCGATGACGTACGAGGAGATGTGGCTGGACTGCGCCAACAAGCTCACATCCATGATCCAGAACATCATCGAGTTCGCCAAGCTCATCCCCGGCTTCATGAAGCTCAGCCAGGACGACCAGATCCTGCTGCTAAAATCAGGTTCCTTCGAACTAGCGATCGTCCGTCTGTCCCGTCTGATAGACATCAACAGAGATCACGTGCTGTACGGTGACGTGGTGCTGCCTATAAGGGAGTGTGTACACGCACGCGATCCGCGCGACATGTCGCTAGTGGCTGGCATCTTCGACGCTGCTAAGACGATCGCACGACTCAAGCTCACGGAGACTGAACTGGCGCTGTACCAGAGCTTGGTGCTGCTGTGGCCGGAGCGTCACGGTGTCCGCGGTAACCCTGAGATCCAGATGTTGTTCAACATGTCCATGGCTACCATGAGACACGAGATAGAGACGAACCACGCGCCCCTCAAGGGTGACGTCACAGTACTAGACACACTGCTCGCTAAGATACCCACCTTCAGAGAGCTGTCGTTAATGCACCTCGAAGCGCTGTGTCGCTTTAAGGCAGCGCATCCACATCACGTATTCCCAGCGCTTTATAAGGAACTGTTCTCACTGGACAGCGTCCTAGATTACACGCAGTAA

Protein sequence:

>DPOGS205589-PA
MMNNNQFHELFGSQWPPDQHGGHSSATTMLHQGQQQPQMQLKREPHDVPGMHSMGMDITSGSVADSTSPPPGNSESMFGSSISGMFMDKKAANSIRAQIEIIPCKVCGDKSSGVHYGVITCEGCKGFFRRSQSTVVNYQCPRNKACVVDRVNRNRCQYCRLQKCLKLGMSRDAVKFGRMSKKQREKVEDEVRFHQAQMRAQSDAAPDSVYDAQQQTPSSSDQFHGHYNSYQNYGSPLSSYGYNAPLNSNLNIQAQPPQYDVSANYVDSTTYEPKQPGFLDTDFIDHDEQQKTIRASTSTTTATTATTTMRQSMSDVNRPRVQEFDRYDERIQSPPASVIAIKQEIKPETSMGVDNLVASYVDSTTFLHSPSNLNSPMDIQNSVLVSGQSSVSLTSEELSPDDLTNSNARLMDPLNMNMSGMGMVNPNAVSTRRQQGSADDLPSEGEISKVLVKSLAEAHANTNPKLEYIHEMFRKPPDVSKLLFYNSMTYEEMWLDCANKLTSMIQNIIEFAKLIPGFMKLSQDDQILLLKSGSFELAIVRLSRLIDINRDHVLYGDVVLPIRECVHARDPRDMSLVAGIFDAAKTIARLKLTETELALYQSLVLLWPERHGVRGNPEIQMLFNMSMATMRHEIETNHAPLKGDVTVLDTLLAKIPTFRELSLMHLEALCRFKAAHPHHVFPALYKELFSLDSVLDYTQ-