Monarch geneset OGS2.0

DPOGS214137
TranscriptDPOGS214137-TA1401 bp
ProteinDPOGS214137-PA466 aa
Genomic positionDPSCF300014 - 1260694-1274453
RNAseq coverage1392x (Rank: top 9%)
Annotation
HeliconiusHMEL0050040.093.92% 
BombyxBGIBMGA006183-TA2e-11577.24% 
Drosophilausp-PA4e-11553.54% 
EBI UniRef50UniRef50_O762020.083.51%Protein ultraspiracle homolog n=13 Tax=Ditrysia RepID=USP_CHOFU
NCBI RefSeqNP_001037470.10.080.35%protein ultraspiracle homolog [Bombyx mori]
NCBI nr blastpgi|182021510.083.51%Ultraspiracle [Choristoneura fumiferana]
NCBI nr blastxgi|182021510.085.65%Ultraspiracle [Choristoneura fumiferana]
Group
Gene OntologyGO:00037071.6e-75steroid hormone receptor activity
GO:00056341.6e-75nucleus
GO:00063551.6e-75regulation of transcription, DNA-dependent
GO:00434011.6e-75steroid hormone mediated signaling pathway
GO:00037001.6e-75sequence-specific DNA binding transcription factor activity
GO:00082701.5e-37zinc ion binding
GO:00435651.5e-37sequence-specific DNA binding
GO:00036772.6e-29DNA binding
GO:00048794.1e-28ligand-dependent nuclear receptor activity
GO:00054964.1e-28steroid binding
KEGG pathway 
InterPro domain[207-458] IPR0089461.6e-75Nuclear hormone receptor, ligand-binding
[252-427] IPR0005364.6e-46Nuclear hormone receptor, ligand-binding, core
[113-184] IPR0016281.5e-37Zinc finger, nuclear hormone receptor-type
[111-206] IPR0130884e-37Zinc finger, NHR/GATA-type
[177-187] IPR0017232.6e-29Steroid hormone receptor
[184-197] IPR0000034.1e-28Retinoid X receptor
Orthology groupMCL10722 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214137-TA
ATGTCGAGCGTGGCGAAGAAAGATAAGCCGACAATGTCAGTGACGGCGCTTATCAACTGGGCCCGACCGGCGCCGCCGGGGCCTCAGCAGCAGTTGGCGCAGGCGGTGCCAGTCTCCTCGACGGCTCTCCTGCAGTCCCTAGGAACATCCTCGAACATTCCCAACGTCGACTGCTCTATCGACATGCAATGGCTGAACATAGAATCGGGGTTCATGTCCCCTATGTCTCCACCAGAGATGAAGCCGGACACAGCGATGCTGGACGGCATGAGGGAGGACGCCACCTCACCCTCGGCCATGAGGAACTATCCCCCGAATCACCCGCTCAGCGGATCCAAGCACCTCTGTTCCATCTGCGGAGACAGAGCATCGGGCAAACATTACGGCGTTTATAGCTGCGAAGGCTGTAAAGGATTCTTCAAGAGGACCGTCCGTAAAGATTTGACGTACGCGTGTCGCGAGGAGAGGAATTGTATAATAGACAAGCGTCAAAGGAATAGGTGCCAGTACTGCCGCTATCAGAAATGTCTGGCGTGCGGGATGAAGAGGGAGGCGGTGCAGGAGGAGAGGCAGAGGGCTGCAAGGGGTGCTGAGGACGTACATCCAAGCAGCTCAGTACAGGAGCTGTCAATCGAGCGTCTCCTTGAGATGGAATCTCTGGTGGCGGACCCTAACGAGGAGTTCCAATTCCTCCGCGTGGGTCCTGACAGTAACGTGCCACCGAGATACAGGGCTCCCGTCTCCAGCCTCTGTCAGATTGGTAATAAACAGATCGCTGCATTAGTAGTATGGGCTCGTGACATACCGCACTTCAGTCAGCTGGAGTTGGAAGACCAGGTCATACTGATCAAGGCCTCCTGGAACGAGCTCATGCTGTTCGCCATCGCCTGGAGGAGTATGGAGTACTTGGAAGATGAGAGAGAGAATCTAGACGGCACTCGGACAGCGCCACCGCCACAACTGATGTGTCTCATGCCAGGGATGACCCTCCATCGTAACTCAGCGCTTCAGGCCGGCGTTGGTCAGATCTTCGACCGCGTGCTCTCTGAACTCTCGCTGAAGATGAGGGCGCTGAGGATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCGTGCTGCTCAACCCCGACATAAAAGGCCTTAAAAACAGACAGGACGTGGACGTTCTACGAGAGAAGATGTTCTCCTGTTTGGACGAGTACTGTCGCCGCGCGCACAGTTCTGAGGAGGGTCGGTTCGCGTCTCTGTTGCTGCGGCTGCCGGCTCTTCGCTCCATCTCCCTCAAGAGCTTCGAGCACCTGTTCTTCTTCCATTTGATCGCCGAGGGCACCATCGGGACCTACATCAGGGACGCCCTCCGCAGCCACGCGCCCACCATAGACACCAACTCGATTATGTAG

Protein sequence:

>DPOGS214137-PA
MSSVAKKDKPTMSVTALINWARPAPPGPQQQLAQAVPVSSTALLQSLGTSSNIPNVDCSIDMQWLNIESGFMSPMSPPEMKPDTAMLDGMREDATSPSAMRNYPPNHPLSGSKHLCSICGDRASGKHYGVYSCEGCKGFFKRTVRKDLTYACREERNCIIDKRQRNRCQYCRYQKCLACGMKREAVQEERQRAARGAEDVHPSSSVQELSIERLLEMESLVADPNEEFQFLRVGPDSNVPPRYRAPVSSLCQIGNKQIAALVVWARDIPHFSQLELEDQVILIKASWNELMLFAIAWRSMEYLEDERENLDGTRTAPPPQLMCLMPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIVLLNPDIKGLKNRQDVDVLREKMFSCLDEYCRRAHSSEEGRFASLLLRLPALRSISLKSFEHLFFFHLIAEGTIGTYIRDALRSHAPTIDTNSIM-