Monarch geneset OGS2.0

DPOGS205808
TranscriptDPOGS205808-TA1209 bp
ProteinDPOGS205808-PA402 aa
Genomic positionDPSCF300144 + 315392-320508
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0115813e-15577.43% 
BombyxBGIBMGA010370-TA9e-9193.30% 
Drosophiladsf-PA2e-7859.84% 
EBI UniRef50UniRef50_D6W9X65e-8248.15%Dissatisfaction n=3 Tax=Neoptera RepID=D6W9X6_TRICA
NCBI RefSeqXP_002432637.16e-8447.91%Orphan nuclear receptor NR6A1, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3286964673e-8363.45%PREDICTED: nuclear receptor subfamily 2 group E member 1-like [Acyrthosiphon pisum]
NCBI nr blastxgi|910773863e-9149.08%PREDICTED: similar to Dissatisfaction (Dsf) [Tribolium castaneum]
Group
Gene OntologyGO:00037073.9e-56steroid hormone receptor activity
GO:00056343.9e-56nucleus
GO:00063553.9e-56regulation of transcription, DNA-dependent
GO:00434013.9e-56steroid hormone mediated signaling pathway
GO:00037003.9e-56sequence-specific DNA binding transcription factor activity
GO:00036772.9e-15DNA binding
GO:00048792.1e-09ligand-dependent nuclear receptor activity
KEGG pathway 
InterPro domain[49-388] IPR0089463.9e-56Nuclear hormone receptor, ligand-binding
[199-356] IPR0005363.5e-27Nuclear hormone receptor, ligand-binding, core
[200-221] IPR0017232.9e-15Steroid hormone receptor
[193-209] IPR0030682.1e-09Transcription factor COUP
Orthology groupMCL15534 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205808-TA
ATGTCCATTACTACTAAAATTTTCACTTTAAGCCGGCGCGGCGTAGCGCGGGAACTAACGTATGGACATAGCATGTGCGGGCACAATAGCCACATTCCCATAGACTCAGGCTTGGCTGATGAACTATGGTGTGACAAACAAGTGAAAATAAAATCTGTTCAACACGAGCGGGGTCCTCGAAAGCCGAAGCCCCACCCGAGTGTCCTGGGGTCACTGCCGCCCCCGCATGCCCACACTCACAAGCCCCACGCTCTCAAACTGTCGCCTCCATCGCCCTACACACCTATACCACAGCCCTTCAATTTCCAGTTTAGTATAAATGGCATCACTGACTCGGCGCCTCTAAGTACTGCATCATCAGCTGGATCCAGCGGTGCTGGTGGTGCTGGTGCAGTGAGTCCTCTGCCTCTCATCACGGAACCGTTCATAGCACCTCCTCCTCCTGGGCTTTTGCATATGCTCATGTCCAGTGATAAATGTCAAGAATTAATATGGAGTGCGAAACAATTGCAACTACAAGGCGACCCTTCTCTGTTACGACCTCCGCCGAATGCTTTCGGAGCACCCCTAGCGCCTACTTGGGAGTTGTTACAGGAAACAAGCGCGCGTCTTCTATTCATGGCAGTGCGGTGGGTGAGATGTTTGGCTCCATTCCAAGCCTTGGCGGCATCAGATCAGGCGGTGTTGCTGCGTGCTGCTTGGAAGGATCTGTTCGTGCTGCATCTCGCACAGTGGTCCGCACCATGGGACCTCGCGCCCCTACTGGCGGCCCCAGCTGCCAGAGCTAGACTGCCCTCTGACCCCTTGGTCGATCTAGAAATTAACACTCTACAGGAAATTCTTTGTAGATTCCGACAAATTGCTCCCGACGGCAGTGAGTGCGGCTGTATGAAAGCTATTGTTCTTTTTTCACCGGACACGCCCGGTCTAAGCGAAACACAGCCGGTGGAGATGCTCCAAGATCAGGCTCAGTGTATTCTGGCCGACTACGTAAGGACGAGATACACTCGTCAGCCTACCAGATTCGGCCGACTCCTTCTTCTACTGCCATCTCTACGCGCTGTCAGAGCTCGTTCTATAGAGTCACTTCTGTTTCGGGAGACGGTTGGCGACGTGTCCGTGGCCACTCTGCTTCATGATATGTACCGCATGCAGCCAGCGCCCACGCCTGTACCAGCCTTCCAACCACCAAACTGTTCTTCGCCTTAA

Protein sequence:

>DPOGS205808-PA
MSITTKIFTLSRRGVARELTYGHSMCGHNSHIPIDSGLADELWCDKQVKIKSVQHERGPRKPKPHPSVLGSLPPPHAHTHKPHALKLSPPSPYTPIPQPFNFQFSINGITDSAPLSTASSAGSSGAGGAGAVSPLPLITEPFIAPPPPGLLHMLMSSDKCQELIWSAKQLQLQGDPSLLRPPPNAFGAPLAPTWELLQETSARLLFMAVRWVRCLAPFQALAASDQAVLLRAAWKDLFVLHLAQWSAPWDLAPLLAAPAARARLPSDPLVDLEINTLQEILCRFRQIAPDGSECGCMKAIVLFSPDTPGLSETQPVEMLQDQAQCILADYVRTRYTRQPTRFGRLLLLLPSLRAVRARSIESLLFRETVGDVSVATLLHDMYRMQPAPTPVPAFQPPNCSSP-