Monarch geneset OGS2.0

DPOGS201084
TranscriptDPOGS201084-TA648 bp
ProteinDPOGS201084-PA215 aa
Genomic positionDPSCF300185 + 157169-157909
RNAseq coverage434x (Rank: top 28%)
Annotation
HeliconiusHMEL0046206e-2365.88% 
BombyxBGIBMGA001392-TA1e-4960.54% 
Drosophiladsf-PA4e-1651.28% 
EBI UniRef50UniRef50_E3X8065e-4696.70%Putative uncharacterized protein n=2 Tax=Coelomata RepID=E3X806_ANODA
NCBI RefSeqXP_001655964.18e-4793.62%coup transcription factor [Aedes aegypti]
NCBI nr blastpgi|3407293065e-4696.70%PREDICTED: steroid receptor seven-up, isoforms B/C-like [Bombus terrestris]
NCBI nr blastxgi|1571318456e-4493.62%coup transcription factor [Aedes aegypti]
Group
Gene OntologyGO:00037078.7e-30steroid hormone receptor activity
GO:00056348.7e-30nucleus
GO:00063558.7e-30regulation of transcription, DNA-dependent
GO:00434018.7e-30steroid hormone mediated signaling pathway
GO:00037008.7e-30sequence-specific DNA binding transcription factor activity
GO:00036776.3e-25DNA binding
GO:00048796.3e-25ligand-dependent nuclear receptor activity
KEGG pathway 
InterPro domain[29-108] IPR0089468.7e-30Nuclear hormone receptor, ligand-binding
[27-39] IPR0030686.3e-25Transcription factor COUP
[31-92] IPR0005366.6e-10Nuclear hormone receptor, ligand-binding, core
Orthology groupMCL19330 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201084-TA
ATGAAATACTCCGACGCTCAATGGAGCGTGAACGACGAGCTGCGTGTGACTCGAATGTCGATTGACCAAAGATTTTCCTCTGTTCCAGATGCCTGTGGACTATCAGACGTAACGCACATAGAGAGCCTTCAAGAGAAATCCCAATGCGCTCTCGAAGAGTACTGCAGAAGCCAGTACCCCAACCAGCCGACGCGTTTCGGGAAACTGCTTCTTAGGCTACCTTCATTACGGACAGTGAGCTCTCAGGTCATAGAACAGCTGTTCTTCGTGAGGCTGGTGGGGAAGACACCCATCGAGACCCTGATAAGGGACATGCTGTTATCCGGCAGCTCGTTCTCGTGGCCCTACATGGCGACCATGTACCGGCGGGAAGCCGAGGACGTCGTGACGTCTTCCAGCGAGTTCCCGGAGGCGGAGGCCGCTCCCTCCGAGGACTACTTCGGCTCGCGCGTCAGAGACGACAGGGCGGAACCCTGCCCCGACTTCGACCCCATCTCAGCGTTCTACCCTCCGAAGAGCACAGAAAACGAAGTCTTCGATACAAGCAATAATTGTGAAACACAAAAAAACGCTCAAAAAACTTCCATGATGATCGAGAACCTCCTCAACATCAGAAGTGACTCGTGTTCCCAGTCCCGCGACAAATAA

Protein sequence:

>DPOGS201084-PA
MKYSDAQWSVNDELRVTRMSIDQRFSSVPDACGLSDVTHIESLQEKSQCALEEYCRSQYPNQPTRFGKLLLRLPSLRTVSSQVIEQLFFVRLVGKTPIETLIRDMLLSGSSFSWPYMATMYRREAEDVVTSSSEFPEAEAAPSEDYFGSRVRDDRAEPCPDFDPISAFYPPKSTENEVFDTSNNCETQKNAQKTSMMIENLLNIRSDSCSQSRDK-