Monarch geneset OGS2.0

DPOGS206120
TranscriptDPOGS206120-TA1821 bp
ProteinDPOGS206120-PA606 aa
Genomic positionDPSCF300028 + 714395-717356
RNAseq coverage504x (Rank: top 25%)
Annotation
HeliconiusHMEL0140740.092.58% 
BombyxBGIBMGA006839-TA0.091.50% 
DrosophilaEip75B-PB0.053.87% 
EBI UniRef50UniRef50_P502390.088.72%Ecdysone-inducible protein E75 n=17 Tax=Ditrysia RepID=E75_GALME
NCBI RefSeqNP_001106080.10.091.14%nuclear hormone receptor E75 isoform B [Bombyx mori]
NCBI nr blastpgi|3093207610.088.18%E75 [Spodoptera littoralis]
NCBI nr blastxgi|3742524600.090.67%nuclear receptor E75 [Spodoptera litura]
Group
Gene OntologyGO:00037072.6e-59steroid hormone receptor activity
GO:00056342.6e-59nucleus
GO:00063552.6e-59regulation of transcription, DNA-dependent
GO:00434012.6e-59steroid hormone mediated signaling pathway
GO:00037002.6e-59sequence-specific DNA binding transcription factor activity
GO:00036775.6e-15DNA binding
GO:00048875.9e-06thyroid hormone receptor activity
KEGG pathway 
InterPro domain[55-301] IPR0089462.6e-59Nuclear hormone receptor, ligand-binding
[118-277] IPR0005365e-28Nuclear hormone receptor, ligand-binding, core
[119-140] IPR0017235.6e-15Steroid hormone receptor
[116-137] IPR0017285.9e-06Thyroid hormone receptor
Orthology groupMCL16475 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206120-TA
ATGGGTGAAGATTTACCGATACTTAAAGGGATTTTAAATGGAGTCGTGAAATATCACAATGCTCCCGTCCGATTCGGTCGTGTTCCAAAGCGCGAGAAGGCTCGCATTTTGGCCGCCATGCAACAATCTTCGTCGTCTCGTGCCCAGGAGCAGGCGGCGGCCGCTGAGCTAGACGACGCGCCCCGGCTCCTGGCCCGCGTGGTCCGCGCACACCTGGACACCTGCGAGTTCACACGCGATCGCGTCGCCTCCATGAGGGCGCGGGCTCGCGACTGTCCCACCTACTCACAACCGACTTTGGCTTGTCCGCTGAATCCAGCTCCAGAGCTACAGTCCGAGAAGGAATTCTCTCAGCGTTTCGCGCACGTTATCAGAGGCGTCATTGATTTCGCCGGCCTCATTCCTGGATTCCAATTACTTACGCAAGATGACAAGTTCACTCTCTTAAAAAGCGGTCTCTTCGACGCCCTGTTCGTGAGACTCATCTGCATGTTCGATGCGCCACTCAATAGCATAATATGCCTCAATGGACAACTCATGAAACGGGATTCCATTCAGAGTGGAGCGAACGCACGCTTCCTTGTTGACTCTACTTTTAAGTTCGCCGAGCGCATGAATTCTATGAATTTAACGGACGCCGAGATCGGTCTCTTCTGCGCCATAGTTCTCATAACTCCCGACCGGCCGGGTCTTCGTAATATTGAATTAGTTGAGCGAATGCACGCGAGACTGAAGGCGTGCCTGCAGACCGTCGTCACACAGAACAGACCTGACAGACCTGGCTTCCTCCGGGAGTTAATGGACACTCTACCTGATCTCCGTACACTGAGTACTCTTCATACTGAAAAGCTCGTAGTTTTCCGAACCGAGCACAAGGAATTACTGAGGCAACAGATGTGGGGAGACGAAGAGGGATGCTCGTGGGCCGACTCCGGAGCAGACGAGTCAGCTCGCAGCCCCATCGGCTCGGTATCCAGCAGTGAATCCGGTGAAGCGATGGGTGACTGTGGAACTCCTCTGCTGGCCGCGACTCTGGCCGGCAGACGGCGCCTGGACTCCCGCGGGTCCGTGGACGAGGAAGCTCTTGGTGTCGCTCACCTCGCTCACAACGGCCTCACCGTCACCCCTGTGCGCCCTCCGCCACGCTACCGCAAACTAGATTCCCCCACAGACTCCGGCATAGAGTCTGGGAACGAGAAGCACGAGAGAATAGTGGGGCCCGGCTCGGGCTGCTCCAGCCCTCGCTCGTCATTGGAGGAACACACGGAGGAGAGGCGACCCGTGCCGGCGGACGACATGCCCGTGCTGAAACGAGTGCTAGAGGCGCCGCCGCTGTACGACACTCCCTCATTAATGGACGAGGCCTACAAACCTCACAAGAAGTTCCGCGCGATGCGTCGCGACACGGGCGAGGCGGAGGCCCGGCCGATGCTGCTGACCCCGTCCCCGCAGCCGCCGCAGCACCCTCACCCCGCCAGCCCCGCTCATCCAGCCCACTCTCCGCGCCCCCTGCGTGCGTCGCTGTCGTCGACGCACTCCGTGCTAGCTAAGAGTTTAATGGAAGGTCCGCGTATGACTCCGGAGCAACTCAAGCGCACGGACATCATCCAGCAGTACATGCGGCGCGGTTCCAGCACGTCGAGCGCCGGCGAGTGTCCCCTCCGCAGCGGTCTGCTGGCTTGTTACCGCGGCGCGTCTCCGTCTCCGGCGCCAGAGCCGGTGCTGGAGCTGCAGGTGGAGGTGGCGGACGCTCCCCTGAACCTGTCCAAGAAGTCTCCCTCGCCGCCGCGCTCCTTCATGCCCCGCATGCTGGAGGCGTGA

Protein sequence:

>DPOGS206120-PA
MGEDLPILKGILNGVVKYHNAPVRFGRVPKREKARILAAMQQSSSSRAQEQAAAAELDDAPRLLARVVRAHLDTCEFTRDRVASMRARARDCPTYSQPTLACPLNPAPELQSEKEFSQRFAHVIRGVIDFAGLIPGFQLLTQDDKFTLLKSGLFDALFVRLICMFDAPLNSIICLNGQLMKRDSIQSGANARFLVDSTFKFAERMNSMNLTDAEIGLFCAIVLITPDRPGLRNIELVERMHARLKACLQTVVTQNRPDRPGFLRELMDTLPDLRTLSTLHTEKLVVFRTEHKELLRQQMWGDEEGCSWADSGADESARSPIGSVSSSESGEAMGDCGTPLLAATLAGRRRLDSRGSVDEEALGVAHLAHNGLTVTPVRPPPRYRKLDSPTDSGIESGNEKHERIVGPGSGCSSPRSSLEEHTEERRPVPADDMPVLKRVLEAPPLYDTPSLMDEAYKPHKKFRAMRRDTGEAEARPMLLTPSPQPPQHPHPASPAHPAHSPRPLRASLSSTHSVLAKSLMEGPRMTPEQLKRTDIIQQYMRRGSSTSSAGECPLRSGLLACYRGASPSPAPEPVLELQVEVADAPLNLSKKSPSPPRSFMPRMLEA-