Monarch geneset OGS2.0

DPOGS213157
TranscriptDPOGS213157-TA2073 bp
ProteinDPOGS213157-PA690 aa
Genomic positionDPSCF300016 + 1283497-1290986
RNAseq coverage139x (Rank: top 55%)
Annotation
HeliconiusHMEL0103180.069.35% 
BombyxBGIBMGA007914-TA0.075.17% 
DrosophilaHr39-PB2e-13541.24% 
EBI UniRef50UniRef50_E0VMS42e-13442.37%Ecdysone receptor, putative n=4 Tax=Neoptera RepID=E0VMS4_PEDHC
NCBI RefSeqXP_001845875.13e-14143.78%nuclear hormone receptor FTZ-F1 beta [Culex quinquefasciatus]
NCBI nr blastpgi|38343460.073.85%hormone receptor 39 [Bombyx mori]
NCBI nr blastxgi|38343460.083.82%hormone receptor 39 [Bombyx mori]
Group
Gene OntologyGO:00037072.9e-37steroid hormone receptor activity
GO:00056342.9e-37nucleus
GO:00063552.9e-37regulation of transcription, DNA-dependent
GO:00434012.9e-37steroid hormone mediated signaling pathway
GO:00037002.9e-37sequence-specific DNA binding transcription factor activity
GO:00082704.7e-34zinc ion binding
GO:00435654.7e-34sequence-specific DNA binding
GO:00036771e-11DNA binding
KEGG pathway 
InterPro domain[487-656] IPR0089462.9e-37Nuclear hormone receptor, ligand-binding
[360-431] IPR0016284.7e-34Zinc finger, nuclear hormone receptor-type
[359-426] IPR0130889.6e-28Zinc finger, NHR/GATA-type
[550-657] IPR0005366.8e-16Nuclear hormone receptor, ligand-binding, core
[424-434] IPR0017231e-11Steroid hormone receptor
Orthology groupMCL12890 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213157-TA
ATGTCTAGTGAGGGGGAGGCGGTGAAAGTGGAAGGGGGTCATATCTCCGTCACTACTATCAGCATGTCGGGACCGGAGAGTAGCAATGGGCAGTTCTCGTATAGCTCGAGCGGAGTCCGTATCAGCGTGTCCTCGGAGCCTCAAGACGAGGATTCGACGGACGCGGAAATATCCAAAATAGATTTCACCCAACACCAGTATGAGGTCAACATGCGGAATAAGAAGAAGCGTTCGTCCGGCCAGTGTGACCAGGCCAAGGAGCAAGAGAGGCCGATGTCCTGGGAGGGAGAGCTGTCAGACTCGGAAATGGTCATCGACACCAGCACCAATAACGTAGACGAGAACAGCAGCTCCCTGGACCTCCAATCCTCCCGGGATTCAATCGACACGCTCCGCAACGTGTCCATTAAGACTGAACCTCTTAAAACAGAGCTCTTCCACACTATGGAGGACCAACGTGTTCTGGACCTGAAGTATACCGTGCCTCTTCAGTCACATCAGCGCAGTCTTAGTATGAATAGGATATCGGTCCTAACAGACAGTAATTCGTTGTTAGCGCGGCGGCCGTCATCACCCTCGCACTACGACGCGCATCCCGAGGTCCAGAACCTCACTATCAAGAAGGAACAGCTCTCTGGTGGGTATTATGGACCTGAAAGACAGTCCACTGTCCGTGAACTCAAGTCCGAACCGACGTCCAGTGTGGACAAGCTGCTAGGGCTCCACGGTTCCCCCCTGATGGGACGCCTCCCCCGCGTGCAGTCCAGCGCCAGCACTGACTCCGCGGTACACTCAATGTATACACACAGTGTATACAGCAGTCCGTCAGCCAGCCCCCGCCCCTCGCGCCACTACACCCCCTCCCTCTCCCGGAACAACAGCGACGCGTCACACTCCTCGTGCTACTCCTACAGCTCAGAATTTTCCCCCACTCACTCTCCAGTACAAAGTCGTCACCCCCACGTGGTGTACCGCGAGGCGGCCGTGTTCCCCGCCTCCCCCGCTCATGATGAGGACGCGGACGGAACCGACGACAGGCTGCATCATCACCAGGGGATCAGCCGCCAACAACTCATTAACAGTCCGTGTCCAATTTGCGGCGACAAAATCAGTGGCTTCCACTACGGGATATTTTCATGCGAGTCTTGCAAGGGCTTCTTCAAGCGGACGGTTCAAAATCGGAAGAACTACATGTGTCTGAGAGGCGGGAACTGTCCCGTCACCGTCGCCACCAGGAAGAAGTGCCCCGCGTGTAGATTCGATAAGTGTCTGGGATGCGGGATGAAACTCGAAGCTATAAGAGAGGACCGCACACGCGGCGGTCGGTCTACTTATCAATGTTCGTACACGCTGTCTGGCGCGGCCTCCACGGGCTCCTTACTATCAGCGCACGCGCCCGCAACGCTGAGACACGCCTCGAGTCTCACATGTGTGAACGGTCCGGGCTCCTACAACAGAGGCGAATCAAGCAACAGTCGACTCACCCCTGACATACCGCCGTTATTGCAGGAAATAATGGACGTGGAACATCTATGGCAGTACAACGAGTCCGAGCTGAGTCGTATGAGCAAGAGCTCGAGCAGTCCGTCCGCCAACCCCCTGTTAGCGGCCAGCGGCATCACGGCGCAGAACTCTAGCGCCGACTTCCTGGCCGACCTGTGCAATATAGCAGACCACAGATTATACAAGATCGTCAAGTGGTGCAAGAGTCTGCCGCTCTTTAAAAATATCTCCATCGACGATCAGATATGTCTGTTGATAAACAGCTGGTGCGAGTTACTGGTGTTGTCCTGTTGCTACAGAGGAGTCAGCACGCCAGGGGAGGTGCGGGTGGGGGGAGGCAGGGGAATAACGCTGCAGCAGAGCGCCAAGTATGGCCTAACTCCATGTATTGAACGTATGTTGAGTTTCACTGATCACTTGAGGAGGCTGCGTGTGGACCGCTACGAGTACGTCGCGCTCAAGTTTTTCATGCAAGTCGGAAAAGAAATGCTCAATCCAGCGAACAAGAGTAAGGACGGAGAGGGACCGAGCTTCAACCTCCTCATGGAACTGCTTCGAGGAGATCATTGA

Protein sequence:

>DPOGS213157-PA
MSSEGEAVKVEGGHISVTTISMSGPESSNGQFSYSSSGVRISVSSEPQDEDSTDAEISKIDFTQHQYEVNMRNKKKRSSGQCDQAKEQERPMSWEGELSDSEMVIDTSTNNVDENSSSLDLQSSRDSIDTLRNVSIKTEPLKTELFHTMEDQRVLDLKYTVPLQSHQRSLSMNRISVLTDSNSLLARRPSSPSHYDAHPEVQNLTIKKEQLSGGYYGPERQSTVRELKSEPTSSVDKLLGLHGSPLMGRLPRVQSSASTDSAVHSMYTHSVYSSPSASPRPSRHYTPSLSRNNSDASHSSCYSYSSEFSPTHSPVQSRHPHVVYREAAVFPASPAHDEDADGTDDRLHHHQGISRQQLINSPCPICGDKISGFHYGIFSCESCKGFFKRTVQNRKNYMCLRGGNCPVTVATRKKCPACRFDKCLGCGMKLEAIREDRTRGGRSTYQCSYTLSGAASTGSLLSAHAPATLRHASSLTCVNGPGSYNRGESSNSRLTPDIPPLLQEIMDVEHLWQYNESELSRMSKSSSSPSANPLLAASGITAQNSSADFLADLCNIADHRLYKIVKWCKSLPLFKNISIDDQICLLINSWCELLVLSCCYRGVSTPGEVRVGGGRGITLQQSAKYGLTPCIERMLSFTDHLRRLRVDRYEYVALKFFMQVGKEMLNPANKSKDGEGPSFNLLMELLRGDH-