Monarch geneset OGS2.0

DPOGS200902
TranscriptDPOGS200902-TA1320 bp
ProteinDPOGS200902-PA439 aa
Genomic positionDPSCF300066 + 50953-57646
RNAseq coverage2x (Rank: top 91%)
Annotation
HeliconiusHMEL0133971e-7139.95% 
BombyxBGIBMGA000670-TA2e-6157.75% 
DrosophilaHr51-PC3e-2848.74% 
EBI UniRef50UniRef50_E2C2612e-4167.48%Photoreceptor-specific nuclear receptor n=7 Tax=Formicidae RepID=E2C261_HARSA
NCBI RefSeqXP_973111.27e-4154.76%PREDICTED: similar to PNR-like [Tribolium castaneum]
NCBI nr blastpgi|3071964238e-4167.48%Photoreceptor-specific nuclear receptor [Harpegnathos saltator]
NCBI nr blastxgi|3071964239e-4068.60%Photoreceptor-specific nuclear receptor [Harpegnathos saltator]
Group
Gene OntologyGO:00056349.2e-32nucleus
GO:00063559.2e-32regulation of transcription, DNA-dependent
GO:00082709.2e-32zinc ion binding
GO:00435659.2e-32sequence-specific DNA binding
GO:00037009.2e-32sequence-specific DNA binding transcription factor activity
GO:00037073.1e-22steroid hormone receptor activity
GO:00434013.1e-22steroid hormone mediated signaling pathway
GO:00036778.2e-08DNA binding
KEGG pathway 
InterPro domain[6-77] IPR0016289.2e-32Zinc finger, nuclear hormone receptor-type
[6-73] IPR0130881.9e-30Zinc finger, NHR/GATA-type
[332-406] IPR0089463.1e-22Nuclear hormone receptor, ligand-binding
[349-413] IPR0005364.7e-08Nuclear hormone receptor, ligand-binding, core
[70-80] IPR0017238.2e-08Steroid hormone receptor
Orthology groupMCL26481 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200902-TA
ATGGATTGCAAGCCGGAAGTCGTGTGTCGTGTCTGCGGAGACAAGGCTTCTGGAAAGCACTACGGCGTTCCGTCCTGTGATGGTTGCAGAGGTTTCTTTAAACGAAGTATCCGAAGGAACCTAGATTACATTTGCAAAGAAAACGGTTCGTGCATAGTGGACGTGAGCAGAAGGAATCAGTGCCAGGCCTGCCGGTTCTCGAAGTGTTTACGAGTTAATATGAAGAAAGATGCTGTACAAAATGAACGTGCACCGAGAACACTGGGCAATCAACATCAACTTGCTCTTCAAAAGCTGAGTTACTTATCACGGCAGTCGGCTTTTATTCCTAATCATTCTCCGGTAGCCTTATCGACATTTTCTCCATATACGTATCCAATACAAGACCGCGTCCAAAATCCTTACTTGGCTAATTCATCCTTTCAAAATTATTCGAATCAATCGTCGATGCCAATGGACGTTCCTAGTCTCAACCCCCTTTTTAACAATCCAAGTGGCATAAATCCGTTCAAATTTCCGCTATTCCCTGGACCTATGCAATACTCTTTACCACATCCGCACACATATTTTTCTGCAAATATTTTCTACCCTCCCATCATATCTGCTGAGAATCCTACACTGTACTTGGATTCACAAGAAACTTCTTCGAACGCATTGCATAACTCACAATTCGGTACATCTATACAAAGGAATATAGCAGAAAAACATGACTCACTAAACTGTGACAAAATAAAAGAAAATGAAGTTACCAGTTCTGAAGAAACCTGCAGGGACACAGCATCAAAAGGCGCAACTATAAACAAAAATCACAATCAGTCAAACCAGAAAGTTGACTGCACAGAAAATATTATCCCTAAATGCAGTAATAAAAATAAGGAACAAGATTTGTTTACGTCAAGAGAAAAACGTTTCTCGCTGTCCGACCTAGCTGATTATTCGGACGTCGTTGGACAGACCAAGGCCAATGTAATGTTTATGGACCGAGGTGTTAAACATAGGATCGATTCACAAAATTCTCAAATGGACATAGAGTTATACAATCCAGGTGCAAGGCTGTTGGTGTCGGCCGTGCAATGGTTACATACAATTCCGTCATTCACACAAATCTCCCAAAAGGAACAAATATTGCTGCTACAAAGTAATTGGAAGGAATTGTTTATAATGCATGCTGCAGAATACTCTTTTTGTTTCGACGAAGATTACTATTATTTCAACAGAACAGATCTCACCGGTTATCACGTCGAAGCGGCCAAATATCAAAGAAGAGATAAGGAAATTGGCAGCGCTCTTGAAAAGAATTTCCCTGTGCCGATTAGATAA

Protein sequence:

>DPOGS200902-PA
MDCKPEVVCRVCGDKASGKHYGVPSCDGCRGFFKRSIRRNLDYICKENGSCIVDVSRRNQCQACRFSKCLRVNMKKDAVQNERAPRTLGNQHQLALQKLSYLSRQSAFIPNHSPVALSTFSPYTYPIQDRVQNPYLANSSFQNYSNQSSMPMDVPSLNPLFNNPSGINPFKFPLFPGPMQYSLPHPHTYFSANIFYPPIISAENPTLYLDSQETSSNALHNSQFGTSIQRNIAEKHDSLNCDKIKENEVTSSEETCRDTASKGATINKNHNQSNQKVDCTENIIPKCSNKNKEQDLFTSREKRFSLSDLADYSDVVGQTKANVMFMDRGVKHRIDSQNSQMDIELYNPGARLLVSAVQWLHTIPSFTQISQKEQILLLQSNWKELFIMHAAEYSFCFDEDYYYFNRTDLTGYHVEAAKYQRRDKEIGSALEKNFPVPIR-