Monarch geneset OGS2.0

DPOGS212382
TranscriptDPOGS212382-TA948 bp
ProteinDPOGS212382-PA315 aa
Genomic positionDPSCF300019 + 559390-564450
RNAseq coverage25x (Rank: top 77%)
Annotation
HeliconiusHMEL0056302e-13681.23% 
BombyxBGIBMGA009688-TA4e-0828.89% 
Drosophilaeg-PB6e-3835.81% 
EBI UniRef50UniRef50_E0VVW82e-5342.51%Putative uncharacterized protein n=2 Tax=Neoptera RepID=E0VVW8_PEDHC
NCBI RefSeqXP_002430262.14e-5442.51%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420196297e-5342.51%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420196293e-5443.18%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00056341.7e-13nucleus
GO:00063551.7e-13regulation of transcription, DNA-dependent
GO:00435651.7e-13sequence-specific DNA binding
GO:00082701.7e-13zinc ion binding
GO:00037001.7e-13sequence-specific DNA binding transcription factor activity
GO:00037074.2e-07steroid hormone receptor activity
GO:00434014.2e-07steroid hormone mediated signaling pathway
KEGG pathway 
InterPro domain[7-55] IPR0016281.7e-13Zinc finger, nuclear hormone receptor-type
[9-51] IPR0130884.7e-11Zinc finger, NHR/GATA-type
[52-107] IPR0089464.2e-07Nuclear hormone receptor, ligand-binding
Orthology groupMCL22686 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212382-TA
ATGAGCTGTTCAGAAGACGAAATGTCTTTCTTCGGGCGAACGTACAACAACCTCTCCTCGATTTCTGAATGCAAGAACAATGGTGAATGTGTCATCAATAAGAAAAACAGGACGGCTTGCAAGGCATGCAGGTTGCGGAAATGTTTGCTAGTCGGCATGTCCAAATCAGGATCACGTTACGGGCGAAGATCGAACTGGTTTAAGATCCACTGCCTTTTACAAGAGCAGCAGCAACAACACATGCAACAATTACAGACAAGTAAATCACAAACCACATTAAATACTTCCATCAGCCCATCTTTTTTGCCAACAAATCTTTTGCCGGCAGCCGCTCTTGCGGAATATTATAAGAGTACAGAAAAGAATATTTTCACCGACGAAGTGACGCGACAGAGCGTTTCGCCTTCCGATTCTGGTGCTTCTTCGGCTGACCCAGAGGATGACAATAGTTCCAGGAGTACCAGCGGCTTGAGTATTTTCAGACCCGCCTCCTCGCCCAGCAGTGAAAAAGACACTCGGATACAGGCAATCAGAAATCAAAATAAAGATGTGAGAAGAAAAAAGCCATCAACATTGCCACCTTCACCATTTGGTGCCATGGGAGCTGCGTCTGGTTTTTCGTTTTCACGGAGCGCCCCCTATCTCCCAACACCAATGGCAAATATGAGATCAGTGCCATCAGGAATAGCTTGGCAAAATCGTAGTGGCGGGGATTTGTTACTCCAATCTCCGGCGATTGCTGGCGTTGCCATCGAACAAGACCAACCTATAGATCTGTCTATTAAATCTACAGCAGTTATATTTAGAAATTCAAAAACCGAAGTGAGTGATTCGGAACCCGAGTTGAGTATCGATCTGAATGACGATAACGGTAAAGACATAATGAAGAATCCCCTCGATTTGAGTCTTGTATCAAAACGAGCCGAGGAAGTTCCGATGACTGGCTAG

Protein sequence:

>DPOGS212382-PA
MSCSEDEMSFFGRTYNNLSSISECKNNGECVINKKNRTACKACRLRKCLLVGMSKSGSRYGRRSNWFKIHCLLQEQQQQHMQQLQTSKSQTTLNTSISPSFLPTNLLPAAALAEYYKSTEKNIFTDEVTRQSVSPSDSGASSADPEDDNSSRSTSGLSIFRPASSPSSEKDTRIQAIRNQNKDVRRKKPSTLPPSPFGAMGAASGFSFSRSAPYLPTPMANMRSVPSGIAWQNRSGGDLLLQSPAIAGVAIEQDQPIDLSIKSTAVIFRNSKTEVSDSEPELSIDLNDDNGKDIMKNPLDLSLVSKRAEEVPMTG-