Monarch geneset OGS2.0

DPOGS205807
TranscriptDPOGS205807-TA999 bp
ProteinDPOGS205807-PA332 aa
Genomic positionDPSCF300144 + 304211-312356
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0128549e-2964.29% 
BombyxBGIBMGA002000-TA7e-3065.52% 
Drosophiladsf-PA5e-4587.36% 
EBI UniRef50UniRef50_D6W9X63e-4366.67%Dissatisfaction n=3 Tax=Neoptera RepID=D6W9X6_TRICA
NCBI RefSeqXP_002065368.16e-4487.36%GK14705 [Drosophila willistoni]
NCBI nr blastpgi|1954347551e-4287.36%GK14705 [Drosophila willistoni]
NCBI nr blastxgi|910773862e-4266.67%PREDICTED: similar to Dissatisfaction (Dsf) [Tribolium castaneum]
Group
Gene OntologyGO:00056347.9e-31nucleus
GO:00063557.9e-31regulation of transcription, DNA-dependent
GO:00435657.9e-31sequence-specific DNA binding
GO:00082707.9e-31zinc ion binding
GO:00037007.9e-31sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[15-90] IPR0016287.9e-31Zinc finger, nuclear hormone receptor-type
[16-86] IPR0130885.3e-30Zinc finger, NHR/GATA-type
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205807-TA
ATGTTATTTGCCTCAGGCTGCAATGAAGGGGATAGGTTACTGGACATACCATGTAACGTGTGCGGTGACAGAAGCTCTGGAAAACATTATGGCATTTACAGTTGTGATGGTTGCTCAGGATTCTTCAAGCGGTCTATCCACAGGAACCGTGTGTACACTTGCAAGGCTGGTGGTGAGATGAAAGGTCGTTGTCCAGTCGACAAGACTCATAGGAACCAGTGCCGAGCCTGTAGACTCGCTAAATGCTTTCAGGCTAACATGAACAAAGATGAAGTGTCACAGACGCGACGGCGGGCGATCGCGCCGCCCGCGCCATCCGACAAGGGGCAAACTTTTCAATTACCGCTATGTCGTGTTACCCCGCAGCGGGTTAAATTAATTCTCTCCACTTCACAAAAAGATTCACCTTCCCCGAGCCTCCCTGAATCGAAGTCCGTGTATGTAGCCGTGAGATCGGGCTCACCTGTCGGCCCCCTCTGCCTCTTCGCTGCCCTCCGCCCTCTCTGGCTCGTTAATGATGATGTGCTCAGCCAGCAGACTATTTCACTTCTGAATACATCTTTCTCACTTTTAAGCTTACATCCCACTCGCACCCAGGTGATCATTATACCTCTGTCCCTTTCCATCAGGTTCGCCTGGTCGCATCCAGCTAGTCAATTCGGCAAACGCGTATCTACTCTGTGTAGGAAATGTAATGGAACAAATGTGGCTAGAATTAACTATCTAGAATTTCGTCTTAGTACGGTCAATAAACCTTTTGACCATTCACCAATTCGGCTTCGTGAAGCTTTCATAGGCCAGGAGGCGAATTTGGATTATTGGGTCATTTATCACACCCTGCCGACGTGGAGGAACGCCCGACCAAGACTTCCACGTATCATAATCGGTGCTCAGAACTCCATTTCCGAGCGGCTGAGTCTAAGATACATAGTTACACCACCTTCCTCCCACTTGAATCTGGAGTGGGTCCTACATACGCTTTGTACATCTGTGCACTGA

Protein sequence:

>DPOGS205807-PA
MLFASGCNEGDRLLDIPCNVCGDRSSGKHYGIYSCDGCSGFFKRSIHRNRVYTCKAGGEMKGRCPVDKTHRNQCRACRLAKCFQANMNKDEVSQTRRRAIAPPAPSDKGQTFQLPLCRVTPQRVKLILSTSQKDSPSPSLPESKSVYVAVRSGSPVGPLCLFAALRPLWLVNDDVLSQQTISLLNTSFSLLSLHPTRTQVIIIPLSLSIRFAWSHPASQFGKRVSTLCRKCNGTNVARINYLEFRLSTVNKPFDHSPIRLREAFIGQEANLDYWVIYHTLPTWRNARPRLPRIIIGAQNSISERLSLRYIVTPPSSHLNLEWVLHTLCTSVH-