Monarch geneset OGS2.0

DPOGS213248
TranscriptDPOGS213248-TA975 bp
ProteinDPOGS213248-PA324 aa
Genomic positionDPSCF300124 + 221906-227586
RNAseq coverage143x (Rank: top 54%)
Annotation
HeliconiusHMEL0080811e-12091.41% 
BombyxBGIBMGA009439-TA2e-13388.60% 
DrosophilaSox102F-PB4e-6195.45% 
EBI UniRef50UniRef50_F4WJY93e-7177.71%Transcription factor SOX-6 n=10 Tax=Formicidae RepID=F4WJY9_ACREC
NCBI RefSeqXP_974417.15e-7274.18%PREDICTED: similar to GA10800-PA [Tribolium castaneum]
NCBI nr blastpgi|910809519e-7174.18%PREDICTED: similar to GA10800-PA [Tribolium castaneum]
NCBI nr blastxgi|910809513e-7947.62%PREDICTED: similar to GA10800-PA [Tribolium castaneum]
Group
Gene OntologyGO:00036773.4e-32DNA binding
GO:00055152.2e-26protein binding
KEGG pathway 
InterPro domain[172-251] IPR0009103.4e-32High mobility group, HMG1/HMG2
[154-243] IPR0090712.2e-26High mobility group, superfamily
Orthology groupMCL17838 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213248-TA
ATGGAAAAACTTGGTCGATTACCTACCAGTGACGCCCGTCCTCAGTATCGGCAGCGGAATATAGGGACGGTTTCCGCTGAGTTGGAGCTCCAGAGACTTCAGCAGGAGCATCTCCGGAGACAGGAGCTGGTACGACGAGGTCACTCCTTGTATCCAGCTCCTCCACTGGCGCTGCTGCCACTGCTGGAACAGATGCGGCCACAACAACCTCAGCCAAACGTGGTTCAAACTTCGAACTGGCCGGCCACAGCTCAACTGGCACAACTGACTGCCAGCGCTCGGTCCCCACCTCCTCAAGATCCTGACGCGCCTCTGAACCTCAGCAAACAACGATCACCGTCTCCCATGATGATGCCACGCTATGTACCCTACCCCCCAATGGAGGAACAGTATATGAAGAAAGACGATGACTTCAACAACGCTTGTAACACATCATCCTGGAATCAATCACCGCCAGAGGAATCCGAAAAGGCTAAGCTCATTCGTCAACCAAAACGCGATGAGTCTGGAAAACCACACATCAAAAGACCAATGAATGCTTTCATGGTTTGGGCAAAGGATGAGCGCCGTAAGATTTTAAAGGCATGCCCGGATATGCACAACTCTAATATATCTAAGATTTTGGGAGCAAGATGGAAGGCCATGTCCAACGCCGAGAAGCAGCCCTACTATGAAGAACAATCGAGACTTTCAAAGCTTCACATGGAAAAGCATCCTGATTATAGGTACCGACCAAGACCCAAACGAACATGCATCGTCGATGGTAAGAAGATGCGGATATCAGAGTACAAAAACCTGATGCGTACACGCAGGCAAGAGATGAGGCAACTTTGGTGCAGGGATGGAGGTAGTGAACTTAACTTCCTCCCATCCCTGTCTAGCCCTGGGCCGTCGAATTCCTCTCCTCCGCCGAACGGGGGCAACTATATGAACCCGGCATTCTCGCCCCCCCTTAGCCCTGGCGAGGAGGATTGA

Protein sequence:

>DPOGS213248-PA
MEKLGRLPTSDARPQYRQRNIGTVSAELELQRLQQEHLRRQELVRRGHSLYPAPPLALLPLLEQMRPQQPQPNVVQTSNWPATAQLAQLTASARSPPPQDPDAPLNLSKQRSPSPMMMPRYVPYPPMEEQYMKKDDDFNNACNTSSWNQSPPEESEKAKLIRQPKRDESGKPHIKRPMNAFMVWAKDERRKILKACPDMHNSNISKILGARWKAMSNAEKQPYYEEQSRLSKLHMEKHPDYRYRPRPKRTCIVDGKKMRISEYKNLMRTRRQEMRQLWCRDGGSELNFLPSLSSPGPSNSSPPPNGGNYMNPAFSPPLSPGEED-