Monarch geneset OGS2.0

DPOGS213485
TranscriptDPOGS213485-TA525 bp
ProteinDPOGS213485-PA174 aa
Genomic positionDPSCF300100 + 192144-192770
RNAseq coverage130x (Rank: top 56%)
Annotation
HeliconiusHMEL0168508e-7296.92% 
BombyxBGIBMGA004367-TA6e-6589.31% 
DrosophilaSox15-PA5e-2088.89% 
EBI UniRef50UniRef50_B3MCY64e-1888.89%GF12898 n=2 Tax=Sophophora RepID=B3MCY6_DROAN
NCBI RefSeqXP_974207.11e-2065.17%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910868431e-1965.17%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910868431e-1857.69%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00055151.7e-11protein binding
GO:00036771.9e-11DNA binding
KEGG pathwayspu:5871412e-15 
 K04495 (SOX17)maps-> Wnt signaling pathway
InterPro domain[75-130] IPR0090711.7e-11High mobility group, superfamily
[94-131] IPR0009101.9e-11High mobility group, HMG1/HMG2
Orthology groupMCL22067 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213485-TA
ATGATGGATTCTGGCGGCGCGGGTATAGAATCGCCACCCACCTACCACCGAAGCTACGAACAGTACGTCCAGGGGGCTGTAGAGACCAGCACGGACTCTGGCCAGGATCAGACCAGCCCGGAGCTCGTCGTGTGGTCTACGTTACCATACGGTCTGGACTATAGAGCGCAATACGACTACAGAAGTCCCTATGATACCTCAAGGGATTACTCCTCACAGCAATACGCTAGAGCGCCCTTCACAACAAAGATGGGTCAGGCCAAAGCCTTAAAGGAAGCCAGGATACGAAGACCTATGAATGCTTTCATGGTGTGGGCGAAGGTGGAGCGGAAGAAGCTGGCTGATGAGAACCCGGATCTCCATAACGCTGACTTAAGTAAAATGCTAGGATATTTATTATTCCGTATTCGGGAAGCAAAAAACGCTTCGTTGTTTAAAATAAACGCGTTAAAAAGTACAAAAGCAGTTAAACCCTTGCTCGTGAGTCAGCACAAACGACCAATCTTCGGAAGATTTGAATTTTAA

Protein sequence:

>DPOGS213485-PA
MMDSGGAGIESPPTYHRSYEQYVQGAVETSTDSGQDQTSPELVVWSTLPYGLDYRAQYDYRSPYDTSRDYSSQQYARAPFTTKMGQAKALKEARIRRPMNAFMVWAKVERKKLADENPDLHNADLSKMLGYLLFRIREAKNASLFKINALKSTKAVKPLLVSQHKRPIFGRFEF-