Monarch geneset OGS2.0

DPOGS210897
TranscriptDPOGS210897-TA669 bp
ProteinDPOGS210897-PA222 aa
Genomic positionDPSCF300045 - 701759-702427
RNAseq coverage24x (Rank: top 78%)
Annotation
HeliconiusHMEL0028572e-10382.88% 
BombyxBGIBMGA003078-TA1e-7669.63% 
DrosophilaSox21b-PA3e-3371.26% 
EBI UniRef50UniRef50_G7YF271e-3165.31%Transcription factor SOX1/2/3/14/21 n=1 Tax=Clonorchis sinensis RepID=G7YF27_CLOSI
NCBI RefSeqXP_002048056.13e-3271.26%GJ11556 [Drosophila virilis]
NCBI nr blastpgi|3407216493e-3261.21%PREDICTED: SOX domain-containing protein dichaete-like [Bombus terrestris]
NCBI nr blastxgi|3407216494e-3255.81%PREDICTED: SOX domain-containing protein dichaete-like [Bombus terrestris]
Group
Gene OntologyGO:00036771.3e-30DNA binding
GO:00055158.8e-27protein binding
KEGG pathway 
InterPro domain[24-102] IPR0009101.3e-30High mobility group, HMG1/HMG2
[5-97] IPR0090718.8e-27High mobility group, superfamily
Orthology groupMCL26729 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210897-TA
ATGTCTTATCCGTTGGAAAATCCACAAGCAAAGTCAGTGACGAAACTATCGAGAAATTCAAACCCCTATCACATAAAACGTCCCATGAACGCGTTTATGGTTTGGTCGAGGCTACAGCGGAAGAAGATATCCTCTTTGAACCCGAAACTTCATAACTCCGAGATATCAAAACGGCTCGGTCTAGAATGGAAGAGTCTAGACGACTCCGAGAAGAGACCGTTTATAGATGAGGCAAAGAGGTTGCGGCTGAAACACATGCACGACTATCCAGACTATAAGTACAGACCGCGTCGGAAAAACAGGATGGACGCCTCAATATATGGACCTACTTCTTTATATTCATCTCGAGAGAGCTTCGTAGAAATAGAACCGAGGGTCGAAGCCAGCTATCCGATACCAATCAACTATTCCGATCCGCAGTACATGTACAACAACGTTATAAGTTACACGGTGCCTATAAATCAGGCTCCTTTCGTTTCCCCCATAAGACCCAAAGAGGAGACCTTGCCGAGTTTAGATTTGAGGCCTCTGCCGTCGATCGAGAGCATTTCCCCGAGGCCGTTCGCGGTCGTGAACCAGCAGATGATGGTGAAGTCTTACCCCAACCTGCACTACGTCCCTGAAGACGTCGCGAGGATATCGTATCATTACCCCTTCAGTGTGCAGTAG

Protein sequence:

>DPOGS210897-PA
MSYPLENPQAKSVTKLSRNSNPYHIKRPMNAFMVWSRLQRKKISSLNPKLHNSEISKRLGLEWKSLDDSEKRPFIDEAKRLRLKHMHDYPDYKYRPRRKNRMDASIYGPTSLYSSRESFVEIEPRVEASYPIPINYSDPQYMYNNVISYTVPINQAPFVSPIRPKEETLPSLDLRPLPSIESISPRPFAVVNQQMMVKSYPNLHYVPEDVARISYHYPFSVQ-