Monarch geneset OGS2.0

DPOGS210928
TranscriptDPOGS210928-TA669 bp
ProteinDPOGS210928-PA222 aa
Genomic positionDPSCF300045 + 713021-718971
RNAseq coverage7x (Rank: top 86%)
Annotation
HeliconiusHMEL0133012e-9889.64% 
BombyxBGIBMGA003079-TA2e-3986.05% 
DrosophilaSox21a-PA8e-4395.18% 
EBI UniRef50UniRef50_D6WND15e-5861.16%Sox21a n=1 Tax=Tribolium castaneum RepID=D6WND1_TRICA
NCBI RefSeqXP_971910.19e-5961.16%PREDICTED: similar to Sox21a CG7345-PA [Tribolium castaneum]
NCBI nr blastpgi|910837632e-5761.16%PREDICTED: similar to Sox21a CG7345-PA [Tribolium castaneum]
NCBI nr blastxgi|1107608932e-6260.17%PREDICTED: hypothetical protein LOC726150 [Apis mellifera]
Group
Gene OntologyGO:00036773.9e-33DNA binding
GO:00055152.9e-27protein binding
KEGG pathway 
InterPro domain[9-88] IPR0009103.9e-33High mobility group, HMG1/HMG2
[2-83] IPR0090712.9e-27High mobility group, superfamily
Orthology groupMCL18472 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210928-TA
ATGTCTCTGTCGAAGCAACCTTCGGATCATATAAAGAGGCCCATGAACGCATTCATGGTGTGGTCCAGGGGTCAAAGGAGGAAGATGGCACAGGACAACCCGAAGATGCACAACTCAGAGATATCCAAGAGACTTGGAGCCGAGTGGAAGTTGCTCACTGAGATGGAGAAGAGACCGTTCATCGATGAGGCGAAAAGACTCAGAGCCCTCCACATGAAAGAACATCCCGATTATAAATATCGGCCGCGGCGGAAACCGAAGGCGTTGATCAAGAAGGAGCCCAAGTTCGGTTTCAACATCAGCGGTCTGATGGCACCAGTTCCCCGTCTGATGACGCCCTCTATGCCGCAACCAGTACCACAGATGCCTGTACCACATCACTTGCTGCAGGACAAACCTGACTTGGGACGGACGCTCTTCCCACCCATACCGTACCCATTTTACCCCTTCGCCAAGATTCCCTCCGACGATGGAAAGTTAGCCGCAGAATTAGCGCATCTACAGGCCCTGTACGGCGGCGCGCTGTACAGCTCGGCGTTGTACAACAGCGCCCTGTCTCCTTGCGGCTGCCCTCCTCGCCGGTCTCCGTCCCCCCCGCCGGACGTGAAGCGCCCGGTGGCTTACGTTCTGATGAAGAGTGACGAGGAGCCTCCGCAGCATGTTATATGA

Protein sequence:

>DPOGS210928-PA
MSLSKQPSDHIKRPMNAFMVWSRGQRRKMAQDNPKMHNSEISKRLGAEWKLLTEMEKRPFIDEAKRLRALHMKEHPDYKYRPRRKPKALIKKEPKFGFNISGLMAPVPRLMTPSMPQPVPQMPVPHHLLQDKPDLGRTLFPPIPYPFYPFAKIPSDDGKLAAELAHLQALYGGALYSSALYNSALSPCGCPPRRSPSPPPDVKRPVAYVLMKSDEEPPQHVI-