Monarch geneset OGS2.0

DPOGS210898
TranscriptDPOGS210898-TA609 bp
ProteinDPOGS210898-PA202 aa
Genomic positionDPSCF300045 - 686857-687465
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0028562e-9094.58% 
BombyxBGIBMGA003079-TA5e-10093.63% 
DrosophilaSox21b-PA2e-4068.97% 
EBI UniRef50UniRef50_UPI00022476693e-4347.73%UPI0002247669 related cluster n=3 Tax=unknown RepID=UPI0002247669
NCBI RefSeqXP_001688246.11e-3970.69%AGAP010919-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3454941901e-4247.73%PREDICTED: transcription factor Sox-14-like [Nasonia vitripennis]
NCBI nr blastxgi|3407216495e-5154.72%PREDICTED: SOX domain-containing protein dichaete-like [Bombus terrestris]
Group
Gene OntologyGO:00036777.7e-29DNA binding
GO:00055151.6e-23protein binding
KEGG pathway 
InterPro domain[1-73] IPR0009107.7e-29High mobility group, HMG1/HMG2
[1-69] IPR0090711.6e-23High mobility group, superfamily
Orthology groupMCL18241 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210898-TA
ATGAACGCCTTCATGGTCTGGTCCAGGCTACAACGTCGCCAGATCGCCAAGGATAATCCGAAGATGCACAATTCGGAGATATCGAAACGGCTCGGAGCCGAATGGAAGCTGTTGTCTGAAATGCAGAAGAGACCATTCATTGACGAAGCAAAACGACTCCGAGCTCTCCATATGAAAGAGCACCCCGACTACAAGTACCGGCCACGAAGGAAGCCCAAGCCACCGACAGCGGGAGGAGCTCCGGGCGCTGGAGCTTTCCCGAGCTTTCCACTGCCTTACTTCGCGGGTCCAGCACCGACCGTTGGACCGTTGGACGCCCTTTCGTATTCGGCAGTGCCTCCATACTTCCCACATCAGCTGGATCACTTGCAATTCTCAAAACTAATGGCTCCGACCGAGAAGTTGCCGACGGCATCTTCAGCTGCCGCTGTGGTGTCGTCGTTCTATTCATCACTCTACACACAGCCGGCCGCACCTCCGAAGCCGTTCCCATCTCCTCTGTTCCACCAGTACGGAGCAGCGCCGGCTTCTCCTGTGTCTCCGGTGACTTCCACGCAGCACAGCCCTCATGACGACCAGCTCAGGCGGCCGGTTTCAGTTATATATTGA

Protein sequence:

>DPOGS210898-PA
MNAFMVWSRLQRRQIAKDNPKMHNSEISKRLGAEWKLLSEMQKRPFIDEAKRLRALHMKEHPDYKYRPRRKPKPPTAGGAPGAGAFPSFPLPYFAGPAPTVGPLDALSYSAVPPYFPHQLDHLQFSKLMAPTEKLPTASSAAAVVSSFYSSLYTQPAAPPKPFPSPLFHQYGAAPASPVSPVTSTQHSPHDDQLRRPVSVIY-