Monarch geneset OGS2.0

DPOGS209911
TranscriptDPOGS209911-TA675 bp
ProteinDPOGS209911-PA224 aa
Genomic positionDPSCF300049 + 693185-693859
RNAseq coverage387x (Rank: top 31%)
Annotation
HeliconiusHMEL0080126e-3888.10% 
BombyxBGIBMGA004151-TA3e-9785.09% 
DrosophilaSox14-PA1e-2970.27% 
EBI UniRef50UniRef50_D6WQB92e-4343.40%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WQB9_TRICA
NCBI RefSeqXP_973116.13e-4443.40%PREDICTED: similar to Putative transcription factor SOX-14 [Tribolium castaneum]
NCBI nr blastpgi|910869636e-4343.40%PREDICTED: similar to Putative transcription factor SOX-14 [Tribolium castaneum]
NCBI nr blastxgi|2700096464e-4846.36%hypothetical protein TcasGA2_TC008936 [Tribolium castaneum]
Group
Gene OntologyGO:00036771.2e-28DNA binding
GO:00055151.9e-25protein binding
KEGG pathway 
InterPro domain[1-74] IPR0009101.2e-28High mobility group, HMG1/HMG2
[1-69] IPR0090711.9e-25High mobility group, superfamily
Orthology groupMCL18287 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209911-TA
ATGAATGCGTTTATGGTTTGGTCTCAAATCGAACGCAGAAAAATTTGCGAACAAACTCCAGATATGCATAATGCAGAAATTTCTAAGAACTTAGGGCGAGTTTGGAAAACGCTAAACGATGAGGAAAGGCAGCCTTTTATAGATGAAGCAGAAAGGCTGAGACAACTGCATATGCGCGAGTACCCTGATTATAAATATAGGCCTCGCAAGAAAACAGCAAAACCTGCACAAAGAAGTGGTGCAATAACCAAGCAGAAGCGTAAACAAAGAGCTGACAGTAACAACAACAGAGGTGTATCCAGACGGAGAACGCGGACGGTCCCAAGCATCTCCAGTGTTCCAATGGAAACGCCAGCTCCTCCACCACTGCCAGCATCCCCAGCGGGATCGCCGGATTCTCCTGAATCTGCGTGTTTCTACGACGATAACACGAAACGCGATCAATCAGACCTAACTGAACTTTATTCAATTACGGATTTATTTACGTTACCAGCAGACTGTGAAGTTGATCTAGACGCGTTGACGGAAATGGAGGACTTTGAGACGGCATCTTCTTCGTCAGGATCACACTTTGAGTTCTCATGCACGCCGGACGTGTCTGATATGCTCAGTGAGATTGGCGTAGCAGGGGATTGGGATGACCCTACTTTCTCATCGTACCTTACGTCGTCTTAA

Protein sequence:

>DPOGS209911-PA
MNAFMVWSQIERRKICEQTPDMHNAEISKNLGRVWKTLNDEERQPFIDEAERLRQLHMREYPDYKYRPRKKTAKPAQRSGAITKQKRKQRADSNNNRGVSRRRTRTVPSISSVPMETPAPPPLPASPAGSPDSPESACFYDDNTKRDQSDLTELYSITDLFTLPADCEVDLDALTEMEDFETASSSSGSHFEFSCTPDVSDMLSEIGVAGDWDDPTFSSYLTSS-