Monarch geneset OGS2.0

DPOGS207792
TranscriptDPOGS207792-TA1095 bp
ProteinDPOGS207792-PA364 aa
Genomic positionDPSCF300042 + 214451-216244
RNAseq coverage189x (Rank: top 48%)
Annotation
HeliconiusHMEL0175692e-9873.57% 
BombyxBGIBMGA005325-TA8e-9961.64% 
DrosophilaCG9418-PA5e-3332.69% 
EBI UniRef50UniRef50_UPI00015B49EF5e-5242.40%UPI00015B49EF related cluster n=1 Tax=unknown RepID=UPI00015B49EF
NCBI RefSeqXP_625193.18e-5339.87%PREDICTED: similar to high mobility group 20A isoform 1 [Apis mellifera]
NCBI nr blastpgi|3503978054e-5340.06%PREDICTED: high mobility group protein 20A-like [Bombus impatiens]
NCBI nr blastxgi|1565429513e-5238.72%PREDICTED: high mobility group protein 20A-like [Nasonia vitripennis]
Group
Gene OntologyGO:00036771.1e-22DNA binding
GO:00055151.6e-22protein binding
KEGG pathwaydme:Dmel_CG122233e-08 
 K10802 (HMGB1)maps-> Base excision repair
InterPro domain[70-156] IPR0009101.1e-22High mobility group, HMG1/HMG2
[56-148] IPR0090711.6e-22High mobility group, superfamily
Orthology groupMCL11245 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207792-TA
ATGGAACCTGATAATACTCAATCTGTTGAAATACAAACAGTGGAAAACGAGGCATCTATTAAATCTACGACTGCCGATCACGAAGATAGTGCATTAGCTGCAAATTCAACAAATGGTAATGATACCGACGTAGGCACTCAACCCCCAAATTCTAAAGATATTCAGAAACCGTCACCCAAAAAGCCGAAAAAGAGGAAACCTAAAATACCACGGGATGTAACTGCGCCTCGCCAGCCTCTGACTGGTTATGTTAGATACCTCAATGAACGCCGTGATCAGTTAAGAGCTGAACAACCAGAATTGGGTTTTGCGGAGTTAACTAGACAACTAGCCAGTGAATGGAGCAAACTCCCCACTGAAGAAAAACAGCACTATTTGGATGCAGCTGATCAAGATAAGGAAAGATATATTAAAGAATGGGCTGAGTATAAAAAAACTGATGCTTATAAAGAGTTTAGAAAGCAGCAAATGGAACAAAAAGATACAGGAACTGTAACCAAAAAGGTTAAATCATGTGTTGATAATAATATTATAAGTGTTACAGGAGGTGCTACTCAAATTTCTGCTGAACCTGTTGTGGGTGGGAATATATCAACTGGTTCTATTCTCAGTGCAACAAGGCAACAGACTCCACCTCGACCGAGGCCCTGCATAACACCTGCTTCAGGTGAAGAAATGTCTGGAGATACAGATATACCTATTTTCACTGATCAATTCTTACAACACAACAAGTTGAGGGAATCTGAACTTCGTCAGTTGAGAAAAGCAAATTCAGATTATGAGCAGCAGAATGCTATATTACAACGTCATGCTGAAGAAGTAAGTGCTGCTACAGCTAGGCTTCGAGCTGAAACAACCGCAGCAGCCGAACGCACAGCCTTGCTTGTAGCTCATCGCAGACTTTTAGTGACATCACTAGTTCAGGCTTTACAATCTGTTGTGCTGCCTTTACAAAACGGACCAGCTGGTGCATCAGAGTCCAACATAGAGGAGTACATGGAAAAATTACAGAGCTTGGTAACAGAAGGAAAGAACAATCCAGTATTGAAACAGGCGAAAGAAATATTAAATAAAATTGAATTACCTATGAATTAA

Protein sequence:

>DPOGS207792-PA
MEPDNTQSVEIQTVENEASIKSTTADHEDSALAANSTNGNDTDVGTQPPNSKDIQKPSPKKPKKRKPKIPRDVTAPRQPLTGYVRYLNERRDQLRAEQPELGFAELTRQLASEWSKLPTEEKQHYLDAADQDKERYIKEWAEYKKTDAYKEFRKQQMEQKDTGTVTKKVKSCVDNNIISVTGGATQISAEPVVGGNISTGSILSATRQQTPPRPRPCITPASGEEMSGDTDIPIFTDQFLQHNKLRESELRQLRKANSDYEQQNAILQRHAEEVSAATARLRAETTAAAERTALLVAHRRLLVTSLVQALQSVVLPLQNGPAGASESNIEEYMEKLQSLVTEGKNNPVLKQAKEILNKIELPMN-