Monarch geneset OGS2.0

DPOGS206067
TranscriptDPOGS206067-TA1896 bp
ProteinDPOGS206067-PA631 aa
Genomic positionDPSCF300028 - 426385-435015
RNAseq coverage756x (Rank: top 17%)
Annotation
HeliconiusHMEL0050340.075.71% 
BombyxBGIBMGA006848-TA0.077.47% 
DrosophilaCG12104-PA3e-1443.75% 
EBI UniRef50UniRef50_B7P7175e-4947.80%High mobility group protein, putative n=1 Tax=Ixodes scapularis RepID=B7P717_IXOSC
NCBI RefSeqXP_002409472.11e-4947.80%high mobility group protein, putative [Ixodes scapularis]
NCBI nr blastpgi|2410950842e-4847.80%high mobility group protein, putative [Ixodes scapularis]
NCBI nr blastxgi|3838474203e-11544.19%PREDICTED: uncharacterized protein LOC100875697 [Megachile rotundata]
Group
Gene OntologyGO:00055153.8e-24protein binding
GO:00036777.7e-22DNA binding
GO:00056347.5e-05nucleus
KEGG pathway 
InterPro domain[209-301] IPR0090713.8e-24High mobility group, superfamily
[223-314] IPR0009107.7e-22High mobility group, HMG1/HMG2
Orthology groupMCL16959 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206067-TA
ATGCATGAGTGCACAAGCAAACATAACAGTTCACTACGGCCCGAAAGCCTTGACACGTCTCTATCCAACACCCGCGACACAAAAGGAAGTGTCTACGCTATGAATGATCAGACTTTTCACACGCCATCTTTCGGAGACGAAGAGTTTGACATTCCTCTGATCCACGGCCAGCATGCTGCCAGTGGACAGAACACACATATGCAATATTCACAACTACATCATTCAGCTCCTCAGGTTGGCATGATGAATCCAGCTCAAGACGGTCTAGCTCCTCCAGGAGGTGCTCCTTCATATCAACAGCCTTTATACTTACAAGAACCCCATACACCTGTCACATCTCACAGCAATGCAACCGCGCCGGCTGGCAATTATATGATACAGCAACAACCTGGAGGCCAGCAGTTATTAATGCTGCAACCGAGCCAAGTAATGAGTGGCCCACCAACTCCTAGCACCCCGACACAGGCTGCCCCTGTCTATGGATCACCACAGAGAGCATCTCCACCTGGAACTACCAGTGATGATTCTGATGATAGTGTACCATCTCAACATTCCCAGATTGGTGTAAGTAATATGGCCGTAAAGAGATCATCACCAGAACCAATGGATAATGGAATAAGTAGGGGACAAATGCAAAAAAAACCTAAGGTTCAGAAGAAAAAGAAAAAGAGAGATCCTAATGAACCCCAAAAACCTGTATCTGCCTATGCCTTATTTTTTCGAGACACTCAAGCTGCCATTAAAGGTCAAAACCCTAATGCCAGTTTTGGAGAAGTGTCAAAAATTGTTGCATCTATGTGGGATGGCCTCGATTCAGAACACAAAAGCGTATATAAACAAAAAACAGAAGTGGCAAAGAAAGAATATTTAAAAGCATTAGCAGCATATCGGGCCAGTTTGGTTTCAAAGGGCGGAGAACAAGAAAATCAAGTCATGTATAATCACAACAACACAAATGCAAATTATGGGAATTATTATCAAGGTCAAGCCTATGGCAATGGTCATCCACCACAGGGCTATGCACCAAATTCGACACCACAAGGTTACACACCACAGAATTTCCCCGGAGGACAACCCCAACCTCCATATGGTGGTAATGGACCACAAGGATACCCAACAAATCCCCAAGCACCATCTCAAAATTATCAAGGAAACATGGGACATAATCCCCAAACATATCAAAATGTACCTGGACAACCACCTCAAGGTTACCAAGTGAATACAACATCTTCGTCTCAAGTGTGTCAACCTAACATTGCTCAATCACCAAGAAACTATCAGCCCGCACAATCTCCTAACTATGGAGCCAATAATGCCCTGTCCCCGCCAGGCTACAGACAAGTTCAACCACAATCGCCACCTGTGCAACAAGTCCATCCTGCTATGCAGTATCATCACTCCCAACAAATGCAGCAAGTACATCAAGTCCAATACCAGCAGCAACAACAAATACATCAGCAAAGTCAGCAGCAAGCTATGCAACATCAGCAACAGCAACAACACCAGCAGCAGCAGCAACAACAACATCAGCAGCAGCAGCAACATCAACAGCAACAACACCAGCAGCAGCAGCATCAGCAACAACATCAACAACAAACACCACAACAAATACCACCTCAACCATTGCCACCACAAGCACCAATTATAAAAAATGAACAGCAGTCTCCGAACAACAATGGAACAGGTGTACCTCATCAGTCCCCTGAGCAAACTGAAAATAGAAGTACTCCATCATGTATACGCCAAGGCTGCACCAATCCTGCCATCCCAAATAGCGAATGGGAAGATGAATATTGTTCTAATGAATGTGTTGTTAGTCACTGCAGGGATGTCTTCAGCTCATGGGTAGCATCAAATACCAACAATCAGATACAAAATTTTTCTGCTGTGAAGTAA

Protein sequence:

>DPOGS206067-PA
MHECTSKHNSSLRPESLDTSLSNTRDTKGSVYAMNDQTFHTPSFGDEEFDIPLIHGQHAASGQNTHMQYSQLHHSAPQVGMMNPAQDGLAPPGGAPSYQQPLYLQEPHTPVTSHSNATAPAGNYMIQQQPGGQQLLMLQPSQVMSGPPTPSTPTQAAPVYGSPQRASPPGTTSDDSDDSVPSQHSQIGVSNMAVKRSSPEPMDNGISRGQMQKKPKVQKKKKKRDPNEPQKPVSAYALFFRDTQAAIKGQNPNASFGEVSKIVASMWDGLDSEHKSVYKQKTEVAKKEYLKALAAYRASLVSKGGEQENQVMYNHNNTNANYGNYYQGQAYGNGHPPQGYAPNSTPQGYTPQNFPGGQPQPPYGGNGPQGYPTNPQAPSQNYQGNMGHNPQTYQNVPGQPPQGYQVNTTSSSQVCQPNIAQSPRNYQPAQSPNYGANNALSPPGYRQVQPQSPPVQQVHPAMQYHHSQQMQQVHQVQYQQQQQIHQQSQQQAMQHQQQQQHQQQQQQQHQQQQQHQQQQHQQQQHQQQHQQQTPQQIPPQPLPPQAPIIKNEQQSPNNNGTGVPHQSPEQTENRSTPSCIRQGCTNPAIPNSEWEDEYCSNECVVSHCRDVFSSWVASNTNNQIQNFSAVK-