Monarch geneset OGS2.0

DPOGS213901
TranscriptDPOGS213901-TA1554 bp
ProteinDPOGS213901-PA517 aa
Genomic positionDPSCF300218 - 295631-297523
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0060640.096.06% 
BombyxBGIBMGA004614-TA0.090.95% 
DrosophilaHsp68-PA0.088.67% 
EBI UniRef50UniRef50_Q9U6390.081.73%Heat shock 70 kDa protein cognate 4 n=125 Tax=cellular organisms RepID=HSP7D_MANSE
NCBI RefSeqNP_001037396.10.095.81%heat shock protein 70 [Bombyx mori]
NCBI nr blastpgi|2249992830.096.55%HSP70 [Spodoptera exigua]
NCBI nr blastxgi|2249992830.096.55%HSP70 [Spodoptera exigua]
Group
Gene OntologyGO:00055241.7e-77ATP binding
KEGG pathwaydmo:Dmoj_GI241630.0 
 K03283 (HSPA1_8)maps-> Endocytosis
    MAPK signaling pathway
    Spliceosome
    Antigen processing and presentation
    Protein processing in endoplasmic reticulum
InterPro domain[1-515] IPR0010230Heat shock protein Hsp70
[3-407] IPR0131267.4e-198Heat shock protein 70
Orthology groupMCL10014 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213901-TA
ATGCCCGCTATTGGTATTGATCTCGGTACAACTTACTCCTGCGTTGGTGTTTGGCAACATGGAAATGTCGAAATCATCGCAAACGACCAAGGCAACAGGACGACTCCGTCTTACGTCGCATTCACAGATACGGAGAGATTGATCGGCGACGCTGCTAAGAACCAGGTGGCCCTGAACCCCAACAACACAGTCTTCGACGCGAAACGGTTAATCGGCCGCAAATTCGATGATCCTAAGATACAAGCCGACATGAAGCACTGGCCCTTCAAAGTGGTCAACGACTGTAGCAAACCGAAAATCCAAGTGGAGTTCAAGGGCGAGACGAAGAGATTCGCCCCCGAGGAAATCAGCAGCATGGTGTTGGTCAAGATGAAGGAGACCGCGGAGGCGTATCTCGGTACAACGGTCCGCGACGCCGTAGTCACAGTTCCGGCTTACTTCAACGACTCCCAGCGTCAAGCGACGAAGGATGCCGGAGCGATCGCAGGTCTGAACGTTCTGCGCATCATCAACGAGCCCACAGCCGCCGCACTCGCCTACGGCCTGGACAAGAACCTCAAAGGCGAAAGGAACGTGTTAATCTTCGATCTCGGCGGCGGCACCTTCGACGTGTCCATTCTGACCATCGACGAGGGCTCGCTGTTCGAGGTGAAGGCTACCGCTGGAGACACGCATCTCGGAGGCGAGGACTTTGACAACAGGCTGGTGAATCATTTCGCTGAAGAATTCGTCAGAAAGTACAAGAAAGACCTTCGAGCCAACCCTCGCGCGTTGCGACGCCTCCGCACCGCCGCCGAGCGCGCCAAGAGGACGCTGTCGTCCAGCAGCGAAGCGACGATCGAAATAGACGCTCTGTACGAGGGAATCGACTTCTACACCCGGGTCTCCCGCGCCAGGTTCGAGGAACTCAACTCCGACCTGTTCCGCGGTACCCTGGAGCCGGTCGAGAAGGCTCTGAAAGATGCGAAGATGGACAAGAGTCAGATACACGACGTGGTGCTCGTCGGTGGGTCGACTCGCATCCCGAAGGTGCAGAGCCTACTGCAGAACTTCTTCTGCGGCAAAAAGCTTAACCTGTCCATCAATCCGGACGAAGCGGTGGCCTACGGCGCGGCGGTCCAGGCGGCCATCCTGAGCGGAGAGAGCGACTCGAAGATCCAGGACGTGTTGCTCGTGGACGTGGCTCCGCTGTCTTTGGGCATCGAGACCGCCGGAGGGGCGGAACGCTACAAGGAAGAGGACGAGCGGCAGAGGCAGAGGGTGGCGGCGAGGAACCAGCTGGAGTCGTACGTGTTCAGTGTGAAGCAGGCCTTGGAGGACGCCGGAGAGAAGCTGAGCGACGGAGACAAGAGCGCGGCGAGGAACGAGTGTGACGAGGCGCTGAGGTGGCTGGACAACAACACGCTGGCCGAGAAGGAGGAGTACGAGCACCGGCTGAAGGACCTGCAGAGAGTATGTTCGCCCATCATGAGCAAGCTACACGGCGCGGGGGCGACGACCGGGCCGGCCGGAGCGGCGCGGGGCAGCGGACCCACGGTGGAGGAAGTGGACTAG

Protein sequence:

>DPOGS213901-PA
MPAIGIDLGTTYSCVGVWQHGNVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPNNTVFDAKRLIGRKFDDPKIQADMKHWPFKVVNDCSKPKIQVEFKGETKRFAPEEISSMVLVKMKETAEAYLGTTVRDAVVTVPAYFNDSQRQATKDAGAIAGLNVLRIINEPTAAALAYGLDKNLKGERNVLIFDLGGGTFDVSILTIDEGSLFEVKATAGDTHLGGEDFDNRLVNHFAEEFVRKYKKDLRANPRALRRLRTAAERAKRTLSSSSEATIEIDALYEGIDFYTRVSRARFEELNSDLFRGTLEPVEKALKDAKMDKSQIHDVVLVGGSTRIPKVQSLLQNFFCGKKLNLSINPDEAVAYGAAVQAAILSGESDSKIQDVLLVDVAPLSLGIETAGGAERYKEEDERQRQRVAARNQLESYVFSVKQALEDAGEKLSDGDKSAARNECDEALRWLDNNTLAEKEEYEHRLKDLQRVCSPIMSKLHGAGATTGPAGAARGSGPTVEEVD-