Monarch geneset OGS2.0

DPOGS213900
TranscriptDPOGS213900-TA1746 bp
ProteinDPOGS213900-PA581 aa
Genomic positionDPSCF300218 - 306221-307966
RNAseq coverage15x (Rank: top 82%)
Annotation
HeliconiusHMEL0060640.095.41% 
BombyxBGIBMGA014536-TA0.092.48% 
DrosophilaHsp68-PA0.085.64% 
EBI UniRef50UniRef50_Q9U6390.078.86%Heat shock 70 kDa protein cognate 4 n=125 Tax=cellular organisms RepID=HSP7D_MANSE
NCBI RefSeqNP_001037396.10.095.41%heat shock protein 70 [Bombyx mori]
NCBI nr blastpgi|2249992830.096.33%HSP70 [Spodoptera exigua]
NCBI nr blastxgi|2249992830.096.33%HSP70 [Spodoptera exigua]
Group
Gene OntologyGO:00055248.5e-88ATP binding
KEGG pathwaytca:6632930.0 
 K03283 (HSPA1_8)maps-> Endocytosis
    MAPK signaling pathway
    Spliceosome
    Antigen processing and presentation
    Protein processing in endoplasmic reticulum
InterPro domain[1-544] IPR0010230Heat shock protein Hsp70
[3-544] IPR0131265.4e-256Heat shock protein 70
Orthology groupMCL10014 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213900-TA
ATGCCCGCTATTGGTATTGATCTCGGTACAACTTACTCCTGCGTTGGTGTTTGGCAACATGGAAATGTCGAAATCATCGCAAACGACCAAGGCAACAGGACGACTCCGTCTTACGTCGCATTCACAGATACGGAGAGATTGATCGGCGACGCTGCTAAGAACCAGGTGGCCCTGAACCCCAACAACACAGTCTTCGACGCGAAACGGTTAATCGGCCGCAAATTCGATGATCCTAAGATACAAGCCGACATGAAGCACTGGCCCTTCAAAGTGGTCAACGACTGTAGCAAACCGAAAATCCAAGTGGAGTTCAAGGGCGAGACGAAGAGATTCGCCCCCGAGGAAATCAGCAGCATGGTGTTGGTCAAGATGAAGGAGACCGCGGAGGCGTATCTCGGTACAACGGTTCGCGACGCTGTAGTCACAGTTCCGGCTTACTTCAACGACTCCCAGCGTCAAGCGACGAAGGATGCCGGAGCGATCGCAGGTCTGAACGTTCTCCGCATCATCAACGAGCCCACAGCCGCCGCACTTGCCTACGGCCTGGACAAGAACCTCAAAGGCGAACGGAACGTGTTAATCTTTGATCTCGGCGGCGGCACCTTCGACGTGTCCATTCTGACCATCGACGAGGGCTCGCTGTTCGAGGTGAAGGCTACCGCTGGAGACACGCATCTCGGAGGCGAGGACTTTGACAACAGGCTGGTGAATCATTTCGCTGAAGAATTCGTCAGAAAGTACAAGAAAGACCTTCGAGCCAACCCTCGCGCGTTGCGACGCCTCCGCACCGCCGCCGAGCGCGCCAAGAGGACGCTGTCGTCCAGCAGCGAAGCGACGATCGAAATAGACGCTCTGTACGAGGGGATCGACTTCTACACCCGGGTCTCCCGCGCCAGGTTCGAGGAACTCAACTCCGACCTGTTCCGCGGTACCCTGGAGCCGGTCGAGAAGGCTCTGAAAGATGCGAAGATGGACAAGAGTCAGATACACGACGTGGTGCTCGTCGGTGGATCGACTCGCATCCCAAAGGTGCAGAGCCTGCTGCAGAACTTCTTCTGCGGCAAGAAGCTCAACCTGTCCATCAATCCGGACGAAGCGGTGGCCTACGGCGCGGCGGTCCAGGCGGCCATCCTGAGCGGAGAGAGCGACTCGAAGATCCAGGACGTGTTGCTCGTGGACGTGGCTCCGCTGTCTTTGGGCATCGAGACCGCCGGAGGGGTGATGACGAAGATCATAGAACGCAACTGCAAGATCCCCTGCAAGCAGTCGCAGACGTTCACCACGTACTCGGACAACCAGCCGGCCGTCACCATCCAGGTGTACGAGGGCGAGCGAGCGATGACGAAAGACAACAACCTGCTGGGGACGTTCGACCTGACCGGCATACCGCCGGCGCCGCGCGGAGTGCCCAAGATAGACGTGACCTTCGACCTGGACGCCAACGGCATCCTGAACGTGTCGGCCAAGGAGAACAGCACGGGCCGCAGCAAGAACATCGTGATCAAGAACGACAAGGGGCGGCTGTCGCAAAGCGAGATCGAGCGCATGTTGGCGGAGGCGGAGCGCTACAAGGAAGAGGACGAGCGGCAGAGACAGAGGGTGGCGGCAAGGAACCAGCTGGAGTCGTACGTGTGGCGCGGGGGCGGCGACCGGCCCGGCCGGGGCGGCGCGGGGCAGCGGACCCACGGTGGAGGAAGTGGACTAGACTCGCCCACGGCCCGGAGCAGGTCGCTGGTTAGGAGCTAA

Protein sequence:

>DPOGS213900-PA
MPAIGIDLGTTYSCVGVWQHGNVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPNNTVFDAKRLIGRKFDDPKIQADMKHWPFKVVNDCSKPKIQVEFKGETKRFAPEEISSMVLVKMKETAEAYLGTTVRDAVVTVPAYFNDSQRQATKDAGAIAGLNVLRIINEPTAAALAYGLDKNLKGERNVLIFDLGGGTFDVSILTIDEGSLFEVKATAGDTHLGGEDFDNRLVNHFAEEFVRKYKKDLRANPRALRRLRTAAERAKRTLSSSSEATIEIDALYEGIDFYTRVSRARFEELNSDLFRGTLEPVEKALKDAKMDKSQIHDVVLVGGSTRIPKVQSLLQNFFCGKKLNLSINPDEAVAYGAAVQAAILSGESDSKIQDVLLVDVAPLSLGIETAGGVMTKIIERNCKIPCKQSQTFTTYSDNQPAVTIQVYEGERAMTKDNNLLGTFDLTGIPPAPRGVPKIDVTFDLDANGILNVSAKENSTGRSKNIVIKNDKGRLSQSEIERMLAEAERYKEEDERQRQRVAARNQLESYVWRGGGDRPGRGGAGQRTHGGGSGLDSPTARSRSLVRS-