Monarch geneset OGS2.0

DPOGS210048
TranscriptDPOGS210048-TA1884 bp
ProteinDPOGS210048-PA627 aa
Genomic positionDPSCF300017 - 1126553-1128436
RNAseq coverage193x (Rank: top 48%)
Annotation
HeliconiusHMEL0104250.091.61% 
BombyxBGIBMGA014536-TA0.089.97% 
DrosophilaHsp68-PA0.080.79% 
EBI UniRef50UniRef50_Q9U6390.074.51%Heat shock 70 kDa protein cognate 4 n=125 Tax=cellular organisms RepID=HSP7D_MANSE
NCBI RefSeqNP_001037396.10.092.30%heat shock protein 70 [Bombyx mori]
NCBI nr blastpgi|2568622120.091.80%heat shock protein 70 [Helicoverpa zea]
NCBI nr blastxgi|2568622120.091.80%heat shock protein 70 [Helicoverpa zea]
Group
Gene OntologyGO:00055241.8e-87ATP binding
KEGG pathwaytca:6552340.0 
 K03283 (HSPA1_8)maps-> Endocytosis
    MAPK signaling pathway
    Spliceosome
    Antigen processing and presentation
    Protein processing in endoplasmic reticulum
InterPro domain[1-625] IPR0010230Heat shock protein Hsp70
[3-607] IPR0131269.2e-267Heat shock protein 70
Orthology groupMCL10014 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210048-TA
ATGCCAGCAATTGGAATCGATCTTGGAACTACGTATTCATGTGTCGGAGTATGGCAACATGGAAATGTGGAAATAATCGCTAACGACCAAGGCAACCGGACCACTCCATCTTACGTCGCATTTACTGACACGGAACGCTTAATTGGAGATGCTGCCAAAAATCAAGTAGCATTAAACCCCAACAACACGGTATTCGATGCCAAGCGATTAATAGGCCGCAAGTTTGACGACCCAAAGATACAACAGGATATGCAACATTGGCCTTTCAAAGTCATCAATGACTGCGGCAAACCAAAGATTCAGGTGGAATTCAAAGGTGAAATTAAACGGTTCGCACCAGAAGAGATAAGTAGCATGGTATTGACTAAGATGAAAGAAACTGCTGAAGCGTTTTTAGGCTCCAGCATACGAGACGCCGTGATCACAGTGCCCGCATACTTTAACGACTCCCAGCGTCAGGCCACCAAAGATGCGGGAGGTATAGCTGGAATAAATGTTTTACGAATAATCAATGAACCTACTGCTGCTGCTTTGGCGTATGGTCTGGATAAAAATCTTAAAGGTGAGAGGAACGTATTAATATTCGACCTCGGAGGCGGCACCTTCGATGTATCCATCCTCACTATAGATGAAGGCTCTTTGTTCGAGGTGAAGTCTACAGCAGGTGACACCCATCTGGGAGGTGAAGACTTCGACAACCGCTTAGTCGATCATTTAGCTGCTGAATTTAAACGCAAGTATAAAAAGGATCTTCGTGGTAATTCACGAGCTCTGCGTAGACTACGTACAGCTGCGGAGAGAGCTAAACGCACACTTTCCTCTAGCACGGAAGCCACTTTGGAAATTGATGCACTACATGAAGGCATCGACTTTTACACTCGAGTCTCTCGAGCTAGATTTGAAGAATTAAACTCTGACTTATTCCGTGGAACGTTGGAACCGGTCGAAAAGGCTTTGAAGGATGCCAAGCTTGACAAAAGTTCCATTCATGATGTGGTCCTCGTCGGAGGTTCTACTCGTATTCCGAAGATTCAAAACATGCTTCAGAACTTCTTCTGTGGCAAAAAATTAAATCTCTCCATCAACCCTGACGAGGCGGTCGCTTACGGAGCGGCGGTGCAAGCGGCCATCCTAAGCGGTGAACAACACAGTAAAATCCAAGACGTTCTGTTGGTGGACGTGGCGCCTCTGTCTTTGGGCATCGAAACAGCTGGAGGAGTCATGACGAAGATCATCGAACGAAATGCTAAAATACCATGCAAGCAAAGTCAAACCTTCACTACATATTCTGATAACCAACCCGCCGTCACCATACAGGTGTACGAAGGTGAAAGGGCTATGACCAAAGACAATAATCTATTAGGTCGTTTCGATCTGACGGGTATTCCACCAGCACCTCGAGGTGTTCCTAAAATAGATGTAACTTTCGATCTCGATGCTAATGGAATCTTGAATGTCTCGGCCAAAGAAAACAGCACGGGCAGAAGCAAGAACATCGTCATCAAGAACGATAAAGGAAGATTGTCGCAGGCTGAGATCGACAGAATGGTTTCCGAAGCCGAGCGTTACAAGGAAGAGGATGAACGGCAACGAGAGAGAGTATCAGCTCGGAACCAACTCGAATCTTACATATTTAACGTGAAACAGGCCATAGATGATGCGGGAGATAAACTGAGTCAACAAGACAAGGATACTGTCAGAAATGAATGCGACGAAACACTGAAATGGCTCGATAATAATGTACTGGCTGAGAAGGAGGAATATGAACACAAACTGAAAGAAATACAACGAGTGTGTTCACCACTCATGAGCAAAATGCACGGAGCATCGAACGGCAACTATCAACAGAACCATACAGGACCCACCGTGGAAGAGGTTGATTGA

Protein sequence:

>DPOGS210048-PA
MPAIGIDLGTTYSCVGVWQHGNVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPNNTVFDAKRLIGRKFDDPKIQQDMQHWPFKVINDCGKPKIQVEFKGEIKRFAPEEISSMVLTKMKETAEAFLGSSIRDAVITVPAYFNDSQRQATKDAGGIAGINVLRIINEPTAAALAYGLDKNLKGERNVLIFDLGGGTFDVSILTIDEGSLFEVKSTAGDTHLGGEDFDNRLVDHLAAEFKRKYKKDLRGNSRALRRLRTAAERAKRTLSSSTEATLEIDALHEGIDFYTRVSRARFEELNSDLFRGTLEPVEKALKDAKLDKSSIHDVVLVGGSTRIPKIQNMLQNFFCGKKLNLSINPDEAVAYGAAVQAAILSGEQHSKIQDVLLVDVAPLSLGIETAGGVMTKIIERNAKIPCKQSQTFTTYSDNQPAVTIQVYEGERAMTKDNNLLGRFDLTGIPPAPRGVPKIDVTFDLDANGILNVSAKENSTGRSKNIVIKNDKGRLSQAEIDRMVSEAERYKEEDERQRERVSARNQLESYIFNVKQAIDDAGDKLSQQDKDTVRNECDETLKWLDNNVLAEKEEYEHKLKEIQRVCSPLMSKMHGASNGNYQQNHTGPTVEEVD-