Monarch geneset OGS2.0

DPOGS212775
TranscriptDPOGS212775-TA1893 bp
ProteinDPOGS212775-PA630 aa
Genomic positionDPSCF300012 + 961793-963685
RNAseq coverage188x (Rank: top 48%)
Annotation
HeliconiusHMEL0141630.088.25% 
BombyxBGIBMGA014536-TA0.082.38% 
DrosophilaHsp70Aa-PA0.076.95% 
EBI UniRef50UniRef50_P081070.073.52%Heat shock 70 kDa protein 1A/1B n=2302 Tax=root RepID=HSP71_HUMAN
NCBI RefSeqNP_001037396.10.084.43%heat shock protein 70 [Bombyx mori]
NCBI nr blastpgi|3171354880.082.89%heat shock protein 70 [Spodoptera litura]
NCBI nr blastxgi|3171354880.082.89%heat shock protein 70 [Spodoptera litura]
Group
Gene OntologyGO:00055240ATP binding
KEGG pathwaytca:6632930.0 
 K03283 (HSPA1_8)maps-> Endocytosis
    MAPK signaling pathway
    Spliceosome
    Antigen processing and presentation
    Protein processing in endoplasmic reticulum
InterPro domain[1-628] IPR0010230Heat shock protein Hsp70
[3-606] IPR0131261e-263Heat shock protein 70
Orthology groupMCL10014 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212775-TA
ATGAGCGCAATCGGAATAGATTTAGGAACGACTTATTCTTGCGTCGGTGTATGGCAACATGGTAATGTAGATATTATAGCTAACGATCAAGGTAATAGAACAACACCTTCTTATGTAGCGTTTACGGACACCGAACGCCTCATCGGAGACGCTGCCAAAAACCAGGTCGCTCTTAACCCCGTTAATACAATCTTTGACGCAAAACGTCTCATAGGCAGAAAGTTTGATGATCCTAAGATCCAGCAAGATCTCAAACACTGGCCTTTCAAAGTCATTAACGAGGGCAGTAAACCGAAGATACAAGTAACTTATAAGGGCGAGACCAAACGTTTTGCACCCGAAGAGATCAGTAGCATGGTTCTGACGAAAATGAAAGATACGGCCGAGGCATATCTGGGAAGATCAGTCAAAAATGCTGTCATAACGGTCCCAGCTTACTTCAATGACTCTCAGCGCCAAGCTACTAAAGACGCAGGAGCAATCGCTGGCCTCAATGTCCTCCGTATCATCAACGAACCAACTGCAGCTGCCCTTGCTTATGGGCTGGATAAAAATCTGAAAGGGGAAAGAAATGTTCTAATTTTCGATCTTGGAGGGGGTACATTTGACGTCTCCATACTTAACATCGACGAGGGATCGCTCTTCGAAGTAAAAGCTACAGCAGGAGACACACATCTAGGTGGCGAAGACTTCGACAACCGCCTGGTAAACTTTTTGGCTGATGAATTTAAAAGAAAGCACAAAAAGGATTTGCGTACCAACCCGAAGGCATTGCGTCGTCTTAGGACAGCTGCGGAGCGCGCTAAAAGGACTTTGTCTTCCAGTACTGAAGCGAATATTGAAATCGATGCTTTATTTGAGGGCATTGACTTTTATTCCAGAATTTCCAGAGCGCGTTTTGAAGAACTGAATGCAGATTTGTTCCGCGTAACATTGGATCCAGTAGAAAAAGCGTTGAAAGATGCAAAGTTGGACAAAAACTCCATTAACGACATCGTTTTGGTGGGGGGCTCTACCCGCATTCCGAAGATACAAAGCTTGTTGCAGAACTTCTTTAACGGGAAAAAACTCAACTTGTCTATTAACCCGGACGAAGCCGTTGCTTATGGGGCCGCGGTTCAAGCTGCTATATTGAGCGGGGAGCGTCATTCCAAAATTCAGGACGTTTTGTTGGTGGATGTGACGCCGTTGTCTTTGGGTATCGAAACAGCCGGTGGGGTGATGACCAAAATAGTCGAGCGTAACGCGAAGATACCGTGTAAGCATTCGCAAACCTTCACGACTTATGCGGACAATCAGCCAGCCGTGACAATTCAAGTTTACGAAGGTGAAAGGGCTATGACGAAAGACAATAATCTTCTAGGAACCTTTGATCTCACCGGAATACCTCCAGCGCCACGGAACGTCCCACAAATCGACGTGGCTTTCGATTTGGACGCTAATGGTATATTAAACGTGTCAGCCAAAGAAAACAGTACGGGGAAGAGCAAGAATATAGTCATCAAAAATGACAAAGGACGTTTATCTCAGGCAGATATCGATCGAATGGTTGCTGATGCTGAAAAGTACAGAGACGAAGACGAAAGGCAGAGAGAACGTGTGGCGTCTCGGAATAAACTGGAAACGTACATATTTGGAATCAAACAAGCCATTGACGAAGCGGGGTCTAAGCTCGGGGAGGCGGAAAGCTCCAAAGCCAAGAGAATGTGCGACGATACTCTGCGTTGGCTTGAAAAGAATTCTTTGGCTGAGAAAGAGGAATACGAGGACAAGATCAAGGAGGTATCCGCTGTTTGTGCGCCTCTGATGAGGAGAATTCACGGTGCGGGTACTAAGAAGCCACAGGGCTCCAACAACAGCAACGACGGCCCACTGATCGAAGAAGTGGATTAA

Protein sequence:

>DPOGS212775-PA
MSAIGIDLGTTYSCVGVWQHGNVDIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPVNTIFDAKRLIGRKFDDPKIQQDLKHWPFKVINEGSKPKIQVTYKGETKRFAPEEISSMVLTKMKDTAEAYLGRSVKNAVITVPAYFNDSQRQATKDAGAIAGLNVLRIINEPTAAALAYGLDKNLKGERNVLIFDLGGGTFDVSILNIDEGSLFEVKATAGDTHLGGEDFDNRLVNFLADEFKRKHKKDLRTNPKALRRLRTAAERAKRTLSSSTEANIEIDALFEGIDFYSRISRARFEELNADLFRVTLDPVEKALKDAKLDKNSINDIVLVGGSTRIPKIQSLLQNFFNGKKLNLSINPDEAVAYGAAVQAAILSGERHSKIQDVLLVDVTPLSLGIETAGGVMTKIVERNAKIPCKHSQTFTTYADNQPAVTIQVYEGERAMTKDNNLLGTFDLTGIPPAPRNVPQIDVAFDLDANGILNVSAKENSTGKSKNIVIKNDKGRLSQADIDRMVADAEKYRDEDERQRERVASRNKLETYIFGIKQAIDEAGSKLGEAESSKAKRMCDDTLRWLEKNSLAEKEEYEDKIKEVSAVCAPLMRRIHGAGTKKPQGSNNSNDGPLIEEVD-