Monarch geneset OGS2.0

DPOGS210663
TranscriptDPOGS210663-TA1893 bp
ProteinDPOGS210663-PA630 aa
Genomic positionDPSCF300401 + 191717-193609
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0104250.053.72% 
BombyxBGIBMGA001635-TA0.062.64% 
DrosophilaHsp68-PA0.054.13% 
EBI UniRef50UniRef50_Q9U6390.050.92%Heat shock 70 kDa protein cognate 4 n=125 Tax=cellular organisms RepID=HSP7D_MANSE
NCBI RefSeqXP_002099299.10.054.29%Hsp68 [Drosophila yakuba]
NCBI nr blastpgi|3233615680.054.27%heat shock protein 70-S3 [Stratiomys singularior]
NCBI nr blastxgi|3233615680.054.24%heat shock protein 70-S3 [Stratiomys singularior]
Group
Gene OntologyGO:00055248.6e-293ATP binding
KEGG pathwaydya:Dyak_GE108320.0 
 K03283 (HSPA1_8)maps-> Endocytosis
    MAPK signaling pathway
    Spliceosome
    Antigen processing and presentation
    Protein processing in endoplasmic reticulum
InterPro domain[1-629] IPR0010238.6e-293Heat shock protein Hsp70
[3-594] IPR0131267.5e-214Heat shock protein 70
Orthology groupMCL10014 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210663-TA
ATGGTGGCCGTGGGTATAGACTTAGGCACGTCTTCATCCTGTGTGGCCGTGTGGCGGAACGGGTCCGTCGAAGTCATACCGAATGAGGAAGGCAATAAGACGACTCCGTCATACGTAGCATTCTCTGCCAGCGGTCGCGTTGTCGGCGAAACCGCGAAAAATCAAGCATCCGTCAACGCAACGAATACGATTTATGGCGCGAAACGTTTGATCGGGCGGAACTACGACGATATCGGCTTACAGACGGACCTCCGCCACTTCCCGTACGCGGTGATCAATAAAAATGGTAAGCCATATATAATTGTTAACTATGAGGGCCAGAAATGTTTTGCGCCAGAGGAAGTAAGTGCTATGATTCTTTTCAGGATGAAGGGGTTGGCGGAGGCTTGGTTAGGTACTAGTGTTGGTAGGGCCGTGGTTACTGTGCCGGCTTATTTCAACGATTCTCAGAGACAGGCCACCAAGCTGGCGGGCCGTATCGCTGGCTTGGACGTCATTAGGATAATAAACGAACCTACCGCAGCGGCTTTCGCTTACGGGTTTCATAAAAACATCAATTCTGATAGAAATATTTTAGTCTATGACCTTGGCGGCGGGACTTTCGATGTATCAGTTCTGAAGATAGGTCGAGGCTGTGTGTATGAAGTGAAAGCCACAGCTGGCAACACCAGGCTTGGGGGTGAAGACATTGACAACCGTCTGGTGGCGTATTTCCTTGAGGACATCCGTAAACGATATAAGACGGAAGTAAGGAGCGCACGGTCTATGAGGAGGTTGAAGTTTGCTGCGGAGAAAGCCAAAAAGGCTCTGACCTCGTCGAACCACGCTGAAGTGTTTATAGAAGCTCTGTGCGGTATCAACTATATTGGTAAAATATCACGTTCTATATTCGAGCATCTGTGCCTCGATCTGTTCAAAGATACCCTCAAACCGATAGATCAGGCGTTGATGGATGCCAGTATGACCAAGGATGATTTACACGAGGTCATATTAGTCGGAGGCAGCACCAGGATTCCCATAGTGCGGAGGATTCTAAAGGAATATTTCGGCTCTAGGAAGATATCTAGTGAGATAAATCCAGATGAGGCTGTAGCTTGCGGCGCAGCCATCCAAGCCGCCGTATTGTCCGGCGAAACTCACGAGAGGATACAGGAGCTGTTACTAGTGGATGTTGTGCCATTGTCCCTGGGCCTGGAAACGGCCAGGGGCTTAATGTTCAAGGTAATAGAACGAAACACGCCGATACCCTGCCGGGTTGTTAAGGAAATAACTACGTTGGAGGACTATCAGAACGCTATGACCATAGAAATATTTGAAGGTGAACGTACTCTCACCAAAGATAATCACTGCTTGGGTGTATTCGAAATGCAGAACATCCCACCAGTGCCACGAGGAGTCGCAAAATTGGATGTGATTTTCGAAGTGGATGCGAATGGTATCCTGACGGTATCAACTGTGGACAGAACCACCGGTAACAGCAACAGTATCACCATCGAGAACATCAGTAGGTTGCGGCAGCAGGAGATCAGGAGAATGATATCCAATGCTGATAGATTCAAGGAAGAAGACATGGAGAACAAGAGACGCCTGGAAGTGAGAAATCAGCTGGAGTCTTATATATATAACGTTAAGAGATCTGTCGTTGAGAATCTGGACAGTTTAAGCGGGGAGGAGTTCAGAGATATGATTGGTGAATGCGAGGACGCGCTTACCTGGCTCGATGAGAACGAGGATTGCTTGAGAGAGGAGTATGAGAGGAAGATGTCGGAGTTGCTGCAGCGTTGGTCGTTTGATATACGGAAGCTGGACGCCGCGCACAGAGCTAAAAGGCACAGAGAAGAGTCAGTCTGTGACCAGACAGCTATTATAGAAGAACTGTCCGAGGAGCGCTGA

Protein sequence:

>DPOGS210663-PA
MVAVGIDLGTSSSCVAVWRNGSVEVIPNEEGNKTTPSYVAFSASGRVVGETAKNQASVNATNTIYGAKRLIGRNYDDIGLQTDLRHFPYAVINKNGKPYIIVNYEGQKCFAPEEVSAMILFRMKGLAEAWLGTSVGRAVVTVPAYFNDSQRQATKLAGRIAGLDVIRIINEPTAAAFAYGFHKNINSDRNILVYDLGGGTFDVSVLKIGRGCVYEVKATAGNTRLGGEDIDNRLVAYFLEDIRKRYKTEVRSARSMRRLKFAAEKAKKALTSSNHAEVFIEALCGINYIGKISRSIFEHLCLDLFKDTLKPIDQALMDASMTKDDLHEVILVGGSTRIPIVRRILKEYFGSRKISSEINPDEAVACGAAIQAAVLSGETHERIQELLLVDVVPLSLGLETARGLMFKVIERNTPIPCRVVKEITTLEDYQNAMTIEIFEGERTLTKDNHCLGVFEMQNIPPVPRGVAKLDVIFEVDANGILTVSTVDRTTGNSNSITIENISRLRQQEIRRMISNADRFKEEDMENKRRLEVRNQLESYIYNVKRSVVENLDSLSGEEFRDMIGECEDALTWLDENEDCLREEYERKMSELLQRWSFDIRKLDAAHRAKRHREESVCDQTAIIEELSEER-