Monarch geneset OGS2.0

DPOGS210702
TranscriptDPOGS210702-TA1893 bp
ProteinDPOGS210702-PA630 aa
Genomic positionDPSCF300013 - 496956-500001
RNAseq coverage0x (Rank: top 98%)
Annotation
HeliconiusHMEL0117660.080.85% 
BombyxBGIBMGA006313-TA0.077.90% 
DrosophilaHsp68-PA0.070.30% 
EBI UniRef50UniRef50_P081070.069.29%Heat shock 70 kDa protein 1A/1B n=2302 Tax=root RepID=HSP71_HUMAN
NCBI RefSeqXP_001358499.10.073.29%GA20564 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1257744810.073.29%GA20564 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|1700579860.072.60%heat shock protein 70 B2 [Culex quinquefasciatus]
Group
Gene OntologyGO:00055240ATP binding
KEGG pathwaydpo:Dpse_GA205640.0 
 K03283 (HSPA1_8)maps-> Endocytosis
    MAPK signaling pathway
    Spliceosome
    Antigen processing and presentation
    Protein processing in endoplasmic reticulum
InterPro domain[5-628] IPR0010230Heat shock protein Hsp70
[5-608] IPR0131267.2e-255Heat shock protein 70
Orthology groupMCL10014 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210702-TA
ATGGTCGCCCAAGCGGTGGGTATAGATCTCGGCACTACCTTTTCTTGTGTCGGGGTCTTTCAACATGGAAAGGTGGAGATCATAGCTAACGAACAAGGAAATAGAACAACTCCTTCATACGTTGCTTTCAACGACACTGAAAGACTTATCGGCGATTCGGCAAAAAATCAGATAGCAATGAACCCTAAAAACACAGTGTTCGATGCAAAACGTTTAATAGGAAGAAAATTTAATGACCCCAAAATTCAGCAAGATATGAGACTCTGGCCATTTGAAGTAATATCAGATGGGGGAAACCCAAAGATTGTTGTTGAATACAAAGGAGAGAAACGCAAATTCACTCCGGAAGAAATTTCCTCCATGGTTTTATCAAAAATGAAGGAAATAGCTGAAACATATCTCGGGGGTATTGTCAAAGATGCTGTGATAACCGTCCCAGCATATTTCAATGATGCGCAACGACAAGCAACGAAGGATGCTGGAGCTATAGCCGCGCTAAATGTTCTTAGAATTATTAACGAACCTACAGCCGCTGCACTTGCCTATGGTCTAGATAAAGAACTGAAAGGAGAAAAGAACGTCCTCATATTCGACCTTGGTGGAGGAACATTTGACGTATCCATACTTCAGATTTCCGAGGGCTCGTTATTTGAAGTTAGTTCTACAGCTGGCGACACTCATTTGGGAGGGGAAGATTTCGATTGCCGAATGGTTGATCATTTTTGTCAGGAATTTGAGAGAAAGTACAAGAAGGATATTAAATCTAATCCAAAAGCTTTGAGAAGGCTAAGAACTGCTTGCGAGAGGGCTAAACGAACATTGTCCTCTAATACCGAAGCGAGTTTAGAAGTAGACGCATTACATGAAGGCATAGATTTCTATTCTAAGATCACGAGAGCACGCTTCGAAGAACTTTGTTCAGACTTATTCAGACAAACCTTAGGTCCGGTGGACAGAGCACTTAAAGATGCTGGATTAAACACAAGAGAAATACACGATGTGGTCATGGTCGGTGGTTCTACAAGAATTCCAAAAATTCAAAGATTATTGCAGGATTTCTTTAGTGGAAAAGTGTTGAATCTTTCGATAAATCCCGATGAAGCTGTGGCCTATGGTGCAGCTGTGCAAGCAGCAATTCTTACAGGTTCAAAAGACACTCGCATTCAGGACGTCTTGCTTGTGGATATCACACCGCTATCTCTAGGCATAGAGACGGCAGGAGGAATTATGACGAAACTCGTTGACAGAAATACGAGGATACCGATAGCACAAAAAAAGATATTTACAACATATTCTGATAACCAACCCGCTGTTACAATACAAGTTTTTGAAGGAGAAAGAGCATTGACGAAAGATAATAATTTATTGGGTGTCTTTAACTTGACGGGCATCCCACCAGCTCCTAGAGGAGTACCGCAAGTTGAAGTAACTTTTGACATAGATGCCAATGGCATTCTTAGTGTGAGTGCTCAAGATAGAAGTACGGGCAGATCAGAACAGATTACAATATCAAATGACAGAGGCCGACTTAATAAAAAGGAGATTGAGAAGATGCTGCAGGACGCAGAAAAGTTTAAAGCGGAAGATGAAATGGTGCGGAAGAAAGTAGAAGTTAGAAACCAATTGGAAGCTTACCTTTTTGGATGTAAAACGGCAGCGGAAAGCGCTGGTACCAGATTGACGGATGACGAAAAAGACGCTGTCATAACTGAATGTAGCAATCAATTGCTCTGGCTGGAAACCAACGGAGACGCGTCACTGGCGGAATTAGAAAGCCGTCTGAAGTCAGCTCAAGCTATCTGCCAATCAGCTATGATGAAGTTACACGCTGGTGGTCCAAGTTATGGGAGACCGGTCAGTTCCGGCGGACCGAGGGTCGAGGAAGTAGATTGA

Protein sequence:

>DPOGS210702-PA
MVAQAVGIDLGTTFSCVGVFQHGKVEIIANEQGNRTTPSYVAFNDTERLIGDSAKNQIAMNPKNTVFDAKRLIGRKFNDPKIQQDMRLWPFEVISDGGNPKIVVEYKGEKRKFTPEEISSMVLSKMKEIAETYLGGIVKDAVITVPAYFNDAQRQATKDAGAIAALNVLRIINEPTAAALAYGLDKELKGEKNVLIFDLGGGTFDVSILQISEGSLFEVSSTAGDTHLGGEDFDCRMVDHFCQEFERKYKKDIKSNPKALRRLRTACERAKRTLSSNTEASLEVDALHEGIDFYSKITRARFEELCSDLFRQTLGPVDRALKDAGLNTREIHDVVMVGGSTRIPKIQRLLQDFFSGKVLNLSINPDEAVAYGAAVQAAILTGSKDTRIQDVLLVDITPLSLGIETAGGIMTKLVDRNTRIPIAQKKIFTTYSDNQPAVTIQVFEGERALTKDNNLLGVFNLTGIPPAPRGVPQVEVTFDIDANGILSVSAQDRSTGRSEQITISNDRGRLNKKEIEKMLQDAEKFKAEDEMVRKKVEVRNQLEAYLFGCKTAAESAGTRLTDDEKDAVITECSNQLLWLETNGDASLAELESRLKSAQAICQSAMMKLHAGGPSYGRPVSSGGPRVEEVD-