Monarch geneset OGS2.0

DPOGS208109
TranscriptDPOGS208109-TA1467 bp
ProteinDPOGS208109-PA488 aa
Genomic positionDPSCF300154 - 359509-360975
RNAseq coverage447x (Rank: top 27%)
Annotation
HeliconiusHMEL0062220.084.57% 
BombyxBGIBMGA006574-TA0.076.74% 
DrosophilaCG7182-PA4e-5533.50% 
EBI UniRef50UniRef50_E9GX913e-8140.24%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9GX91_DAPPU
NCBI RefSeqXP_002734658.13e-7540.34%PREDICTED: heat shock 70kDa protein 2-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|3214648241e-8040.24%hypothetical protein DAPPUDRAFT_306599 [Daphnia pulex]
NCBI nr blastxgi|3214648242e-7840.24%hypothetical protein DAPPUDRAFT_306599 [Daphnia pulex]
Group
Gene OntologyGO:00055241.3e-82ATP binding
KEGG pathwaycal:CaO19.20133e-48 
 K09490 (HSPA5, BIP)maps-> Prion diseases
    Protein processing in endoplasmic reticulum
    Protein export
InterPro domain[1-412] IPR0010231.3e-82Heat shock protein Hsp70
[5-385] IPR0131262.1e-69Heat shock protein 70
Orthology groupMCL19530 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208109-TA
ATGGCGACAGCGTATGGAATACATATAGGAAATAGTTCAGGTTGCCTTGCAACTTTTGTTAATGGGGATGCTTCTGTTTTGGCTAATGATGCAGGAGACAGAGTCACTCCAGCTGTTGTTGCTCTCAATGGTGTGGAATGGGAAATTGGCCTTCCAGCTAAATCGGGACAAGCCTCTTCAAAAGCCATTATAAAAAATAACAAGCGCCTCATGAACTGTGATTTTAGTGAAGATGATATCGCATTCGTGGAAAATTCCTCTTCATGCAGAATTCAAAATGATGAGGAGTTAGTATATGAATTTGAAACAAGTGAAACAAAGTTATACTCAAATCCTGATAATATTGCTACAAAGATTTATGCAAAATTATATACTATTGCCAGCCACGCTGTTCAGAATGAAGGTGACTTAAAATTAGTGTTAGCGGCACCATTGAATTGGTCATCCTTAAGTAGAGAGAGGCTTGTAAAATGTGCCGAGTTAGCAGGTTTTGATGTTCTACAAGTTATCAGTGAACCTGCTGCTGCTCTTCTTGCATACAATGTTGAAGAGTCTGCAGACGATGTGAATGTTTTGGTGTACAGGCTCGGTGGGTCTTCATGTGCTGCCTCTGTAGTTAAAGTATCTTCAGGATTTATGTCTGTGGAAAAAAATGTTTTTAGATCCGATCTCGGAGGACAGTGTCTAACAAAGGACTTGGCAGATTATATTGCACAAGAATTCAAACAAAAGTGGAAATTGGATCCACATGAAAGTAGACGAGCTATGTCAAAATTACTTAACCATGCAGACAACTGTAAACACATTCTGTCAACTTTAAGTTCAGCCCATGTCTTTATTGAGTCTTTATTGGATGGAGTTGATTGGAGTCAAAATGTGACGAGGGCAAGATTTGAAAATATCATATCTTCTAAAATATCTGCATACATAGAACCAGCTAAGCAAGTAATTGATAGTTTTAATGGTAAAATCCATAAAATTATTCTCTGTGGAGGTAGTATGAAGATCCCTAAATTGCAATCGGCTGTGGCAAGTTTGTTGCCAGAAGCGGAAGTTCTTTCAGGCATTAATCCTGATGAAGTAATAGCAGTGGGATGTGCTAGACAAGCTGGGATGATGCTGGACTTACCAGACCTGTCCTTAGCAGATACAAACATGGAGATTGAATTCCTCGGAAAAGACATATACATGAAGTACTTAGATCAAACTGTTAAACTTTTCAAAGAAGGTTCACCACCTTATGCCCAAAATATATGCAGTATTGAATCAGTAAATGACGTAAAGGATATGAGTTTCACATTACATGAGGATCCAAATGGTGAACAATTTGCTACAGAAACCTTTAATATTGAAAATCTAACAAAACCTTTCAAATTAAAAGCTACTCTACAGTCATCAAATATATTAATGCAAGTGGATTTTTTTTTTAAGTTTATTTTTGAAACATTTCTGGATTCTGGACTATAG

Protein sequence:

>DPOGS208109-PA
MATAYGIHIGNSSGCLATFVNGDASVLANDAGDRVTPAVVALNGVEWEIGLPAKSGQASSKAIIKNNKRLMNCDFSEDDIAFVENSSSCRIQNDEELVYEFETSETKLYSNPDNIATKIYAKLYTIASHAVQNEGDLKLVLAAPLNWSSLSRERLVKCAELAGFDVLQVISEPAAALLAYNVEESADDVNVLVYRLGGSSCAASVVKVSSGFMSVEKNVFRSDLGGQCLTKDLADYIAQEFKQKWKLDPHESRRAMSKLLNHADNCKHILSTLSSAHVFIESLLDGVDWSQNVTRARFENIISSKISAYIEPAKQVIDSFNGKIHKIILCGGSMKIPKLQSAVASLLPEAEVLSGINPDEVIAVGCARQAGMMLDLPDLSLADTNMEIEFLGKDIYMKYLDQTVKLFKEGSPPYAQNICSIESVNDVKDMSFTLHEDPNGEQFATETFNIENLTKPFKLKATLQSSNILMQVDFFFKFIFETFLDSGL-