Monarch geneset OGS2.0

DPOGS205238
TranscriptDPOGS205238-TA2055 bp
ProteinDPOGS205238-PA684 aa
Genomic positionDPSCF300265 + 271108-282410
RNAseq coverage5661x (Rank: top 2%)
Annotation
HeliconiusHMEL0027310.092.44% 
BombyxBGIBMGA007950-TA4e-16851.88% 
DrosophilaHsc70-5-PA0.083.44% 
EBI UniRef50UniRef50_P298450.083.44%Heat shock 70 kDa protein cognate 5 n=843 Tax=root RepID=HSP7E_DROME
NCBI RefSeqNP_001153520.10.082.24%heat shock protein cognate 5 [Apis mellifera]
NCBI nr blastpgi|2230368300.091.82%heat shock protein 70 [Spodoptera exigua]
NCBI nr blastxgi|2230368300.091.82%heat shock protein 70 [Spodoptera exigua]
Group
Gene OntologyGO:00055240ATP binding
GO:00064576.3e-278protein folding
GO:00510826.3e-278unfolded protein binding
KEGG pathwayame:4086050.0 
 K04043 (dnaK)maps-> RNA degradation
InterPro domain[42-682] IPR0010230Heat shock protein Hsp70
[58-656] IPR0127256.3e-278Chaperone DnaK
[58-656] IPR0131264.1e-267Heat shock protein 70
Orthology groupMCL16081 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205238-TA
ATGTTGACCGCGACGCGTGTGGTGAGCCGAAAGGCGTTAGAGTGCTCCGGGACCGAATTCTACACACACAGAAATTTCTCCACATTCCTCAGAAGTACCGCAGCCCCCACAGTTCCAATCTACCAACGTCATGTGCAACACAGGCACAAATCTGAGGGTGTCAGGGGCGCGGTCATCGGCATTGACTTGGGCACAACCAACTCATGTGTGGCCGTCATGGAAGGAAAGACCCCGAAGGTGGTAGAAAACACAGAGGGATCAAGAACAACACCATCACATGTGGCGTTCAGTAAGGAGGGTGAGAGGCTGGTGGGTATGCCGGCTAAGAGGCAGGCTGTCACTAACAGCGGCAACACCTTCTATGCAACCAAGAGACTGATTGGAAGAAGATTTGACGATCCAGAAGTACAGAAAGATATGAAGAATCTTTCATATAAAGTTGTCAAAGCATCCAACGGCGACGCCTGGGTCCAAGGCAGCGATGGCAAAGTGTACTCTCCTAGTCAGATTGGTGCCTTCGTACTGATGAAGATGAGAGAGACGGCAGAGGCCTATCTCAATACTAATGTAAAGAACGCCGTAGTCACAGTACCCGCCTACTTCAACGATTCCCAAAGACAAGCCACCAAAGACGCCGGTCAAATCGCCGGTCTCAATGTACTCCGTGTGATCAACGAGCCGACTGCAGCAGCCCTCGCCTACGGGATGGACAAGACCGACGACAAAATTATCGCTGTATACGATCTGGGCGGTGGCACCTTCGATATATCTGTGCTGGAGATACAGAAAGGGGTGTTTGAGGTCAAGTCCACCAACGGCGACACACTCCTCGGTGGTGAGGACTTTGACAATGTCATTGTCAATTTCCTTGTGGACGAATTCAAACGTGATCAAGGTCTGGACATCCGCAAAGACGCGATGGCCATGCAGAGACTGAAGGAGGCTGCTGAGAAAGCAAAGATTGAACTCTCGGGCTCATTGCAGACGGACATCAACCTGCCGTACCTCACTATGGATTCATCGGGACCGAAACACATGAATCTCAAGATGACACGTTCCAAGCTGGAGTCATTAGTGGAGGGTCTCATCAAGAGGACGGTGAGCCCTTGCCAGAAGGCCCTTCAGGATGCGGAGGTCGCACGAGCTGATGTTGGGGAGGTGCTGCTTGTGGGGGGGATGACTAGGATGCCCAAGGTTCAGCAGACGGTGCAGGAGATCTTCGGTAGGGCTCCGTCGCGAGCTGTCAACCCTGACGAGGCTGTGGCCGTGGGCGCTGCGGTCCAGGGCGGAGTGCTGGCCGGTGACGTCACTGACATCCTACTCCTCGACGTGACACCCCTGTCCCTCGGCATAGAGACGCTCGGAGGAGTGTTCACAAAGCTCATCACAAGGAACACAACCATCCCGACCAAGAAGAGTCAGGTGTTCTCCACAGCCGCCGACGGGCAGACCCAGGTGGAGATCAAAGTGCATCAGGGTGAACGTGAGATGGCCTCGGACAACAAGCTGTTGGGGCAGTTCTCGTTGGTTGGTATACCACCAGCGCCGAGGGGTGTTCCGCAGATTGAGGTGACGTTCGACATTGACGCCAACGGTATCGTGCATGTATCAGCCAGGGACAAGGGTACCGGCAAGGAGCAGCAGATCGTCATCCAATCGTCCGGTGGTCTGTCGAAGGATGAGATCGAGAACATGGTGAAGGCGGCTGAGCAGTTCGCAGCGGCGGATAAGACCAGGCGAGAACGGGTGGAGGCTTGCAACCAGGCGGAGGGAGTGCTCCACGACACAGAGACCAAGATGGACGAATACAAGGCACAGCTACCGCAGGACGAGTGCGACAAGCTTCGCGAGGAAATGGCTAAGCTGAGAGATCTGCTCGCTCAGAAGGACTCCGTTGAACCTGAACCAGTTAGACAAGCGACGGCGTCGTTACAGCAAGCCAGTCTCAAGCTGTTCGAGCAAGCCTACAAGAAGATGGCGGCCGAGCGCGAAGGACAGTCCCAGACCCAGTCCCAGGCGGAGACGGACGAAAAGAAAGAGGAAAAGAAGAATTGA

Protein sequence:

>DPOGS205238-PA
MLTATRVVSRKALECSGTEFYTHRNFSTFLRSTAAPTVPIYQRHVQHRHKSEGVRGAVIGIDLGTTNSCVAVMEGKTPKVVENTEGSRTTPSHVAFSKEGERLVGMPAKRQAVTNSGNTFYATKRLIGRRFDDPEVQKDMKNLSYKVVKASNGDAWVQGSDGKVYSPSQIGAFVLMKMRETAEAYLNTNVKNAVVTVPAYFNDSQRQATKDAGQIAGLNVLRVINEPTAAALAYGMDKTDDKIIAVYDLGGGTFDISVLEIQKGVFEVKSTNGDTLLGGEDFDNVIVNFLVDEFKRDQGLDIRKDAMAMQRLKEAAEKAKIELSGSLQTDINLPYLTMDSSGPKHMNLKMTRSKLESLVEGLIKRTVSPCQKALQDAEVARADVGEVLLVGGMTRMPKVQQTVQEIFGRAPSRAVNPDEAVAVGAAVQGGVLAGDVTDILLLDVTPLSLGIETLGGVFTKLITRNTTIPTKKSQVFSTAADGQTQVEIKVHQGEREMASDNKLLGQFSLVGIPPAPRGVPQIEVTFDIDANGIVHVSARDKGTGKEQQIVIQSSGGLSKDEIENMVKAAEQFAAADKTRRERVEACNQAEGVLHDTETKMDEYKAQLPQDECDKLREEMAKLRDLLAQKDSVEPEPVRQATASLQQASLKLFEQAYKKMAAEREGQSQTQSQAETDEKKEEKKN-