Monarch geneset OGS2.0

DPOGS210387
TranscriptDPOGS210387-TA1236 bp
ProteinDPOGS210387-PA411 aa
Genomic positionDPSCF300025 + 981654-983276
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0087112e-14059.35% 
BombyxBGIBMGA011605-TA2e-15765.97% 
DrosophilaSgt-PA8e-0729.37% 
EBI UniRef50UniRef50_E0VVV31e-7739.01%Heat shock protein 70 HSP70, putative n=1 Tax=Pediculus humanus corporis RepID=E0VVV3_PEDHC
NCBI RefSeqXP_002430247.12e-7839.01%heat shock protein 70 HSP70, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420195995e-7739.01%heat shock protein 70 HSP70, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420195992e-7538.16%heat shock protein 70 HSP70, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00054881.5e-20binding
KEGG pathwayuma:UM02057.19e-07 
 K09553 (STIP1)maps-> Prion diseases
InterPro domain[275-408] IPR0119901.5e-20Tetratricopeptide-like helical
[1-76] IPR0089782.3e-07HSP20-like chaperone
[7-74] IPR0174473.4e-07CS domain
Orthology groupMCL17017 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210387-TA
ATGCCTATATTAGTGAAGGATTATACGTGGATCCAAACTCCTACAAATATAAGCATACGTGTCCCCTTAGATCCAGTTTATAGAGAAAATGTAGATATGTTCACAGCTGATTGCTATATCAAAGCACATTTCAGTCCATATCTTTTGGAATTATTCTTATTACATGACGTAAACATTGAAAAGAGCAAATGTGTTGTAAACAAGAATATGATTTCAATGGATTTAGAAGAAAAGAAAGCAATACGACAACGAGTGTTACAGGAGAGCCAAGAAAAAGCTAAAAAAGAGGCAGAAGAAAGAACAATCAAGAAGAATGAATTGGACAGATTCACCGTGCAAAGGGCCATGGATATAGATTCACAACAACACGCATTGATGGACTCGAGACGAGATGACGAGCGTCAGAAAGCTATGAGCGAGTTGGAAAAGTGGAAGGAGCGATCACATCAAGAAAATGATTTTTACAATGGCAATATGCTTGTTAATAATAAACAGTCAGGTGTTAAAATTGTCGAGTTGCCAGACTCGGACACAGAATCAAAAGATAAAGCGAAAGATGTGGCTCCACCACTTTCCAAACGTCATCTGACTCCAAAAACACCGCCGAAAACTGTCGTGAAGTCTGAATATGTCGATAAGAAAAAGCAGGAAACAGCTACAAGAGTTTTACCTAAGTTAAGGCAAATGATGCAATTGGAGATAACACACACGAAGCGCACCTTTCCCACTCCCAGCAGGGAATCCACGGCGCAGGAAGAAGAAGCGTGGCTTAAGAATATTACGTTAGCTCGTAGGGCTACAGGTTTCGTTTCGGAAGACCTTCGTCCAGAAGAGCAGGATCCTCAGTGGTGTAAAGAAAAAGGTGATGAGTTCTTTCGTTGTGGTAACTTTTTGGGAGCAATCAGCGCTTACACTCATGGTATTACTCTGTCCGAAAAGCTGCCCAGCCTCTATGCAAACAGAGCCGCTGCACATTTCGCTTTAGGGAACTTTAACAAATGTGCGAATGATTCTTCCACGGCGCTGGATTTAATGAAGCCAGCTTGTGAGGGGAACCGTCAGAGTCGTGCCAAATGTATTGCTCGACGAGCGGCCGCGCTTGCTAGGATGGGCTACTTGAACAAAGCCATCGATGAAATGAAGGCAGCTTCAAAGTTGATGCCAGAAGATGAGAAAATCAAAAAGGATATTCATAACATGGAAAGAGCCTGGGAGCAAAATCCAGATTCCGATTAA

Protein sequence:

>DPOGS210387-PA
MPILVKDYTWIQTPTNISIRVPLDPVYRENVDMFTADCYIKAHFSPYLLELFLLHDVNIEKSKCVVNKNMISMDLEEKKAIRQRVLQESQEKAKKEAEERTIKKNELDRFTVQRAMDIDSQQHALMDSRRDDERQKAMSELEKWKERSHQENDFYNGNMLVNNKQSGVKIVELPDSDTESKDKAKDVAPPLSKRHLTPKTPPKTVVKSEYVDKKKQETATRVLPKLRQMMQLEITHTKRTFPTPSRESTAQEEEAWLKNITLARRATGFVSEDLRPEEQDPQWCKEKGDEFFRCGNFLGAISAYTHGITLSEKLPSLYANRAAAHFALGNFNKCANDSSTALDLMKPACEGNRQSRAKCIARRAAALARMGYLNKAIDEMKAASKLMPEDEKIKKDIHNMERAWEQNPDSD-