Monarch geneset OGS2.0

DPOGS215443
TranscriptDPOGS215443-TA1539 bp
ProteinDPOGS215443-PA512 aa
Genomic positionDPSCF300298 + 94299-99370
RNAseq coverage1850x (Rank: top 7%)
Annotation
HeliconiusHMEL0158419e-12471.43% 
BombyxBGIBMGA004541-TA1e-7169.44% 
Drosophilal(2)efl-PA4e-4051.40% 
EBI UniRef50UniRef50_Q5MGN83e-6466.67%Heat shock protein 3 n=7 Tax=Ditrysia RepID=Q5MGN8_LONON
NCBI RefSeqNP_001037038.15e-7069.44%heat shock protein 20.4 [Bombyx mori]
NCBI nr blastpgi|3010701484e-7069.95%small heat shock protein [Spodoptera litura]
NCBI nr blastxgi|3010701482e-6870.49%small heat shock protein [Spodoptera litura]
Group
KEGG pathwaydme:Dmel_CG45333e-38 
 K09542 (CRYAB)maps-> Protein processing in endoplasmic reticulum
InterPro domain[65-159] IPR0020683.9e-29Heat shock protein Hsp20
[16-28] IPR0014361.1e-27Alpha crystallin/Heat shock protein
[50-159] IPR0089781.3e-19HSP20-like chaperone
Orthology groupMCL35059 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215443-TA
ATGTCACTCTTACCATTCATGTTTGACTACGACTGGCCACGGCACAGACCCAGCCGTCTTCTGGACCAAGACTTCGGACTAGCTATAACTCCGGATGATTTATTAACAATCGTCGCTTCTCCCATGCAAACCCGGGACTATTTTAGACCCTGGCGTCAATTGGCAAACATCCGAGACATCGGTTCCAGTCTTAAGGCGGATAAGGATAAATTCCAGATTAATTTAGATGTCCAACACTTTTCGCCTGACGAAATAACAGTGAAGACATCCAATGGTTATATAGTTGTGGAGGGGAAACACGAGGAGAAACAGGATGAACACGGCTACATTTCCCGTCAGTTTGTGAGAAGGTACGCTCTGCCTGATGGCTGTAACCCTGATACTGTTGAATCTCGTTTATCATCTGATGGTGTGCTCACTGTAACCGCCCCAAAACAACCCTCGGCGTTGAAAAACGAGAAAAATATTCCAATACAACAGACAGGCCCAGTACGAAAGGAAGCCAGAGATGAAATAAAAGAATCCGAAGAAAAGAACCCTCTTCTAAACGTTAGTCTGCGCCCACAATTGTGTTCGTGGGTGAAGCAAACGAGAAATTTAAGACCAATCATAAAAATAGGCAAAGAGAGATTTCAGTTGTTCCTAGATGTTCATCAGTTTAACAAGGACGAAATAAGAGTTAAGGCGAGATCCGAATTCGTTATCGTGGAAGGCAAACAAGAAAGGAAAACCAAAAACGGTTGTCTGATACGGACTTTTGTGAGAAGATTCAAGTTACCAGAAGGTTGTAACCCTCAAGATATAAAATCGAAGCTATCTCCAGACGGTTGCTTAATGATCACAGCTCCTAGAAATAAGTGCAGTGTGAACTATCCTTGTGAAACTGTAATACCTATTGCTTCTACTTCAAAAGAATCCGATGTTTTTAAGGAAGATAAGCCTAGTGGTTCATCGAAGCCACCAGAAAGCGGCAAACTAATATTCGAAAGGTATATCGAGATCAAAATGTCTATTTCTCCGTATTTCTTCGACTACGATCTACGATGGCCACGACGGCTGTATGATCAAAACTTTGGCCTAGCTCTAACACCACACGATCTCTTCAATGCAACTGCCAGTCCTGTCATACCACGATACAATTTTTGGTGGCCGAAAGACAGCGGTTCCTCGATCAAATTCGATAAAGACAAATGGCAAATCAGCGTTGATGTGCAGCACTTCGCACCAGACGAGATCACTGTGAAGATTGCGAATGGCAACATAGTAGTTGAAGGCAAACACGAAGAGAAACAAGACGAGCACGGCTTTATATCCAGGCAGTTTGTTAGGCGTTTCAAAATACCAGAAGACACTAATTCAGATGCTATAGAGTCCAGGCTATCTTCTGACGGTGTTCTGACAGTTCTCGCCTCACGTATGGATACGCCCAAAGGTGAGAGAAATGTACCAATAACACACACAGGACCGGTCCGGAAGGATATGAAAAATGAAACGAGCGAGGAAAAGCCTGACAAACATACTATCCCTACAGACCTGTAG

Protein sequence:

>DPOGS215443-PA
MSLLPFMFDYDWPRHRPSRLLDQDFGLAITPDDLLTIVASPMQTRDYFRPWRQLANIRDIGSSLKADKDKFQINLDVQHFSPDEITVKTSNGYIVVEGKHEEKQDEHGYISRQFVRRYALPDGCNPDTVESRLSSDGVLTVTAPKQPSALKNEKNIPIQQTGPVRKEARDEIKESEEKNPLLNVSLRPQLCSWVKQTRNLRPIIKIGKERFQLFLDVHQFNKDEIRVKARSEFVIVEGKQERKTKNGCLIRTFVRRFKLPEGCNPQDIKSKLSPDGCLMITAPRNKCSVNYPCETVIPIASTSKESDVFKEDKPSGSSKPPESGKLIFERYIEIKMSISPYFFDYDLRWPRRLYDQNFGLALTPHDLFNATASPVIPRYNFWWPKDSGSSIKFDKDKWQISVDVQHFAPDEITVKIANGNIVVEGKHEEKQDEHGFISRQFVRRFKIPEDTNSDAIESRLSSDGVLTVLASRMDTPKGERNVPITHTGPVRKDMKNETSEEKPDKHTIPTDL-