Monarch geneset OGS2.0

DPOGS216186
TranscriptDPOGS216186-TA1248 bp
ProteinDPOGS216186-PA415 aa
Genomic positionDPSCF300080 - 458097-488534
RNAseq coverage59x (Rank: top 68%)
Annotation
HeliconiusHMEL0158412e-8478.38% 
BombyxBGIBMGA004541-TA1e-7675.42% 
Drosophilal(2)efl-PA2e-3951.59% 
EBI UniRef50UniRef50_Q5MGN84e-7073.14%Heat shock protein 3 n=7 Tax=Ditrysia RepID=Q5MGN8_LONON
NCBI RefSeqNP_001036941.18e-8076.57%heat shock protein hsp20.1 [Bombyx mori]
NCBI nr blastpgi|3235412002e-8384.02%small heat shock protein 19.8 [Cydia pomonella]
NCBI nr blastxgi|3235412001e-7984.02%small heat shock protein 19.8 [Cydia pomonella]
Group
KEGG pathwaydme:Dmel_CG45332e-37 
 K09542 (CRYAB)maps-> Protein processing in endoplasmic reticulum
InterPro domain[61-155] IPR0020681.1e-28Heat shock protein Hsp20
[12-24] IPR0014361.5e-25Alpha crystallin/Heat shock protein
[47-156] IPR0089783.3e-18HSP20-like chaperone
Orthology groupMCL17153 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216186-TA
ATGTCTCTGTTACCATTCGTGCTGGGTTACGAACGACCTCACCGTATCATCGATCAGGACTTCGGCTTGTCTTTGACTCCCGACGATCTACTGACCGTTGCCGTGTCTCCACTACTATCACGAGACTACTACAGACCCTGGCGTCAGATGGCGGCTGCTGCGAGGGACGTCGGATCCACCATCAAGTCGGACAAAGATAAATTCCAAGTCAACTTGGACGTGCAGCATTTCAAACCTGAGGAAATAACTGTGAAGACTGCAGACGGTTACATAGTCGTGGAAGGAAAACACGAAGAAAAGAAAGATGAACACGGCTTTATATCCCGTCAGTTCACTAGACGATACGCACTCCCGGAAGGTTGTAATCCAGACACAGTAGAGTCACGGCTGTCTTCAGATGGAGTGCTGAGTGTTATTGCTCCGAAAGTGCCATCAGTATCGAAGAATGAACGAAGCGTCCCCATCGCCCAGACCGGACCCGTGAGGAAGGAGATCAAGGACCAGAATTCCCATGGTGAAGAAAATGTTCCGTCCTCCCCACAAAACCTGACAGCGAACAAAGTTACATCAGATGAGATACATTTATTGTGGGCCCCGCCAGCCACCTATACAGTCCAAAGACAGAGCAATGATGATATAAATATCGATAGGCAGACTCTTAAACCCGAACCGGATCTGACGGTCCTCGCTAAAGACACGCAAGCAGATGAAGCCATGAACGATGATGGCAAGGACTACAAAATGGAAGCGTCGAAGGACTCGGACTACACGTACAACTTCTACAAGAGTGACAAACAGAAGTACAGCGAGGATTTTTTAGATAATGATAACTATAGGGTCAAAAGGGACGTTAGATCGCACAGGCATAGGAAAAGAAGACAGGAGAATTCCACGGATACGAAGATAACTATGGAGAAGGGTATGGATGGTGTAGAGACTCACCAGGCTTTCGAACTGCCGATAGAGATCGTGAAGAAATCTATGGCGTTGCCCTCCCGGAAGGATATAACGCAAATAGCATTCGTTCTATACTACGAAGAAGGTGTGACGATAAAGAAGGCGTCCACTGATTCCATCATAGCAACAGTTCAGAGTTCGGAAGAAATCAAACAGAGGAACGTGTTCAGAAGCGATCTGGGCATACAAGATAATTACAACGCCACCAAAAACTTGACCTTACTTAACACCTCCGGGGTCCAGACTAAAGTCGTTGGATTCTCTAACTCCTTAAATTCACTCATCTTTTAA

Protein sequence:

>DPOGS216186-PA
MSLLPFVLGYERPHRIIDQDFGLSLTPDDLLTVAVSPLLSRDYYRPWRQMAAAARDVGSTIKSDKDKFQVNLDVQHFKPEEITVKTADGYIVVEGKHEEKKDEHGFISRQFTRRYALPEGCNPDTVESRLSSDGVLSVIAPKVPSVSKNERSVPIAQTGPVRKEIKDQNSHGEENVPSSPQNLTANKVTSDEIHLLWAPPATYTVQRQSNDDINIDRQTLKPEPDLTVLAKDTQADEAMNDDGKDYKMEASKDSDYTYNFYKSDKQKYSEDFLDNDNYRVKRDVRSHRHRKRRQENSTDTKITMEKGMDGVETHQAFELPIEIVKKSMALPSRKDITQIAFVLYYEEGVTIKKASTDSIIATVQSSEEIKQRNVFRSDLGIQDNYNATKNLTLLNTSGVQTKVVGFSNSLNSLIF-