Monarch geneset OGS2.0

DPOGS216185
TranscriptDPOGS216185-TA966 bp
ProteinDPOGS216185-PA321 aa
Genomic positionDPSCF300080 - 493582-496026
RNAseq coverage294x (Rank: top 38%)
Annotation
HeliconiusHMEL0158415e-3881.18% 
BombyxBGIBMGA004541-TA1e-3580.46% 
Drosophilal(2)efl-PA7e-2260.76% 
EBI UniRef50UniRef50_Q5MGN85e-2972.09%Heat shock protein 3 n=7 Tax=Ditrysia RepID=Q5MGN8_LONON
NCBI RefSeqNP_001036941.11e-3672.83%heat shock protein hsp20.1 [Bombyx mori]
NCBI nr blastpgi|3106880814e-3885.23%small heat shock protein [Ostrinia nubilalis]
NCBI nr blastxgi|3106880814e-3785.23%small heat shock protein [Ostrinia nubilalis]
Group
Gene OntologyGO:00036764.9e-12nucleic acid binding
KEGG pathwaydme:Dmel_CG45335e-20 
 K09542 (CRYAB)maps-> Protein processing in endoplasmic reticulum
InterPro domain[231-298] IPR0020681.6e-16Heat shock protein Hsp20
[8-130] IPR0123374.9e-12Ribonuclease H-like
[227-240] IPR0014368.3e-11Alpha crystallin/Heat shock protein
[231-299] IPR0089782e-09HSP20-like chaperone
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216185-TA
ATGGCAGTGTCACGTAAGGATATTGTAAAAGCTGAAAGTGGGATTTTACCCCCATTTGGGAAACTGTCCCTGTTGATCAACCCACAAAAAGACATACCAAGTCAAATAGCGAAATCGTCAGGATTATCGAACGTGTATTTAACGAATCAGCCAATTTTTAAAGACAAAGTGAGCACAATCAATTCATTCCTTGAGCTGCCTAAGCCAGTGTGCCTCGTTGCTTGTAAAGGAAACAGATTCGATTATAAAATATTAAGATCGGAGTATGTGGAAGCCAACGCCCAGCTGCCAGATGATCTACTGTGTGTCGATCCAATCAGCTTGCTTACTGAAGAAATTAAAAGTAAGGTTAAAGATACAACAGCTCCATCGAGTGCCAACAATTCATTACTGGACACGCCAAACTTAAACGCCTCAGGGTTCCAGCCCGCTTCCCCATCAGCCCCCAGCAATGAAAACCAGCCAGGACCGAGCAACATCGACGACCTAAATGACGCTTTTGCTGGTATGACGACAAACACAACTCCCAAGAAAGAGAGGTATACCGTACAAAAGATATACAAAAGTGTTATCAACAGAGAGCCATCGAGATCCCAGAGGACTGATGTTACGGCTATGATGCTTTTGGACCCGGCATTCATTACAAGAAAGCGATACGAGAAACACACTAGTGCTCTAGCGCTGCCTGCAGACGGTTACATAGTCGTGGAAGGAAAACACGAAGAAAAGAAAGATGAACACGGCTTTATATCCCGGCAGTTCACTAGACGATACGCACTCCCGGAAGGTTGTAATCCAGACACAGTAGAGTCACGGCTGTCTTCAGATGGAGTGCTGAGTGTTATTGCTCCGAAAGTGCCATCAGTATCTAAGAACGAACGAAGCGTCCCCATCGCCCAGACCGGACCCGTGAGGAAGGAGATCAAGGATCAGAATTCACAAGCCGGAGCTGGTGATAATAAATGA

Protein sequence:

>DPOGS216185-PA
MAVSRKDIVKAESGILPPFGKLSLLINPQKDIPSQIAKSSGLSNVYLTNQPIFKDKVSTINSFLELPKPVCLVACKGNRFDYKILRSEYVEANAQLPDDLLCVDPISLLTEEIKSKVKDTTAPSSANNSLLDTPNLNASGFQPASPSAPSNENQPGPSNIDDLNDAFAGMTTNTTPKKERYTVQKIYKSVINREPSRSQRTDVTAMMLLDPAFITRKRYEKHTSALALPADGYIVVEGKHEEKKDEHGFISRQFTRRYALPEGCNPDTVESRLSSDGVLSVIAPKVPSVSKNERSVPIAQTGPVRKEIKDQNSQAGAGDNK-