Monarch geneset OGS2.0

DPOGS212742
TranscriptDPOGS212742-TA1017 bp
ProteinDPOGS212742-PA338 aa
Genomic positionDPSCF300012 + 420541-421557
RNAseq coverage14x (Rank: top 82%)
Annotation
HeliconiusHMEL0083142e-16080.77% 
BombyxBGIBMGA013135-TA1e-15675.82% 
DrosophilaCG5001-PA9e-6039.09% 
EBI UniRef50UniRef50_E2AS753e-8853.23%DnaJ-like protein subfamily B member 13 n=1 Tax=Camponotus floridanus RepID=E2AS75_CAMFO
NCBI RefSeqXP_001123348.19e-9254.89%PREDICTED: similar to testis spermatogenesis apoptosis-related protein 6 [Apis mellifera]
NCBI nr blastpgi|3407250172e-9154.57%PREDICTED: LOW QUALITY PROTEIN: dnaJ homolog subfamily B member 13-like [Bombus terrestris]
NCBI nr blastxgi|3838546806e-9355.52%PREDICTED: dnaJ homolog subfamily B member 13-like [Megachile rotundata]
Group
Gene OntologyGO:00310722.7e-23heat shock protein binding
GO:00064577.9e-18protein folding
GO:00510827.9e-18unfolded protein binding
KEGG pathway 
InterPro domain[3-84] IPR0016232.7e-23Heat shock protein DnaJ, N-terminal
[6-24] IPR0030957.9e-18Heat shock protein DnaJ
[227-307] IPR0029391.5e-15Chaperone DnaJ, C-terminal
[134-218] IPR0089713.8e-15HSP40/DnaJ peptide-binding
Orthology groupMCL17131 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212742-TA
ATGGGTTTCGATTACTACGGCATACTGGGATTGAAACGATCGTGCAAGCAGAGCGAAGTCAAGAAGGCGTATAGAAGACTCGCCTTGAAGTATAACCCGGAGAGGTACGACAATGATGAGAACATGAAACGAATCTTCGCTCTCATAGGAGAGGCTTACGAGGTTCTGGTGGACCACAAGCACAGGGCGGTGTACGACCAGTACGGGGAGGAGGGACTGAAGAAAGGCGTTCCCGGACCGGAGGACTTCATACACGCCTACACCTACCACGGAGACCCCGTGAGGACCTTCCACGACTTTTTCGGCAGCAGCAACCCCTACGCCGATCTCTTGGATTACTACGAGAACCCGCCGCCAATGTTCGAATCACCGTTGGGGAAAGGATACAAGGAAAAAGATCAAACTATCGTCCGACCGCTAGCGCTGACGCTCGAGGAAGTGTTCAAAGGAGGGCTCAAGAAAATGAAGATACAACGCTTGGTGTTCACAGACGAGACGTGCTCCGAACTGAGGCTGAGGGAGAAGGTTCTATCGATACCTATCAAGCCGGGGATATATCCTGGGACGGAGATCAAGTTCAAAGAGGAAGGAGACCAAGGACCGACCAGGATACCAGCTGACGTGATATTCATAACAGAAGACAGGCCTCACGAGAACTTTATAAGGAGTGGGCTCAGCGACCTCATGATGTCTAGGACGATATCTCTGAAGGAAGCTCTGTGTGGTTTCATGCTGATAGTGAACACGTTGGACGAACGAGTCCTCAGGATCAAAATAACTGACGTCGTCGACCCCACATACGAGAAGGTCATTGAAGACGAGGGTCTACCGATCCCGGCCTGCCCGAATAAGGTCAAAGGCAATCTCAAGATACGCTTCCAAATAACATATCCCATATATCTATCCAAACGCAGCAAAGAAGCTTTCGAAGAAGCCTTCAGGACTACGGAGGATGAAGACAAGAAATTCGACAAGCTCAAGTGCGGAACTCTCTCCACGCCGTCCATTTACAAATAG

Protein sequence:

>DPOGS212742-PA
MGFDYYGILGLKRSCKQSEVKKAYRRLALKYNPERYDNDENMKRIFALIGEAYEVLVDHKHRAVYDQYGEEGLKKGVPGPEDFIHAYTYHGDPVRTFHDFFGSSNPYADLLDYYENPPPMFESPLGKGYKEKDQTIVRPLALTLEEVFKGGLKKMKIQRLVFTDETCSELRLREKVLSIPIKPGIYPGTEIKFKEEGDQGPTRIPADVIFITEDRPHENFIRSGLSDLMMSRTISLKEALCGFMLIVNTLDERVLRIKITDVVDPTYEKVIEDEGLPIPACPNKVKGNLKIRFQITYPIYLSKRSKEAFEEAFRTTEDEDKKFDKLKCGTLSTPSIYK-