Monarch geneset OGS2.0

DPOGS202237
TranscriptDPOGS202237-TA1062 bp
ProteinDPOGS202237-PA353 aa
Genomic positionDPSCF300149 + 612779-614077
RNAseq coverage3205x (Rank: top 4%)
Annotation
HeliconiusHMEL0091700.092.37% 
BombyxBGIBMGA013536-TA0.090.65% 
DrosophilaCG5001-PA1e-13366.01% 
EBI UniRef50UniRef50_Q207742e-9152.84%Protein DNJ-13 n=29 Tax=Bilateria RepID=Q20774_CAEEL
NCBI RefSeqNP_001036990.12e-17588.95%DnaJ (Hsp40) homolog 5 [Bombyx mori]
NCBI nr blastpgi|3784658002e-17890.65%DnaJ-5 [Bombyx mori]
NCBI nr blastxgi|3784658000.090.65%DnaJ-5 [Bombyx mori]
Group
Gene OntologyGO:00310721.4e-34heat shock protein binding
GO:00064578.8e-31protein folding
GO:00510828.8e-31unfolded protein binding
KEGG pathway 
InterPro domain[2-85] IPR0016231.4e-34Heat shock protein DnaJ, N-terminal
[6-24] IPR0030958.8e-31Heat shock protein DnaJ
[258-348] IPR0089718.4e-22HSP40/DnaJ peptide-binding
[265-345] IPR0029393.7e-19Chaperone DnaJ, C-terminal
Orthology groupMCL11710 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202237-TA
ATGGGAAAAGATTACTACAAAATTTTGGGACTTTCCAAGGGTGCATCAGACGACGAAATCAAAAAGGCCTATCGTAAATTAGCTTTGAAATACCATCCAGACAAGAACAAATCAGCAGGCGCCGAAGAAAGATTTAAGGAGGTGGCAGAAGCGTACGAGGTGCTATCGGACAAGAAGAAGCGTGAGATTTATGATACCCTCGGTGAAGAGGGATTGAAGGGAGGAATGGGAGGACAGAACGGACCTGGAAGCGGGCAGTCGTTCTCATACACCTTCCATGGGGACCCACGGGCGACGTTCGCACAGTTCTTTGGATCAGCTAGCCCGTTCCAAGGATTGTTCGACCTCAATGGTGGTTCCGGTGCCTCGACAATGTTTTTCGATCGCGACATGGATGTAGATCTTGATCCATTCGCCAATATCGGAATGGGACAGACGAGACCCGGCGGCGGAAGTGGGGCTTTCAGGAGTCACAGTTTCAATTTCCACGGGTCACCGAACAGGAAGGAAAAAACCCAAGATCCGCCTATAGAACACGATTTGTACGTGTCGTTGGAAGACATCGCTCGAGGATGTGTTAAAAAAATGAAGATTTCTCGTCGTGTTATTCAGCCAGATGGTACATCGAAGAAAGAGGACAAGGTGTTGACCATCCACGTGAAACCTGGTTGGAAAGCCGGAACGAAGATCACGTTCCAGAAGGAAGGTGACCAGGGTAGGAATAAAATCCCTGCTGACATAGTCTTCATTATCAGAGACAAGCCAAACCCATTATTCAAACGAGAAGGCAGTGACATCAGATATACAGCCAAGATATCACTCAAACAGGCTCTGTGCGGGACCATCATTGAAGTGCCTACCATGTCCGGCGAGAAGCTCACAGTTAACTTGCAAGGCGAGGTTGTGAAGCCCTACACTGTTAAGAGATTCCCTGGCTATGGTCTGCCATTCCCCAAGGAACCAACGCGGAAGGGAGATCTTCTGGTGGCTTTTGACATCAAGTTCCCTGACCGTCTCAATTCTGGAGTGAAAGAAATACTCATGGACACCCTACCTAACTAG

Protein sequence:

>DPOGS202237-PA
MGKDYYKILGLSKGASDDEIKKAYRKLALKYHPDKNKSAGAEERFKEVAEAYEVLSDKKKREIYDTLGEEGLKGGMGGQNGPGSGQSFSYTFHGDPRATFAQFFGSASPFQGLFDLNGGSGASTMFFDRDMDVDLDPFANIGMGQTRPGGGSGAFRSHSFNFHGSPNRKEKTQDPPIEHDLYVSLEDIARGCVKKMKISRRVIQPDGTSKKEDKVLTIHVKPGWKAGTKITFQKEGDQGRNKIPADIVFIIRDKPNPLFKREGSDIRYTAKISLKQALCGTIIEVPTMSGEKLTVNLQGEVVKPYTVKRFPGYGLPFPKEPTRKGDLLVAFDIKFPDRLNSGVKEILMDTLPN-