Monarch geneset OGS2.0

DPOGS212273
TranscriptDPOGS212273-TA1617 bp
ProteinDPOGS212273-PA538 aa
Genomic positionDPSCF300077 + 9238-11862
RNAseq coverage158x (Rank: top 52%)
Annotation
HeliconiusHMEL0066590.077.04% 
BombyxBGIBMGA011435-TA0.072.56% 
DrosophilaCG2790-PA3e-7849.28% 
EBI UniRef50UniRef50_UPI000224676D2e-9254.17%UPI000224676D related cluster n=1 Tax=unknown RepID=UPI000224676D
NCBI RefSeqXP_001602150.13e-9354.17%PREDICTED: similar to DnaJ domain protein [Nasonia vitripennis]
NCBI nr blastpgi|3784661390.072.56%DnaJ-16 [Bombyx mori]
NCBI nr blastxgi|3784661390.072.47%DnaJ-16 [Bombyx mori]
Group
Gene OntologyGO:00310723.9e-28heat shock protein binding
GO:00064571.6e-20protein folding
GO:00510821.6e-20unfolded protein binding
KEGG pathway 
InterPro domain[2-61] IPR0016233.9e-28Heat shock protein DnaJ, N-terminal
[5-23] IPR0030951.6e-20Heat shock protein DnaJ
Orthology groupMCL11841 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212273-TA
ATGAAGTGTCACTACGAAGTATTGAGCGTGACGAAGGAGGCGAGCGGATCAGAAATCAAAAAGGCCTATCGCAAACTCGCCCTGCAGTGGCACCCTGACAAAAATCTAGACAATCTACAAGAAGCTAAAGAACAGTTTCAACTTGTACAGAATGCTTACGAAGTGCTCTCCGACCCTCAAGAAAGAGCCTGGTATGACAATCACCGCGAACAACTTTTACGTGGCGCAGGCAGTTCCTATAATGATGATAGTCTTGATGTTTACCCTTACTTTAGTCCGTCCTGCTATAAAGGCTTTGGTGATGATCCACAAGGATTTTTCGCTGTTTATGCAGAGGTTTTCTCCAAATTGGCCTCTGAAGAGGCAGATTTTTTGGAGGATCCCGAAGAGATCTCAAAGATTCCAAAATTTGGTGTATCAACATCACCTTATGAAGATGTTAATGAATTCTATGCGTTTTGGATGTCGTTTTCCACTAATAAGAGTTATGTGTGGTTAGATCAATATGAAATATCCCAAGGTGATAACCGCAGAGTTATCAAACTGATGGAAAAAGAGAATAATAAAATACGGCAGAAGGCCCGCAAGGAAAGAAACGAAGAAATAAGACGCCTTGTTAGTTTCGTCCGTCGGAAGGATAAAAGAGTTATTGAACATACAAAACAACTTCAAGAGAAAGTTGAGGAAAATAAAAAGAAGGCGGAACAACTTAGAAGGAAAAGAATAATTGAAAGACAGAAGGAAATAGAAGAAGCTAAGAAGAAAGAAGGAGAGTCTTCATTCTTACAAAGCGAAGACTATCAAAAGAAATTGAGCGAAATTGAATCTTTGTTGGCCGAAGAATTTGGTTTATCATCTGACGATGATACTATTAGTGAAGGGGTATGGAGAGTAGCAACGAAGAAAGTTCCAAGACTGAGGAGCATGACATTAGGAGATGATATTCCCATTTCTGAAGAAGGGAATGACGTTGAAGAAGCTGAAGCTGATATGATGACGAAAATCAATGGGGATGCAGATGATGGTCAAAATGTAGCCAACTCTACATCTGACACTGAAGATAATGATATCCGGGAGACGCTTAGCGATGATTCAGCACTACCTAAAAGTCAGAAGAAAAAGAAAAACAAGAAGAAATCATTCATACCGATGCCGGAAAGCGAAGGCAATGATAGTCATGATGAAATGAGCTTTAATGAGATCGAGGGACCTGTGAGAAGCCGAAAGGCTAAGAAATTTAATATGCTGAAAAGTCAAATACAGGCTAAAAAGGAGGCTAATCTCAAAAAAGGTTCTCAGTCCCAGACTTCAACTGAGAATTTATATGAAGTATCAAGTACTACACCGACCGATGATGCATTGGCTGATTCCGTCTTACCAAAAGATAGACCAGCATTACCAAAAACACAAAGGAATATAAAAGGTAAAAAATTATTTGAACGGAAGCCATTGAGACCTAAAGCTAGCGAGACAGAAGATTCCAGCAGCGCTGTTAACTTAAGATGTCTGATTTGTCAAACGGATTTCCCATCGAAAAATAAATTATTCGAGCATTTGAAGAAAACAGGCCACTCAGTAGCCCTACCCCAAACTAGTTACACACAAAAAAAAGGGTAA

Protein sequence:

>DPOGS212273-PA
MKCHYEVLSVTKEASGSEIKKAYRKLALQWHPDKNLDNLQEAKEQFQLVQNAYEVLSDPQERAWYDNHREQLLRGAGSSYNDDSLDVYPYFSPSCYKGFGDDPQGFFAVYAEVFSKLASEEADFLEDPEEISKIPKFGVSTSPYEDVNEFYAFWMSFSTNKSYVWLDQYEISQGDNRRVIKLMEKENNKIRQKARKERNEEIRRLVSFVRRKDKRVIEHTKQLQEKVEENKKKAEQLRRKRIIERQKEIEEAKKKEGESSFLQSEDYQKKLSEIESLLAEEFGLSSDDDTISEGVWRVATKKVPRLRSMTLGDDIPISEEGNDVEEAEADMMTKINGDADDGQNVANSTSDTEDNDIRETLSDDSALPKSQKKKKNKKKSFIPMPESEGNDSHDEMSFNEIEGPVRSRKAKKFNMLKSQIQAKKEANLKKGSQSQTSTENLYEVSSTTPTDDALADSVLPKDRPALPKTQRNIKGKKLFERKPLRPKASETEDSSSAVNLRCLICQTDFPSKNKLFEHLKKTGHSVALPQTSYTQKKG-