Monarch geneset OGS2.0

DPOGS214147
TranscriptDPOGS214147-TA1482 bp
ProteinDPOGS214147-PA493 aa
Genomic positionDPSCF300014 - 835683-838734
RNAseq coverage1433x (Rank: top 9%)
Annotation
HeliconiusHMEL0156390.086.84% 
BombyxBGIBMGA006193-TA0.080.57% 
DrosophilaTpr2-PA3e-13549.28% 
EBI UniRef50UniRef50_Q2F6080.080.36%DNAJ9 n=13 Tax=Bilateria RepID=Q2F608_BOMMO
NCBI RefSeqNP_001040185.10.080.36%DnaJ (Hsp40) homolog 9 [Bombyx mori]
NCBI nr blastpgi|3784659180.080.57%DnaJ-9 [Bombyx mori]
NCBI nr blastxgi|3784659180.080.97%DnaJ-9 [Bombyx mori]
Group
Gene OntologyGO:00310724.3e-30heat shock protein binding
GO:00054883.3e-27binding
GO:00064577.3e-20protein folding
GO:00510827.3e-20unfolded protein binding
GO:00055157.9e-07protein binding
KEGG pathway 
InterPro domain[369-490] IPR0016234.3e-30Heat shock protein DnaJ, N-terminal
[253-373] IPR0119903.3e-27Tetratricopeptide-like helical
[379-397] IPR0030957.3e-20Heat shock protein DnaJ
[290-323] IPR0014407.9e-07Tetratricopeptide TPR-1
[59-91] IPR0131051.1e-06Tetratricopeptide TPR2
[59-92] IPR0197346.5e-06Tetratricopeptide repeat
Orthology groupMCL11963 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214147-TA
ATGGCTGAGCCAGAAGTAGTGGATTTGGATCTAACAATCGATGATTTAGTTCCCAAAAGTCCAGAAAGACTGGCTGAGGAAAAAAAGGAGAGCGGAAACCATCTCTATAAATTCAAAAATTATAAGGGGGCATTGGCCATGTATGAAGATGCAATCAAACTCTGTCCTGAAAATGCAGCCTATTATGGCAACAGATCTGCCTGCTACATGATGCTGGGGATGTATAAAAAAGCTTTAGAGGATGCTCAAAAAGCTGTAGCTCTGGACCCAACATTCACTAAAGGATATATTCGTATGGCTAAATGTCATATTGCTGTAGGTGATATATCTGGTGCAGAACAGGCGGTTCGTAGTGCAAGCGAACTCGGTGGGCCAGATTGTGCATCGAACGAACGTCGTGCATTAGAATCACTGCGACGGTTACATGAAGACGCACAGCGTGCCATGGAGGCAGGAGACTACCGTCGTGTGGTCTTCTGCATGGACCGCTGTTTAGAATACAGTCCTTCAAGTATAAAGGCAAAACTTATCAAAGCCGAGTGCCTTGCAATGATTGGACGCTGTCAGGAAGCTCAGGAAATAGCAAATGATTCACTAAGATTTGATAGTTTAGACACAGAGGCAATATATGTACGTGGGTTGTGCCTTTATTTTGAGGACAAAGACGAGCAAGCCTTCAAACACTTCCAGCAGGTTTTGAGACTTGCACCAGATCACAAGAAATCCCTTGAGACTTATAAAAAGGCCAAGCTACTAAAACAAAAGAAAGAGGAAGGCAATGAGGCGTTTAAAATGGGTAGATGGCAACAAGCTTTAAATCTGTATAACGAAGCACTGACTATTGATAAAAATAACAGAAAAGTCAACGCCAAACTATATTTTAATAAAGCCACTGTGTGCTCAAAGTTGAATCAAATAGAAGAAGCAGCAGAGGCTTGCACAGCCGCATTGGAGTTAGATGAGAACTATGTTAAAGCTTTGTTGCGTCGTGCCAAATGTTACGCCGAACTGGGGAATCACGAAGACGCTGTCAAGGACTACGAGAAGCTTTATAAGATCGACAAAAATAAGGAACACAAACAGTTACTCCACGAGGCAAAATTGGCTTTAAAGAAATCCAAACGCAAAGACTACTATAAGATTTTGGGCATTGAAAAAACAGCATCAGAAGACGATATCAAGAAAGCTTATAGAAAGCGCGCTCTAGTTCACCATCCGGACAGACACGCGGGGGCTCCGGACAACGAGCGCAGGGAACAGGAGCGTCGCTTCAAGGAAGTGGGGGAGGCGTATGAAGTGCTCAGTGACCCCAAGAAACGAGCCCGTTACGATCACGGACAGGACCTTGATGATGGTTCCGGTGGTATTAATATTGATCCAAATATGATGTTCCAAACCTATTTTAACGGCGGTGGACAAGGTTTTGACTTTTCTTCAGGTGGAGGCTTCCCGGGATCAGCTTTTAGCTTTCAATTTGGATAG

Protein sequence:

>DPOGS214147-PA
MAEPEVVDLDLTIDDLVPKSPERLAEEKKESGNHLYKFKNYKGALAMYEDAIKLCPENAAYYGNRSACYMMLGMYKKALEDAQKAVALDPTFTKGYIRMAKCHIAVGDISGAEQAVRSASELGGPDCASNERRALESLRRLHEDAQRAMEAGDYRRVVFCMDRCLEYSPSSIKAKLIKAECLAMIGRCQEAQEIANDSLRFDSLDTEAIYVRGLCLYFEDKDEQAFKHFQQVLRLAPDHKKSLETYKKAKLLKQKKEEGNEAFKMGRWQQALNLYNEALTIDKNNRKVNAKLYFNKATVCSKLNQIEEAAEACTAALELDENYVKALLRRAKCYAELGNHEDAVKDYEKLYKIDKNKEHKQLLHEAKLALKKSKRKDYYKILGIEKTASEDDIKKAYRKRALVHHPDRHAGAPDNERREQERRFKEVGEAYEVLSDPKKRARYDHGQDLDDGSGGINIDPNMMFQTYFNGGGQGFDFSSGGGFPGSAFSFQFG-