Monarch geneset OGS2.0

DPOGS205447
TranscriptDPOGS205447-TA1293 bp
ProteinDPOGS205447-PA430 aa
Genomic positionDPSCF300332 + 115904-120323
RNAseq coverage720x (Rank: top 18%)
Annotation
HeliconiusHMEL0147172e-12858.54% 
BombyxBGIBMGA009209-TA1e-9658.13% 
Drosophilal(2)tid-PB1e-10966.43% 
EBI UniRef50UniRef50_G6DRS50.099.72%Putative uncharacterized protein n=2 Tax=Coelomata RepID=G6DRS5_DANPL
NCBI RefSeqXP_001605490.16e-12472.44%PREDICTED: similar to chaperone protein dnaj [Nasonia vitripennis]
NCBI nr blastpgi|3784660679e-15781.33%DnaJ-14 [Bombyx mori]
NCBI nr blastxgi|3784660678e-17581.33%DnaJ-14 [Bombyx mori]
Group
Gene OntologyGO:00064571.2e-18protein folding
GO:00510821.2e-18unfolded protein binding
GO:00310721.3e-16heat shock protein binding
KEGG pathway 
InterPro domain[209-287] IPR0029391.2e-18Chaperone DnaJ, C-terminal
[202-289] IPR0089712.7e-18HSP40/DnaJ peptide-binding
[87-165] IPR0013051.3e-16Heat shock protein DnaJ, cysteine-rich domain
Orthology groupMCL13868 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205447-TA
ATGAGAACAAGCGTAAGCAATATGACACTTATGGCACAACTTCTGAACAAATGGGTATGGGAGGAGCTGGTGGAAGCGATGGTTTTACCCATCAATGGCAGTACAAAATCTACTATAGACCCTGAGGAATTATTCCGGAAAATTTTTGGAGATGCAGGCTTTAAAAGTGAGGCTTTCAGTGACTTTGCAGAGAGTCAATTCGGTTTTGGTGCATCCCAAGAGATAATTGTAAATCTAAAGTTCACTGAGGCAGCCCGTGGTGTTAACAAAGATATTAATCTAAATGTTGTTGACACATGTCCTAAATGTCAGGGTTCGAGATGTGAACTCGGCACTAAAGCCGTCAAGTGCACATATTGTAATGGCACTGGCATGGAGACATTTTCTAGAGGTCCATTTGTTATGAGGTCGACATGTAGACATTGCCATGGTACTCGTATGTTGATTAAATTTCCATGTCTTGAATGTGAAGGAAAAGGCCAGTCGGTTCAACGTAAAAAAGTTACAGTTCCAGTGCCAGCTGGCGTAGAGGACGGTCAGACTGTACGTATGTCTGTTGGAAGTAACGAAGTATTCATTACATTCAAAGTGGAAAGCTCCAAGTACTTCAGACGTGACGGACCCGATGTTCATACTGACTGCGCGATATCTGTGTCCCAAGCGCTGCTCGGTGGTACAGTGAGGATACAAGGACTTTATGAAGATCACACTTTGCAGATCGTGCCTTGCACTTCATCTCACAGCACGATACGTCTTTCTCGCAAAGGCATGAAGCGTGTCAGTCAACATGGTTACGGAGATCATTATGTGCACATTAAAATACAAGTACCAAAATCTTTAAGCGATAAACAGAAGGCACTGATCAGTGCGTATGCTGAACTAGAAGAAGACACACCGGGACAAATACACGGAGTTGCTTTTGACAGAGATGACGGTACAAATAATAGCGGTAGTGATAAGAAAATTCACGAAGCTAATCGTGAGAGCGATTTCAAAGAGGAGACGAAATGGACGTTCTTTGATAGTTTAAGCGAAGCGTTCGCAAAGAATAAGACTAATTTCCTCATAGGTTTTCTATCCTCGGTCATTATAGGATTTTTGGTATTGACGAACGATCCCGCAGATAGGTCGGGCATACAGAGATATATGGAAAGCGAGACGGGTAATAAAAATTCAATAGCGGAGCCGCAGAACTTAGTGGACGCTATAAAAGAAGCACTCAAGGACAAGAAAAGTATAGAAGCGGGCGTCACAGAGGACGACTTGAAAGAGCCAAAGCGCAGCAAAGGATAA

Protein sequence:

>DPOGS205447-PA
MRTSVSNMTLMAQLLNKWVWEELVEAMVLPINGSTKSTIDPEELFRKIFGDAGFKSEAFSDFAESQFGFGASQEIIVNLKFTEAARGVNKDINLNVVDTCPKCQGSRCELGTKAVKCTYCNGTGMETFSRGPFVMRSTCRHCHGTRMLIKFPCLECEGKGQSVQRKKVTVPVPAGVEDGQTVRMSVGSNEVFITFKVESSKYFRRDGPDVHTDCAISVSQALLGGTVRIQGLYEDHTLQIVPCTSSHSTIRLSRKGMKRVSQHGYGDHYVHIKIQVPKSLSDKQKALISAYAELEEDTPGQIHGVAFDRDDGTNNSGSDKKIHEANRESDFKEETKWTFFDSLSEAFAKNKTNFLIGFLSSVIIGFLVLTNDPADRSGIQRYMESETGNKNSIAEPQNLVDAIKEALKDKKSIEAGVTEDDLKEPKRSKG-