Monarch geneset OGS2.0

DPOGS207907
TranscriptDPOGS207907-TA1056 bp
ProteinDPOGS207907-PA351 aa
Genomic positionDPSCF300478 - 37046-42197
RNAseq coverage2642x (Rank: top 5%)
Annotation
HeliconiusHMEL0101625e-16581.07% 
BombyxBGIBMGA014455-TA1e-14378.47% 
DrosophilaCG3061-PA9e-9849.32% 
EBI UniRef50UniRef50_C7AQZ42e-14178.47%DnaJ-6 n=1 Tax=Bombyx mori RepID=C7AQZ4_BOMMO
NCBI RefSeqNP_001157383.14e-14278.47%DnaJ (Hsp40) homolog 6 [Bombyx mori]
NCBI nr blastpgi|2556528858e-14178.47%DnaJ (Hsp40) homolog 6 [Bombyx mori]
NCBI nr blastxgi|2556528852e-15478.75%DnaJ (Hsp40) homolog 6 [Bombyx mori]
Group
Gene OntologyGO:00310723.4e-28heat shock protein binding
GO:00064571.3e-18protein folding
GO:00510821.3e-18unfolded protein binding
KEGG pathwaynvi:1001175257e-102 
 K09518 (DNAJB12)maps-> Protein processing in endoplasmic reticulum
InterPro domain[244-348] IPR0153998.4e-29Domain of unknown function DUF1977, DnaJ-like
[101-202] IPR0016233.4e-28Heat shock protein DnaJ, N-terminal
[109-127] IPR0030951.3e-18Heat shock protein DnaJ
Orthology groupMCL13050 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207907-TA
ATGACCATTGAAGCGAATAAGGATGAAGCTGAGAAATGCATCGAGATTGCACAAGTGGCCTTCAGATCTGGGAATGTATCGAAAGCAGAGAGATTTCTGCTGAAAGCTGAAATGTTATACCCAAGCCCGCATGCCAAGGAGCTTCTGGCGAGAGTTAGGGCTGCTAGTGGCACAGGAAGCGCCTCTAAAAAGACCCCTCCGAGCAGCCCTAGTGCTGACGAGTTACGACGAAGGAAAACTCCAAACCACCAGCCACAACAACAAGAGTATACTACAGAACAGATGGAAGCAGTTAGAAGGATTAACACGAAATGCAAAGATTATTATGAAATACTAGGAGTCACCAAGGAAGCGACGGATTCGGATATCAAAAAGGCTTACAAGAAACTGGCTCTGCAGTTGCATCCTGATAAAAATCACGCTCCGGGGGCTGCCGAGGCATTTAAGGCTATAAGCAACGCGGCAGCCATACTGACGAATCCTGAGAAGAGGAAGCAGTATGATCTCCGTGGGGACGAACCGGCGCCGAGTCACCACCACACGTACTACGCGAGAGGATTCGAGTCGGACCTGACAGCGGAGGAGCTGTTCAACATGTTTTTTGGGGCTACAGCCTTCCCGGGTGGTTCGCCCCCCGCGTACCGCCGCCGGGCCCGCGAGCCGGAGCCCCGGGACAGCCACGCGGGCCTGGTCCAACTGTTGCCGGTGATAGCCCTGGTTCTGTTGTCCATGATGTCTGGTTTCTTCATCAGCGAGCCGGTATACAACCTGGCGCCGTCGCCGAAGTATCCTGTACCTAGGGAGACGGTCAACCTCAAGGTGCCGTACTACGTGAAGGAGAATTTCCACACGGACTACCAGGGATCGCTCAGAAGACTGGAGATGGCTATCGAGGAGGAGTATATAGTGGGGCTCCGTCACGCGTGCCAACGCGAGAGGAACTATCGGGACAACGCCGCTTGGAAGGCCAGGAACTTTGGTGACGCTCGACAACATGCTGAAGCTACCAAGATGAGGCTGCCCTCCTGTGAGAAGCTGCAGGCCTTCCAAAGATAG

Protein sequence:

>DPOGS207907-PA
MTIEANKDEAEKCIEIAQVAFRSGNVSKAERFLLKAEMLYPSPHAKELLARVRAASGTGSASKKTPPSSPSADELRRRKTPNHQPQQQEYTTEQMEAVRRINTKCKDYYEILGVTKEATDSDIKKAYKKLALQLHPDKNHAPGAAEAFKAISNAAAILTNPEKRKQYDLRGDEPAPSHHHTYYARGFESDLTAEELFNMFFGATAFPGGSPPAYRRRAREPEPRDSHAGLVQLLPVIALVLLSMMSGFFISEPVYNLAPSPKYPVPRETVNLKVPYYVKENFHTDYQGSLRRLEMAIEEEYIVGLRHACQRERNYRDNAAWKARNFGDARQHAEATKMRLPSCEKLQAFQR-