Monarch geneset OGS2.0

DPOGS211167
TranscriptDPOGS211167-TA1008 bp
ProteinDPOGS211167-PA335 aa
Genomic positionDPSCF300007 + 287323-289443
RNAseq coverage1194x (Rank: top 11%)
Annotation
HeliconiusHMEL0172214e-17390.75% 
BombyxBGIBMGA003153-TA6e-16989.62% 
DrosophilaCG4164-PA3e-14575.52% 
EBI UniRef50UniRef50_Q9UBS49e-11260.71%DnaJ homolog subfamily B member 11 n=120 Tax=root RepID=DJB11_HUMAN
NCBI RefSeqNP_001157381.12e-17689.25%DnaJ (Hsp40) homolog 3 [Bombyx mori]
NCBI nr blastpgi|2556528814e-17589.25%DnaJ (Hsp40) homolog 3 [Bombyx mori]
NCBI nr blastxgi|2556528810.089.25%DnaJ (Hsp40) homolog 3 [Bombyx mori]
Group
Gene OntologyGO:00310722.9e-30heat shock protein binding
GO:00064572.5e-24protein folding
GO:00510822.5e-24unfolded protein binding
KEGG pathwaytca:6625443e-149 
 K09517 (DNAJB11)maps-> Protein processing in endoplasmic reticulum
InterPro domain[3-96] IPR0016232.9e-30Heat shock protein DnaJ, N-terminal
[9-27] IPR0030952.5e-24Heat shock protein DnaJ
[237-317] IPR0029395.9e-23Chaperone DnaJ, C-terminal
[230-320] IPR0089714.6e-22HSP40/DnaJ peptide-binding
Orthology groupMCL12090 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211167-TA
ATGACCCTGGCTGGTCGAGATTTTTATCAAATACTTGGTGTGTCCCGGTCGGCGAATACCAATGAAATTAAAAAAGCTTACAGAAAATTAGCTAAAGCATTACATCCGGATAAGAATCAAGATGATCCAGATGCTTCCCAAAAGTTTCAAGACCTTGGCGCGGCCTATGAAGCTTTATCGGACCCAGAAAAGAGGGAATTATATGATAGGTGTGGAGAAGATTGTTTGAAAAAAGATGGAATGATGAATAATAACGACCCCTTCGCAAGTTTCTTTGGTGATTTCGGTTTTCATTTTGGTGGGGAATCCCAGCAGCATGAAACACCGAGAGGGGCAGATGTCCTTATGGAGTTAATGGTGTCTCTTGAAGAGCTGTATAATGGAAACTTTATAGAAATAACACGAAATAAGCCAGTAATCAAACCAGCGTCAGGGACACGCAAATGCAACTGTCGCCAGGAGATGGTTACAAGAAATCTTGGCCCTGGCAGGTTCCAGATGATGCAACAAACTGTTTGTGATGAATGTCCTAATGTTAAACTAGTGAATGAAGAGAGACTTCTGGAAATTGAGGTTGAAGTTGGTGCGCCGGATAATCACAAAACAAGATTGAGAGGTGAAGGTGAACCTCATATGGATGGAGAGCCGGGTGACCTGGTTATAGTGTTTAGAACAGAAAAACATCCACAGTTCACCCGTCGCGGCGATGATCTTTATACAAATGTTACCATTTCATTACAAGATGCTCTAACCGGGTTTACATTGGAGCTGCAACATTTAGATGGTCATAAAGTGAATGTGGCGCGCGACAAGGTCACGTGGTCAGGAGCACGCATCCGCAAGAAGGGGGAGGGCATGCCGAACTTTGAGAATAATAATCTGCATGGAAATATGTATATCACTTTTGATATTGAATTCCCCAAGAAAGATTTGAGCGATGATGACAAAGAAGCCCTAAAGAAAATTTTACAACAATCACCAAATAATAAAGTATACAATGGACTTTAG

Protein sequence:

>DPOGS211167-PA
MTLAGRDFYQILGVSRSANTNEIKKAYRKLAKALHPDKNQDDPDASQKFQDLGAAYEALSDPEKRELYDRCGEDCLKKDGMMNNNDPFASFFGDFGFHFGGESQQHETPRGADVLMELMVSLEELYNGNFIEITRNKPVIKPASGTRKCNCRQEMVTRNLGPGRFQMMQQTVCDECPNVKLVNEERLLEIEVEVGAPDNHKTRLRGEGEPHMDGEPGDLVIVFRTEKHPQFTRRGDDLYTNVTISLQDALTGFTLELQHLDGHKVNVARDKVTWSGARIRKKGEGMPNFENNNLHGNMYITFDIEFPKKDLSDDDKEALKKILQQSPNNKVYNGL-