Monarch geneset OGS2.0

DPOGS208881
TranscriptDPOGS208881-TA1212 bp
ProteinDPOGS208881-PA403 aa
Genomic positionDPSCF300009 - 1265045-1267380
RNAseq coverage3385x (Rank: top 4%)
Annotation
HeliconiusHMEL0038910.086.08% 
BombyxBGIBMGA012541-TA0.087.10% 
DrosophilaDroj2-PA2e-13661.38% 
EBI UniRef50UniRef50_Q8WW224e-14163.95%DnaJ homolog subfamily A member 4 n=217 Tax=Opisthokonta RepID=DNJA4_HUMAN
NCBI RefSeqNP_001040292.10.087.34%DnaJ (Hsp40) homolog 2 [Bombyx mori]
NCBI nr blastpgi|1140532030.087.34%DnaJ (Hsp40) homolog 2 [Bombyx mori]
NCBI nr blastxgi|1140532030.087.56%DnaJ (Hsp40) homolog 2 [Bombyx mori]
Group
Gene OntologyGO:00310727.9e-32heat shock protein binding
GO:00064573.9e-22protein folding
GO:00510823.9e-22unfolded protein binding
KEGG pathwaytca:6600936e-173 
 K09502 (DNAJA1)maps-> Protein processing in endoplasmic reticulum
InterPro domain[2-94] IPR0016237.9e-32Heat shock protein DnaJ, N-terminal
[110-254] IPR0089713.9e-22HSP40/DnaJ peptide-binding
[135-209] IPR0013052.2e-19Heat shock protein DnaJ, cysteine-rich domain
[8-26] IPR0030954.6e-19Heat shock protein DnaJ
[262-337] IPR0029396.1e-19Chaperone DnaJ, C-terminal
Orthology groupMCL11103 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208881-TA
ATGGTGAAAGAAACAACCTACTATGACATATTGGGTGTCAAACCCACCTGTACGACGGATGAGTTGAAGAAGGCATATAGAAAACTTGCACTTAAGTATCATCCTGATAAGAATCCTAATGAAGGAGAGCGCTTTAAACAAATCTCACAGGCTTATGAAGTACTTTCTAATCCAGACAAAAGAAGAATATATGATCAAGGTGGTGAACAGGCTTTGAAAGAAGGTGGTGGTGGAGGCAGTGGCTTCTCATCACCGATGGACTTGTTTGATATGTTCTTTGGCAGTGGATACAGTGGAGGAAGACGACGCGGTCGCGAAAGGAAAGGCAAAGATGTCATTCATCAACTATCTGTCACACTGGAAGAGCTTTACAAAGGGGCTGTTCGTAAGCTAGCCTTACAGAAGAATGTCATCTGTGAGAAATGTGAAGGTCGTGGAGGGAAGAAGGGTGCAGTGTTAGTATGCCCGACATGCCGAGGTACAGGAATGCAGGTTCAGATCCAACAACTGGGACCGGGAATGATCCAACAAATCCAAACAGTCTGCTCTGAATGCAGAGGTCAACGTGAAATCATAGATCCTAAAGATCGTTGCAAAGTTTGCCAGGGTCGTAAGACAGTACGAGATCGTAAAATCATTGAAGTGCATATAGACAAGGGTATGACAGACGGACAGAAGATTATGTTTAGCGGTGAGGGTGACCAGGAACCAGAGTTGGAGCCGGGTGATCTTATTATAGTATTAGATGAGAAGGAACATGAGGTTTTCAAACGTACTGGTAATGACCTCATTATAAGAATTAATATAGAATTGGTAGAGGCTCTGTGTGGGTTCCAGAAGGTAATAAGAACTTTAGATGATAGAGATATTGTGATAACTGTGTTACCGGGAGAAGTGACAAAGCATGGTGAAGTGAAGTGTGTTTTGAATGAAGGTATGCCCATGTACAAAAATCCATTTGAAAAAGGCCAGCTGATTATGCAGTTCTTGGTTAATTTCCCCAATCGCATTCCTCCTGAAGTCATTCCAGCATTGGAGAACTGCCTACCACCTAGACCTATGGTGGAGATTCCAGAGTTAGCGGAAGAATGTCAGCTCATGGATCTAGATCCGGAACAGGAGTCTCGCCGTCGACGAGCCCACCAGGGTAATGCATATGAAGAGGACGATGACCATTCGGGCGTCAATAGAGTTCAATGTGCTACTGGCTGA

Protein sequence:

>DPOGS208881-PA
MVKETTYYDILGVKPTCTTDELKKAYRKLALKYHPDKNPNEGERFKQISQAYEVLSNPDKRRIYDQGGEQALKEGGGGGSGFSSPMDLFDMFFGSGYSGGRRRGRERKGKDVIHQLSVTLEELYKGAVRKLALQKNVICEKCEGRGGKKGAVLVCPTCRGTGMQVQIQQLGPGMIQQIQTVCSECRGQREIIDPKDRCKVCQGRKTVRDRKIIEVHIDKGMTDGQKIMFSGEGDQEPELEPGDLIIVLDEKEHEVFKRTGNDLIIRINIELVEALCGFQKVIRTLDDRDIVITVLPGEVTKHGEVKCVLNEGMPMYKNPFEKGQLIMQFLVNFPNRIPPEVIPALENCLPPRPMVEIPELAEECQLMDLDPEQESRRRRAHQGNAYEEDDDHSGVNRVQCATG-