Monarch geneset OGS2.0

DPOGS207146
TranscriptDPOGS207146-TA1725 bp
ProteinDPOGS207146-PA574 aa
Genomic positionDPSCF300001 + 4022762-4029547
RNAseq coverage540x (Rank: top 23%)
Annotation
HeliconiusHMEL0122000.070.09% 
BombyxBGIBMGA000581-TA6e-13573.65% 
DrosophilaCG8531-PA6e-14343.57% 
EBI UniRef50UniRef50_E0VZ303e-15046.36%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VZ30_PEDHC
NCBI RefSeqXP_966551.18e-16449.39%PREDICTED: similar to DnaJ (Hsp40) homolog, subfamily C, member 11 [Tribolium castaneum]
NCBI nr blastpgi|3784661820.081.04%DnaJ-17 [Bombyx mori]
NCBI nr blastxgi|3784661820.081.04%DnaJ-17 [Bombyx mori]
Group
Gene OntologyGO:00310729.5e-24heat shock protein binding
GO:00064571e-17protein folding
GO:00510821e-17unfolded protein binding
KEGG pathway 
InterPro domain[14-90] IPR0016239.5e-24Heat shock protein DnaJ, N-terminal
[18-36] IPR0030951e-17Heat shock protein DnaJ
Orthology groupMCL14426 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207146-TA
ATGCGAGAAATGGACGAAGAGGGAGATAATATTCTACTTCAAGATAATTATTATCAATTACTCAATGTGGCAAAAACGGCCAGTGTAGAAGAAATAAACAGTGCCTACCGTCGTTTCTCGAGAATATTTCATCCAGACAAACACAGCACAGATCCCAATAAACAGAAATGGGCTGAACAGATATTTAATAAAATCAAAGAAGCATATGAAGTACTCTCTGACTCACACAAACGAGCTATCTACGACACACTCGGTAAAAGGGGGCTAGAAGTAGATGGGTGGGAAGTTATATTCAGAACACGAACTCCTCGGGAGATCAGAGAAGAGTATGAAAGGTTAAAAAGAGAAAGAGAAGAAAGAAGGTTACAGCAGAGTGCCAACCCTCGAGGCACAATCACACTCTCCATAAATGCTACTGATATGTTTATGAAGTACTATGATGAGTATGATATTCTTGAGGAGTCAGCTGTGATCCCAAGTGTGGAAGTCTCGGGGATGACAATCCAGCAATCTATAGATGCTCCAGTCACACTCCGCAACACTGTAACACTGTCGGGGAATATCTCCACACAGAATGGGATTGGCACGGGGTCACTTACGGTGTCAAATCGCCATCTAAGTTCCGACAGAGGCTGGACGGAGTTAGAATGTGGTATAGGAAACGGACCCGTCGTTGGCTTCAAAATATTCAGAACCATATCAAGATTGATGTTTGTTAACTGCGGCACGGGTTTGCAATTCACTTCCAGAGGGATTATGCCAAGTTTTGTATCTACGATGGCCCTTCAGCTGGACGCTCATTCTGTTGGTTATCTGACGTATCGTGCGGGGTCAGGCGCGGGGTCCTCACTCACGAGCACATACGTCCGTGACTCGCAGCGCCACCACGTGTCAGCGTCCGTACAACTCGGCTCGCCTCATTCCTTCGTGTCACTACATCTCGTCCGCAAACTAACAGATCACGATCTCAAGCTGCGCCTGGCTTTCAAAATGGGAACATTTGGTGCTATCGTGGAATATGGCGCGGAGAAGAAAGTGTCTCAGAACAGCAGTGTATCGGCTGCTGTTATGCTAGGGGTGCCCAGCGGTGTGATGCTTAAACTAAAATGGTCGTGCTCATCACAGAGCGTGGTGCTTCCTATTCATCTTTGCGAGGAGGTGATGCCATCTCCCGTTTTCTACGCCACTGCCGTACCTATAATTTCATGGCTGTTGTTGAAGAAACTGCTATTAGACCCTATAGCAAGAGATAAGAGGGAGAGGGAACGGCAGAGATCTATGGAGGCAAACTTTGAAAGGTTACAAGAAATGCAGCGTCAGGCGAGAGCAACTATAGAGCTGATGAGAGAAACGTATTCTAGAATAAAAAGTGACGAAGAAAAGAAGAAAGGCTTGGTTATTATAAGAGCTATGTATGGAAAACTACCTCAAGGCGCGTCAGATCACGATGCGTCTAGTGAGCCGATAGGTGATGGTGTTGAAGTTAGTCACGCTGAGGTTATTGACGTCACTATCCCCATGCAGTGCCTCATCAGAGATAGCCGTCTCGAATTACTAGACGCCAGCAAGTCTGAGCTGCCTGGCTTTTACGATCCGTGTGTCGGTGAAGACAAACATCTCACTGTGCTGTATATGTTCCACGGGAACGAGCATCGTGCCACCGTCCCAGACGATCAGCCGCTAGTACTGCCCCGCAACAACCATCGAATAAAGAACAGATCTTGA

Protein sequence:

>DPOGS207146-PA
MREMDEEGDNILLQDNYYQLLNVAKTASVEEINSAYRRFSRIFHPDKHSTDPNKQKWAEQIFNKIKEAYEVLSDSHKRAIYDTLGKRGLEVDGWEVIFRTRTPREIREEYERLKREREERRLQQSANPRGTITLSINATDMFMKYYDEYDILEESAVIPSVEVSGMTIQQSIDAPVTLRNTVTLSGNISTQNGIGTGSLTVSNRHLSSDRGWTELECGIGNGPVVGFKIFRTISRLMFVNCGTGLQFTSRGIMPSFVSTMALQLDAHSVGYLTYRAGSGAGSSLTSTYVRDSQRHHVSASVQLGSPHSFVSLHLVRKLTDHDLKLRLAFKMGTFGAIVEYGAEKKVSQNSSVSAAVMLGVPSGVMLKLKWSCSSQSVVLPIHLCEEVMPSPVFYATAVPIISWLLLKKLLLDPIARDKRERERQRSMEANFERLQEMQRQARATIELMRETYSRIKSDEEKKKGLVIIRAMYGKLPQGASDHDASSEPIGDGVEVSHAEVIDVTIPMQCLIRDSRLELLDASKSELPGFYDPCVGEDKHLTVLYMFHGNEHRATVPDDQPLVLPRNNHRIKNRS-