Monarch geneset OGS2.0

DPOGS211439
TranscriptDPOGS211439-TA1422 bp
ProteinDPOGS211439-PA473 aa
Genomic positionDPSCF300223 - 146204-149507
RNAseq coverage884x (Rank: top 14%)
Annotation
HeliconiusHMEL0138213e-15063.49% 
BombyxBGIBMGA002188-TA0.075.81% 
DrosophilaP58IPK-PA6e-15156.37% 
EBI UniRef50UniRef50_Q9VHA88e-14956.37%LD25575p n=20 Tax=Neoptera RepID=Q9VHA8_DROME
NCBI RefSeqXP_002074116.11e-15455.21%GK14476 [Drosophila willistoni]
NCBI nr blastpgi|3784662780.075.10%DnaJ-20 [Bombyx mori]
NCBI nr blastxgi|3784662780.075.10%DnaJ-20 [Bombyx mori]
Group
Gene OntologyGO:00310721.5e-28heat shock protein binding
GO:00054881.5e-17binding
GO:00064572.2e-16protein folding
GO:00510822.2e-16unfolded protein binding
GO:00055152.1e-05protein binding
KEGG pathwaydwi:Dwil_GK144763e-154 
 K09523 (DNAJC3)maps-> Protein processing in endoplasmic reticulum
InterPro domain[368-460] IPR0016231.5e-28Heat shock protein DnaJ, N-terminal
[138-259] IPR0119901.5e-17Tetratricopeptide-like helical
[374-392] IPR0030952.2e-16Heat shock protein DnaJ
Orthology groupMCL14255 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211439-TA
ATGGACTTGTTTTTAAATAAAAATTGGAACAAAGTCACGCCATGCCTAGTGTTATTAGCGTTAGAAGTTTTGCTAGAATTTTCAGAATGTGCGACCCAGGCGGAAGTTAACAAGCACCTAGAGCTGGGCCGCGACTTCCTGGCCCGAGGTCAGCTGTCCGACGCACTCACACACTATCACGCCGCTGTTGAGGGTGATCCACATAACTACCTCACATACTTTAAGAGAGGCACTGTGTTAGAAATGAAGGCCGACTTCACAGCCGCCCGACTCCACAGGGCTAATGTGTACCTCAAGCTAGCTCAGTACAGGGAGGCCAAGGAAGACTATCTACAAGTTACTTATAGTGAACCTTACAATGAGGAGGCGATATCCTTGTATCACCGGATGGATGGTCTGTCAGAGGAGCTACAGCTAGCGGAGGCCTACTACCGCGGGCGGGACTTCGCCGCCGCCGCTGAACTCACCTCCCGACTGCTAGAGGCTTCCCCCTGGGCCGCTAACCTCAGACAACTTAGGGCGGAATGCTATATTGCACTAAATGATCTGTTCTCAGCGGTGTCGGATATCAGGTCTGTGAATCGTTTACAGCAGGACTCCACTGACGGCTACCACCGTCTTGCCACACTCCTGTACCAACTGGGACATGTCAGTGACGCTCTCAAGGAAATAAGAGAATGTCTCAAACTAGACCCGGAGCACAAGCTGTGTTTCCCGTTGTACAAGAAATTAAAGAAAGTGGACAAACTGTTATTAGACTGTGAGGAGGCCAGTCAGAACAGAGAGTTTGTGAAGTGTGTGGACAAGGCTGAGGCGGTGCTGAAGGTGGAACAGGAGGTAACGCTGGTGGTGTTTGAGGCCAGGAAGTGGCTGTGCTCTTGTCATGCTAAGGAGGAGCAGTATTCAGAAGCTATCCTGGAGTGTGGCCGAGCTCTGGAACTACAACGAGATGCGGGCGTGTTATGTTCCAGAGGAGACGCCTGGCTCGGACTGGGGGAGTTTGATGACGCTATCAGATCCTACAAGGAGGCGCTGGATATAGACGAGGGGCTGCAGAGAGCCAAGGATGGGATCAGCAGGGCACAGAAACTACAGAAACAGTCGGAGCAGAGAGACTACTACAAGATATTAGGAGTTAAGAGAACGGCGAACAAACAGGAGATCACGAAGGCGTACCGCAAGGCGGCGCAGAAGTGGCACCCGGACAACTTCCAGGGAGACGAGAAGAAACTGGCGGAGAAGAAGTTCATAGACATCGCCGCCGCCAAAGAGGTGCTGACGGACCCCGAGAAGCGCGCCGTGTTCGACGCGGGCGGTGACCCGCTGGACCCCGAGGCGGGTCGCCAGCAGCACGGGTTCAACGCCCCCTTCGGCCACTTCCACCACGGCAGCCCCTTCCAGTTCAAGTTCCACTTCAACTGA

Protein sequence:

>DPOGS211439-PA
MDLFLNKNWNKVTPCLVLLALEVLLEFSECATQAEVNKHLELGRDFLARGQLSDALTHYHAAVEGDPHNYLTYFKRGTVLEMKADFTAARLHRANVYLKLAQYREAKEDYLQVTYSEPYNEEAISLYHRMDGLSEELQLAEAYYRGRDFAAAAELTSRLLEASPWAANLRQLRAECYIALNDLFSAVSDIRSVNRLQQDSTDGYHRLATLLYQLGHVSDALKEIRECLKLDPEHKLCFPLYKKLKKVDKLLLDCEEASQNREFVKCVDKAEAVLKVEQEVTLVVFEARKWLCSCHAKEEQYSEAILECGRALELQRDAGVLCSRGDAWLGLGEFDDAIRSYKEALDIDEGLQRAKDGISRAQKLQKQSEQRDYYKILGVKRTANKQEITKAYRKAAQKWHPDNFQGDEKKLAEKKFIDIAAAKEVLTDPEKRAVFDAGGDPLDPEAGRQQHGFNAPFGHFHHGSPFQFKFHFN-