Monarch geneset OGS2.0

DPOGS210105
TranscriptDPOGS210105-TA2553 bp
ProteinDPOGS210105-PA850 aa
Genomic positionDPSCF300017 + 1087302-1092090
RNAseq coverage588x (Rank: top 22%)
Annotation
HeliconiusHMEL0104200.067.77% 
BombyxBGIBMGA012686-TA0.066.58% 
DrosophilaCG14650-PA2e-7843.41% 
EBI UniRef50UniRef50_F4WM788e-12538.32%DnaJ-like protein subfamily C member 14 n=8 Tax=Formicidae RepID=F4WM78_ACREC
NCBI RefSeqXP_001599524.17e-12740.05%PREDICTED: similar to GA13147-PA [Nasonia vitripennis]
NCBI nr blastpgi|3784663370.066.24%DnaJ-22 [Bombyx mori]
NCBI nr blastxgi|3784663370.065.93%DnaJ-22 [Bombyx mori]
Group
Gene OntologyGO:00310721.3e-23heat shock protein binding
GO:00064574e-11protein folding
GO:00510824e-11unfolded protein binding
KEGG pathway 
InterPro domain[604-668] IPR0016231.3e-23Heat shock protein DnaJ, N-terminal
[610-628] IPR0030954e-11Heat shock protein DnaJ
Orthology groupMCL17038 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210105-TA
ATGGCAAGGTCGCCTAAAGATGATCCCAGATTTAATGACAATGCTTGGAATTTCCATACGGAAACCGGCGAGCATTGTTCACCGACAGGAATTAATGAGATAAAACAGCCAACACTATCTTCCAATGGCTCTGTAATACCCGAAAACTTATATGGAAGACCAAATTTTAATAGCGGTGTGTACATTAACAAACCATACATGCAAACTGGAGATAGTACGGCAAATAAAACTGATAACATCCCTTACAACGTTTTGCCTAACGTTAACAGAATTCCACGTCAAAATTATGGCAATGCCATTCTAAACTTGGGAAATGGTCAAGTGGTAAGAGTAATTTATGACGAAAATAATCAACAACTAATCTTTCCCATGTCCGGACAGTACGAATTGTTCAATCATAATCAAGGAATACGCGACGTGCCACTACAAGCTGCCCCACCATCCCATTTGTTTACATTAAACCAAGAGTTTACTGTGAACAATAACGTTAATGTTCAGCCACCTACTGCCACAATACAGCACAGTACTTTCAATCAAGATTGTGTTAGCCAGTCTACCATACCAACATCTAACTTTTTGAAAGAAGTTCTTGGTAACTGGGAACCAAATTCTTCAGGAACATATTCTCCATTTGGACAAAATTATCCCTTAAATCATCCTGATGCCCCAACCATATTAAATAACAATCCTATAGAGTCAACACTAAATCAGATAAAACCAGAAGCTTCGCCCAAAAGAAATGAGAATGGACCATTAGCTGATACGAGTAATAAAAAAAGAATAGTAGCCGAAGTGAAGCCCATGCGACCAAGTTATTCTGAAGTAGCAAAAAACACCAAAAATGCTCAACCAGACCCAGCAAAGAAAACAAAACCACAAAACAACCCTGATAACAAAACACCCAGTAAACCAAATCCCAAACCAGAAAAACCAGTTAATACTAAACATGTGTCTGACGAAACCAAGACTAAGGCAGAAAAGAAACAGCACTCAAACGCTATATCCTCTGGAAGCGAGTCCGGTGACGTTAACAACGATGACACAGAAAAACAACAGAAAACCAACAAGCGGTCCAAAAACAAACGTAATAATATGTCACGTAAATGGTCATCCCTAGACGATATAACCAACGAAGAAGGGAACTTCAACCAGGAGAACGAAAGTCAGTTTCTGTTTATACAAAACAACGCTGAAAAGGTTAAAAAAGATAAAAAATCTGACAGAACTAAAAATATAGACAAAGGTATTGCCGAGGATGAATTGAAGTTTGAAGAAGACGACGAGCAGTCTCAATTTGGGTTCAACGAAGGTCAACCCGATGTAACCAAGGCCAGGAAGAAGAAGGAAACTCGCAACCATCACAAGGCTTCTAAGATAATTCAGGACAAGAAAAAAACTAACCAGACTAAAATAAGAAAAAACAAGCCCGGCTACATGGGTGTAGTCCATAACTACTTGGAGCACTGGGGGGGAGCCACTTGGAAAGCATTTGTTTGGTTTATTTTCTTGCTCTCTGATATCTGCCGAATGAGTCTTCATCTGTCATTCGACTTGTGTACATCTCTATTCAGTCAAACCTATGTTAGCTCGCAGGCTGTGTGGCGCAGCAGCAGGGACTGGCTCTCCAGGATCACTAAAAACAAATACCTCATGTACATCGACAGAAAGTTTGGACACACACAGTTTGCTTTTTGGAGAAAACTTAAATGGTTTAAAAAAGAAAGTGAAACGGACAGTAACGACACCAAGCTGAACTCCAACATCCCGCTGCCGGCCACAGGGGAAGAAGCCATGAAACGCTTACTGGCGTGTAAAGGAAAAGATCCGTACAGTATCCTAGGCGTCAGCGTGTCTTGTAGCGATGAAGACATCAAGCGTTACTACCGACGGCAGGCGTTCCTCGTCCATCCCGACAAGAACCAGCAGCCTGGCGCTGAAGAGGCCTTCAAGATTTTGCAGCATGCGTTCGATCTTATCGGGGAGCCGGAGCGCCGCGAGGCCTACGAGCGTCGCGCCCGCGAGTCACGACACGCCGAGGCCGCGTGGGGTGAGCTCAGTGTACTGCTGGAACAACTGCACGACAAGATGGAGTTCGCAGCCAACACGATACGCTGTACGAACTGCGGTCGTCGTCACAAGCGAGTGATGACGTCACGGCCGTGCTACGCCGCGCGGTACTGTGGACAGTGCAAGATAAGACACTCCGCCAAGGAGGGTGATATATGGGCGGAGTCCAGTATGCTGGGTCTGCTGGTGATGTACTACGCGTGTATGGACGGAGCCGTCTACCAGATCACACAGTGGGGCAGTTGTCAGAAACGTAACCTGAGACAGCTGCGTCCCGACTGCCACGTGGTGCAGTACAGGATCGTGCTCGGGAACAAAGCGACGGCCGACCCACAGAAAACCACCACCGGACACGATCCAAATCTCGAAGAATTCCTGAACAATCTGTACAGTAAATCTGGTGTGACACCGAACACTACCGGCTGTAAGCCGACGACCGAATCAGCGGACGCAAAGAAACGACGCAATAAAAAACCTAAAGCCTGA

Protein sequence:

>DPOGS210105-PA
MARSPKDDPRFNDNAWNFHTETGEHCSPTGINEIKQPTLSSNGSVIPENLYGRPNFNSGVYINKPYMQTGDSTANKTDNIPYNVLPNVNRIPRQNYGNAILNLGNGQVVRVIYDENNQQLIFPMSGQYELFNHNQGIRDVPLQAAPPSHLFTLNQEFTVNNNVNVQPPTATIQHSTFNQDCVSQSTIPTSNFLKEVLGNWEPNSSGTYSPFGQNYPLNHPDAPTILNNNPIESTLNQIKPEASPKRNENGPLADTSNKKRIVAEVKPMRPSYSEVAKNTKNAQPDPAKKTKPQNNPDNKTPSKPNPKPEKPVNTKHVSDETKTKAEKKQHSNAISSGSESGDVNNDDTEKQQKTNKRSKNKRNNMSRKWSSLDDITNEEGNFNQENESQFLFIQNNAEKVKKDKKSDRTKNIDKGIAEDELKFEEDDEQSQFGFNEGQPDVTKARKKKETRNHHKASKIIQDKKKTNQTKIRKNKPGYMGVVHNYLEHWGGATWKAFVWFIFLLSDICRMSLHLSFDLCTSLFSQTYVSSQAVWRSSRDWLSRITKNKYLMYIDRKFGHTQFAFWRKLKWFKKESETDSNDTKLNSNIPLPATGEEAMKRLLACKGKDPYSILGVSVSCSDEDIKRYYRRQAFLVHPDKNQQPGAEEAFKILQHAFDLIGEPERREAYERRARESRHAEAAWGELSVLLEQLHDKMEFAANTIRCTNCGRRHKRVMTSRPCYAARYCGQCKIRHSAKEGDIWAESSMLGLLVMYYACMDGAVYQITQWGSCQKRNLRQLRPDCHVVQYRIVLGNKATADPQKTTTGHDPNLEEFLNNLYSKSGVTPNTTGCKPTTESADAKKRRNKKPKA-