Monarch geneset OGS2.0

DPOGS207151
TranscriptDPOGS207151-TA1092 bp
ProteinDPOGS207151-PA363 aa
Genomic positionDPSCF300001 + 4250799-4253556
RNAseq coverage236x (Rank: top 43%)
Annotation
HeliconiusHMEL0157954e-15368.00% 
BombyxBGIBMGA000608-TA1e-17278.47% 
Drosophilawus-PA2e-11251.12% 
EBI UniRef50UniRef50_Q9VX953e-11051.12%LD21896p n=15 Tax=Endopterygota RepID=Q9VX95_DROME
NCBI RefSeqXP_971138.12e-12859.76%PREDICTED: similar to wurst CG9089-PA [Tribolium castaneum]
NCBI nr blastpgi|3784662207e-17078.47%DnaJ-18 [Bombyx mori]
NCBI nr blastxgi|3784662204e-17378.47%DnaJ-18 [Bombyx mori]
Group
Gene OntologyGO:00310722.7e-19heat shock protein binding
GO:00064577.7e-12protein folding
GO:00510827.7e-12unfolded protein binding
KEGG pathway 
InterPro domain[246-347] IPR0016232.7e-19Heat shock protein DnaJ, N-terminal
[4-50] IPR0078296.1e-13TM2
[286-304] IPR0030957.7e-12Heat shock protein DnaJ
Orthology groupMCL12099 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207151-TA
ATGCCAAGTCAGAAGTCGGTTATTGTAGCATACATATTTTGGCTATTTGGCGGTATATTCGGAGTTCATCATTTCTATCTACGTAGAGATAAACATGCATTTGTTTGGTGGTCCACCCTCGGGGGTTTTGGTATCGGTTGGCTCGGTGAAATATTTCGAATACCACGGTACGTCAGAGATGCTAATGAAGACCCTCAACATATTCAAGCGTTAGTGAAAAGAATGAAACATAACAAAAAGCCTCCATTTTCTATGAACCGTTTCACTGGAATGTTAATGGTTAGCTACTCATGGGGCCAGATGATGATGCTTGCTGTTCCACCTGAAGAGATATGGGGCATCAATTTTAGATATCTGAACTTCCTTGTACCATTTGTAGTTGCTCTCGGTGTGTGGACAGTGGGAAATATAGGCAGGGAGTGTGGTTCATTCTTGTGGCCTGTACTAGGCGCGTATGTGGCGTACCCGTTGCGTTACTACATTTACGACGAGAGTTTTTGGTTCACGATCATGGTACTGGTATCTGCACTGGCTTTTGACACGTTTTCTAAACAGTGGCGCCGGACTCCATATAAACGCACTCATTTTGTCAAACGGATAATAGTGCTGGGTGTTTGTGCCTCCCTTTACCTGTCTCTCTGGGTAGGTTACCTCTACTTCCATGGCACCATCACTGACAGTGATGGAGACGAAGTTCCCGTGTATGAGGCACTGCATCACTTTTTCACTAGTCCCTGGTGGCTGGATGTAAAGCAATGTATTGTGGATACATACCAATTCGCTCAGCATCATGGATGGTATGAAGTATGGAAGCAGATCATTGATCTTTCTGATCCAAAGGGAGAACAGAATGCTTACAAGGTTCTGGGTCTAGGACCAGATGCGAGTCAGCAGGAAATTACAACAAAGTGGAGGCAGCTATCAAGGGAAAATCATCCAGATAAGGCAAAGCCAGAGTTAAAGAAGGAAGCCCAAGAACGTTTCATGGACATCCAGAAGGCCTACGAGTTGCTATCTAGCCGCAAACACCGCAGACATCGTAGAAACAAGAGGGACAATTCCGACGAATATGGCAACGGCCAGGATCTATAA

Protein sequence:

>DPOGS207151-PA
MPSQKSVIVAYIFWLFGGIFGVHHFYLRRDKHAFVWWSTLGGFGIGWLGEIFRIPRYVRDANEDPQHIQALVKRMKHNKKPPFSMNRFTGMLMVSYSWGQMMMLAVPPEEIWGINFRYLNFLVPFVVALGVWTVGNIGRECGSFLWPVLGAYVAYPLRYYIYDESFWFTIMVLVSALAFDTFSKQWRRTPYKRTHFVKRIIVLGVCASLYLSLWVGYLYFHGTITDSDGDEVPVYEALHHFFTSPWWLDVKQCIVDTYQFAQHHGWYEVWKQIIDLSDPKGEQNAYKVLGLGPDASQQEITTKWRQLSRENHPDKAKPELKKEAQERFMDIQKAYELLSSRKHRRHRRNKRDNSDEYGNGQDL-