Monarch geneset OGS2.0

DPOGS211303
TranscriptDPOGS211303-TA1551 bp
ProteinDPOGS211303-PA516 aa
Genomic positionDPSCF300125 - 242031-243581
RNAseq coverage106x (Rank: top 60%)
Annotation
HeliconiusHMEL0112136e-1024.42% 
BombyxBGIBMGA007349-TA0.075.94% 
DrosophilaHsp60-PA0.071.26% 
EBI UniRef50UniRef50_P108090.066.40%60 kDa heat shock protein, mitochondrial n=2607 Tax=cellular organisms RepID=CH60_HUMAN
NCBI RefSeqXP_318461.20.073.48%AGAP004002-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1162530.078.74%chaperonin isoform [Heliothis virescens]
NCBI nr blastxgi|1162530.079.37%chaperonin isoform [Heliothis virescens]
Group
Gene OntologyGO:00442673.9e-250cellular protein metabolic process
GO:00055243.9e-250ATP binding
GO:00420262.4e-195protein refolding
GO:00057372.4e-195cytoplasm
KEGG pathwayaga:AgaP_AGAP0040020.0 
 K04077 (groEL, HSPD1)maps-> Type I diabetes mellitus
    RNA degradation
InterPro domain[1-503] IPR0024233.9e-250Chaperonin Cpn60/TCP-1
[1-495] IPR0018442.4e-195Chaperonin Cpn60
Orthology groupMCL10648 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211303-TA
ATGGGACCTAAAGGACGGAACGTTATATTAGAACAAACATTTGGACCACCTAAAATCACAAAAGATGGTGTAACCGTTGCAAAAGGGATTGAACTAAAAGATAAGTTCCAAAATATTGGCGCAAAGCTTGTGCAAAATGTTGCACATAAAACTAACGAAGAAGCCGGCGATGGAACTACAACAGCTACTGTTCTAGCAAGAGCAATAGCGAGAGAAGGGTTTGAATGCATCTCAAAGGGGGCAAATCCAATAGAAATTCGAAAAGGTGTGATGTTGGCTGTGGAAACAGTTACAGAACATTTGAAAAAGATGTCGAAACCTGTTAAAACCTCTGATGAAATAGAACAAGTCGCAACTATTTCAGCCAATGGTGACAGAAGTATAGGGAAACTCATAGCGGCCGCCATGAATAGGGTTGGAAAAGACGGTGTCATAACTGTCAAAGATGGTAAAACCCTCGATGATGAATTAGAAATAATAGATGGAATGAAACTAGAGAAGGGTTATATATCGCCATATTTTATTAATTCTAGTAAAGGTCCAAAGGTTGAATATAATGACGCGTTGATACTTTTTTCGGATAAAAAAATTTTCAATGCCAATCAAATTGTGCCCGCCTTGGAAATAGCTAATGCTCAAAAAAAACCATTGATAATTATTGCAGAAGATTTTGAGGGCGAACCTCTATCAGTATTAGTAGTAAATAAGTTAAAAATTGGTTTACAAGTAGCTGCTGTGAAAGCACCCGGTTTTGGTGAATATAGAAAAAATACTCTCATTGATTTAGCTATTGCGACTGGTGGTGTAATATTTGAAGATAATGAGAATTTAATTCGTCTTGAAGATTGTCAGCTTGAAAGTCTTGGTCAAGTCGGTGAAGTTTTAATAACAAAAGACACTACACTGTTAATCAAAGGAAAAGGCGATAAAGCTGAAATAGAACAGCGCATTGAACAGATTAGAGCTGAATACGAAGAAACTTCTAGCGAGTTTGAGAAGCAAAGGTTATTAGATCGCATTTCTAGACTTAAATGTGGTGTGGCTATATTACGTATTGGAGGATGTAGTGAAGTAGAAGTTAATGAAAAGAAAGACCGTGTCAACGATGCCCTTAATGCTACAAGAGCTGCTGTTTCGGAGGGAATTGTTCCTGGTGGGGGAGCAGCCCTCGTTAGATGTATACCTATTTTAAATAAACTAAAACCTGCCAATCCAGACCAGGCAGTTGGAATAGATATAGTCAAGAAAGCTTTACGTACTCCATGCTTAACGATCGCAAGTAACGCTGGTTATGATGGTTCGGTTGTGGTTTCGAAAGTTGAGAGCATGGACAAAGATTTTGGTTACGACGCTCTCAACAATGAATACGTTAATATGATAGAAAAGGGCATTATAGATCCTACGAAGGTGGTTAGAAGAGCACTCACTGATGCCAGTGGCGTTGCATCACTTTTGACTACAGCAGAGGCTGTTATTTGTGAACAGAGAATGGATAAAAGCTTATCACCTCCAACACTAGGTCCCGATACACAAGGAGTAACTATCTATTAA

Protein sequence:

>DPOGS211303-PA
MGPKGRNVILEQTFGPPKITKDGVTVAKGIELKDKFQNIGAKLVQNVAHKTNEEAGDGTTTATVLARAIAREGFECISKGANPIEIRKGVMLAVETVTEHLKKMSKPVKTSDEIEQVATISANGDRSIGKLIAAAMNRVGKDGVITVKDGKTLDDELEIIDGMKLEKGYISPYFINSSKGPKVEYNDALILFSDKKIFNANQIVPALEIANAQKKPLIIIAEDFEGEPLSVLVVNKLKIGLQVAAVKAPGFGEYRKNTLIDLAIATGGVIFEDNENLIRLEDCQLESLGQVGEVLITKDTTLLIKGKGDKAEIEQRIEQIRAEYEETSSEFEKQRLLDRISRLKCGVAILRIGGCSEVEVNEKKDRVNDALNATRAAVSEGIVPGGGAALVRCIPILNKLKPANPDQAVGIDIVKKALRTPCLTIASNAGYDGSVVVSKVESMDKDFGYDALNNEYVNMIEKGIIDPTKVVRRALTDASGVASLLTTAEAVICEQRMDKSLSPPTLGPDTQGVTIY-