Monarch geneset OGS2.0

DPOGS210046
TranscriptDPOGS210046-TA1188 bp
ProteinDPOGS210046-PA395 aa
Genomic positionDPSCF300017 - 1182597-1186832
RNAseq coverage9680x (Rank: top 1%)
Annotation
HeliconiusHMEL0053770.085.64% 
BombyxBGIBMGA000475-TA0.084.55% 
DrosophilaCrc-PA1e-16774.52% 
EBI UniRef50UniRef50_P294132e-16574.52%Calreticulin n=180 Tax=Eukaryota RepID=CALR_DROME
NCBI RefSeqNP_001037075.10.084.29%calreticulin precursor [Bombyx mori]
NCBI nr blastpgi|178269330.084.13%calreticulin [Galleria mellonella]
NCBI nr blastxgi|178269330.086.15%calreticulin [Galleria mellonella]
Group
Gene OntologyGO:00064579.4e-291protein folding
GO:00057839.4e-291endoplasmic reticulum
GO:00510829.4e-291unfolded protein binding
GO:00055099.4e-291calcium ion binding
KEGG pathwaydgr:Dgri_GH195791e-167 
 K08057 (CALR)maps-> Chagas disease
    Phagosome
    Antigen processing and presentation
    Protein processing in endoplasmic reticulum
InterPro domain[1-395] IPR0091699.4e-291Calreticulin
[7-374] IPR0015804.5e-242Calreticulin/calnexin
[16-231] IPR0133203.6e-96Concanavalin A-like lectin/glucanase, subgroup
[12-243] IPR0089855.5e-68Concanavalin A-like lectin/glucanase
Orthology groupMCL11755 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210046-TA
ATGAAAAAAATATTCGTAACTATTGCCAGCCTGCTGGCATTAAGTTTAGTTAATTGTGATGTGTACTTTGAAGAGAAATTCCCTGATGACTCCTGGGAATCCAGTTGGGTGTACAGTGAACATCCCGGAAAAGAATTCGGCAAATTCAAACTGACCGCTGGAAAGTTCTACAACGACCAAGAGGAAGATAAAGGTCTTCAGACATCAGAAGATGCGAGGTTTTACGCCCTTTCGCGCAAGTTTGAGCCGTTCTCAAACGAAGGCAAGCCTTTGGTGGTTCAGTTTTCTGTGAAACATGAACAGGACATTGACTGTGGGGGTGGATACCTCAAGGTGTTCGACTGCAAGCTCGACCAGAAAGACATGCATGGAGAGAGTCCCTATGAAATCATGTTTGGACCCGACATCTGCGGACCCGGAACTAAGAAGGTGCACGTGATCTTCAGCTACAAGGGCAAGAACCACCTCATCAAGCAGGACATCCGTTGTAAGGACGACGTCTACACTCACATGTACACACTCATTGTGAAGCCCGATAACACCTACCAGGTCCTCATCGACAATGAAGAGGTCCAAGCCGGCAGCCTCGAAGAACACTGGGACTTCCTCCCACCTAAGATGATCAAGGACCCTGAAGCTAAGAAGCCCGAGGATTGGGATGACCGTGCCACAATCCCTGATCCGGAAGACACTAAGCCAGAAGACTGGGACAAACCTGAACACATCCCTGACCCTGATGCCGCCAAGCCCGAGGATTGGGATGATGAGATGGACGGTGAATGGGAACCACCCATGATCGACAACCCTGACTACAAGGGAACCTGGGCGCCTAAGCAGATCTCCAACCCAGCTTACAAAGGTGCGTGGGTGCACCCCGAGATCGACAACCCTGAATACACTCCGGACCCCAATCTATACAAGAGGGATGAACTGTGCGCCATCGGTCTAGACTTGTGGCAGGTGAAGTCTGGAACAATCTTCGATAACATACTCTTCACCGATGACATAGAACTGGCCAAGGAGAGGGCCGAGGTCGTGAAGAAGACTCAGGAGGGTGAGAAGAAGATGAAGAACGAACAGGATGAGCTGGAGAGAGAGAAGGATAAGGACAAGCCCGAGGAGGAGGATGATGAGGATCTTGATGATGAGGGATTAGCATCACCTGTGGAGGAGCACGATGAGTTGTGA

Protein sequence:

>DPOGS210046-PA
MKKIFVTIASLLALSLVNCDVYFEEKFPDDSWESSWVYSEHPGKEFGKFKLTAGKFYNDQEEDKGLQTSEDARFYALSRKFEPFSNEGKPLVVQFSVKHEQDIDCGGGYLKVFDCKLDQKDMHGESPYEIMFGPDICGPGTKKVHVIFSYKGKNHLIKQDIRCKDDVYTHMYTLIVKPDNTYQVLIDNEEVQAGSLEEHWDFLPPKMIKDPEAKKPEDWDDRATIPDPEDTKPEDWDKPEHIPDPDAAKPEDWDDEMDGEWEPPMIDNPDYKGTWAPKQISNPAYKGAWVHPEIDNPEYTPDPNLYKRDELCAIGLDLWQVKSGTIFDNILFTDDIELAKERAEVVKKTQEGEKKMKNEQDELEREKDKDKPEEEDDEDLDDEGLASPVEEHDEL-