Monarch geneset OGS2.0

DPOGS211902
TranscriptDPOGS211902-TA1815 bp
ProteinDPOGS211902-PA604 aa
Genomic positionDPSCF300011 - 153640-161975
RNAseq coverage1673x (Rank: top 8%)
Annotation
HeliconiusHMEL0177229e-16963.38% 
BombyxBGIBMGA001097-TA0.070.71% 
DrosophilaCnx99A-PC0.053.43% 
EBI UniRef50UniRef50_B7PGQ21e-15053.05%Calnexin, putative n=4 Tax=Ixodidae RepID=B7PGQ2_IXOSC
NCBI RefSeqNP_001036766.10.053.43%calnexin 99A, isoform C [Drosophila melanogaster]
NCBI nr blastpgi|3227902591e-18051.84%hypothetical protein SINV_10264 [Solenopsis invicta]
NCBI nr blastxgi|1160081160.052.76%calnexin 99A, isoform C [Drosophila melanogaster]
Group
Gene OntologyGO:00055099.6e-212calcium ion binding
GO:00064572.6e-58protein folding
GO:00057832.6e-58endoplasmic reticulum
GO:00510822.6e-58unfolded protein binding
KEGG pathwaydme:Dmel_CG119582e-180 
 K08054 (CANX)maps-> Phagosome
    Antigen processing and presentation
    Protein processing in endoplasmic reticulum
InterPro domain[1-604] IPR0015809.6e-212Calreticulin/calnexin
[50-297] IPR0133201.1e-95Concanavalin A-like lectin/glucanase, subgroup
[44-299] IPR0089851.9e-68Concanavalin A-like lectin/glucanase
[257-398] IPR0090332.6e-58Calreticulin/calnexin, P
Orthology groupMCL10647 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211902-TA
ATGATGGCACCTGGTATTATGCGGGTCTTTTTATTAAGCTTCTTGGTAGTCTCTGGCTCGCTGCAAGTTACGGCCGATGTCGACGATGCCGAAGATGGAGTAACTGTTGAGACAGAAGAGGAAATCTACCAAAGTCCTAAGGCCGATCCCAAGAAGGTGTATCTGGCGGAGAACTTTGATGATGTGGCATTGTTCAAGAAGAAGTGGATTAAGTCTGAAGCAAAGAAACAGGGTGTGGACGAAGATATCGCCAAATATGATGGGAAATGGGAGATACAAATACCAACAAGAAAAATATTCAATAGCGACTCAGGGTTGGTGCTGACTACAGAGGCTAAGCATGCAGCTATATCAACACTGCTCGACCGGCCGTTCGAGTTCAAAGACAAACCACTCATTGTACAATACGAAGTGACTATGCAGGAGGGTCAAAATTGTGGTGGTGCTTACCTAAAACTTCTATCACGCGGTGTGAACACGAAAGCAGACCTCAAACAGTTCCACGACCAGACTGCGTACACCATCATGTTTGGGCCCGACAAATGTGGCAACGACAACAAACTGCACTTCATCTTCAGACACAAAAACCCCAAGAATGGGACCATCGAAGAAAAACACTGCAAGAAACCAACCCAACGTCTTGAAGACATCTACAAAGACAAGGAGCCTCACCTGTACACTCTGATAGTGCGGCCAGACAACACATTCTCAGTCCTCGTCGACAACAAGGAGTTCAACGCCGGTTCGTTGCTAGAAGACTTCACCCCACCCGTCAACCCTCCGGAGGAGGTGGACGATCCCAACGACGAGAAGCCAGAGGACTGGGACGAGAGGGAGAAGATCGTGGATCCCTCAGCGAGTAAGCCAGATGACTGGGATGAGAGTGAGCCGGCACAGATCATAGACTTCAACGCTGTCAAACCAGACGGCTGGTTGGAAGACGAGCCTGACATGATACCAGACCCGGAGGCCAAGAAACCTGCGGATTGGGACGAGGAGATGGACGGGGAGTGGGAGGCGCCTCTCGTGGATAACCCTCGCTGTGCCTCCGCACCCGGCTGTGGAACCTGGGCGCCGCCCACCATTCCCAACCCTAAATACAAGGGTATCTGGCGGGCACCTCTCATCCCCAACCCCAACTACAAGGGCAAGTGGAGTCCAAGGCGGATCCCCAACCCGGACTACTTCAACGATGAGCATCCCTTCAGGATGACGCCCATTCACGCTGTTGGATTTGAACTGTGGTCGATGTCGCCCATGCTCTTGTTCGACAACCTGATCATCACGGACGATCCGGCGGTGGCGGAGGCCTGGGCCGCTCAGGGCTTCGCTCTCAAGAAACAGAGGATATCCAGTGACTCGAAAACGTGGTGGGGCAGACTGCTGAGAGCCGTGAAGTACCGGCCGGGCGCGGTGTCGCTGTACGTGGTGTACTGCGCCGTACCTATCGTTATATACGTCGCCTACCTTATAAGGAGATCCTATGAGGAGTCCGTGGTGGAGCTCGTCCTGCGCTCGGTGGGTGACAGACCCTGGCTGTGGGGAGCCGCGCTTCTGGTTTCCTTCGCTGTGTTGGCCTTCGTCGCATACATGTGTTGTGGACCTCGAGTGGATCCGGAAGCGGATGTCAAGAAGACGGACGCGGTTGTAGAGGATGATCCTCATCAAGAAGAAGTTGAAGAAACCAGTGAGAAGACGAGCAAAGCTGATCTGGAAGGCCCCGAGCCTGAGGCTGACACCAGTGATACCACACCCTTAGTGGACTCGGAAGCAGCCGGCGACGGACAGAGGAAGAGGAAACCACGCAAGGAGTGA

Protein sequence:

>DPOGS211902-PA
MMAPGIMRVFLLSFLVVSGSLQVTADVDDAEDGVTVETEEEIYQSPKADPKKVYLAENFDDVALFKKKWIKSEAKKQGVDEDIAKYDGKWEIQIPTRKIFNSDSGLVLTTEAKHAAISTLLDRPFEFKDKPLIVQYEVTMQEGQNCGGAYLKLLSRGVNTKADLKQFHDQTAYTIMFGPDKCGNDNKLHFIFRHKNPKNGTIEEKHCKKPTQRLEDIYKDKEPHLYTLIVRPDNTFSVLVDNKEFNAGSLLEDFTPPVNPPEEVDDPNDEKPEDWDEREKIVDPSASKPDDWDESEPAQIIDFNAVKPDGWLEDEPDMIPDPEAKKPADWDEEMDGEWEAPLVDNPRCASAPGCGTWAPPTIPNPKYKGIWRAPLIPNPNYKGKWSPRRIPNPDYFNDEHPFRMTPIHAVGFELWSMSPMLLFDNLIITDDPAVAEAWAAQGFALKKQRISSDSKTWWGRLLRAVKYRPGAVSLYVVYCAVPIVIYVAYLIRRSYEESVVELVLRSVGDRPWLWGAALLVSFAVLAFVAYMCCGPRVDPEADVKKTDAVVEDDPHQEEVEETSEKTSKADLEGPEPEADTSDTTPLVDSEAAGDGQRKRKPRKE-