Monarch geneset OGS2.0

DPOGS207616
TranscriptDPOGS207616-TA1572 bp
ProteinDPOGS207616-PA523 aa
Genomic positionDPSCF300248 + 225654-227225
RNAseq coverage1328x (Rank: top 10%)
Annotation
HeliconiusHMEL0078620.076.38% 
BombyxBGIBMGA006523-TA3e-8477.47% 
DrosophilaCG6453-PA2e-12344.15% 
EBI UniRef50UniRef50_E0VQQ46e-13445.94%Glucosidase 2 subunit beta, putative n=3 Tax=Coelomata RepID=E0VQQ4_PEDHC
NCBI RefSeqXP_974655.23e-14049.62%PREDICTED: similar to glucosidase 2 subunit beta [Tribolium castaneum]
NCBI nr blastpgi|1892345785e-13949.62%PREDICTED: similar to glucosidase 2 subunit beta [Tribolium castaneum]
NCBI nr blastxgi|1892345788e-15050.10%PREDICTED: similar to glucosidase 2 subunit beta [Tribolium castaneum]
Group
Gene OntologyGO:00055098.3e-05calcium ion binding
KEGG pathwaytca:6635228e-140 
 K08288 (PRKCSH)maps-> Protein processing in endoplasmic reticulum
InterPro domain[356-509] IPR0090112e-18Mannose-6-phosphate receptor, binding
[409-467] IPR0129132.8e-07Glucosidase II beta subunit-like
Orthology groupMCL14838 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207616-TA
ATGTTTTATTCTGCTTGGGACAAGCAATTAAGATCATCTATATTTATAATATTTTTATTAGTAATATCGGCTCAATCAGATGTACCTAGGCCGCGCGGCGTTTCTTTATCAAAAGCTTCCCTTTATTTGCCGACTAAAGATTTTACGTGTTTCGATGGAACCAGCACTATTCCTTTCAGTTACGTAAATGACGATTATTGCGACTGCTTTGATGGTAGCGATGAACCTGGAACATCGGCGTGCTTAAATGGAGTTTTTCATTGTACTAATGCAGGCCATCGACCGCAGAACATTCCAAGCTCCCGAGTCAATGACGGTGTATGTGATTGCTGTGACGGCACTGATGAATATGCAAACCAAGAGACATGTCCAGACATTTGCGAAGAGTTAGGCAAAGAAGCAAGGGTAAAAGCACAACAGTTGGCAGAACTTCACAAAGCTGGTAACTCTATTCGATTAGAACTCATTGAAAAGGGCAACAAAAAACGAAATGATATGGCAGAACAACTTTCCCAATTAGAAAAAGATAAATATGAGGCTCAAAAAATGAAGGAAGAGAAGGAAAGTCTTAAAAATGATCTTGAAGCTAAAGAAAATGAAGCCTTACAAGTCTATAGAGATGCCGAAGAAAAAGAAAGGCAACAAAAAGCACAGTTAGAAAAGCAACAACAAGAAAAAGAAGCAAATGAACAGTTCATCCGATTTGATTCTAATAATGATGGTGTCTTATCAGCTGATGAAATTAAAGTAGTCAATGTATTTGATAAGAACAAGGATGGAGAAGTTGATTCAGAGGAACTTAAATATTTTATAGGTGAAAAAGATAGCATAGAAAAAGAGGAATTTATTGATACAACATACCCATTGCTAAAGCCATTGCTCATGTTAGAACAAGGTATGTTCCGTCCAGCTGAGAATGAAGAAGAAACTGAGGAATCTGAACATGAAGATGAGGAAGAGCCTAAGATAGCAGATATGGAAGATCTAGCAGATGATGAAGGCCATGACGACATTCCGGAAGACGATGAACACCATGAAGAACAAACTGATGATACAAAGAAATATGACGATGATACTCAGAAACTAATTGATGAAGCCACAGAAGCTAGACGTCAATTAGCTGAAGCTGAGCGTGCTGTAAGGGAAATAGAATCAAATATAAGGACATTCCAGCAGAATCTAGAAAAGGATTATGGATTACAACAAGAATTTGCTACCCTTGATGGTGAGTGTATCGAATATGAAGACAAAGAATATGTTTACAAGCTGTGTCTCTTCCAGAAAGTTACACAGAAATCAAAGAATGGCGGCATGGAAATAGGACTCGGGGATTGGGGTGAATGGGTTGGTGAAGATGGCAATAAATATTCTGTTATGAAATACACAAATGGAATAGCGTGCTGGAATGGTCCTAATAGATTAACAATAGTCAATGTGAGCTGTGGCTTGGAAACTAAAATTACATCAGTTACGGAACCATTCCGTTGTGAATATAAAATGAATTTAATCACTCCAGCCGCATGTGACGATTCAAATTACACTCAACAGCAATCATCGCACGATGAACTGTGA

Protein sequence:

>DPOGS207616-PA
MFYSAWDKQLRSSIFIIFLLVISAQSDVPRPRGVSLSKASLYLPTKDFTCFDGTSTIPFSYVNDDYCDCFDGSDEPGTSACLNGVFHCTNAGHRPQNIPSSRVNDGVCDCCDGTDEYANQETCPDICEELGKEARVKAQQLAELHKAGNSIRLELIEKGNKKRNDMAEQLSQLEKDKYEAQKMKEEKESLKNDLEAKENEALQVYRDAEEKERQQKAQLEKQQQEKEANEQFIRFDSNNDGVLSADEIKVVNVFDKNKDGEVDSEELKYFIGEKDSIEKEEFIDTTYPLLKPLLMLEQGMFRPAENEEETEESEHEDEEEPKIADMEDLADDEGHDDIPEDDEHHEEQTDDTKKYDDDTQKLIDEATEARRQLAEAERAVREIESNIRTFQQNLEKDYGLQQEFATLDGECIEYEDKEYVYKLCLFQKVTQKSKNGGMEIGLGDWGEWVGEDGNKYSVMKYTNGIACWNGPNRLTIVNVSCGLETKITSVTEPFRCEYKMNLITPAACDDSNYTQQQSSHDEL-