Monarch geneset OGS2.0

DPOGS202204
TranscriptDPOGS202204-TA1383 bp
ProteinDPOGS202204-PA460 aa
Genomic positionDPSCF300149 - 242219-245462
RNAseq coverage966x (Rank: top 13%)
Annotation
HeliconiusHMEL0092040.072.19% 
BombyxBGIBMGA013498-TA0.065.92% 
DrosophilaCG6370-PA9e-5330.39% 
EBI UniRef50UniRef50_E2BN861e-6135.52%Dolichyl-diphosphooligosaccharide--protein glycosyltransferase subunit 2 n=5 Tax=Formicidae RepID=E2BN86_HARSA
NCBI RefSeqXP_001601669.13e-6836.05%PREDICTED: similar to ribophorin ii [Nasonia vitripennis]
NCBI nr blastpgi|3454871166e-6736.05%PREDICTED: dolichyl-diphosphooligosaccharide--protein glycosyltransferase subunit 2-like [Nasonia vitripennis]
NCBI nr blastxgi|3454871161e-6536.45%PREDICTED: dolichyl-diphosphooligosaccharide--protein glycosyltransferase subunit 2-like [Nasonia vitripennis]
Group
Gene OntologyGO:00182795.2e-66protein N-linked glycosylation via asparagine
GO:00082505.2e-66oligosaccharyltransferase complex
GO:00045795.2e-66dolichyl-diphosphooligosaccharide-protein glycotransferase activity
GO:00057895.2e-66endoplasmic reticulum membrane
KEGG pathwaynvi:1001174268e-68 
 K12667 (SWP1, RPN2)maps-> Protein processing in endoplasmic reticulum
    N-Glycan biosynthesis
InterPro domain[23-453] IPR0088145.2e-66Ribophorin II
Orthology groupMCL11761 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202204-TA
ATGAATAAATTAGCCCAACTTTTTATAGTTTTAAGTATATTAGCAGTCAGTAATGCTATATTACAAGGTATTATAAATGTCAAACACTTTCAATCCCTGCTAGAGGAAGGAATTAAAAGTAAAGATATCAGTACTTTGTATCACTCTATCAAGGGTCTGAAACAACTTAATGTGAAACTTCCTAAACTCTGTGAGGACATCAAAAATTCCAAATATGACATCAAGAACATAGAACATGTATTCTATCTGACGAATGCTGCCGAACTCACCGGTTGTCAAAATTTCCTATTGCCCGATGTCCTGGGTACTCCGGCAAAGGTGCTTAATAACAAAGATGTAACTATACCAGAGATATATTATGCCGTGTATTCCTTGAAAGCTATCGGTAGAGGCTCCGTGTACGACAGAGAGGATGGTTTGAAAAATTTAATCCAAATGCTGAAAAAGGATGACTCTCCGGCTAACTATGGCTATGTATTTGGTTTGTGCGAGCACATGGGATGCCAGGCTTGGACGGTGATGCATGCAGAAAATGCTCTGCTTGCCGCCGATGAGACCGATGGAAGAGCTTTACATTTCGAAGGAGGATTACCGGTTACATCACTCGTACTAAGCACTATCATCAGATCATTTAAATCTGTGAAGAAGCCCTCGCCTCTGACTCCAGACCAGAAACACAAGTTCGGGGCCTACTTGTTATCTCGTCGCTCGGCGTCAGCACCACGCACTGTTGCTTCTCTCATTGAAGCTGCAAAAATTCTTGCCGACGATCAGCCCACTCCCATATGCATCGTGTTTAAAGACAAGAAATATATCACGACAGAGACGGATTCCATTGAATTTTCAGTCACAGATTTAGTCGGTAGAGCCATTCCAAACCTGAAGCCTGATGAGGTGGTCGCCCAGTCGGGCACTAGGCTGGCCGATGACGTCGTCGTACTATCGAAGCAGCCGCTGACACAGAAACCTAACGAGCCATCGACCTTCGTTCTAAATCTCAGCAAGATTAAGTCTCAGTACGGTCTATACAAGATATCTCTCAGTACGGGAAGCAAGTCGGCCAACTTCAACGTAGCAGTGCTCGGAGAGATACAAGTGTCGAGTGTTGAGATCGGAGTGGGGGATGTAGATGGCACCACCAGTCCTAAACTCACGACAGTCGCGTACCCCAACAAGATGGCTGAAGTACTACAAGCTGATCATTTACAAAAAATGAGCGTGAAGTTCTCAATCCGCGACAAGTTCAACAAGCCGGCGCTGGTGCAGCAAGCCTTCCTGCACGTGGCCTCGCTGCAGGACGAGCGCGAGGCGATCTATGTAGCGGAGCCCGACCACGCCAAGAACTATAAAGTGGAACTGATTCAAGATGTGGAACAGAAATGA

Protein sequence:

>DPOGS202204-PA
MNKLAQLFIVLSILAVSNAILQGIINVKHFQSLLEEGIKSKDISTLYHSIKGLKQLNVKLPKLCEDIKNSKYDIKNIEHVFYLTNAAELTGCQNFLLPDVLGTPAKVLNNKDVTIPEIYYAVYSLKAIGRGSVYDREDGLKNLIQMLKKDDSPANYGYVFGLCEHMGCQAWTVMHAENALLAADETDGRALHFEGGLPVTSLVLSTIIRSFKSVKKPSPLTPDQKHKFGAYLLSRRSASAPRTVASLIEAAKILADDQPTPICIVFKDKKYITTETDSIEFSVTDLVGRAIPNLKPDEVVAQSGTRLADDVVVLSKQPLTQKPNEPSTFVLNLSKIKSQYGLYKISLSTGSKSANFNVAVLGEIQVSSVEIGVGDVDGTTSPKLTTVAYPNKMAEVLQADHLQKMSVKFSIRDKFNKPALVQQAFLHVASLQDEREAIYVAEPDHAKNYKVELIQDVEQK-