Monarch geneset OGS2.0

DPOGS201416
TranscriptDPOGS201416-TA1395 bp
ProteinDPOGS201416-PA464 aa
Genomic positionDPSCF300006 - 1688221-1689615
RNAseq coverage2158x (Rank: top 6%)
Annotation
HeliconiusHMEL0090620.082.76% 
BombyxBGIBMGA002570-TA0.080.48% 
DrosophilaCG33303-PA2e-13151.28% 
EBI UniRef50UniRef50_E2A8B37e-13951.40%Dolichyl-diphosphooligosaccharide--protein glycosyltransferase subunit 1 n=6 Tax=Arthropoda RepID=E2A8B3_CAMFO
NCBI RefSeqXP_001663283.12e-15759.23%ribophorin [Aedes aegypti]
NCBI nr blastpgi|1571680073e-15659.23%ribophorin [Aedes aegypti]
NCBI nr blastxgi|1571680079e-15258.96%ribophorin [Aedes aegypti]
Group
Gene OntologyGO:00064861.3e-207protein glycosylation
GO:00057831.3e-207endoplasmic reticulum
GO:00045791.3e-207dolichyl-diphosphooligosaccharide-protein glycotransferase activity
GO:00160211.3e-207integral to membrane
KEGG pathwayaag:AaeL_AAEL0130715e-157 
 K12666 (OST1, RPN1)maps-> Protein processing in endoplasmic reticulum
    N-Glycan biosynthesis
InterPro domain[26-456] IPR0076761.3e-207Ribophorin I
Orthology groupMCL14929 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201416-TA
ATGGCTAAAATACAATTCTTTGCTTTTATTTTATTATATGTTGCTTCTAATTGCATCAGTGTAAACGTCGATAATATTTCTAGTGATGTAAGAATTAAAAATGTTGATAGATCGATCGATATATCTTCACAGCTTGTGAAAACATCGTCTAAAATAACCTTTGAAAATACAGGAAAAGTGCCGGTGAAACAATTTCTTTTAGCTGTTGAAGGTGCGGCTAAAAACAACTTGGCTTTTATTGGTGCCAGAGATAGCAATAACAAAGATTTGCGACTTGTGGAAACAACAGTCAAAGGATATGATAACGTAAAATTTTGGCGTGTAGAGTTAAAGGAATCTGTTAATGCTGCGGCCAGTGCAGTCATTACGTCTGATGCAGTGTTCTCTAAAGCATTGTTACCACACCCAACAGCAATAACTCAGCAAGAAGACCAACTTGTCAAATACAACAGTAATTTGTACTTTTATTCACCATATAAAGTGTTAACACAGAAAACAACAGTCGTATTAAATACTAAATCAGTTGAATCATTCACAAAAGTTAAGCCTTTCTCCCAACAAGATGGCAATATTAACTATGGTCCATACTCAAACATTGAGCCATTTACAGAGAAAGAACTTAGTCTCCATTATAAGAATAACTCCCCGTTCCTTACTGTGACTCGTTTGGAGCGACTTATTGAAGTATCTCATTGGGGTAACATTGCTATTGAAGAAATCATTGAAATTGAACATAGTGGGGCAAAGTTGAAAGGCCCATTTTCAAGGTATGACTATCAACAGGACCACCACAGTGGACCCGCAAGTGTAAGGTCTTACAAGACTCTACTGCCAGCATCAGCCTCTGATGTTTATTACCGAGACACTAATGGTAACATCTCAACATCCAACATGAAAGTAAAGAAAGATTCAGTTGAATTGGACTTAAGGCCCAGATACCCACTGTTTGGTGGTTGGAGGTCACACTACACATTAGGCTACAATGTCCCTAGCTACGAATACCTGTACCACTCTGGAAATGAATATTTACTTAAGATGAGAGCGGTGGATCATGTTTTTGATGATATGCAAATAGATGAACTTGTCACAAAAATTATATTACCCGAAGGCTCAACAAGTATTAAAATTAATTTCCCATTTGCTGTCACCAGACTGCCCGATAGTCTGCACTACACATATCTAGACACCAAGGGTCGTCCAGTGATTACATTCAAAAAGAGCAATGTCGTTGAAAATCACATACAAGATTTTCAACTTCGTTACACATTCCCGAGAATCCTGATGTTACAAGAGCCTCTGTTAGTGGTTGGCTTCCTCTATGCGTTATTCTTGTGTGTTATTGTGTATGTACGGCTGGATTTCTCCATACACAAGCCTGAACATCTAAAAGAGTAA

Protein sequence:

>DPOGS201416-PA
MAKIQFFAFILLYVASNCISVNVDNISSDVRIKNVDRSIDISSQLVKTSSKITFENTGKVPVKQFLLAVEGAAKNNLAFIGARDSNNKDLRLVETTVKGYDNVKFWRVELKESVNAAASAVITSDAVFSKALLPHPTAITQQEDQLVKYNSNLYFYSPYKVLTQKTTVVLNTKSVESFTKVKPFSQQDGNINYGPYSNIEPFTEKELSLHYKNNSPFLTVTRLERLIEVSHWGNIAIEEIIEIEHSGAKLKGPFSRYDYQQDHHSGPASVRSYKTLLPASASDVYYRDTNGNISTSNMKVKKDSVELDLRPRYPLFGGWRSHYTLGYNVPSYEYLYHSGNEYLLKMRAVDHVFDDMQIDELVTKIILPEGSTSIKINFPFAVTRLPDSLHYTYLDTKGRPVITFKKSNVVENHIQDFQLRYTFPRILMLQEPLLVVGFLYALFLCVIVYVRLDFSIHKPEHLKE-