Monarch geneset OGS2.0

DPOGS202247
TranscriptDPOGS202247-TA1440 bp
ProteinDPOGS202247-PA479 aa
Genomic positionDPSCF300032 - 818638-823051
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0208730.070.15% 
BombyxBGIBMGA004898-TA0.084.27% 
DrosophilaOstStt3-PA0.068.61% 
EBI UniRef50UniRef50_Q8TCJ20.066.46%Dolichyl-diphosphooligosaccharide--protein glycosyltransferase subunit STT3B n=182 Tax=cellular organisms RepID=STT3B_HUMAN
NCBI RefSeqXP_972341.10.072.76%PREDICTED: similar to oligosaccharyl transferase [Tribolium castaneum]
NCBI nr blastpgi|910862250.072.76%PREDICTED: similar to oligosaccharyl transferase [Tribolium castaneum]
NCBI nr blastxgi|910862250.072.65%PREDICTED: similar to oligosaccharyl transferase [Tribolium castaneum]
Group
Gene OntologyGO:00160201.1e-76membrane
GO:00064861.1e-76protein glycosylation
GO:00045761.1e-76oligosaccharyl transferase activity
KEGG pathwaytca:6610590.0 
 K07151 (STT3)maps-> Protein processing in endoplasmic reticulum
    N-Glycan biosynthesis
InterPro domain[22-382] IPR0036741.1e-76Oligosaccharyl transferase, STT3 subunit
Orthology groupMCL10156 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202247-TA
ATGGCAATTGGTGCTCTGAAGTATTTACACAGTCTGAATCCTAAAGGACAATGGAAGCAGCTGTTGATTCTTGGTGGTCTGATAGCTGCAGTAACAGTGTTCTTGGCTGTAGTTCTTCTCACATATATGGGAATTATTGCTCCCTGGAGTGGCAGGTTCTACTCCTTATGGGACACAGGATATGCTAAGATTCATATCCCTATAATAGCATCAGTGTCAGAGCATCAACCGACCACATGGTCATCTTTCTTCTTTGACCTCCACGTTCTGGTGTGCACCTTCCCTGTAGGATTGTGGTACTGCATCAAGAATGTCAATGATGAAAGAGTATTTGTGGCGCTGTATGCTCTATCAGCGGTTTACTTCGCTGGTGTAATGGTTCGGTTGATGTTAACTTTAACGCCGGTGGTGTGTGTGTTGGCTGGGATCGCCTTCTCCATACTACTGGACTTGGTGCTTAGAGAGGACGAAACACCAGCACCAGCACAGGACAGCGATGATAAGAACTTGTATGATAAGGCTGGTAAATTAAAGAAGCGTACCCCGGAGCCGGTCGCGGAGACTGGTTTAGGTATGAACGTACGTTCTGGTACCCTCATAGCATTCATGATACTTTTGATGCTGTTCTCCGTTCACTGCACCTGGGCCACCTCCAACGCATACTCCAGCCCTAGCATAGTGCTGGCCAGCTATGGTAATGATGGATCACGTAAAATCTTAGATGACTTCCGAGAAGCTTATGGTTGGCTATCACAGAATACAGCTGAAGATGCCCGGGTGATGTCCTGGTGGGACTACGGTTACCAGATAGCTGGTATGGGCAACAGGACTACCTTAGTGGACAATAATACATGGAATAATTCTCACATAGCGTTGGTCGGGAAAGCCATGGCAAGTAATGAGACAGCTGCCTATGATATCATGACGATGTTGGATGTGGACTATGTGCTGGTGATATTTGGTGGTGCCATCGGTTACTCTGGTGATGATATCAATAAATTCATCTGGATGGTCAGAATAGCTGAGGGCGAACATCCTAAGGATATACATGAAGCGGATTACTTCACAGAGAGAGGAGAGTACAGAATAGACTCAGAGGCATCAAAAACTATGTTAAATTCACTTATGTACAAATTATCATATTATCGTTACGATAGTGGTGGAAGTCCCCCGGGCTTCGATCGCACTCGCGGCGCCCTGCCCGGTCTGCGTGGGTTCAAACTGACGTACCTCGAGGAAGCCTACACCACTGAGCATTGGCTCGTCAGAATATATAGAGTTAAGAAACCAGACGAATTCAACCGACCACGACTACCTCTGGCCAAGAGGACAATACAGACCTCAAATGTCATGTCTAAGAAAACGTCAAAGAGAAGAAAAGGTGCGTTAAAGAACAAGCCAATATTAGTGAAGGGCAAGAAAGTGACCAAGTTGGATTGA

Protein sequence:

>DPOGS202247-PA
MAIGALKYLHSLNPKGQWKQLLILGGLIAAVTVFLAVVLLTYMGIIAPWSGRFYSLWDTGYAKIHIPIIASVSEHQPTTWSSFFFDLHVLVCTFPVGLWYCIKNVNDERVFVALYALSAVYFAGVMVRLMLTLTPVVCVLAGIAFSILLDLVLREDETPAPAQDSDDKNLYDKAGKLKKRTPEPVAETGLGMNVRSGTLIAFMILLMLFSVHCTWATSNAYSSPSIVLASYGNDGSRKILDDFREAYGWLSQNTAEDARVMSWWDYGYQIAGMGNRTTLVDNNTWNNSHIALVGKAMASNETAAYDIMTMLDVDYVLVIFGGAIGYSGDDINKFIWMVRIAEGEHPKDIHEADYFTERGEYRIDSEASKTMLNSLMYKLSYYRYDSGGSPPGFDRTRGALPGLRGFKLTYLEEAYTTEHWLVRIYRVKKPDEFNRPRLPLAKRTIQTSNVMSKKTSKRRKGALKNKPILVKGKKVTKLD-