Monarch geneset OGS2.0

DPOGS201579
TranscriptDPOGS201579-TA2412 bp
ProteinDPOGS201579-PA803 aa
Genomic positionDPSCF300152 - 369434-377704
RNAseq coverage251x (Rank: top 42%)
Annotation
HeliconiusHMEL0081060.070.92% 
BombyxBGIBMGA012142-TA0.064.59% 
DrosophilaCG1597-PB0.049.44% 
EBI UniRef50UniRef50_A9UNC00.049.56%RE03215p n=26 Tax=Neoptera RepID=A9UNC0_DROME
NCBI RefSeqXP_972740.10.052.18%PREDICTED: similar to CG1597 CG1597-PA [Tribolium castaneum]
NCBI nr blastpgi|910917000.052.18%PREDICTED: similar to CG1597 CG1597-PA [Tribolium castaneum]
NCBI nr blastxgi|910917000.050.92%PREDICTED: similar to CG1597 CG1597-PA [Tribolium castaneum]
Group
Gene OntologyGO:00045732.9e-263mannosyl-oligosaccharide glucosidase activity
GO:00093112.9e-263oligosaccharide metabolic process
GO:00038248.4e-48catalytic activity
KEGG pathwaytca:6614920.0 
 K01228 (GCS1)maps-> Protein processing in endoplasmic reticulum
    N-Glycan biosynthesis
InterPro domain[54-803] IPR0048882.9e-263Glycoside hydrolase, family 63
[331-799] IPR0089288.4e-48Six-hairpin glycosidase-like
Orthology groupMCL15204 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201579-TA
ATGGTCAAACATAGGAAGACTGTCCAGTATAAACATGCCAACAACGAGTCCAGCAGCAGTTCCGGGGCAAGCGATACATCTATTGCAGGACGTGCATTTCACTTGTTATCAGTATGGAAGACTGGTGCCGGCTTCATATGTCTCGCGATAGCTGTGTACGTGGGCACCCTGGGTTACCTGGAAACGAGGGTCAATACTCCGCTCGATGAGGAGAAGGTGGTTCAAGAAACGGGACTGTCGGTCCCAGAGCGTTACTGGGGTTCCTATAGGCCGGGGGTATACTTTGGAATGAAGTGCCGGGAGCCTCGTTCTCCAGTTTTCGGAATGATGTGGTATGAACTAGCGGCAGCAGCTCATAAGGGGATCAGGCATTTGTGCGAACAGAATGATAACCTGCCAACGTACGGCTGGCTGCGTCACGATGGTTTGACCTTCGGTGAGCAGCTGATATCAGATCCGCCCCACCAGATACACACATCCTTCATCAAGACCCCGGGGGGAGAACACGGTGGACATTGGACGGCCAGAATTAATATCACAGCAACGGGTAAATCCGCGCCGCCATTAGTTCTGATCTGGTACGCAGCCCTGGACGAGTCTCTGGGGTCGGGATCTCATTCCCGTCTCTGGGCGGAGCGTGGCACCCTGATGGGGCACACCCCCGCCCTGGGGAGATTCAGGGTGCATCTGGTGCCGCATTCTGGCATACTCATCCACTCGTCGCTCTCGGAGGCTCATTCCGCTGGTCTCCACCTCCTCAAGGAGAAGTTCTACTCGCTGCTGAGGATCGAGGACCAGCCGTACCTTGGAAGGCTGGCTGTGCTGGGGCCGGATGAAGAGATTGCTGATTCGGACAAGGAGGTGAATTTCGTGCCTATCCAGATGTTGGTGGAGACGCCGTTCGTTCTGGACGTGGTGTACACCACCGAGGACCTGCCGACACCGCCTGTCCGAGGAGACGAGTACACCAAAACCATGGAGAAGCTGAAGATGGGTTATGATGAGGAATTCGAGAGGATCTTCAACCTGGAGAAGAAGGGTTATAGCGCCCAGGATATATCGATAGCAAGGGCTGCGCTGTCTAACATGGTGGGGGGTATGGGCTACTTCTATGGCGCGAGCAGGGTGCAGTCCAAGTACACCAGGGAACCGGTCCCGTACTGGAGAGCTGCGCTTCATACGGCTGTACCCAGCAGATCGTTCTTCCCGCGCGGGTTCCTGTGGGATGAGGGTTTCCACCTGCTGCTGGTGTCGTGGTGGTCGGCTGACCTGGCCCTGGACGTGGCCGCTCACTGGCTGGACCTCATCAACGTGGAGGGTTGGATACCGCGGGAACAGATACTAGGAGTCGAGGCGTTGGCGAGGGTGCCCAAGGAGTTCGTGGTGCAGCACAATTCTGCTGCAAACCCTCCAATGCTGCTGCTGTCTCTGGCCAGGCTGGTGAGGTCCAGGCCGCATCTGTTCACCGAGACGCCCTACAGACAGACCTTGGACAGAATGTTCCCTAGGCTGCAGGCGTGGTACCAGTGGTTCCTGACGACTCAGAAGGGAGACGAGCCCACCACGTACAGGTGGCGCGGCCGGGAGGATGACGGGTTCCAGCTCAACCCGAAGACCTTGACGTCTGGACTCGACGATTACCCCAGGGCGTCTCATCCCAGCAGTATCGAGCGCCACGTGGATCTCCGCTGCTGGATGTACGCTGCCGCGGATGCTATGGCGGTCATAGCGCGCGCCCTGGACCGGGATACTGACAAGTTCGAGGATATGAAGGAGCAGCTGGGTAACGAAGACCTGCTGAACGAGTTGCACTGGTCGCCGCACACGCAGACATACGCCGACTACGGTCTACACACGGACGGCGTGAGGTTCGTCCGCCAGCAGGCCAGGGACCCTCAGGAGGGAGCCAGGGTCGTGAGGAGCGTCACCATAGCGCCGCAGCCGAGGCTGGTGACGTCTGCGTTCGGGTACGTGTCACTATTCCCCATGCTGATGAAAGTTCTCAAACCCGAGAGCGACAAGCTGGGGAATATCCTGGAAATGCTGGACAAGCCCGACCTGCTGTGGTCTCCGTACGGACTGAGATCTCTATCCAAGCTGTCTCCGCTGTACATGAAGAGGAACACAGAGCACGACCCCCCGTACTGGCGGGGTCAAGTGTGGATCAACATTAACTACCTGGCCATATCAGCCCTCCACCACTACTCGGTCTCTGGGGGACCACACGCTGCGAGGGCGAAGTCCCTGCACCAGAGACTAAGAGATAATGTTGTCAGTAATATCCTGTCGGAGTACAAAAGAACTGGTTACCTCTGGGAGCAGTACTCGGGTGAGGATGGCAAGGGCAGTGGGTGTAGACCGTTCACAGGGTGGACGGCGTTGGTCGTGCTGTTGATGGCTGATGAGTACTAG

Protein sequence:

>DPOGS201579-PA
MVKHRKTVQYKHANNESSSSSGASDTSIAGRAFHLLSVWKTGAGFICLAIAVYVGTLGYLETRVNTPLDEEKVVQETGLSVPERYWGSYRPGVYFGMKCREPRSPVFGMMWYELAAAAHKGIRHLCEQNDNLPTYGWLRHDGLTFGEQLISDPPHQIHTSFIKTPGGEHGGHWTARINITATGKSAPPLVLIWYAALDESLGSGSHSRLWAERGTLMGHTPALGRFRVHLVPHSGILIHSSLSEAHSAGLHLLKEKFYSLLRIEDQPYLGRLAVLGPDEEIADSDKEVNFVPIQMLVETPFVLDVVYTTEDLPTPPVRGDEYTKTMEKLKMGYDEEFERIFNLEKKGYSAQDISIARAALSNMVGGMGYFYGASRVQSKYTREPVPYWRAALHTAVPSRSFFPRGFLWDEGFHLLLVSWWSADLALDVAAHWLDLINVEGWIPREQILGVEALARVPKEFVVQHNSAANPPMLLLSLARLVRSRPHLFTETPYRQTLDRMFPRLQAWYQWFLTTQKGDEPTTYRWRGREDDGFQLNPKTLTSGLDDYPRASHPSSIERHVDLRCWMYAAADAMAVIARALDRDTDKFEDMKEQLGNEDLLNELHWSPHTQTYADYGLHTDGVRFVRQQARDPQEGARVVRSVTIAPQPRLVTSAFGYVSLFPMLMKVLKPESDKLGNILEMLDKPDLLWSPYGLRSLSKLSPLYMKRNTEHDPPYWRGQVWININYLAISALHHYSVSGGPHAARAKSLHQRLRDNVVSNILSEYKRTGYLWEQYSGEDGKGSGCRPFTGWTALVVLLMADEY-