Monarch geneset OGS2.0

DPOGS208914
TranscriptDPOGS208914-TA1596 bp
ProteinDPOGS208914-PA531 aa
Genomic positionDPSCF300009 - 444352-455291
RNAseq coverage503x (Rank: top 25%)
Annotation
HeliconiusHMEL0146640.084.52% 
BombyxBGIBMGA002486-TA0.076.38% 
Drosophilaalpha-Man-I-PI0.065.32% 
EBI UniRef50UniRef50_O184980.082.66%Alpha 1,2-mannosidase n=8 Tax=Eukaryota RepID=O18498_SPOFR
NCBI RefSeqNP_511105.20.065.53%alpha mannosidase I, isoform K [Drosophila melanogaster]
NCBI nr blastpgi|22455700.082.66%alpha 1,2-mannosidase [Spodoptera frugiperda]
NCBI nr blastxgi|22455700.083.37%alpha 1,2-mannosidase [Spodoptera frugiperda]
Group
Gene OntologyGO:00160202.7e-276membrane
GO:00055092.7e-276calcium ion binding
GO:00045712.7e-276mannosyl-oligosaccharide 1,2-alpha-mannosidase activity
KEGG pathwaydme:Dmel_CG422750.0 
 K01230 (MAN1)maps-> High-mannose type N-glycan biosynthesis
    Protein processing in endoplasmic reticulum
    N-Glycan biosynthesis
InterPro domain[58-531] IPR0013822.7e-276Glycoside hydrolase, family 47
Orthology groupMCL10424 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208914-TA
ATGCGAAAATTAGTTCGGTTGTTGACCGGGGCTCTCTCGTTCATCGTGACTATGATCTTACTGGTCAGGGTTTCCCAACTCGACAGTGAGACGGATTCGGCAGAGCATCGTATGGAAAGATCAGCGCGATGGGGAAATATGAAAACTTTGGAAAACTTTTCTCAAACAAACATCACTGAGCAACGACAGCTCATTGTAGAGATGATGCGTCACGCCTGGAACAACTATAAGTTATACGCATGGGGTAAGAACGAGCTGAAGCCGATGACAAAACGAGCTCATCTGACCAGTGTATTCGGAGGTGGTGATCTAGGAGCCACCATCGTCGATGGTATGGACACGCTCTACGTCATGGGACTGATAGATGAGTTCAGGGAAGGTAGAGACTGGATAGCAGAACATTTCCACATCAATGAAATTGACTCGGATCTGTCGGTGTTCGAGACCACAATCAGGTTCGTGGGTGGTCTCCTGTCCTGCTACGCCCTGACAGGAGACGCCGTGTTCCGTGATAAGGCTGCCGAGGTAGCCGACGCCCTACTGCCGGCCTTCGAGACACCCACTGGATTACCTTACGCCCTCATAAACCCATCGAATAAGGCCAATCGTCAGTATCACTGGGCTGGCGCTAACAGTATACTGTCTGAAGTCGGGACTCTTCATCTAGAATTCACATACCTGAGCGATGTCACCGGCAAAGACGTGTACAGACAGAAGGTCGATCGTATTCGTGATGTTCTTCACAACATCCAGAAGCCAGAAGGTCTCTATCCCAACTACATCAATCCCAGGAACGGACAGTGGGGGCAGAAGCACACATCTCTGGGAGCGCTAGGCGACTCCTTCTACGAGTATCTGCTGAAGGCCTGGATTCTATCAGACAACGAGGACGTCCAGGCCCGCGAGATGTTCGACGAGGCGATGCAGGCAGCTCTCGATAAGATGCTTAGGACTTCGCCCTCAGGGCTCGCCTACCTCGCGGAGGTCAAGTACGGCAGGATATTTGAAGAGAAGATGGACCACTTGTCGTGTTTCGCAGGTGGTATGTTCGCCCTCGCTTCAACAACAATGTCCAACTCTCTGTCGGAGCGGTACATGGACGTGGCCCGCAAATTGACTCACACCTGTCACGAGAGCTACGACCGCTCGGAAACAAAACTCGGACCGGAGGCTTTCAGGTTCTCTGGCGCTGTCGAGGCTCGCGCTATGAAGAGTAACGAGAAGATGTACCTCCTGCGGCCGGAGACGTTCGAGAGCTACTTCATCATGTGGAGACTCACCAAGGAACAGAAATATAGAGATTGGGGCTGGGAGGCTGTTCAGGCTCTCGAGAAGCACTGTCGAGTGGAGGGAGGGTACACGGGTCTGCTGAACGTGTACCACGCCTCGCCTCAGGGCGACGACGTGCAACAGAGCTTCTTCCTTGCGGAGACGCTTAAGTATCTGTACCTGCTGTTCTCCGAGGACTCCTTACTGCCATTAAACGAATGGGTTTTCAACACCGAGGCCCATCCTCTGCCCATCAAGAACAGGAATCCGCTGTACCGAGCCGCCGACAAGACCGCTCACATTGTGAACGAGAGCAATCAAATTTAA

Protein sequence:

>DPOGS208914-PA
MRKLVRLLTGALSFIVTMILLVRVSQLDSETDSAEHRMERSARWGNMKTLENFSQTNITEQRQLIVEMMRHAWNNYKLYAWGKNELKPMTKRAHLTSVFGGGDLGATIVDGMDTLYVMGLIDEFREGRDWIAEHFHINEIDSDLSVFETTIRFVGGLLSCYALTGDAVFRDKAAEVADALLPAFETPTGLPYALINPSNKANRQYHWAGANSILSEVGTLHLEFTYLSDVTGKDVYRQKVDRIRDVLHNIQKPEGLYPNYINPRNGQWGQKHTSLGALGDSFYEYLLKAWILSDNEDVQAREMFDEAMQAALDKMLRTSPSGLAYLAEVKYGRIFEEKMDHLSCFAGGMFALASTTMSNSLSERYMDVARKLTHTCHESYDRSETKLGPEAFRFSGAVEARAMKSNEKMYLLRPETFESYFIMWRLTKEQKYRDWGWEAVQALEKHCRVEGGYTGLLNVYHASPQGDDVQQSFFLAETLKYLYLLFSEDSLLPLNEWVFNTEAHPLPIKNRNPLYRAADKTAHIVNESNQI-