Monarch geneset OGS2.0

DPOGS206782
TranscriptDPOGS206782-TA1578 bp
ProteinDPOGS206782-PA525 aa
Genomic positionDPSCF300001 - 5681267-5688110
RNAseq coverage340x (Rank: top 34%)
Annotation
HeliconiusHMEL0038441e-13179.93% 
BombyxBGIBMGA000564-TA0.084.58% 
DrosophilaCG11874-PA6e-16561.10% 
EBI UniRef50UniRef50_E2BM442e-17061.87%Endoplasmic reticulum mannosyl-oligosaccharide 1,2-alpha-mannosidase n=8 Tax=Formicidae RepID=E2BM44_HARSA
NCBI RefSeqXP_971080.10.066.81%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910788262e-18066.81%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910788262e-17765.67%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00160201.5e-272membrane
GO:00055091.5e-272calcium ion binding
GO:00045711.5e-272mannosyl-oligosaccharide 1,2-alpha-mannosidase activity
KEGG pathwaytca:6597090.0 
 K01230 (MAN1)maps-> High-mannose type N-glycan biosynthesis
    Protein processing in endoplasmic reticulum
    N-Glycan biosynthesis
InterPro domain[27-522] IPR0013821.5e-272Glycoside hydrolase, family 47
Orthology groupMCL14464 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206782-TA
ATGGCTAGCTTAGATACAAGGATAGACCTAGGAGGTCAACAAAATTCCTGGAATGGATCTGATGGCTCCTTAAAACATTCCGAACACTTAATCTCACATAGAAAAATAGTTAGGGAGTTGTCAAACGCTTCGTCACTTGTAACCGATCCCCCACCAAATGCATCGCCCCTGAAAAAAATTAATATTGACAGTGAAATTATACAAAAGCCATTATTCACAGGTCCGGGTAACGCTCGTCAGTATGCAGTAGTTGAATCATTCAAGCACGCATGGAAAGGCTATAAGGAACACGCTTGGGGTCACGACAACTTAAAACCAGTCTCCGGCATGGCATTTGATTGGTTTTCACTGGGTTTGACCATAGTCGATGGATTGGACACGGCTTACATCATGGGACTGAATGAAGAATTCCAAGAAGGCAAGGAGTGGATCAACAACGAGCTGATATTCACAAAGCAGAAGGACGTTAATTTCTTTGAAGTGACCATTAGAGTGCTGGGTGCTCTTCTAACTAATTATCACTTTACAGAAGATAAAATGTTTTTAGATAAAGCGAAGGATCTCGGTGAGCGACTGATGTCAGCGTTCTCATCTCCGTCCGGGATCCCGTACTCAGACGTGAACCTCGGCTCGCGGACGGCTCACGCCCCTGAGTGGGCTCACTACAGCACCACCGCTGAGGCCACCACGATACAACTAGAGTTCAGGGAGCTGTCTAGATCATCAAACAATCCTGTATTCGAGGATGCTGCAGCTGCGGTGTCTGAAAAGATTCATCAACTGCCAAAAAAGCACGGCCTGGTGCCCATCTTCATCAACCCTAACACTGGTCAGTTTGCACCTCACGCCACCATCACGTTGGGGGCACGTGGAGACAGCTATTACGAATATCTATTGAAGCAATGGCTTCAAACCGGAAAGACTATAACTTATCTGGTGGATGATTACATGACTGCTATAGAGGGCGTGAGAGAGTACCTCGCTAAACGTTCATCGCCAAACAAAAGATTATTTATCGGCGAATTATCCTCTGGTTCTGAGGCATTCAATCCCAAAATGGACCATTTAACATGTTTCCTCCCCGGTACGTTGGCGCTGGGTCATATGAACGGTCTACCCGACTGGCACATGACCATGGCCGAAGAATTGCTTTACACCTGCTACCTGACTTACGCTGCCCACCCTACGTTCCTAGCCCCGGAGATCACACATTTTAATATGGTGAGTACGACAGAGGACATGTACACAAAGACAGCTGATGCTCATAATCTACTGAGGCCCGAATTCGTCGAAAGCTTATGGTATATGTATCAAATAACTGGCAACACCACATATCAAGACTGGGGATGGCAGATATATCAGAGTTTCGAGAAATACGCGAAAGTACCAAATGGATACACATCCCTTAACAATGTGAAATCTGAGAAACCAGTACTAAGGGACATGATGGAATCATTTTTTCTCTCTGAAACACTCAAATATCTGTACCTTCTGTTTAGTGATGATAGATTTATAATTGATTTGAATAAATACGTCATCACTTCTGAAGCACATCCATTGCCAATACACAAGAATTAG

Protein sequence:

>DPOGS206782-PA
MASLDTRIDLGGQQNSWNGSDGSLKHSEHLISHRKIVRELSNASSLVTDPPPNASPLKKINIDSEIIQKPLFTGPGNARQYAVVESFKHAWKGYKEHAWGHDNLKPVSGMAFDWFSLGLTIVDGLDTAYIMGLNEEFQEGKEWINNELIFTKQKDVNFFEVTIRVLGALLTNYHFTEDKMFLDKAKDLGERLMSAFSSPSGIPYSDVNLGSRTAHAPEWAHYSTTAEATTIQLEFRELSRSSNNPVFEDAAAAVSEKIHQLPKKHGLVPIFINPNTGQFAPHATITLGARGDSYYEYLLKQWLQTGKTITYLVDDYMTAIEGVREYLAKRSSPNKRLFIGELSSGSEAFNPKMDHLTCFLPGTLALGHMNGLPDWHMTMAEELLYTCYLTYAAHPTFLAPEITHFNMVSTTEDMYTKTADAHNLLRPEFVESLWYMYQITGNTTYQDWGWQIYQSFEKYAKVPNGYTSLNNVKSEKPVLRDMMESFFLSETLKYLYLLFSDDRFIIDLNKYVITSEAHPLPIHKN-