Monarch geneset OGS2.0

DPOGS212698
TranscriptDPOGS212698-TA786 bp
ProteinDPOGS212698-PA261 aa
Genomic positionDPSCF300012 - 853210-853995
RNAseq coverage2x (Rank: top 92%)
Annotation
HeliconiusHMEL0095581e-1024.79% 
BombyxBGIBMGA001608-TA3e-3229.28% 
DrosophilaMgat2-PA7e-2345.38% 
EBI UniRef50UniRef50_E2BYB21e-3033.45%Alpha-1,6-mannosyl-glycoprotein 2-beta-N-acetylglucosaminyltransferase n=8 Tax=Formicidae RepID=E2BYB2_HARSA
NCBI RefSeqNP_001014684.19e-3534.13%Mgat2, isoform B [Drosophila melanogaster]
NCBI nr blastpgi|165066111e-3334.13%UDP-GlcNAc:alpha-6-D-mannoside beta-1,2-N-acetylglucosaminyltransferase II [Drosophila melanogaster]
NCBI nr blastxgi|165066112e-3334.13%UDP-GlcNAc:alpha-6-D-mannoside beta-1,2-N-acetylglucosaminyltransferase II [Drosophila melanogaster]
Group
Gene OntologyGO:00084555.7e-45alpha-1,6-mannosylglycoprotein 2-beta-N-acetylglucosaminyltransferase activity
GO:00057955.7e-45Golgi stack
GO:00160215.7e-45integral to membrane
GO:00093125.7e-45oligosaccharide biosynthetic process
KEGG pathwaydme:Dmel_CG79213e-34 
 K00736 (MGAT2)maps-> N-Glycan biosynthesis
InterPro domain[1-251] IPR0077545.7e-45N-acetylglucosaminyltransferase II
Orthology groupMCL27827 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212698-TA
ATGCAAATTTTTTATCCCTACTCTCTTCAACTTCATCCAAATGAATTTCCCGGTGTAGACCCTAACGACTGCAATTCATACAAGCGACGAATTAAGAATGCGCAAAAATTATCCCATTGCGCCAAAAGAGACGCTAGCATTACGGAACACAAGCAGCATTGGTGGTGGAAGGCAAACTTTGTCTTTGAACACGTCCGTCTGATTCGGAAACATACAGGACCCTTTATATTCCTCGAGGAAAATAATTATGTCGCTCCAGATTTATTAATTATGTTCCATTGCGCTCTAGAAACATTTAACTATTTTTCTCACATAGAAGTATTGTCTTTTGGAGGGCCTTTGAATGTTATAAATATGAATCTATTATCTGTAGAGCCATGGCGGCCGCCGTTTGATTTAGGCTTAGCTTTTAATAAAACTACATGGAGAAAGATATTTTCTTATTCATCCCATTACTGTATGTTCGATGACAGTAGCTGGAGTTACTCCATGTGGAATCTGTTTGGTAACTTTCCGAAGGGTTACGTCACTATGGCGAGGTTTATGACACCGAGAGTATTAAATACCAAAGAGATCGTACACTCTGAACAGAAATTTAAAGAATATGTCGGCGGTTTTAATACATTAAATGTATTTTGTAAGAAACTTAAAGCTGTTTTTCTTTTTGGACCTGAAGGTGTTGTTGAAAGGGCACACAAATGTCCTCCGAAAGGTGACGGTGCCTGGAATGATCTGCGTGATCAGCTGTTATGTTTGGATCCGCTAATGAGCACGACTACTGAGTAG

Protein sequence:

>DPOGS212698-PA
MQIFYPYSLQLHPNEFPGVDPNDCNSYKRRIKNAQKLSHCAKRDASITEHKQHWWWKANFVFEHVRLIRKHTGPFIFLEENNYVAPDLLIMFHCALETFNYFSHIEVLSFGGPLNVINMNLLSVEPWRPPFDLGLAFNKTTWRKIFSYSSHYCMFDDSSWSYSMWNLFGNFPKGYVTMARFMTPRVLNTKEIVHSEQKFKEYVGGFNTLNVFCKKLKAVFLFGPEGVVERAHKCPPKGDGAWNDLRDQLLCLDPLMSTTTE-