Monarch geneset OGS2.0

DPOGS209356
TranscriptDPOGS209356-TA1152 bp
ProteinDPOGS209356-PA383 aa
Genomic positionDPSCF300118 - 308408-309559
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0095589e-1533.33% 
BombyxBGIBMGA001608-TA2e-4229.37% 
DrosophilaMgat2-PA7e-3437.02% 
EBI UniRef50UniRef50_F1KTG83e-4832.45%Alpha-1,6-mannosyl-glycoprotein 2-beta-N-acetylglucosaminyltransferase n=1 Tax=Ascaris suum RepID=F1KTG8_ASCSU
NCBI RefSeqXP_969675.19e-4933.42%PREDICTED: similar to Mgat2 CG7921-PB [Tribolium castaneum]
NCBI nr blastpgi|3245026681e-4732.45%Alpha-1,6-mannosyl-glycoprotein 2-beta-N-acetylglucosaminyltransferase [Ascaris suum]
NCBI nr blastxgi|910944336e-4833.33%PREDICTED: similar to Mgat2 CG7921-PB [Tribolium castaneum]
Group
Gene OntologyGO:00084552.9e-66alpha-1,6-mannosylglycoprotein 2-beta-N-acetylglucosaminyltransferase activity
GO:00057952.9e-66Golgi stack
GO:00160212.9e-66integral to membrane
GO:00093122.9e-66oligosaccharide biosynthetic process
KEGG pathwaytca:6581733e-48 
 K00736 (MGAT2)maps-> N-Glycan biosynthesis
InterPro domain[41-366] IPR0077542.9e-66N-acetylglucosaminyltransferase II
Orthology groupMCL27827 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209356-TA
ATGATTAGATCAAAGATTGAGAAAGATAAATATGAGTTTCCTAAGCATCTTCATAAATATAGAGCCCGCCATAAAATAAAGAAAGGAACTGGTCCCATCGACGCCGAACAACTTAAATTTAAATTGGAACAGTCAAATTTAAGACCAAATATTACTAATTTAAATAAATACAAAACTTCAGTGAAAAACTCGCCAATCTTCCTCATCCAGGTACACAAAGATATCAACAGGCTCCAATATCTCATTATGTCTCTGTCACAGGTAACTGGAATAGCCTCTAGCATGCTCATATTCTCTCACAGCTTTTATAGTGATTCAATTAACCAACTTATTCTAAGTATCGATTTCTGTCAAGTTGTACAAATATTCTATCCTCATTCCCTTCAACTCAATCCTTCTAAATTCCCTGGCGTGGACCCCGACGACTGCCCCTTGAAGGAAAAGAGACGAAACCCAAACTGCCAGGTACGGAATGCAGAACTAACCGAACACAAGCAGCACTGGTGGTGGAAAGCTAACTTCGTGTTCGAACACATGGCTTGGTTGCATCTATATAAAGCACCCATCATTTTCCTAGAAGAGCACAGTTACGTCGCTCCGAGCTTACTACTCATATATCAATATGCAGTGAAGGCGTTTAGCTACTACCCTCACACTGAAGTCCTTTCCTTTGGAAGACCGTTGACGAGAAATGTTGAAATGGATCTGTTGACTATAGAGCCTTGGCGACCGCCGTTCGACGTAGGCTTGGCTTTTAATAAGACAACGTGGAGGAAGATGGTCTCCTTCTCAGCTCAGTACTGTATGTACGATGACAGCAGCTGGAGTTATTCTTTGTTGAACTTATTCCGAATGTTCCCCAAGAGCCACGTGAGCATGATAGGATGTGTGACGCCTAGAGTGTTAAGCACTCGACACTTTAATAACATAGAAGAAAGTTTTAAAAGATATACCAAAATGTTTGCTACTTTGGACGTCTACCCTAAAAAAGTCAAAACTGTGTTTCTTTTCGGTACTGACGGAATTGTTGAAAACTTATACAAGGAACCTCCTTCCGGTAACGGCGGCTGGAGTGATCTGCGAGATCAAGTATTATGCTTGGACCCGCTCGTGAGTACGACAACTGTGGAACCACGTTACTATGACAACTAA

Protein sequence:

>DPOGS209356-PA
MIRSKIEKDKYEFPKHLHKYRARHKIKKGTGPIDAEQLKFKLEQSNLRPNITNLNKYKTSVKNSPIFLIQVHKDINRLQYLIMSLSQVTGIASSMLIFSHSFYSDSINQLILSIDFCQVVQIFYPHSLQLNPSKFPGVDPDDCPLKEKRRNPNCQVRNAELTEHKQHWWWKANFVFEHMAWLHLYKAPIIFLEEHSYVAPSLLLIYQYAVKAFSYYPHTEVLSFGRPLTRNVEMDLLTIEPWRPPFDVGLAFNKTTWRKMVSFSAQYCMYDDSSWSYSLLNLFRMFPKSHVSMIGCVTPRVLSTRHFNNIEESFKRYTKMFATLDVYPKKVKTVFLFGTDGIVENLYKEPPSGNGGWSDLRDQVLCLDPLVSTTTVEPRYYDN-