Monarch geneset OGS2.0

DPOGS201233
TranscriptDPOGS201233-TA1284 bp
ProteinDPOGS201233-PA427 aa
Genomic positionDPSCF300037 - 256499-257869
RNAseq coverage11x (Rank: top 83%)
Annotation
HeliconiusHMEL0095586e-1730.67% 
BombyxBGIBMGA001608-TA6e-4529.60% 
DrosophilaMgat2-PA3e-4044.12% 
EBI UniRef50UniRef50_UPI0002061D5A1e-5033.24%UPI0002061D5A related cluster n=3 Tax=unknown RepID=UPI0002061D5A
NCBI RefSeqXP_969675.18e-5333.70%PREDICTED: similar to Mgat2 CG7921-PB [Tribolium castaneum]
NCBI nr blastpgi|910944332e-5133.70%PREDICTED: similar to Mgat2 CG7921-PB [Tribolium castaneum]
NCBI nr blastxgi|910944331e-5333.15%PREDICTED: similar to Mgat2 CG7921-PB [Tribolium castaneum]
Group
Gene OntologyGO:00084553.1e-53alpha-1,6-mannosylglycoprotein 2-beta-N-acetylglucosaminyltransferase activity
GO:00057953.1e-53Golgi stack
GO:00160213.1e-53integral to membrane
GO:00093123.1e-53oligosaccharide biosynthetic process
KEGG pathwaytca:6581732e-52 
 K00736 (MGAT2)maps-> N-Glycan biosynthesis
InterPro domain[75-241] IPR0077543.1e-53N-acetylglucosaminyltransferase II
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201233-TA
ATGTCCATATTGAAAAGGATAAAGACAAGAGTGATAGACGCAAAGTTTTACCATAGCATCGCATTTGGCAACAGCTTGCTGTTCAGTGTGATATTTTCCATGTTTATTCTATTGTACATGTCTATACGAGCAAATCGGCAAGATGTATTGAAAGATCTTTCGGAAAATTCACAAAAATTCATACTTTACAAAGAGAAGACATATCCGATTTTGAACAGCACTGAAGCGAGACGCCTTATAACTGAATATAATCAGGAAGAAAAAATATCGAATATAGAGACTTTTGGACCTATAAAGAATAATACTGTTGTTCTAGTCGTCACAGTACACAAATTACATGAATCTTTGAAATATCTTATAGCATCACTGAGCGAACTAGACGCCATAGAAGAAACACTATTAGTATTCTCACACTTAACCTATGACCAAGAATTGTACGACTTCATTCAAACCATAGACTTCTGTCGCGTTTGGCAAATATTTTATCCATATACTCTGCAGGTTTACACCGACGCTTTTCCCGGTTTTAGTAAAAATGACTGTCCCACAAATATGAAACACAAGAAAGCCGAAGCACTAAAGTGTATTGGAGCACAAACTCCGGACGTCCATGGAAAATATAGACAGCCATCGAAAGCCCAGGAAAAACATTATTGGTGGTGGACGGCGAATAGGGTGTTCGAACATCTTCTTTCTTTTAACAATCAAAACGGAGTTGTGCTGGAGTTCATATATTTAGGGCTGCACAAGTGGTCGTTGGAAAATGATACATATGGCATAGATATTACGACATGGGATCCCAAACAGCATTCTAGCGTGCTCGCATTTGATGTAAGCGTTTGGAATAGTATCATCCAACACTACGACCTATTCTGTGAGGTGGATGACGCGTCGTGGTCTCGGTCCTTACTGTACATCTCATTGAATAGAAGAGATAATAAAAGATTCAAAGTGGCCTACACTACTATACCTAGAGCTCTAAAGACTACTTTATGCTCATTTAATGGGTTTTATGAATCCTGTAATGTCGAGGATAATGTAGTAAACGCTCTGAGGATGCAAAGACATTTGAAGAGTAATCTTTTTCCGCCATATCTAGAGGTGTACACACATATTGAACTAGAAGACGACGAGTTCATGTTTTTTGACGTTTTAGAAGGCGAGGGCGGCTGGAACGACCCCCGCGACAAAGCTTTGTGCTCGGCCAATATAACAGAAAATACAGTGAAAAAGCTACTTATGGACATGTCCAGGGAATTTAAGGATGCGTATAAAAGTTTTTGA

Protein sequence:

>DPOGS201233-PA
MSILKRIKTRVIDAKFYHSIAFGNSLLFSVIFSMFILLYMSIRANRQDVLKDLSENSQKFILYKEKTYPILNSTEARRLITEYNQEEKISNIETFGPIKNNTVVLVVTVHKLHESLKYLIASLSELDAIEETLLVFSHLTYDQELYDFIQTIDFCRVWQIFYPYTLQVYTDAFPGFSKNDCPTNMKHKKAEALKCIGAQTPDVHGKYRQPSKAQEKHYWWWTANRVFEHLLSFNNQNGVVLEFIYLGLHKWSLENDTYGIDITTWDPKQHSSVLAFDVSVWNSIIQHYDLFCEVDDASWSRSLLYISLNRRDNKRFKVAYTTIPRALKTTLCSFNGFYESCNVEDNVVNALRMQRHLKSNLFPPYLEVYTHIELEDDEFMFFDVLEGEGGWNDPRDKALCSANITENTVKKLLMDMSREFKDAYKSF-