Monarch geneset OGS2.0

DPOGS206745
TranscriptDPOGS206745-TA1323 bp
ProteinDPOGS206745-PA440 aa
Genomic positionDPSCF300316 - 54082-65360
RNAseq coverage173x (Rank: top 50%)
Annotation
HeliconiusHMEL0111760.075.45% 
BombyxBGIBMGA009702-TA8e-17563.27% 
DrosophilaMgat1-PA2e-11250.54% 
EBI UniRef50UniRef50_F8WLR82e-17767.72%Acetylglucosaminyltransferase n=1 Tax=Bombyx mori RepID=F8WLR8_BOMMO
NCBI RefSeqXP_393592.32e-12448.86%PREDICTED: similar to UDP-GlcNAc:a-3-D-mannoside--1,2-N-acetylglucosaminyltransferase I CG13431-PA [Apis mellifera]
NCBI nr blastpgi|3505363195e-17767.72%acetylglucosaminyltransferase [Bombyx mori]
NCBI nr blastxgi|3505363192e-17567.95%acetylglucosaminyltransferase [Bombyx mori]
Group
Gene OntologyGO:00038272.4e-172alpha-1,3-mannosylglycoprotein 2-beta-N-acetylglucosaminyltransferase activity
GO:00064872.4e-172protein N-linked glycosylation
GO:00001392.4e-172Golgi membrane
KEGG pathwayame:4101056e-124 
 K00726 (MGAT1)maps-> N-Glycan biosynthesis
InterPro domain[98-439] IPR0041392.4e-172Glycosyl transferase, family 13
Orthology groupMCL13216 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206745-TA
ATGCGTGTGAATCTACGCAAAATCATTTTAATAAGCGGTGGATTCATCTTGCTATGGCTTGTCGTTTCCTATTCCTCGTTTTTCGTTCCCGTCTCAAAGAAACGGGGAAATATTGAAAAAATAGAAACAGAATTAGACAATTTACAAAATAAAATGTTAGATCAGTTAAGTGATAGCTATAAATTGTTAGATAAAGTTAAACAGCATCTCAGGAAGACTGATCATCAAGAGATTTCAAAAGATTCACCAATACCCGGGAACCGAGATGGTTTTGTACAAACTGGCAAGGGTCCTGTCCTGCCGATTTTAGTGATAGCGTGCGACAGAGTCACGGTCAGGAGGTGTTTGGACAATTTAGTGAAATTCAGACCAGACAAAAATACATTTCCTATTATTGTTAGCCAGGACTGTGGGCACAATGAGACCTACCAAATAATAAAATCGTATACGGACACCGATCCTACCATAACGGTAGTGAGACAGCCAGACCTGTCAGAGATTCCTCTCACCAGGGCCAAAGTGAAGTTCAAAGGGTACTACAAAATAGCTAGGCATTTCAGGTTCGCGTTGAACTACGTGTTCGTAACTCTGAAACACGAGGCTGTGATCATTGTTGAAGATGATCTGGACATTTCGCCGGACTTCTTTGAATATTTCCTCGGGACATACCCTCTGTTGTCCAAAGATCCGACGATATGGTGTATATCAGCGTGGAACGACAATGGAAAGAAGCAGTTAATAGATCTTTCTCGGCCGGAGCTCCTTCATAGGACGGATTTCTTCCCAGGACTTGGATGGCTCCTGAAGAGGGAGACCTGGTTACAGCTGGAACCGAAATGGCCGCAAGCCTTCTTTGACGATTGGCTCCGTGATCCAGAGAACACACAAGGTCGGGCTTGCATCAGACCGGAAGTATCAAGGACGTACAGCTTTGGAAAGGTGGGCGTCAGTAAAGGACTGTTTTTCGATATGCACCTGAGATATATGCAGTTGAACATGGAATACATAGAGTTCACCAAGCTGAACCTCACCTACTTACTTAAGGATGTTTACGACGATCACCTCACGACGCTAGTGGCGTCACTTCCGGAACAGACAGCTGAGGAGGCGAAGTCGCGATCGGGAGTCGAACCCGTTAAAGTGACTTACACCAGCGCGAAAACCTACCAGAAGGCGGCCAAGAAATTAGGACTCATGGATGATTTTAGGAGTGGCATACCTCGTACAGCCTACCGTGGCATAGTGACGTGTTACATCCAAGGAAGAAGAGTATACCTCGCGCCCGGGTACCAATGGACCAAATATGACCCAACGTGGGGGTGA

Protein sequence:

>DPOGS206745-PA
MRVNLRKIILISGGFILLWLVVSYSSFFVPVSKKRGNIEKIETELDNLQNKMLDQLSDSYKLLDKVKQHLRKTDHQEISKDSPIPGNRDGFVQTGKGPVLPILVIACDRVTVRRCLDNLVKFRPDKNTFPIIVSQDCGHNETYQIIKSYTDTDPTITVVRQPDLSEIPLTRAKVKFKGYYKIARHFRFALNYVFVTLKHEAVIIVEDDLDISPDFFEYFLGTYPLLSKDPTIWCISAWNDNGKKQLIDLSRPELLHRTDFFPGLGWLLKRETWLQLEPKWPQAFFDDWLRDPENTQGRACIRPEVSRTYSFGKVGVSKGLFFDMHLRYMQLNMEYIEFTKLNLTYLLKDVYDDHLTTLVASLPEQTAEEAKSRSGVEPVKVTYTSAKTYQKAAKKLGLMDDFRSGIPRTAYRGIVTCYIQGRRVYLAPGYQWTKYDPTWG-