Monarch geneset OGS2.0

DPOGS206312
TranscriptDPOGS206312-TA1038 bp
ProteinDPOGS206312-PA345 aa
Genomic positionDPSCF300082 - 707331-712618
RNAseq coverage298x (Rank: top 37%)
Annotation
HeliconiusHMEL0126185e-5460.45% 
BombyxBGIBMGA014130-TA2e-15280.59% 
DrosophilaC1GalTA-PA2e-11257.01% 
EBI UniRef50UniRef50_F4X6V29e-11559.38%Glycoprotein-N-acetylgalactosamine 3-beta-galactosyltransferase 1 n=10 Tax=Bilateria RepID=F4X6V2_ACREC
NCBI RefSeqXP_972808.27e-11458.86%PREDICTED: similar to Glycoprotein-N-acetylgalactosamine 3-beta-galactosyltransferase 1 (Core 1 beta1,3-galactosyltransferase 1) (Core1 UDP-galactose:N-acetylgalactosamine-alpha-R beta 1,3-galactosyltransferase 1) (Core 1 beta3-Gal-T) (C1GalT1) (Core 1 O-glyc [Tribolium castaneum]
NCBI nr blastpgi|3454830173e-11763.27%PREDICTED: glycoprotein-N-acetylgalactosamine 3-beta-galactosyltransferase 1-like [Nasonia vitripennis]
NCBI nr blastxgi|3454830177e-11963.45%PREDICTED: glycoprotein-N-acetylgalactosamine 3-beta-galactosyltransferase 1-like [Nasonia vitripennis]
Group
Gene OntologyGO:00160203.4e-13membrane
GO:00167573.4e-13transferase activity, transferring glycosyl groups
KEGG pathwaynvi:1001206005e-112 
 K00731 (C1GALT1)maps-> O-Glycan biosynthesis
InterPro domain[91-258] IPR0033783.4e-13Fringe-like
Orthology groupMCL14238 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206312-TA
ATGTACCCTGTGCAGGACGGCCGGATGGGGCGGCGTTTCGTTCTAACACTGGTGATCGGAATATCGGCTGGTTTTAGTTTCGCGTACATTCTGTTAACCTCGGCCGGCTTCACACGGGATGTAGCCTGGTCCTACAGAGATTCAGCAAGAGATCTCGAAAAACATCCAATACCGAGCGTCATAGATCACGGCAAAGACGAGCCCGCTCATAGAGATGAAGACAGAACTGTGGCCGATGAATTGGCTAAGCGAGTACGCGTTCTCTGCTGGGTTATGACACAGCCGAGTAACCATAAGAAAAAGGCTATCCATGTTAAAGCTACGTGGGGGAAGAGATGCAATAAACTGTTGTTTATGAGCACCGTCGAAGATGAGAGTTTGCCATCAGTGAAGCTACCAGTGTCAGAAGGAAGGGATTATCTTTGGGCGAAAACTAAAGCTGCCTTCAGATACGTTTACGAACATCACAGGAGAGACGCAGACTGGTTCCTTAAAGCTGATGACGACACGTATGTGGTAGTAGAGAACCTGAGGTACATGCTGTCAGAGCACGACAGCAAGGAACCGATGTATTTCGGATGTAGATTCAAACCATTCACCTCGCAGGGCTACATGAGCGGCGGGGCTGGGTACGTTTTAAGCCGAGCGGCTCTGGACAAGTTCGTGAGGAACGGTCTGCCGTCACCACACCTGTGTAAGGCGGGCGACCACGGGGCCGAGGACGCCGAGATGGGTATATGCCTTCAGCACCTGGGCGTTAAGGCGATGGATTCGCGGGATTCTCTCCAGCGGGGACGATTCTTTCCCTTCGTCCCTAAGGATCATTTGTTCCCCAACAAGGATAAAGGCTTTTGGTACTGGCAGTACATATACTATCCCACTGATGAGGGTCTAGACTGTTGTTCCGACCACGCGGTTTCCTTCCACTACGTGAATCCTGAACAGATGTACGTATTGGACTATCTGATATACCACCTGAGACCATACGGCATCAACTACAGGGGCTCCATACCCAGGAACGACACTGACGTTAGATAG

Protein sequence:

>DPOGS206312-PA
MYPVQDGRMGRRFVLTLVIGISAGFSFAYILLTSAGFTRDVAWSYRDSARDLEKHPIPSVIDHGKDEPAHRDEDRTVADELAKRVRVLCWVMTQPSNHKKKAIHVKATWGKRCNKLLFMSTVEDESLPSVKLPVSEGRDYLWAKTKAAFRYVYEHHRRDADWFLKADDDTYVVVENLRYMLSEHDSKEPMYFGCRFKPFTSQGYMSGGAGYVLSRAALDKFVRNGLPSPHLCKAGDHGAEDAEMGICLQHLGVKAMDSRDSLQRGRFFPFVPKDHLFPNKDKGFWYWQYIYYPTDEGLDCCSDHAVSFHYVNPEQMYVLDYLIYHLRPYGINYRGSIPRNDTDVR-