Monarch geneset OGS2.0

DPOGS203028
TranscriptDPOGS203028-TA1032 bp
ProteinDPOGS203028-PA343 aa
Genomic positionDPSCF300068 + 564084-569010
RNAseq coverage274x (Rank: top 39%)
Annotation
HeliconiusHMEL0110508e-14869.28% 
BombyxBGIBMGA012266-TA4e-12465.46% 
DrosophilaGlcAT-S-PC1e-8150.51% 
EBI UniRef50UniRef50_B5A9M86e-11768.98%Glucuronyltransferase n=3 Tax=Endopterygota RepID=B5A9M8_BOMMO
NCBI RefSeqNP_001124376.21e-11768.98%glucuronyltransferase [Bombyx mori]
NCBI nr blastpgi|2263717202e-11668.98%glucuronyltransferase [Bombyx mori]
NCBI nr blastxgi|2263717203e-11468.98%glucuronyltransferase [Bombyx mori]
Group
Gene OntologyGO:00160201.3e-121membrane
GO:00150181.3e-121galactosylgalactosylxylosylprotein 3-beta-glucuronosyltransferase activity
KEGG pathwayaag:AaeL_AAEL0049744e-86 
 K10812 (GlcAT-SP)maps-> Glycosaminoglycan biosynthesis - chondroitin sulfate
InterPro domain[68-305] IPR0050271.3e-121Glycosyl transferase, family 43
Orthology groupMCL18917 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203028-TA
ATGAAACATTTTAAGCTGTTTAAGAATATTTTCGCGATGGTCGTGTTCCTTACTGTTTTGTGGTTATTATTTACTTGGGGAAATGTGGGGATATTGAGTTTACATGCAAATGATCCTCTCAGCATTCCCGTGGCTGTGAGAAACAAGCTGTGCTCGGTGAAGTTGTCGGATGGGCGCTCGCACCTGTCTCAAAGTGGCCGCATGATCTACTACGTCACGCCCACCTACCCTCGCCCTGAACAAGTGCCAGAGTTGACCCGTCTCGCTCACACACTCATGCACGTTCCTCGTCTGCATTGGATTGTAGCTGATGATCAGCCAATTTGCTCGGAACTAGTGGGAAATATTCTTAAGCGCACCAGACTGCCGTTCACTCACATATCCAGCCCAAAACCCTTCATCTACAAGAGCAGCAACTTCCCCCGCGGAGTGGCCAACCGGCGGGCTGCTCTCGATTGGCTCCACGAGAACGTGTCTGAAGGGGTGTTGTACTTCGGAGACGACGATAACACGGTGGACCTGCGGCTGTTCGACGAGATCAGGAACACTGAGAAGGTGTCCATGTTCCCAGTGGGACTGATAGGCGACTACGGCGTGTCCTCCCCCGTCGTCAAAGACGGAAAGGTGGTGGGTTTCTATGATTCCTGGCCGGGTGCTCGGTCGTTCCCGGTGGACATGGCGGGCTTCGCGGTCAATGTGGCGATGCTGCGTGAAGGAGCCACTATGCCGTTCGTGGCCGGCCACGAGGAGGACGGCTTCCTCCGCAGCCTGGCCGTCGAGCTGGCGGACATTCAGCCCCTCGCTAAGAACTGCACCAAGATATTGGTCTGGCATACCAAGACCGTGAAACATAAGAAACCCACCGTCAAGGTGGACCTCGACAAGCTGAAGAATACAGGACGGTACCACAACCTCGCCAGCTTGCTCAGAGAGACGTCGTACATGGGCATGGCGGAGACCAGTGCAGATTCCGGGATCAAATCGTTTATAACCAGCAACAGGAAGACCTTTCAAGCTCTCACCGATTTCTGA

Protein sequence:

>DPOGS203028-PA
MKHFKLFKNIFAMVVFLTVLWLLFTWGNVGILSLHANDPLSIPVAVRNKLCSVKLSDGRSHLSQSGRMIYYVTPTYPRPEQVPELTRLAHTLMHVPRLHWIVADDQPICSELVGNILKRTRLPFTHISSPKPFIYKSSNFPRGVANRRAALDWLHENVSEGVLYFGDDDNTVDLRLFDEIRNTEKVSMFPVGLIGDYGVSSPVVKDGKVVGFYDSWPGARSFPVDMAGFAVNVAMLREGATMPFVAGHEEDGFLRSLAVELADIQPLAKNCTKILVWHTKTVKHKKPTVKVDLDKLKNTGRYHNLASLLRETSYMGMAETSADSGIKSFITSNRKTFQALTDF-