Monarch geneset OGS2.0

DPOGS206695
TranscriptDPOGS206695-TA861 bp
ProteinDPOGS206695-PA286 aa
Genomic positionDPSCF300048 + 1473264-1477400
RNAseq coverage238x (Rank: top 43%)
Annotation
HeliconiusHMEL0118294e-12186.70% 
BombyxBGIBMGA008529-TA6e-11886.73% 
DrosophilaGlcAT-I-PA2e-6946.85% 
EBI UniRef50UniRef50_D2A2P02e-7760.76%Putative uncharacterized protein GLEAN_07027 n=1 Tax=Tribolium castaneum RepID=D2A2P0_TRICA
NCBI RefSeqXP_974644.14e-7860.76%PREDICTED: similar to glucuronyltransferase I [Tribolium castaneum]
NCBI nr blastpgi|910818257e-7760.76%PREDICTED: similar to glucuronyltransferase I [Tribolium castaneum]
NCBI nr blastxgi|910818254e-7660.76%PREDICTED: similar to glucuronyltransferase I [Tribolium castaneum]
Group
Gene OntologyGO:00160209.7e-136membrane
GO:00150189.7e-136galactosylgalactosylxylosylprotein 3-beta-glucuronosyltransferase activity
KEGG pathwaytca:6635111e-77 
 K10158 (B3GAT3)maps-> Glycosaminoglycan biosynthesis - heparan sulfate
    Glycosaminoglycan biosynthesis - chondroitin sulfate
InterPro domain[43-286] IPR0050279.7e-136Glycosyl transferase, family 43
Orthology groupMCL13819 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206695-TA
ATGCCTTTTATTAATATCAAGAAACAATATCTAGCAGTCGGAATGTTAGTTTTCGTAGCATTGTTTTTTTTCAATAGCCGACCAGCTTTCCAATGTCAATTTGCTCAGGAAGTGGCCACATATTATCCTACGATATATGGGATTACCCCGACGTATGCAAGACTTGCTCAAAAGGCCGACCTTACAAGATTGTCACAAACATTGATGCTAGTGAAGAACTTTCATTGGATAGTTATAGAGGATTCTGAAACTAAGACTAAGTTAGTCGAAAACCTACTCAAGGAATCCACTTTAAAATACACACACCTTAATGTGAAAACTCAAAAGTCAAAGCTTTCCACGGCTAGCGGAGTGGAACAACGGAATATCGCTTTGAATTGGCTTCGGGATCATTTAAGGAAAGTTGAAGACAAGAGGGGTGTTGTTTACTTCATGGATGACGATAATACATATTCATTGAAAGTCTTCGACGAGATGAGGAAAATTAAGAAAGTTGGAACTTGGCCCGTTGGTATAGTGGGAGGTATGAGGGTAGAAATGCCACTTGTTACTAATGGCAAGGTGTCAGGCTACAACGCTGTTTGGAAGCCTTACAGACCTTTCCCCATAGATATGGCTGGTTTTGGAATCAACGCAACACTGTTCTTAGATCATCCCGAGGCGAAATTCTCCAGAAAAGTTCAATCTGGATTCCAGGAGAGCGAAATATTGAAATACTTTACAAGCAAAGAGGAATTAGAGCCTCTGGCTGAGAATTGCACCAAAGTGTACGTTTGGCACACGAGAACCCAGAAACCATCTATATTGAATCCAAAGAAACTAAAGCATCCGCCCTTACCCGACGATCATATCGAAGTCTGA

Protein sequence:

>DPOGS206695-PA
MPFINIKKQYLAVGMLVFVALFFFNSRPAFQCQFAQEVATYYPTIYGITPTYARLAQKADLTRLSQTLMLVKNFHWIVIEDSETKTKLVENLLKESTLKYTHLNVKTQKSKLSTASGVEQRNIALNWLRDHLRKVEDKRGVVYFMDDDNTYSLKVFDEMRKIKKVGTWPVGIVGGMRVEMPLVTNGKVSGYNAVWKPYRPFPIDMAGFGINATLFLDHPEAKFSRKVQSGFQESEILKYFTSKEELEPLAENCTKVYVWHTRTQKPSILNPKKLKHPPLPDDHIEV-