Monarch geneset OGS2.0

DPOGS209771
TranscriptDPOGS209771-TA789 bp
ProteinDPOGS209771-PA262 aa
Genomic positionDPSCF300397 - 53589-55787
RNAseq coverage209x (Rank: top 46%)
Annotation
HeliconiusHMEL0226197e-4053.33% 
BombyxBGIBMGA010289-TA1e-2941.38% 
DrosophilaCG10178-PB1e-1550.00% 
EBI UniRef50UniRef50_D6RUU64e-2642.47%UDP-glucosyltransferase n=17 Tax=Obtectomera RepID=D6RUU6_BOMMO
NCBI RefSeqNP_001161187.11e-2842.07%UDP-glucosyltransferase protein 3 [Bombyx mori]
NCBI nr blastpgi|2678448692e-2742.07%UDP-glucosyltransferase protein 3 [Bombyx mori]
NCBI nr blastxgi|2678448691e-2642.07%UDP-glucosyltransferase protein 3 [Bombyx mori]
Group
Gene OntologyGO:00081521.5e-29metabolic process
GO:00167581.5e-29transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG101788e-14 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[21-260] IPR0022131.5e-29UDP-glucuronosyl/UDP-glucosyltransferase
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209771-TA
ATGTCAGGCGGTCTATCTGCAGACATGTTTGTTACGAGAATATTATTTCTCTTTGTGTTTCTCCCATATTGCGATGGATATAACGTGCTGGTGGTTTTTCCTGTTCCCGGGAAGAGCCATAGCATCCTGGGAGAGGGCTACGTAAGATATTTGCTGGACGCCGGTCACGAGGTAACCTATTTAACACCTATACTTATTAAAAATCCGCCGTCAAGACTTAAGCAAATAGATGTGTCTGAGAATTCCAAGTATTTGCCAGCAGATATTTTCGACGTGAAGAGGTTCATGTATAAGGAATTGAATATGCAGGATGAAGTTAAATACATGGCTTTATTTGACAATCTGCTGAATAACACATTACGAATGGATGTTGTCCAAAAATTTATGAAAGATAGGAGTGTTAAGTTTGATGTCGTGGTTGTGGAGTGGCTTTATACGGAGTTAGGAGTTGGGTGGCATCTTGACGAAAAAGAAAGGATGTTTCGAGAGATATTCGGACCGGCTGCCGAAGAACGAGGTATAAAATTANNNNNNNNNNNNNNNNNACCATTGCCAAAGGATTTACAAAAAATCATGGATGCAGCTAAAGACGGTGTCATATACTTTAGCATGGGCAGTTTGCTGAAAGGCAGTAAAATACCGAGTGCAGTAAGAAAACAGTTCTTAAAAAAGTTTAGTGAATTAAAACAGGAGGTTATTTGGAAGTATGATGAAAAAATTGCAGATTTGCCTAAAAATGTGCATGTCGTAACATGGGCTCCACAACAAAGTATTTTAGGTAAAGATTAA

Protein sequence:

>DPOGS209771-PA
MSGGLSADMFVTRILFLFVFLPYCDGYNVLVVFPVPGKSHSILGEGYVRYLLDAGHEVTYLTPILIKNPPSRLKQIDVSENSKYLPADIFDVKRFMYKELNMQDEVKYMALFDNLLNNTLRMDVVQKFMKDRSVKFDVVVVEWLYTELGVGWHLDEKERMFREIFGPAAEERGIKLXXXXXXPLPKDLQKIMDAAKDGVIYFSMGSLLKGSKIPSAVRKQFLKKFSELKQEVIWKYDEKIADLPKNVHVVTWAPQQSILGKD-