Monarch geneset OGS2.0

DPOGS209773
TranscriptDPOGS209773-TA1221 bp
ProteinDPOGS209773-PA406 aa
Genomic positionDPSCF300397 - 35241-37611
RNAseq coverage597x (Rank: top 21%)
Annotation
HeliconiusHMEL0226193e-13456.51% 
BombyxBGIBMGA010289-TA9e-10750.53% 
DrosophilaUgt86De-PA1e-5431.49% 
EBI UniRef50UniRef50_G6CTZ62e-11656.15%UDP-glucosyltransferase n=3 Tax=Obtectomera RepID=G6CTZ6_DANPL
NCBI RefSeqNP_001037040.16e-11753.11%phenol UDP-glucosyltransferase [Bombyx mori]
NCBI nr blastpgi|1129831381e-11553.11%phenol UDP-glucosyltransferase precursor [Bombyx mori]
NCBI nr blastxgi|1129831382e-11452.36%phenol UDP-glucosyltransferase precursor [Bombyx mori]
Group
Gene OntologyGO:00081524.3e-106metabolic process
GO:00167584.3e-106transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG66531e-52 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[4-406] IPR0022134.3e-106UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL23345 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209773-TA
ATGGATGTTGTCCAAAAATTTATAAAAGATAGAAGTGTTAAGTTTGATGTTGTAATAGTGGACTGGCTTTATACGGAGTTAGGAGTTGGATTTTCATCAGTATTCAATTGTCCTCTGATATGGTTTTCGTCTATGGATGTCCATACGACCGTGTTGGCACTCATAGATGACTATTTAAATCCGGCTTACGCTCGACATCATATGAGTACAGAATATTCTTCCAGTTTTTTGAACAGGGTGAAGGAACTTTGGACTATATCGAGAACATGGTTATATCATTGGTGGCATCTTGACGAAAAAGAAAGGATGTTTCGAGAGATATTCGGACCGGCTTCCGAAGAACGAGGTATAAAATTACCACATTTCAACGACGTTCGCTACAATGCATCCCTCATGCTTGGCAATTCACATATAGTGGTTGGAGAAGCAATTGCACTGCCGCAGAATTACTGGCACATCGGAGGATACCACATCAAAAAAACTGTTGAACCATTGCCAAAGGATTTACAAAAAATCATGGATACAGCCAAAGATGGTGTAATATACTTTAGCCTGGGCAGTTTGCTGAAAGGCAGAAAAATACCGAGTGCAGTTAAAAAGCGATTCCTAAACATTTTTAGTGAGTTAAAACAGGAAATTATTTGGAAGTTTGATGAACAAATGACTGATTTGCCTAAAAATGTGCATATCGTTACATGGGCTCCACAACAAAGTATTTTAGCACATCCTAATTGCATACTTTTTATAACACACGGTGGTCTTCTATCAACGTTAGAGACCATTAAATATGGCGTGCCAATTATCGGTATACCATTCTTTGCCGACCAATTCCTTAATGTCAACAAAGTTGTCGCTAAAGGATTCGGCAGGCGTGTAGATATAAGTGAAAACACACCGGAAGAATTGAGAATTGCTATAAGGGAAGTATTAGGAAATACCAGCTACCGCACTCGTGTGAAGGAACTGTCATCTCTGTTCAATGCTGATTCAGATCCAGGACAGCGATTGGTTCAGGGCGTGGAGTTAGTGGTCAGGACTAACGGAGCACCACATCTTCGCTCCGTCGCACTACGCGTGCCGTTCTACCAAAAACTGTACTTGGATGTTTTACTATTAGTTATTGGAATTGTTTTTGGAATTATTATATTAATATCGTATGCTTGTAAATATTTGCGCACTTTCATATCCAGGTCTAATCAGGGTAAGAAGAAATTAAACTAA

Protein sequence:

>DPOGS209773-PA
MDVVQKFIKDRSVKFDVVIVDWLYTELGVGFSSVFNCPLIWFSSMDVHTTVLALIDDYLNPAYARHHMSTEYSSSFLNRVKELWTISRTWLYHWWHLDEKERMFREIFGPASEERGIKLPHFNDVRYNASLMLGNSHIVVGEAIALPQNYWHIGGYHIKKTVEPLPKDLQKIMDTAKDGVIYFSLGSLLKGRKIPSAVKKRFLNIFSELKQEIIWKFDEQMTDLPKNVHIVTWAPQQSILAHPNCILFITHGGLLSTLETIKYGVPIIGIPFFADQFLNVNKVVAKGFGRRVDISENTPEELRIAIREVLGNTSYRTRVKELSSLFNADSDPGQRLVQGVELVVRTNGAPHLRSVALRVPFYQKLYLDVLLLVIGIVFGIIILISYACKYLRTFISRSNQGKKKLN-