Monarch geneset OGS2.0

DPOGS214684
TranscriptDPOGS214684-TA870 bp
ProteinDPOGS214684-PA289 aa
Genomic positionDPSCF300503 - 46422-48540
RNAseq coverage49x (Rank: top 70%)
Annotation
HeliconiusHMEL0226497e-10058.06% 
BombyxBGIBMGA013829-TA8e-9753.17% 
DrosophilaCG4302-PA1e-5034.40% 
EBI UniRef50UniRef50_G9LPP72e-9256.12%UDP-glycosyltransferase UGT33T1 n=3 Tax=Obtectomera RepID=G9LPP7_HELAM
NCBI RefSeqNP_001135960.13e-9553.17%uridine diphosphate glucosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638960681e-9756.54%UDP-glycosyltransferase UGT33J1 [Helicoverpa armigera]
NCBI nr blastxgi|3638960586e-9857.54%UDP-glycosyltransferase UGT33B11 [Helicoverpa armigera]
Group
Gene OntologyGO:00081526.5e-92metabolic process
GO:00167586.5e-92transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG43021e-48 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[8-285] IPR0022136.5e-92UDP-glucuronosyl/UDP-glucosyltransferase
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214684-TA
ATGGAAAGGGAGGCAAAAGAAAATATCTTACTGGAAAAACTATTTGGCTCTGATATACCCCCATTACATGAATTGGCTAATAATGTAAATTTATTGTTCTTAAATGTTCATCCAATATGGATAGACAATCAGCCTGTGCCACCAAATGTTGTGTTTATTGGAGGTATTCACAAACAGCCACCAGAAGAAATACCAACAGATCTTTTATATTTCTTAAATGCATCTACAAATGGATTTGTTTACATAAGTTTTGGTACAAATGTTAAGCCATCTCTACTACCTCCGGAAAAAATTGATATTATGATAAAGGTCCTTTCTAAGCTACCTTATAGTGTTTTATGGAAATGGGACAAAGAAGGGATGCCACGACAAACGAATAATATAAAATATGTTCCATGGGTACCACAAAAGGATATTCTTATGCATCCAAATATAAAATTATTTGTGACACAATGCGGTCTGCAGTCAACCGAAGAAGCAATAAACGCTTTGGTACCCCTGATTGGAATTCCAGTACTTGGTGATCAATTTTACAATGCGGAAAAATATGTTTACCATGGTATTGGGATAAAACTTGATTTAGATTATCTCAGTGAGGAAGTATTTAGCGGAGCTCTCGAAACAATATTAAATAGTAAAAGTTACCGCGAAAACCTTATACGATTGAGGAAAATAATGAATGATCAGCCTGAGTCAGCATTGCAGCGAGCTATCTGGTGGATAGATTATACATTAAGACATGGTGGCGCTAAACATTTACGAGCACGTGGAGCTAACATCACGTGGGCACAGTACTTAGAGCTGGAATTGGTCTTCACGGTTTTATCAGCGGTTCTTATTACTTACTTAATACTTGGTGGTTTTGCGTAA

Protein sequence:

>DPOGS214684-PA
MEREAKENILLEKLFGSDIPPLHELANNVNLLFLNVHPIWIDNQPVPPNVVFIGGIHKQPPEEIPTDLLYFLNASTNGFVYISFGTNVKPSLLPPEKIDIMIKVLSKLPYSVLWKWDKEGMPRQTNNIKYVPWVPQKDILMHPNIKLFVTQCGLQSTEEAINALVPLIGIPVLGDQFYNAEKYVYHGIGIKLDLDYLSEEVFSGALETILNSKSYRENLIRLRKIMNDQPESALQRAIWWIDYTLRHGGAKHLRARGANITWAQYLELELVFTVLSAVLITYLILGGFA-