Monarch geneset OGS2.0

DPOGS209528
TranscriptDPOGS209528-TA843 bp
ProteinDPOGS209528-PA280 aa
Genomic positionDPSCF300157 - 318308-319610
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0226721e-10661.07% 
BombyxBGIBMGA013829-TA2e-8955.26% 
DrosophilaUgt86De-PA3e-4836.53% 
EBI UniRef50UniRef50_G9LPP72e-8754.48%UDP-glycosyltransferase UGT33T1 n=3 Tax=Obtectomera RepID=G9LPP7_HELAM
NCBI RefSeqNP_001135960.15e-8855.64%uridine diphosphate glucosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638960562e-8961.07%UDP-glycosyltransferase UGT33B9 [Helicoverpa armigera]
NCBI nr blastxgi|3638960641e-9258.61%UDP-glycosyltransferase UGT33F2 [Helicoverpa armigera]
Group
Gene OntologyGO:00081521.3e-92metabolic process
GO:00167581.3e-92transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG66532e-46 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[1-280] IPR0022131.3e-92UDP-glucuronosyl/UDP-glucosyltransferase
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209528-TA
ATGTTATTTTTAAATGAACATCCTATATGGAGTGACAACCATCCAGTTCCTCCAAACGTTATTTATATGGGAGGAATTCATGAAACTCCAAAAAAACCACTACCCCAGGATTTAAAAGAATATTTAGACACATCTGCCAATGGAGTAATTTACATCAGTTTTGGAACCAATGTGCTGCCTTCTGTTCTACCGCCGGAAAAAATAAAAGTTTTTAGAGATGTTTTATCTCAATTACCTTATAATGTGTTATGGAAATGGGATGGAAATAGTTTGCCAGGACATTCGAAAAATATCAAAATATCCAAGTGGTTTCCGCAAGCCGATTTACTCAGACATCCAAATATGAAACTATTTATCACTCAAGGAGGTTTACAATCAACAGATGAAGCTATAAACGCTGAAGTACCTTTGCTTGGTATTCCTTTTTTCGCAGATCAATGGTATAATACAGAAAAATATGTTTACCATAAAATAGGAATGCAACTAGATATCGAAACCCTGAACGAAGATAAGCTAAAACAAGCAATTCTCACTCTTGTTGAGAATGAAAGCTACAAGAGAAATATTGGAAAACTTAGGGAACTGATCGGACAGCATCCGACAGAGCCTTTGAATCTAACTGTATGGTGGATAGAACATCTAATCAAGTACGGAGGAGATCATCTCCAAGCACCAGCCGCTGGGTTATCTTGGATCGAATACTACGAAGTCCACTTGTTATTAATAATTTTAAGTATCTTATTTGTTATTTTAGTTATAGTTATCTCCACTCTTAAATTTTTATTACGTTTCCTCATAAAATCTTTACATATAAAAAAAGAAATTAAGATGAAAAGTAATTGA

Protein sequence:

>DPOGS209528-PA
MLFLNEHPIWSDNHPVPPNVIYMGGIHETPKKPLPQDLKEYLDTSANGVIYISFGTNVLPSVLPPEKIKVFRDVLSQLPYNVLWKWDGNSLPGHSKNIKISKWFPQADLLRHPNMKLFITQGGLQSTDEAINAEVPLLGIPFFADQWYNTEKYVYHKIGMQLDIETLNEDKLKQAILTLVENESYKRNIGKLRELIGQHPTEPLNLTVWWIEHLIKYGGDHLQAPAAGLSWIEYYEVHLLLIILSILFVILVIVISTLKFLLRFLIKSLHIKKEIKMKSN-