Monarch geneset OGS2.0

DPOGS209539
TranscriptDPOGS209539-TA933 bp
ProteinDPOGS209539-PA310 aa
Genomic positionDPSCF300157 + 389393-390729
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0226339e-10161.54% 
BombyxBGIBMGA013829-TA2e-9051.19% 
DrosophilaCG30438-PD6e-5436.70% 
EBI UniRef50UniRef50_G9LPP71e-8649.52%UDP-glycosyltransferase UGT33T1 n=3 Tax=Obtectomera RepID=G9LPP7_HELAM
NCBI RefSeqNP_001135960.12e-8850.85%uridine diphosphate glucosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638960582e-9256.55%UDP-glycosyltransferase UGT33B11 [Helicoverpa armigera]
NCBI nr blastxgi|3638960583e-9556.75%UDP-glycosyltransferase UGT33B11 [Helicoverpa armigera]
Group
Gene OntologyGO:00081525.2e-108metabolic process
GO:00167585.2e-108transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG43027e-51 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[10-310] IPR0022135.2e-108UDP-glucuronosyl/UDP-glucosyltransferase
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209539-TA
ATGTATAATGATCACGTACAATACGAAAATGAGGTGCTGAAAAGGTTGTTTGGGTCTGAAATACCGGATATTAATGAATTAAGGAAAAATATACGTATGGTATTTTTAAATGTCCATCCGATTTGGGACTTCAATAGACCTGTTCCACCAAATGTTATATATCTAGGACAAATGCATTTACAAAAGGAACGAGTAAAAAAACTACCAGAGGAGATAGAGTTGTTCGTGAATTCATCAGTTCATGGCTTTATATACATGAGTTTTGGTTCCAACGTAAAGTTATCCTCGTTACCTCAAGAAAAAATACAAATTTTCTCTAAAATATTTTCCGAAATTCCTTATGAAGTTCTATGGAAACGGGATGGAGAAATACCTGTCAATCTCTCTCAAAATATTAAAATTTCTGAATGGTTTCCTCAGTCTACGCTATTAAGACACCCGAAAATTAAATTATTTATTACTCAAGGAGGCTTGCAGTCGACAGACGAGGCTATATTTGCAGGAGTTCCGCTAATTGTTGTACCGTGTCTTGGTGATCAATGGTATAACGCTGAGCAATATGTTAGGCACGGTATTGGGAGAAAGTTGGAACTAAATAACCTTAACGAAAAACTATTGAAAGAATCCATAGAAGATGTTATACATAACAAAAGCTATCGTGAAAATGTCAAAAAACTTAGACAAATAATAACTGACCAACCACAAACATCATTAGAAAAAGCTGTTTGGTGGACGGAATACGTTTTACGACATAAAGGAGCTAAACATCTTATGTCACCAGCCGCCAACTTGTCGTGGCTTGAGTACTATGAAATCAACTTTGTAATTTTTTTGCTTGGAATTCTATTTCTATGCATTATTTCCATAATATTTATCTTAAGACTACTTGTAACCTTTCTTTTTGTGAGGGATATAAAAATTAAAGAAAACTAA

Protein sequence:

>DPOGS209539-PA
MYNDHVQYENEVLKRLFGSEIPDINELRKNIRMVFLNVHPIWDFNRPVPPNVIYLGQMHLQKERVKKLPEEIELFVNSSVHGFIYMSFGSNVKLSSLPQEKIQIFSKIFSEIPYEVLWKRDGEIPVNLSQNIKISEWFPQSTLLRHPKIKLFITQGGLQSTDEAIFAGVPLIVVPCLGDQWYNAEQYVRHGIGRKLELNNLNEKLLKESIEDVIHNKSYRENVKKLRQIITDQPQTSLEKAVWWTEYVLRHKGAKHLMSPAANLSWLEYYEINFVIFLLGILFLCIISIIFILRLLVTFLFVRDIKIKEN-