Monarch geneset OGS2.0

DPOGS204212
TranscriptDPOGS204212-TA921 bp
ProteinDPOGS204212-PA306 aa
Genomic positionDPSCF300493 - 25192-26879
RNAseq coverage17x (Rank: top 80%)
Annotation
HeliconiusHMEL0226499e-9855.17% 
BombyxBGIBMGA013860-TA6e-9557.95% 
DrosophilaCG15661-PB2e-5636.82% 
EBI UniRef50UniRef50_G9LPP74e-9559.78%UDP-glycosyltransferase UGT33T1 n=3 Tax=Obtectomera RepID=G9LPP7_HELAM
NCBI RefSeqNP_001040425.12e-9357.30%antennal-enriched UDP-glycosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638960423e-9758.45%UDP-glycosyltransferase UGT33B1 [Helicoverpa armigera]
NCBI nr blastxgi|3638960522e-9858.68%UDP-glycosyltransferase UGT33B7 [Helicoverpa armigera]
Group
Gene OntologyGO:00081525.1e-104metabolic process
GO:00167585.1e-104transferase activity, transferring hexosyl groups
KEGG pathwaydpo:Dpse_GA138787e-56 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[18-294] IPR0022135.1e-104UDP-glucuronosyl/UDP-glucosyltransferase
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204212-TA
ATGAATGAATGGTGGATAAAAAATGTAAGACAGGAGCTCGAAATAGAAGAAAATAGAATAATAAGAAACATCTTCGGACCAGAAACTCCAAATATTAATGAGTTAAAAAAAAATGTTGACATGCTGTTTTTGAATATCCATCCGATATTTGTTGATAATCAACCTGTTCCTCCCGATGTCATATATGTAGGAGGGATACATATAAAGCCTAGAAAAGAATTGCCAAAGGATTTAAGTAAAGTATTGGATTCTTCAAAAAGCGGGGTCATTTATTTTAGTATGGGTACTAACATTAAGAAGTCACATTTACCATCTGAAACAATCCAAATGTTTATAAATACTTTTTCAAGCTTACCGTATGATATTTTATGGAAATGCGATGAAGATATACAAATAACTTCGAAAAATATCAAAATATTGAAATGGTTTCCACAATCAGATCTTTTAGCACATCCAAAAGTTAAACTTTTTATAACCCAAGGAGGTCTTCAATCTACAGATGAAGCTATAAACGCTGGAGTACCACTTATAGGTTTACCTATGATAGCCGATCAATGGTATAATGTTGAGAAGTATGTACATCATAAAATTGGCCTCAAACTTGATATATCTACGCTAACTAAAGAAGGTTTAATCAATGCTATTGAAACAGTAATAACAAATAACAGCTATCGCCAAAATATATTGCGCTTACGCGCTTTAATGCAGGACCAAAAGGAAAAACCATTGGAGAGAGCTGTCCGTTGGATAGAATATACTCTTCGACATGGTGGAACAAAGCATATGAGATCTGGTGCAGGAAATTTAACGTGGCAGCAGTATTATGAGCTGGAACTAATTATTATAGCTGTTGCAATTATCTTAATATCAATGGGACTATTTTTGGATAGTCAGCTGGAGGTAAACTGGCAGTTGGAATGA

Protein sequence:

>DPOGS204212-PA
MNEWWIKNVRQELEIEENRIIRNIFGPETPNINELKKNVDMLFLNIHPIFVDNQPVPPDVIYVGGIHIKPRKELPKDLSKVLDSSKSGVIYFSMGTNIKKSHLPSETIQMFINTFSSLPYDILWKCDEDIQITSKNIKILKWFPQSDLLAHPKVKLFITQGGLQSTDEAINAGVPLIGLPMIADQWYNVEKYVHHKIGLKLDISTLTKEGLINAIETVITNNSYRQNILRLRALMQDQKEKPLERAVRWIEYTLRHGGTKHMRSGAGNLTWQQYYELELIIIAVAIILISMGLFLDSQLEVNWQLE-