Monarch geneset OGS2.0

DPOGS209772
TranscriptDPOGS209772-TA1488 bp
ProteinDPOGS209772-PA495 aa
Genomic positionDPSCF300397 - 45240-49510
RNAseq coverage1110x (Rank: top 11%)
Annotation
HeliconiusHMEL0226195e-16256.14% 
BombyxBGIBMGA010289-TA4e-12950.00% 
DrosophilaCG15661-PB3e-6030.52% 
EBI UniRef50UniRef50_G9LPQ67e-13351.30%UDP-glycosyltransferase UGT40Q1 n=1 Tax=Helicoverpa armigera RepID=G9LPQ6_HELAM
NCBI RefSeqNP_001037040.11e-13451.24%phenol UDP-glucosyltransferase [Bombyx mori]
NCBI nr blastpgi|1129831382e-13351.24%phenol UDP-glucosyltransferase precursor [Bombyx mori]
NCBI nr blastxgi|1129831386e-13350.40%phenol UDP-glucosyltransferase precursor [Bombyx mori]
Group
Gene OntologyGO:00081523.1e-118metabolic process
GO:00167583.1e-118transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG156612e-58 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[2-495] IPR0022133.1e-118UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL23345 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209772-TA
ATGGTGTTCCCTGTGCCCGGAAAGAGTCACTCCATCCTGGGGGAGGGTTATGTAAGACATTTGTTAGCGGCTGGACATGAGGTAACCTATTTAACTCCGATACCGATTAAAAACCCGCCAGACAGACTTCGACAGATAGATGTGTCTGAAAACATAAAGTATATGTCAGAAGAACTTTTTGATGTAAAGAAGTACATGTATAAAGAAGTTAATTTGGTTCATTTGGAACTCACTGAACTGTTCGACAATCTCTGTTATAATACCTTCAAAATTGACAGCGTCCAAAGATTTATGAGAGACAAAGACGTTGATTTTGATGTCGTCATTGTCGAGTGGCTGTATTCTGAACTAGGTGTTGGGTTTTCATCAGTCTTTAATTGTCCTCTGGTATGGTCATCGTCTTTGGATGTTCACACTGAGGTGCTAGGTCTCATAGATGGGTACACAAACCCGGCGTACACCAAACATTTCTTCTCTACTGATTATTCATTCACGTTTTGGGATAGAGTGAATGAACTTTGGAGGGTATCCCGATTACTGTTATATAAATGGTGGCACATTGACGAGAACGATAAGATGTTTCGAGAGATATTCGGACCGGCTGCCGAAGAACGAGGTATAAAATTACCACATTTCAACGACGTGCGCTACAATGCATCCCTCATGCTTGGCAATTCACATATAGTGATTGGAGATGCAATCGCACTGCCGCAGAATTACCTGCATATCGGAGGTTACCACATTAAAAACGTTTTGGAACCGCTACCAAAGGATCTACAACAAATCATGGATAAGGCCAAAAATGGTGTAATATACTTCAGTTTGGGCAGTACGTTACAAGGCAGTAAAATACCAAGTAACGTTAAAAGGAAATTTCTTGACATGTTTGGTGAATTAAGCCAAAACGTTATTTGGAAATTGGATGGAAAAATTACAGATTTACCTAAAAATGTGCATATCGTTGATTGGGCTCCGCAACAAAGTATTTTGGCACATCCTAATTGCGTACTTTTTATAACACACGGTGGTCTTCTATCAACGTTAGAGACCATTAAATATGGCGTGCCAATTATCGGTATACCATTCTTTGCCGACCAATTCCTTAATGTCAACAAAGTTGTCGCTAAAGGATTCGGCAGGCGTGTAGATATAAGTGAAAACACACCGGAAGAATTGAGATTTGCTATAAGGGAAGTATTAGGAAATACCAGCTACCGCACTCGTGTGAAGGAACTGTCATCTCTGTTCATCGCTGATTCAGATCCAGGACAGCGATTGGTTCAGGGCGTGGAGTTAGTGGTCAGGACAAACGGAGCACCACATCTTCGTTCCGTCGCACTACGCGTGCCGTTCTACCAAAAACTGTACTTGGATGTTTTACTATTAGTTATTGCAATCGTTTTTGGACTTCCTCTTGTCATATATTATACGTGTAAACACTTATTGTTGGATGGCACTAAGTCTAATCTTAATAAGAAGAGAAACTAG

Protein sequence:

>DPOGS209772-PA
MVFPVPGKSHSILGEGYVRHLLAAGHEVTYLTPIPIKNPPDRLRQIDVSENIKYMSEELFDVKKYMYKEVNLVHLELTELFDNLCYNTFKIDSVQRFMRDKDVDFDVVIVEWLYSELGVGFSSVFNCPLVWSSSLDVHTEVLGLIDGYTNPAYTKHFFSTDYSFTFWDRVNELWRVSRLLLYKWWHIDENDKMFREIFGPAAEERGIKLPHFNDVRYNASLMLGNSHIVIGDAIALPQNYLHIGGYHIKNVLEPLPKDLQQIMDKAKNGVIYFSLGSTLQGSKIPSNVKRKFLDMFGELSQNVIWKLDGKITDLPKNVHIVDWAPQQSILAHPNCVLFITHGGLLSTLETIKYGVPIIGIPFFADQFLNVNKVVAKGFGRRVDISENTPEELRFAIREVLGNTSYRTRVKELSSLFIADSDPGQRLVQGVELVVRTNGAPHLRSVALRVPFYQKLYLDVLLLVIAIVFGLPLVIYYTCKHLLLDGTKSNLNKKRN-