Monarch geneset OGS2.0

DPOGS206679
TranscriptDPOGS206679-TA1548 bp
ProteinDPOGS206679-PA515 aa
Genomic positionDPSCF300048 + 1112859-1115515
RNAseq coverage25x (Rank: top 77%)
Annotation
HeliconiusHMEL0226580.061.64% 
BombyxBGIBMGA014622-TA1e-13556.08% 
DrosophilaCG17323-PA2e-8735.36% 
EBI UniRef50UniRef50_G9LPR22e-14250.44%UDP-glycosyltransferase UGT42B2 n=6 Tax=Obtectomera RepID=G9LPR2_HELAM
NCBI RefSeqXP_317561.41e-8934.66%AGAP007920-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3796989924e-16455.93%UDP-glycosyltransferase UGT42A1 [Bombyx mori]
NCBI nr blastxgi|3796989921e-16054.38%UDP-glycosyltransferase UGT42A1 [Bombyx mori]
Group
Gene OntologyGO:00081528.4e-139metabolic process
GO:00167588.4e-139transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG173232e-85 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[20-501] IPR0022138.4e-139UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL34671 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206679-TA
ATGTCAGTTGTTACAACGTTAATTAAAATTTCAATATTACTATCGTTAAGTAATAATATTGGAAAAGTGCAAACATTAAATATTTTAGGTGTTTTTCCTGCTTCAATTAAAAGTCATTTCTTTGTTTTTGAACCATTTCTAAAAGAGTTAGTAAACCGCGGCCATAACCTGACTGTCATATCTTATTATCCTCAAAAGGAACCGTTGAAGAATTATAACGACATAGATTTGTCACAAAATGCACAGGCCCTAGAATTGAGCCATCCTATTACAGATTCTCTGTTTATGACTGTGATGTCTATGACAGTTGGTCACATAGTTGTTGCACCTCTTTCTTGTAGAATTATGTTAGATGATGAAAATGTGCAGAATTTGTGGGAAAGTGAAACGAGATTTGATTTAGTGATTGTCGAACAATTTAACAGTGATTGTGCTTTAGCATTAGCGCATAGATTAAAGGCACCAGTAATTGGTATTAGCTCTCATATGATTTTACCATGGCACTACGACAGATATGGTATTATATACAATCCCTCATTTTCTTTATTCGATTTCATGGAGGGTGGAACTAAGCCCACATTTTTACAACGGCTAAAAAGAAGTTTTTTATACCACTATGTCAATATTGTTAATAGATACATTACACAAAGGTTGGAGTATAGCATTGTACAGGAATATTTTGGAAATTTACCTCCACTACACGAATTAGGATCGGACATTAAGTTGATATTAGTGTATCAGAACTTTATTTTCACCGGATCCTCAATATTACCCCCAAATATCATAGAAGTCGGGGGCTATCATGTTAAGAAGCCCAAAGAACTTTCTGGCGAATTATTAAAATTTATCGAAGATTCCGAACATGGAGTAATTTATATAAGTTTTGGCACAATACTAAAACCATCATCAATAAAACCAGAAAAATTAAAAAGTATTATAGAGGCACTAGAGGAATTACCACAACGAGTTGTCTGGAAATGGAATAAAAGGACTTTACCAGGAAATCCAAAGAACATCTATTTATCAAAATGGTTGCCACAAAACGACATTTTAGCTCATCCAAAAACTGTTGCATTCTTTTCACACTGTGGACTCTTGGGAACTACGGAAGCTATTTCCCACGGAGTCCCTATAATCGGTTTACCAATTTTTGGTGACCAGCCCGCTAACGCCGCAGCTATTGAAGAAAGTGGACTAGGCGTTAAAATCACATTAAATCTACTGAACAAAGACAATCTTCTCAAGAAATTGAGAACCGTTTTACACTCAGAATTTCGAGAAAACGTGAAAAGAGTTTCTGCTATGTGGCATGATCGTCCGATACGAGCCATGGACAGTGCCATATTTTGGACAGAATATGCAGCTAAATACCAAAACATCTCACTCAGACCTCCAATTGTAGATGTTCCGCTATATCAATATCTTTGTTTGGATATTATCTTCACATTTACCTGTTTTTTAATTTTTTTCATACTTTTCATCAAGTTTTCCGTGCTCTTTTTGATGTTTCCTTTTAGTAACCAAACATACAATAAAAAATTAAAGTGA

Protein sequence:

>DPOGS206679-PA
MSVVTTLIKISILLSLSNNIGKVQTLNILGVFPASIKSHFFVFEPFLKELVNRGHNLTVISYYPQKEPLKNYNDIDLSQNAQALELSHPITDSLFMTVMSMTVGHIVVAPLSCRIMLDDENVQNLWESETRFDLVIVEQFNSDCALALAHRLKAPVIGISSHMILPWHYDRYGIIYNPSFSLFDFMEGGTKPTFLQRLKRSFLYHYVNIVNRYITQRLEYSIVQEYFGNLPPLHELGSDIKLILVYQNFIFTGSSILPPNIIEVGGYHVKKPKELSGELLKFIEDSEHGVIYISFGTILKPSSIKPEKLKSIIEALEELPQRVVWKWNKRTLPGNPKNIYLSKWLPQNDILAHPKTVAFFSHCGLLGTTEAISHGVPIIGLPIFGDQPANAAAIEESGLGVKITLNLLNKDNLLKKLRTVLHSEFRENVKRVSAMWHDRPIRAMDSAIFWTEYAAKYQNISLRPPIVDVPLYQYLCLDIIFTFTCFLIFFILFIKFSVLFLMFPFSNQTYNKKLK-