Monarch geneset OGS2.0

DPOGS204213
TranscriptDPOGS204213-TA2520 bp
ProteinDPOGS204213-PA839 aa
Genomic positionDPSCF300493 + 11806-17247
RNAseq coverage24x (Rank: top 77%)
Annotation
HeliconiusHMEL0226334e-12547.99% 
BombyxBGIBMGA013860-TA0.045.39% 
DrosophilaUgt35a-PA9e-5632.01% 
EBI UniRef50UniRef50_G6CZJ21e-17541.85%Antennal-enriched UDP-glycosyltransferase n=11 Tax=Obtectomera RepID=G6CZJ2_DANPL
NCBI RefSeqNP_001040425.14e-12547.10%antennal-enriched UDP-glycosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638960646e-12748.31%UDP-glycosyltransferase UGT33F2 [Helicoverpa armigera]
NCBI nr blastxgi|3638960649e-12848.31%UDP-glycosyltransferase UGT33F2 [Helicoverpa armigera]
Group
Gene OntologyGO:00081521.2e-109metabolic process
GO:00167581.2e-109transferase activity, transferring hexosyl groups
KEGG pathwayame:4087886e-54 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[371-838] IPR0022131.2e-109UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL10114 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204213-TA
ATGACGATAAATGTTTTCAGCCGCGCTATGGAAAAATATCTAAGTTTAGGCCTACTGGACGAATTTTTGAATAACAAGACACATCAATTTGATTTGGTAGTAGTGGAGGCTAGCGTAAAGCAAGCTTTAGTATTTTCCCATGTATTTCAAGCACCAGCTATTCAATTGAGTTCATTTGGTGGATACCTCGGAACATTTGAAGCTGTGGGAGCCTCAACTCACCCGTTATTATACCCATCAATAGTAAGACAAAGATTTCAAAATCTAACAATGTGGAATAAGTTACTGGAATACTTTAATCATTATTACATGATGTGGATATATTATCGTGCCGAAAATTATAACAATATGGTCCTTAAAAAATATTTTGGCCCTAACGTTCCAGAATTATCAAAATTGAATAAGAACATTGCCCTACTTTTCTTAAACATCCACTCTGTTTGGGATAGTAATCGTCCTGTACCACCTAACGTTATATATTTAGGTGAATTGAATAAAAACAAACCGAAACGCTTACAACAGGAAATTCAATCGTATTTAGATTCGTCTAAGCACGGAGTTATATATGTTAGTTTCGGAACAAATATTAACAAAGGCATATTGACACATGAAAGACTACAGATTATTATAAAAGTATTATCTGAACTACACTATGATGTATTAATGAAGAATGACGGCGTGGAAGCTATGGACCCTTCAATTAAAAACATTAGACTGTTCGATTGGGTTCCTCAGACCGGTGTTTTAAATCATCCAAAAGTAAAACTGTTCATCACACAAGGAGGACTGCAATCATCACATGAAGCTATAGAAGCCGGTGTACCATTGCTTGGTATTCCATTGATGTGGGATCAAATGTTGAATGTCGATAAATACGTCAAATTTAAAATAGGACTTCAATTAGATATATACTCTTTAAATGAAGCTACGTTTAAGAAAAGCGTAGAAACAGTTCTAGGCAATGGAAGTTTTAGAACAAACATAGAAAAACTTCGCACTATAATGAATGATCAACCTCAAACACCAGTAGAAAAAGCTGTATGGTGGACAGAATACGTTTTAAAATACGGCGGCGAACATCTTGGATCAGAATCAGCGTACGTGGAATGCCATCAAGCTGCGTTCCGTCCATATACGCAAGAATTAGCAAAACGTGGGCACAAAGTTACGGTCATAACGACACACCCAGCTTTTTCTAAGAAAGAATTACCTGATAATCTTACAGAAATAAATATAAGTGGACCAGGTGAAGAAGTGAGAACAGATATTTTTTTGAAATTAGATAAAGCAAATAGCTTATTAGAACAACAAGCAATAGGTATTAAATTCTTAAACGTATTAATGAAAAAATGTTTAGATTCTGGTTTACTAGATCAGTATTTAAACAATAAGGTGCATAAATTCGATTTAGTTGTAGTGGAAGCTACAGCGACACAAGCATTAGTATTTTCGCATGTTTTTAATGCACCTGCTGTTCAAATCAGTTCATTTGGTGGAAACTATGGGACGTTTGAAGCAGTTGGAGCTTCATCTCACCCTTTAATATACCCATCAGCAGCAAGGCAAAGATTTCAAGGTCTTACAATATGGAATAGATTGTTGGAGTATTTTTCTCATTACTACCTTATGTGGACGTATTACGAATCTGAAAAGTACGACGATATTTTGCTTAAGGAATACTTTGGTCCTGATACCCCAGTAATGGCCAAACTAAAACATAATATTGCTCTAGTATTACTAAACATACATCATGTTTGGGATGCTAATCGGCCTGTTCCACCCAATGTGGTGTATCTCGGCGAATTGAATCAAAATAAAAGAAAAGAGTTACCAAAGGAATTGAAAGAGTATCTAGATTCATCGAAAAATGGAGTAATATATGTGAGTTTTGGGACAAATGTTAATAGAGGTATACTCACACCTGAAAAATTAAAAATTATGATAAAAGTATTTCATAGTCTACCGTATGACATTTTAATGAAGAGTGACAACACAACTGATATGAATTCATCAAAAAATATCAGAATGTTTAATTGGATTCCTCAAACCAATGTGCTACATCATCCTAAACTTAAATTATTCATCACTCAAGGCGGCCTGCACTCATCGCAAGAAGCTATAGATGCTGGAGTGCCTCTGATCGGTATTCCAATGATGTGGGACCAGTGGCTTAATGTAGATAGATATGTTAAATTTAAAATAGGACTTCAATTAGATATAAATACACTCAATGAAGAAACGATGAGAAAAGCTATAGAAACAATTGTAAACAACGAAAGCTATAAAAATAATATATTGAAACTCCGTAATTTCTTATATGATCAACCTCAAAAACCGTTGGAAAAGGCTATTTGGTGGACAGAATATGTTTTAAAATATGGCGGAGAACATTTGCGTACCCCAGCGGCAACAATAGAATGGTCAGAATATTACGAAATTGAGATAATAATTTCTATTGCATTGACGATCTTGTGTGTTGTGTTTGCTAGTGTTTTTGTAGTTAAGAGATTCATAAATTGA

Protein sequence:

>DPOGS204213-PA
MTINVFSRAMEKYLSLGLLDEFLNNKTHQFDLVVVEASVKQALVFSHVFQAPAIQLSSFGGYLGTFEAVGASTHPLLYPSIVRQRFQNLTMWNKLLEYFNHYYMMWIYYRAENYNNMVLKKYFGPNVPELSKLNKNIALLFLNIHSVWDSNRPVPPNVIYLGELNKNKPKRLQQEIQSYLDSSKHGVIYVSFGTNINKGILTHERLQIIIKVLSELHYDVLMKNDGVEAMDPSIKNIRLFDWVPQTGVLNHPKVKLFITQGGLQSSHEAIEAGVPLLGIPLMWDQMLNVDKYVKFKIGLQLDIYSLNEATFKKSVETVLGNGSFRTNIEKLRTIMNDQPQTPVEKAVWWTEYVLKYGGEHLGSESAYVECHQAAFRPYTQELAKRGHKVTVITTHPAFSKKELPDNLTEINISGPGEEVRTDIFLKLDKANSLLEQQAIGIKFLNVLMKKCLDSGLLDQYLNNKVHKFDLVVVEATATQALVFSHVFNAPAVQISSFGGNYGTFEAVGASSHPLIYPSAARQRFQGLTIWNRLLEYFSHYYLMWTYYESEKYDDILLKEYFGPDTPVMAKLKHNIALVLLNIHHVWDANRPVPPNVVYLGELNQNKRKELPKELKEYLDSSKNGVIYVSFGTNVNRGILTPEKLKIMIKVFHSLPYDILMKSDNTTDMNSSKNIRMFNWIPQTNVLHHPKLKLFITQGGLHSSQEAIDAGVPLIGIPMMWDQWLNVDRYVKFKIGLQLDINTLNEETMRKAIETIVNNESYKNNILKLRNFLYDQPQKPLEKAIWWTEYVLKYGGEHLRTPAATIEWSEYYEIEIIISIALTILCVVFASVFVVKRFIN-