Monarch geneset OGS2.0

DPOGS205723
TranscriptDPOGS205723-TA1272 bp
ProteinDPOGS205723-PA423 aa
Genomic positionDPSCF300250 + 422947-424458
RNAseq coverage56x (Rank: top 69%)
Annotation
HeliconiusHMEL0226266e-14260.53% 
BombyxBGIBMGA013860-TA4e-12652.82% 
DrosophilaCG6475-PC9e-6130.88% 
EBI UniRef50UniRef50_G6CZJ20.0100.00%Antennal-enriched UDP-glycosyltransferase n=11 Tax=Obtectomera RepID=G6CZJ2_DANPL
NCBI RefSeqNP_001135960.17e-12248.39%uridine diphosphate glucosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638960681e-12551.96%UDP-glycosyltransferase UGT33J1 [Helicoverpa armigera]
NCBI nr blastxgi|3638960523e-12752.39%UDP-glycosyltransferase UGT33B7 [Helicoverpa armigera]
Group
Gene OntologyGO:00081524.4e-115metabolic process
GO:00167584.4e-115transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG156611e-56 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[2-423] IPR0022134.4e-115UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL18547 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205723-TA
ATGACCTTTTCTGACGTAAGGGGAACTTCTATGTTCATGAGGGCGATATTTCGAGCACAGTTGGAAACAGAAGAGGTGCAGAAAATAATATCTGAGAGGCCTAAATTTGATTTGATTTTAATAGAATCAATTAATCGTCTGGGTTTATCGTACTCGCATCTATTTAAGGCACCGGTTATATTAGTTAGCTCATTCACAGCTGTTTTCGATAATCATAATGTTATGGGATCTCAAACGCATCCTTTTTTGTATCCAATATCTTTTCGTGATCGGATTTACAATCTCTCGCTTACTGAAAAGTTAAAACAATTATATATCCATTTTTATGTTGAATACGCAGATTATTTAAATCGAAAAGAAGAAAATTCTTTTCTAAAGGAAATTTTCGGGCCTCAATGTCCATCGCTGAATGAAATGAATAAAAATGTTGACATGTTGCTTTTAAATATTCATCCTATGTGGGTAGACAATCAGCCTGTTGCCTCCAATGTAATTTATATGGGTGGTATACATCAGTTACCTGAAAAAAAACTACCACAGGAACTTCAAAAATATTTAGATTCATCTAAAAAAGGAGTCATTTATGTGAGTTTCGGAACCAACGTGCTGTCGCAAGTTTTTCCTGAAGATAAACTTAAAATTATTATCAATGTTGTATCAAGACTTCCTTACGATATACTATGGAAATGGGATAAGGATGAACTACCTATAAAAGCCAGCAATATCAAATTATCAAAATGGTTGCCACAATCTGATTTATTAAGGCACAAGAATGTTAAACTTTTCATAACACAAGCTGGTCTCCAGTCTACCGATGAAGCCATTACAGCAGGAGTTCCTCTGGTTGCGATTCCAATGTTAGGAGACCAATGGTTTAATGCTGAGAAATATGAAAAGTTCGGTATCGGTATTAAATTAGATGTTAAGACCTTGACAACGGATCAACTATCCAAAGCCATTGAGACCGTTATAAGTGATGAAAGCTATCGTCACAATATATCAAAACTTCGAGGTCTAATGCATGATCAGCCCGAACCACCTCTTAATCGGACCATGTGGTGGATTGAATATGTATTAAGACATGGTGGCGCAAAACATTTACGATCGGCTGGAGCTAATATGTCATATTGGGAATATTTTGAAACGGAATTAATATTAGTGATTCTCTTAGGAATATTTATAATTGTAGCAGGGATTTCTGTTGTAGGTTTTATGCTTATACATTTTATTTCACAATTTTCCAAAACTACGAAGAAATTAAAAACAAATTAA

Protein sequence:

>DPOGS205723-PA
MTFSDVRGTSMFMRAIFRAQLETEEVQKIISERPKFDLILIESINRLGLSYSHLFKAPVILVSSFTAVFDNHNVMGSQTHPFLYPISFRDRIYNLSLTEKLKQLYIHFYVEYADYLNRKEENSFLKEIFGPQCPSLNEMNKNVDMLLLNIHPMWVDNQPVASNVIYMGGIHQLPEKKLPQELQKYLDSSKKGVIYVSFGTNVLSQVFPEDKLKIIINVVSRLPYDILWKWDKDELPIKASNIKLSKWLPQSDLLRHKNVKLFITQAGLQSTDEAITAGVPLVAIPMLGDQWFNAEKYEKFGIGIKLDVKTLTTDQLSKAIETVISDESYRHNISKLRGLMHDQPEPPLNRTMWWIEYVLRHGGAKHLRSAGANMSYWEYFETELILVILLGIFIIVAGISVVGFMLIHFISQFSKTTKKLKTN-