Monarch geneset OGS2.0

DPOGS205683
TranscriptDPOGS205683-TA1272 bp
ProteinDPOGS205683-PA423 aa
Genomic positionDPSCF300250 - 417437-418925
RNAseq coverage8x (Rank: top 85%)
Annotation
HeliconiusHMEL0226267e-14258.37% 
BombyxBGIBMGA013860-TA2e-12852.99% 
DrosophilaUgt86Da-PA1e-5830.66% 
EBI UniRef50UniRef50_G6CZJ25e-16264.78%Antennal-enriched UDP-glycosyltransferase n=11 Tax=Obtectomera RepID=G6CZJ2_DANPL
NCBI RefSeqNP_001040425.15e-12450.24%antennal-enriched UDP-glycosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638960682e-13053.12%UDP-glycosyltransferase UGT33J1 [Helicoverpa armigera]
NCBI nr blastxgi|3638960681e-12753.12%UDP-glycosyltransferase UGT33J1 [Helicoverpa armigera]
Group
Gene OntologyGO:00081527e-109metabolic process
GO:00167587e-109transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG66442e-55 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[2-421] IPR0022137e-109UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL18547 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205683-TA
ATGACATTTGAAGATATAAGAGAAGGGTGTAAATTTTTTGTCTCTTTATTCCAACTACAATTTGAAACAGAAGCAGTCCAAGAAGTTATTTCAAAGAATGAAAAATTTGATTTGATACTTATTGAATCTATCGTTCGACCAGCTTTGGCATACTCTTATGTATTTAAAGCGCCAGTTATTTTAGTCAGTTCGTTTACTGCTGTTTTTAGTAACAATAAAATTATAGGATCACCCACACACCCTTTGCTGTATCCAATTTGTTTTCGAAATCGTTTGTACAATTTATCATTTGCTGAAAAAATACATCAGTTGTACTTACATTATATGTATGAATACGCGGATTATTTGAATGAAAAAGAAGAATCGAATATATTAAGAAAAATATTTGGATCTGAGTTTCCATCTTTACACGAATTAGGTAATAATGTTGATATGTTGCTATTAAACATACATTCTCTATGGGCGGACAATCAGCCAGTTGCTCCCAACGTTATTTATATGGGTGGTATTCATCAATTGCCACGGAAGGAACTACCAAAGGATCTCAAATCTTACTTAGATTCTTCTAAAAGTGGTGTCATATACGTCAGCTTTGGTACCAATGTATTATCTAATATGATTCCTGAAAAGCAAATAGTTGCAATTATAAATGTGTTATCTAAGTTACCCTACGATGTTCTTTGGAAATGGGATGGTGATTCATTACCTTTGACATCTACAAACATAAGAACTTCGAAATGGTTTCCTCAATCGGACTTGTTAAGACATCCGGCTATAAAACTTTTTATAACACAAGCCGGTCTTCAATCAACTGACGAAGCAATCACTGCAGAGGTTCCTCTTATTGCATTCCCAATGCTTGCTGATCAGTGGTTTAATGCAGAAAAATACGAAAAATTTAATATTGGTATTAAGTTACACATTTTATCGTTCACGGAAAAACAACTAGAGACTGCTATTGACGACGTCATTAATAATAAAAGTTATCGACGAAATATTATCAAACTTCGTCATTTAATGCGTGATCAACCCGAAACACCTTTGAACCGGACGATATGGTGGATAGAGCACGTGTTGCGGCACGGAGGGGCGAAGCATCTACGATCACCCGCAGCTAATATGTCGTACATTGATTATTTCGAAGTCAAATTAATGTTATTTATTCTTTTATTAATCACAGTATTCATATTAGCATTCATTTTAGTGCTTAAATGTTTTACAAAATTTATAATAAGATTTTTTTCGAATAAAAATAAAATCAAATATAACTAA

Protein sequence:

>DPOGS205683-PA
MTFEDIREGCKFFVSLFQLQFETEAVQEVISKNEKFDLILIESIVRPALAYSYVFKAPVILVSSFTAVFSNNKIIGSPTHPLLYPICFRNRLYNLSFAEKIHQLYLHYMYEYADYLNEKEESNILRKIFGSEFPSLHELGNNVDMLLLNIHSLWADNQPVAPNVIYMGGIHQLPRKELPKDLKSYLDSSKSGVIYVSFGTNVLSNMIPEKQIVAIINVLSKLPYDVLWKWDGDSLPLTSTNIRTSKWFPQSDLLRHPAIKLFITQAGLQSTDEAITAEVPLIAFPMLADQWFNAEKYEKFNIGIKLHILSFTEKQLETAIDDVINNKSYRRNIIKLRHLMRDQPETPLNRTIWWIEHVLRHGGAKHLRSPAANMSYIDYFEVKLMLFILLLITVFILAFILVLKCFTKFIIRFFSNKNKIKYN-