Monarch geneset OGS2.0

DPOGS214686
TranscriptDPOGS214686-TA1827 bp
ProteinDPOGS214686-PA608 aa
Genomic positionDPSCF300503 + 54579-64802
RNAseq coverage128x (Rank: top 57%)
Annotation
HeliconiusHMEL0226264e-11257.98% 
BombyxBGIBMGA013860-TA3e-10052.65% 
DrosophilaUgt35b-PA1e-4744.28% 
EBI UniRef50UniRef50_G6CZJ23e-10955.56%Antennal-enriched UDP-glycosyltransferase n=11 Tax=Obtectomera RepID=G6CZJ2_DANPL
NCBI RefSeqNP_001040425.12e-9852.65%antennal-enriched UDP-glycosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638961284e-9852.65%UDP-glycosyltransferase UGT33D4 [Bombyx mori]
NCBI nr blastxgi|3638961284e-9750.90%UDP-glycosyltransferase UGT33D4 [Bombyx mori]
Group
Gene OntologyGO:00081521.5e-93metabolic process
GO:00167581.5e-93transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG66331e-43 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[1-608] IPR0022131.5e-93UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL10114 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214686-TA
ATGCTTCGTAATATGAATCCCCTCTTTATGAAGGTCATTGAGTATCATTTTCAATCCAAAGAAGTCCAGGAAATCGTAGCTAATAATAAATATGATCTGATATTGTTAGAATCTATTGTTCTCTCGGGATTGATATACTCACACATATTCAAGGCTCCAGTGATATTAGTGAGTTCATTCGGAGGTTATATAAATGAACATAAAATAATGGGGACACCGACTGCACCTATTTTGTATCCATTGCCTCTGCGAAATAAAATTTACAATCTTAATTTTTTTGAAAAGATCCGAGAAATATACAGACATTATTCAAACGAATATGCAGAATATTTGAATGACCTTGACATTGATAAATTTTTGAAAGATAGATTTGGTTCCCAAACTCCAACTATAAATGAATTGAGTGATAATATTCATATGCTCTTTTTAAATGTTCACACCATTTGGGCAGATCATAAGCCCAGCACTCCGAATATTGTCTATATGGGTGGTATACACCAAGTACCACAGAAAGATTTACCGAAGGCCCTTGAGACATTCCTCAATTCTTCTAAACACGGAGTCATATATGTAAGCTTTGGGACAAATGCTTTGTCATATATGATTCCGTCAGATAAAATAGAAAATGTGGTAAAAGTTCTATCAAAACTTCCCTACGATGTGTTATGGAAATGGGATGGAGAGGAATTGCCGGGAAAGACAGACAATATTAGGTCATCCAAATGGTTCCCACAATCTGATCTGTTGAGACATCCAAATATAAAACTTTTTATAACACAAGCTGGACTGCAATCTACTGATGAAGCTATAACTGGTGGGGTACCGTTAGTTGCCATACCAATGTTTGGCGATCAATGGTACAATGCTGAAAAATTTGAAAAATTCGGCATTGGTATTCAACTAGACATTACAAGCTTTACAGAAGAAGAACTGCATAATGCTGTAATTAACGTCATAAATAATGAAAGCCATCAAATCGTTTTCCGTAAGATTACTCAAGAACTGCATAAACGAGGACATGAACTGACAGTGTTAACACCAGACCCAGCTTATCCAAAAGGCACTGCACCCGCAAACACTCAAATTATAAAATTGAAGGATCTTGAGATATATCTAAATTCTTCCCAACATGGAGTGATATATGTGAGCTTTGGGACAAATGTTTTATCAAACATGATTTCTACAGATAAAATAGACAATATTGTAAAAGTTCTATCAAAACTTCCCTATGATGTGTTATGGAAATGGGATGGAAAGGAATTGCCGGGAAAGTCAGAAAATATTAGGATATCCAAATGGTTTCCCCAGTCTGATCTTCTGAGACATCCTAAAATAAAACTCTTTATAACTCAAGCTGGCCTGCAATCTACTGATGAGGCCATTACTGCTGGAGTGCCGTTAATTGCCATACCATCTTTTGCTGATCAATGGTATAATGCGGAAAAATATGAAAAATTCGGTATTGGCATTCCATTGGATATAAAAACCTTTACAGAGGAAGAACTGCATCATGCAGTAATTACCGTTATAAATAACGAAAGCTATCGGCGCAACGTTATAAAACTTCGTGAAACAATTCTTGATCAACCAATGAGTTCTATAGAACGTGCAATGTGGTGGACAGAATATGTATTAAGACACAGAGAAAAGAATCATTTTCGTACTCTAGCTAGTAACTTGTCATACATGGATTACTTCGATGTAAAGTTTTGGATGACTATTTTTGCAATCATTGGTATCTTTTTAACTTTATTTGTGGTAACGATTGCATATGTTATTAAGTTACTGAATAAAATGTGGATTTATAATAAGGTTAAAACACACTAA

Protein sequence:

>DPOGS214686-PA
MLRNMNPLFMKVIEYHFQSKEVQEIVANNKYDLILLESIVLSGLIYSHIFKAPVILVSSFGGYINEHKIMGTPTAPILYPLPLRNKIYNLNFFEKIREIYRHYSNEYAEYLNDLDIDKFLKDRFGSQTPTINELSDNIHMLFLNVHTIWADHKPSTPNIVYMGGIHQVPQKDLPKALETFLNSSKHGVIYVSFGTNALSYMIPSDKIENVVKVLSKLPYDVLWKWDGEELPGKTDNIRSSKWFPQSDLLRHPNIKLFITQAGLQSTDEAITGGVPLVAIPMFGDQWYNAEKFEKFGIGIQLDITSFTEEELHNAVINVINNESHQIVFRKITQELHKRGHELTVLTPDPAYPKGTAPANTQIIKLKDLEIYLNSSQHGVIYVSFGTNVLSNMISTDKIDNIVKVLSKLPYDVLWKWDGKELPGKSENIRISKWFPQSDLLRHPKIKLFITQAGLQSTDEAITAGVPLIAIPSFADQWYNAEKYEKFGIGIPLDIKTFTEEELHHAVITVINNESYRRNVIKLRETILDQPMSSIERAMWWTEYVLRHREKNHFRTLASNLSYMDYFDVKFWMTIFAIIGIFLTLFVVTIAYVIKLLNKMWIYNKVKTH-