Monarch geneset OGS2.0

DPOGS206438
TranscriptDPOGS206438-TA1470 bp
ProteinDPOGS206438-PA489 aa
Genomic positionDPSCF300070 - 741386-744043
RNAseq coverage872x (Rank: top 15%)
Annotation
HeliconiusHMEL0225550.065.23% 
BombyxBGIBMGA005443-TA0.064.33% 
DrosophilaUgt86Da-PA2e-8434.80% 
EBI UniRef50UniRef50_G6CJU00.0100.00%UGT35E1 n=4 Tax=Obtectomera RepID=G6CJU0_DANPL
NCBI RefSeqXP_308743.34e-9636.85%AGAP007029-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3796990140.064.33%UDP-glycosyltransferase UGT39B1 precursor [Bombyx mori]
NCBI nr blastxgi|3796990140.064.33%UDP-glycosyltransferase UGT39B1 precursor [Bombyx mori]
Group
Gene OntologyGO:00081522.1e-142metabolic process
GO:00167582.1e-142transferase activity, transferring hexosyl groups
KEGG pathwaydpo:Dpse_GA101354e-86 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[1-488] IPR0022132.1e-142UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL10161 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206438-TA
ATGTTAAAACCAATTGGCTTAGAATTAGCACGAAAAGGTCACAATGTTACTGTGATCACACCTATCAGGGATAATAATCCACCTTCGAATTATCATCAAGTTTTAGTAGATGATAAAAAAATTTGGGATATTCTAGGAATGGAGCGTCCAAATATTTTTACTATGGTCGATATGTCTGCAGAGGAGTTCCACGATAAGATTTTATGGTCAGGTGGCGTAGCGTTTACTGAGTTAGCATTGAATTCTACCGAAGTAAGGAAGTTCTTGAAAGCAGACAATACTTTTGATCTAGTTCTTTGTGAACAGTTTTTCCAGGAGGCTATGTATGCTCTCGCATACAAATATAACGCGCCTTTAGCATTAGTGACTACTTTTGGTTCTTGTATGAGGCATAATATTGCAACGGGTAATCCACTTCAAATACCCAACATTCTAGCCGAATTCCTGGATATCAAAAATCCAACAAGTTTTTTGGGAAGGATGAGAAACATTTATTTCACTTTGTACGAATTTATTTGGTGGCGTTATTGGTATCTCGAAAAACATGAAAATTTAGTGAAAAAATATTTACCAGAACTGTCTGGAAAAGTACCAAAATTATATGAAATACAAAAAAACGTCTCTTTAATGTTAATAAATAGCCACTATAGTGCGGAAATTCCTGCTGCTTTTCTACCAAATATTGTTGAAATCGGAGGTGTTCATTTGACTCGAAGTAATACGTCTCTTCCTAAAGATCTCCAAAAGATTCTGGATGATTCAAAGTATGGAGTTGTTTACATGAGTTTGGGTTCTAACGTAAAAAGTGCTGAGTTGCCAGACTCAAAAAGGGAAGCTTTCCTAAAAGTATTTTCTAGTCTTAATCAGACCGTTCTATGGAAGTGGGAAGATGATAATTTGGAAAATAAACCGAAAAATTTAATTACACGCCAATGGCTGCCGCAAAAAGAAATTCTTGCTCATCCGAACGTCAAGGTATTTATATCCCATGGCGGTTTAATAGGAACACAAGAAGCAATATTTAATGGAGTGCCACTTGTCGGCGTTCCCATATACGCTGATCAATATAATAATTTGTTATATGCAGAAAAAGCTGGTTTCGGTAAAATTTTGCAATACCACGAAATTAATGAAAATCATTTGTTTCAAACTCTCAGCGAGGTTCTCACTAATGATTCTTATATGCAAAAAGCTAAGGAGGTTTCTAGAAGATTTAAAGATCGCCCTATGACACCTCTTGATACAGCTGTGTTCTGGTTGGAATATGTCATAAGAAACAATGGCTCCGAGTTCATGAAAAACCCGACACGGAATTTAAATTGGTTTTCATTTTACATGCTCGATGTTTATGCTTTATTCCTTTTGATTGTCTTTTTGTTTATTATGATTTTTTATAAAATTGTGATGTTCATAGCCCAGATGTACGTTGATTATAAAGTTAAAGTCACTATTGTAAAAAAAGAACAATGA

Protein sequence:

>DPOGS206438-PA
MLKPIGLELARKGHNVTVITPIRDNNPPSNYHQVLVDDKKIWDILGMERPNIFTMVDMSAEEFHDKILWSGGVAFTELALNSTEVRKFLKADNTFDLVLCEQFFQEAMYALAYKYNAPLALVTTFGSCMRHNIATGNPLQIPNILAEFLDIKNPTSFLGRMRNIYFTLYEFIWWRYWYLEKHENLVKKYLPELSGKVPKLYEIQKNVSLMLINSHYSAEIPAAFLPNIVEIGGVHLTRSNTSLPKDLQKILDDSKYGVVYMSLGSNVKSAELPDSKREAFLKVFSSLNQTVLWKWEDDNLENKPKNLITRQWLPQKEILAHPNVKVFISHGGLIGTQEAIFNGVPLVGVPIYADQYNNLLYAEKAGFGKILQYHEINENHLFQTLSEVLTNDSYMQKAKEVSRRFKDRPMTPLDTAVFWLEYVIRNNGSEFMKNPTRNLNWFSFYMLDVYALFLLIVFLFIMIFYKIVMFIAQMYVDYKVKVTIVKKEQ-