Monarch geneset OGS2.0

DPOGS206437
TranscriptDPOGS206437-TA1569 bp
ProteinDPOGS206437-PA522 aa
Genomic positionDPSCF300070 - 764026-767848
RNAseq coverage141x (Rank: top 55%)
Annotation
HeliconiusHMEL0225550.065.17% 
BombyxBGIBMGA005443-TA0.062.50% 
DrosophilaUgt86Da-PA2e-8434.84% 
EBI UniRef50UniRef50_G6CJU00.064.47%UGT35E1 n=4 Tax=Obtectomera RepID=G6CJU0_DANPL
NCBI RefSeqXP_001663166.12e-9738.05%glucosyl/glucuronosyl transferases [Aedes aegypti]
NCBI nr blastpgi|3638960760.063.99%UDP-glycosyltransferase UGT39B2 [Helicoverpa armigera]
NCBI nr blastxgi|3638960760.061.69%UDP-glycosyltransferase UGT39B2 [Helicoverpa armigera]
Group
Gene OntologyGO:00081526.4e-156metabolic process
GO:00167586.4e-156transferase activity, transferring hexosyl groups
KEGG pathwaydpo:Dpse_GA101351e-82 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[14-521] IPR0022136.4e-156UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL10161 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206437-TA
ATGGCGAATCTACATCCAGTTTTATTTTTGATTTTTCTAACAAAACTCTCGGACTCGGCGAACATACTTTACGTGATGCCATTCTCTGCGACATCTCATTACATAATGTTGAAACCGATCGGAACAGAACTAGCTCGGAGAGGACACAACGTCACTGTTATTACTTCAATAAGAGACAACAACCCTCCTCCCAACTATCATCAGATTTTAGTTAATGATACAAAGATATGGGACCTTCTTGGTATCGAGCGACCTGTTGTTTTTAGTATGGTGGATTTAAGTACAGAAGAATTGTACAATAAAATTGTATGGCCCGGGTTAATAGCTTTCACAGAACTTACATTCAAATCCCGTGAATTTATGACATTTTTGAAAAAAGATAATGCTTTTGATCTCGTCATAAGTGAACAGTTCTATCATGAAGCGTTGTATGCTCTGGCTTATAAATATAATGCACCTCTAGCGCTAGTCACGACTTTTGGAAATTGCATGAGACACAATTATATTACAAGAAATCCTTTACAAATGGCTACAGTCACGTCAGAACTACTAGTTGTAGAAGATCCTACAAGCTTTTGGGGAAGACTGCGTAATTTGTTATTCAATGTTTACGATTATACTTTTTGGAGGTATTGGTATTTGGAGGAGCAAGAGAAATTAGTACGAAAGTATTTACCAGAATTGACCGGTAAAGTACCTTCGTTGTACGAAATGCAAAAAGAAACTGCTTTGATGTTGATTAATAGCCACTTTAGCTATGACACACCTGCAGCAATTTTACCAAATATAGTTGAAATCGGTGGACTTCATTTTACAAAAAGTAACTTAAGCCTTCCCGAGGATCTTCAAAAGGTCTTGGACGAAGCACAGGAAGGTGTTGTTTATGTTAACTTTGGTTCAAATGTAAGAAGTATTGAATTACCGGTAGAAAAGAAAAATGCGTTTTTAAATGTATTTCGCCAGTTGAAACAAACTGTGTTGTGGAAATGGGAAGATGATGTGTTAGATGACAAACCGTCCAATTTATTTACCCGTAAATGGTTTCCTCAAAAGGATATTCTCCAACATCCTAATATCAAAGTGTTTGTCTCCCATGGTGGTTTGATTGGAATGCAAGAAGCTATAATCAATGGAGTTCCTGTCGTGGGTGTTCCTGTATTTGGTGATCAATTTAATAATGTTTTGCTAGCTCAAGAAGCAGGTTTCGGTAAACTCTTAAGGTACCATGATATAAACGAAAAAACATTGAGTGCAGTTCTCAATGAAGTGCTCTATAATGCTTCCTACATGGAAACTGCCAAAGAAGTCTCCCGTAGATTTCTGGACAGACCTTTATCTCCTATGGATACAGCTATTTATTGGCTTGAATATGTAATTAGAAATAAGGGAGCGGAATATTTAAAGAATCCTGCTCGTAACCTGAGTTGGATTGCTTATAATATGCTTGATGTATATTCATTTATAGCTATAATTTTAATAGTAATAATGTGTGTCTTTATAAAATCGTGGTTTTGTATAAAGGCGTCAATAACTTTTAAGCAAATATCAAAATCTAAAAAGCAGTCATAA

Protein sequence:

>DPOGS206437-PA
MANLHPVLFLIFLTKLSDSANILYVMPFSATSHYIMLKPIGTELARRGHNVTVITSIRDNNPPPNYHQILVNDTKIWDLLGIERPVVFSMVDLSTEELYNKIVWPGLIAFTELTFKSREFMTFLKKDNAFDLVISEQFYHEALYALAYKYNAPLALVTTFGNCMRHNYITRNPLQMATVTSELLVVEDPTSFWGRLRNLLFNVYDYTFWRYWYLEEQEKLVRKYLPELTGKVPSLYEMQKETALMLINSHFSYDTPAAILPNIVEIGGLHFTKSNLSLPEDLQKVLDEAQEGVVYVNFGSNVRSIELPVEKKNAFLNVFRQLKQTVLWKWEDDVLDDKPSNLFTRKWFPQKDILQHPNIKVFVSHGGLIGMQEAIINGVPVVGVPVFGDQFNNVLLAQEAGFGKLLRYHDINEKTLSAVLNEVLYNASYMETAKEVSRRFLDRPLSPMDTAIYWLEYVIRNKGAEYLKNPARNLSWIAYNMLDVYSFIAIILIVIMCVFIKSWFCIKASITFKQISKSKKQS-