Monarch geneset OGS2.0

DPOGS211520
TranscriptDPOGS211520-TA6735 bp
ProteinDPOGS211520-PA2244 aa
Genomic positionDPSCF300354 + 66101-81749
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0081257e-16545.98% 
BombyxBGIBMGA003817-TA5e-11447.92% 
DrosophilaCG15661-PB4e-5829.96% 
EBI UniRef50UniRef50_G9LPR11e-13149.89%UDP-glycosyltransferase UGT41D1 n=7 Tax=Obtectomera RepID=G9LPR1_HELAM
NCBI RefSeqNP_001182386.15e-7936.52%UDP-glucosyltransferase [Bombyx mori]
NCBI nr blastpgi|3796989902e-13248.23%UDP-glycosyltransferase UGT41A3 precursor [Bombyx mori]
NCBI nr blastxgi|3638961003e-13048.21%UDP-glycosyltransferase UGT41D1 [Helicoverpa armigera]
Group
Gene OntologyGO:00081527.9e-114metabolic process
GO:00167587.9e-114transferase activity, transferring hexosyl groups
GO:00055152.1e-08protein binding
KEGG pathwaydpo:Dpse_GA138787e-57 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[1751-2244] IPR0022137.9e-114UDP-glucuronosyl/UDP-glucosyltransferase
[556-1638] IPR0110462.1e-08WD40 repeat-like-containing domain
Orthology groupMCL17542 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211520-TA
ATGAAAACTTCAAAGAGAAGAACTGATAAGGAACGAGAACACTCCAAAGATGATATTTTAAATAAGAAGCAGGAAGAGAGATCCAAGAAAAGTGTGTACAGCGTGTCTACACCACGAAATTTTGTACAGAAATCATCTGCCGCTGATATTTACGCGAGAAAAGATGCCGTTAAAGCTCCGAAAAAGATCGCATCGAATAGAAATAACGACGTGTCTCCAATGAAAGATTTACTTAAATCATCATCTTCATCAGTAAGCCAAGTAAGCACAAGGAGTTATAAAAGCAAAACGCCTGTCAAAGACGTGAAGGCAATACCGCTATCAGCCAGATCTGCAGTTAAATCTGTTAATGCACCGAACAGAAAGTTACCTTTCACCAACGTGACCGTAAATTCACCGATAGTTAAACGAAAATTGAATCTAGAAGACGAAGGAGCTAAGGTTTTAACAAAAACTGAAATTAAAGTCGCGGGGAAAAGAGAGAGAACAAATACAAGGACTTTAGATGAAGCTGAAATAAAAGTTTTAACTAGTGATGTGGTGGACAACAACACGGAGCTATTAAAATTGAACCAAACATTGAGCGCTCAGCCGAAAGCATTTTTCATTGATCTAGAAGACAAGAAAATAAAAGATAAATCCAGCGACGAAGAGGTTAGTTATGAGGATGATTTTGAGAGCTACGAGTCCGATTTCGATTCTTACCACAGTGAATCCAATCGCAGTTCCGCTGGGGATGCGAATGCCAGTGATGATGATGATGGATGTGATAATAATGTTCATCACGAATCATCATCCAGGGAGGAAAGCAAATTGGATTCCGGTAACTTCGACCTCCCGGAACGGCACAAAGATAGGATAAATCCCATGGAATATATAGACGAGGTTCCAGAGAGTGATAAGAAGATCTCTCTCACGGATGAGGGCTTCCAGGATATGTCATCAGGATTGTCAAGTATGAAGACGCTGCATGTGGACATATTAGACAGACCTCTGTTCATTGACTTCACGGAATCCGATGCTAACAGAAAGAGACGGAAGATATTCGAAAGGCTCAGAAAAAGAGCAGAGGACATCACGAGCATGGTGAAGTTTCATGAAATTTCGTACAATTTGTTTGAAATGAAACCTATCCCATACGAATTATACATGGCGAGCTTCGGACGGTACAACTACACGCAGACATCCGTGCAGACCTTCGACGACGGGGTCTCGGAAGAGGTCCAGACTGAGGAGGTTTTAATGTGCAGCAAATGGACCCAGCATCCAATCGAATTTTCCAATCAAATCATACATCTGAACGGACAGGATTACTTAAACAACACGACCGAAAAATTTCACAACAAAGATGTGATGAATAAAGTCAATATCAGTGACACATACATAAACAATCCTCTGAGAATATATTTCGAGCAAAAAGATGGCGTTGGTAACGACACAGTGTTGCCATATGATGGATATAAAAATAAACTAAAGAAAAATCAATACGACGCTAATAAGCTGAGGAAGTTTATGAAAAGAGTCGAGTCCAGGGTATGCAATGTCCTGAGCAAGAATGCTGGCAACATTAACATGTCGAATCTAGTGAGGACCTCAAAATTTCCGTTCAGCACCGGGTACATGGCTATATGTACGAAGAGTTTGAACGAGGAGTCGTTAGTGAAGGACACTGAGATAACCGGTGTCGTTTTCTCTGATAGTAAAAGTAATTTGATTATCACAATACATAAGAAGTTTACTGTGGGCGTGTTGAAAAATAAATGCGCTCTGTGTCTTTGGGATTTGAGTGTCGCGAGAGTGGAGCCATTGAAGACCTTGATGGCGTCTGATGATGTCAGAATCGGTAAATTCAGAGGCAGCACCGATGGCTATTTCGTTGGAGCTTTGGAAGATGGGTCCATAAGTCTCTGGGATTTATCAGAGGAGGCGTGGTGGAGTCACGACGCTAACACGTTGGCTGATGATAGGGTCAGAGAATCAAATCTCAGTCAGTGTGAGTTGGACAGAGAGTGGAACTTGAGGAATAACAGCAATTATGTGAAGGTGGGTAACTTCGACCTCCCGGAACGGCACAAAGATAGGATAAATCCCATGGAATATATAGACGAGGTTCCAGAGAGTGATAAGAAGATCTCTCTCACGGATGAGGGCTTCCAGGATATGTCATCAGGATTGTCAAGTATGAAGACGCTGCATGTGGACATATTAGACAGACCTCTGTTCATTGACTTCACGGAATCCGATGCTAACAGAAAGAGACGGAAGATATTCGAAAGGCTCAGAAAAAGAGCAGAGGACATCACGAGCATGGTGAAGTTTCATGAAATTTCGTACAATTTGTTTGAAATGAAACCTATCCCATACGAATTATACATGGCGAGCTTCGGACGGTACAACTACACGCAGACATCCGTGCAGACCTTCGACGACGGGGTCTCGGAAGAGGTCCAGACTGAGGAGGTTTTAATGTGCAGCAAATGGACCCAGCATCCAATCGAATTTTCCAATCAAATCATACATCTGAACGGACAGGATTACTTAAACAACACGACCGAAAAATTTCACAACAAAGATGTGATGAATAAAGTCAATATCAGTGACACATACATAAACAATCCTCTGAGAATATATTTCGAGCAAAAAGATGGCGTTGGTAACGACACAGTGTTGCCATATGATGGATATAAAAATAAACTAAAGAAAAATCAATACGACGCTAATAAGCTGAGGAAGTTTATGAAAAGAGTCGAGTCCAGGGTATGCAATGTCCTGAGCAAGAATGCTGGCAACATTAACATGTCGAATCTAGTGAGGACCTCAAAATTTCCGTTCAGCACCGGGTACATGGCTATATGTACGAAGAGTTTGAACGAGGAGTCGTTAGTGAAGGACACTGAGATAACCGGTGTCGTTTTCTCTGATAGTAAAAGTAATTTGATTATCACAATACATAAGAAGTTTACTGTGGGCGTGTTGAAAAATAAATGCGCTCTGTGTCTTTGGGATTTGAGTGTCGCGAGAGTGGAGCCATTGAAGACCTTGATGGCGTCTGATGATGTCAGAATCGGTAAATTCAGAGGCAGCACCGATGGCTATTTCGTTGGAGCTTTGGAAGATGGGTCCATAAGTCTCTGGGATTTATCAGAGGAGGCGTGGTGGAGTCACGACGCTAACACGTTGGCTGATGATAGGGTCAGAGAATCAAATCTCAGTCAGTGTGAGTTGGACAGAGAGTGGAACTTGAGGAATAACAGCAATTATGTGAAGTTACAGCAACAAAGGTACGTCGCACAAGTCAGCGCTTTCACCAGCAGCGGTATAAACGTCTGTGAGGACGTCTCGGTAGATAGCATCGTGGGCTTGGAGTTCACAGACTACACAGGTTCCGTTGGAAGCGATGTTATTGGACAGATTTGTTCTCTACAACGTATAGGTGTTTTGACGATCTGGTCGATTATAAGGGAGAGGGCGGTAAAATCGGATATAGGCAAGGCTTTGTGGTCGAAAATTAAACTTAGGAGGTCCCAAGTGATAGTTCTGTCAGAATATCTAGAAGCTAATAGAACAGTTGGCACCGATTTCAATTTGAACTCGGCGAAGAAAAGAATCGCAGCGAGGAAACGGGAGAAAAATATCATTAAACGGCAACACTCGCGACCGAAATCTAGTTCCACATTCAATTTGGATAGGACAGATAGTGTTGCCATCAATAAAACTGACACAAATTTTTGGGAGAATGGAATCATTTGCAGCGATTTGAAAATAATACATTTGAAAGTTGACAACTATCTGGTGGGGAAGAATTTGGGTGAAGTGTTGTGTTGTATGAAGAATATGGGGGGAGTTAAAATTAATAGATTCACAGTAGCTAGTAAACTCCTCAACCATGGTTACGTGCTTGCAAACTACGAAACTACCCTACTTTCTGGCGGCCACCGATACTGGAACCATCAACTTGTGTTCTCTCATAGATTACAGAGTTCTACTCACTTTGGATTGCAGTTACAGCAACAAAGGTACGTCGCACAAGTCAGCGCTTTCACCAGCAGCGGTATAAACGTCTGTGAGGACGTCTCGGTAGATAGCATCGTGGGCTTGGAGTTCACAGACTACACAGGTTCCGTTGGAAGCGATGTTATTGGACAGATTTGTTCTCTACAACGTATAGGTGTTTTGACGATCTGGTCGATTATAAGGGAGAGGGCGGTAAAATCGGATATAGGCAAGGCTTTGTGGTCGAAAATTAAACTTAGGAGGTCCCAAGTGATAGTTCTGTCAGAATATCTAGAAGCTAATAGAACAGTTGGCACCGATTTCAATTTGAACTCGGCGAAGAAAAGAATCGCAGCGAGGAAACGGGAGAAAAATATCATTAAACGGCAACACTCGCGACCGAAATCTAGTTCCACATTCAATTTGGATAGGACAGATAGTGTTGCCATCAATAAAACTGACACAAATTTTTGGGAGAATGGAATCATTTGCAGCGATTTGAAAATAATACATTTGAAAGTTGACAACTATCTGGTGGGGAAGAATTTGGGTGAAGTGTTGTGTTGTATGAAGAATATGGGGGGAGTTAAAATTAATAGATTCACAGTAGCTAACTCCTCAACCATGGTTACGTGCTTGCAAACTACGAAACTACCCTACTTTCTGGCGGCCACCGATACTGGAACCATCAACTTGTGTTCACTCATAGATTACAGAGTTCTACTCACTTTGGATTGCAGAAATATATCAACGCCCGCAACAGACAAGTGTCTGTCCGACAACAAAGGCAGGTTCGTTGAAAGCAGAACTGTTAGGAAAACACCGCTAGATGGCCATCCTATATCGTCTCTGTACTGGTCGTACACAAATCCTCTCCGTATTTTGTGCGTACAGTCTAGAGTGTGTGTGTGGTCGCTCGCACAGAGTGATGTGAACGCTCTCTGTGCAGACCGCGCCATATGCTGTGCTGGCAACGATAGAGCTTTTATCCGCAAGAAGGATCCAAGGATTATCGGTGCCATCAGCTACAGGCCGTACTGTTTCAGTGCCCAGGATTATGACGCGGAAAACGGGCCGACGAATCCAAGGCATGGTGACAGTATCCTCCTCCACCTTCTCCTCCGCACCGCTGACTGGGTCCACTCGCTGCTGTGGCGCTGGAGTGCTAGATGGTGCGCCGTCAGCGCCGTACTGCTTCATAAGGATATTGAAGTATGTTTCATAGCCATGAAGGTCACTCTGCTGCTGCTGATGATGACCTCGTCAGTCATCGGGTACAACATTCTTTGCCTCCATCACATCCCATCCATCAGTCATCATAACTTAGCCAAGGGTATTGTACAGCCGTTATTGAAGGCTGGTCATAAGGTGACCTGGGTTACTCCGTATCCTGACAAGACACCAACGAACAGTAACTTGACTGTCATTGATGTCAGCTTTTTGACAGCGTTCTCACAATCCATGGAAACCAAGGTCACCAGTTTCTCTGCCATCAAAACCCTGACTCGCAACATATCCACGGCGACTATCCATCACCCTGAGGTCAGATCAGCGCTGGTGAGAAATAAATATGACGCTGTCGTCACAGAGTGGTTCTTATCAGACATGGAGGCTGGATATGCAGCTGTCCAGCAAGCTCCTTGGATCCTGTTCAGCGGAGTCATCTACCATCCGCACTTGGAGTACCTCATTGACACCACACGCTCCATCCCAGTGCATCCCACTATGATATTTCACTTCCCCATCCCCATGAGCTTTTTACAGAGATCGTTGAACACTATCATTTATGTTATAACAAAATTGGACTCTATTAAGGAATCTTTTGTCCATGCATCTCTATACGACGAGTGGTTCTCTCCACTCGCCGCTGCCCGCGGAGTCACTCTCCCTCCGTTTTCAGAGGCTGTTCATAATATATCAATATTATTTATCAATTCTCATCCGTCGTACTCTACTCCGAATGTACTTCCACCTAATGCCATAGAAATCGGAGGCTTCTTTGTCGACGAAACTCAGGAACTGCCAAAGGACCTCCGAAACCTCGTCGATGGATTCCGACAAGGCTTCATATACTTTAGCATGGGCTCGCTGCTGAAATCCTCCAATTTTCCACAAAAGATGAAACAGGAACTGATCAAAGTTTTGGGAGAACTTCCTTTTCCGGTTTTATGGAAATACGAAGAAGACATTGAAAACTTACCAAAAAATATACACTTGAGGAAATGGATCCCCCAAGTTAGCGTGCTGGCACATCCAAATATCAAACTCTTCATCACGCATTGCGGTCTGCTGAGTTCTCTGGAAGCTCTTCATCATGGAGTACCGATGTTGGCTGTTCCTGTATTTGGCGACCAGCCTCATAACGCTGATACCGCGACGAGAGAGGGAAGAGCTATTAGAGTCACCTTCGATGAGAACTTGCCAGAAAACTTGCAGGCCGGCTTGAAGCAAATGTTGAGCGATGACAACTACAACCAGCGCGCCAAGTACCTCTCCAAGTTGTTCAGGAATCGTCCCGTGTCGCCAGCGTCGCTCATCAATCATTACATTGAACTGGCCATTGAGACCAGAGGAGCTCAACATCTCCGCTCTAAAGCTCAACTGTACTCCTGGTACCAACTTCTGATGCTGGACCAGCTAGCTTTCTTCAGCCTCATCTTTTATCTAATATTCAAGATGATCAAAATGTTCATATTATTTGTAAAAAAAATGTTTAAAAAGGATAAGAAAAAAACAGAATAG

Protein sequence:

>DPOGS211520-PA
MKTSKRRTDKEREHSKDDILNKKQEERSKKSVYSVSTPRNFVQKSSAADIYARKDAVKAPKKIASNRNNDVSPMKDLLKSSSSSVSQVSTRSYKSKTPVKDVKAIPLSARSAVKSVNAPNRKLPFTNVTVNSPIVKRKLNLEDEGAKVLTKTEIKVAGKRERTNTRTLDEAEIKVLTSDVVDNNTELLKLNQTLSAQPKAFFIDLEDKKIKDKSSDEEVSYEDDFESYESDFDSYHSESNRSSAGDANASDDDDGCDNNVHHESSSREESKLDSGNFDLPERHKDRINPMEYIDEVPESDKKISLTDEGFQDMSSGLSSMKTLHVDILDRPLFIDFTESDANRKRRKIFERLRKRAEDITSMVKFHEISYNLFEMKPIPYELYMASFGRYNYTQTSVQTFDDGVSEEVQTEEVLMCSKWTQHPIEFSNQIIHLNGQDYLNNTTEKFHNKDVMNKVNISDTYINNPLRIYFEQKDGVGNDTVLPYDGYKNKLKKNQYDANKLRKFMKRVESRVCNVLSKNAGNINMSNLVRTSKFPFSTGYMAICTKSLNEESLVKDTEITGVVFSDSKSNLIITIHKKFTVGVLKNKCALCLWDLSVARVEPLKTLMASDDVRIGKFRGSTDGYFVGALEDGSISLWDLSEEAWWSHDANTLADDRVRESNLSQCELDREWNLRNNSNYVKVGNFDLPERHKDRINPMEYIDEVPESDKKISLTDEGFQDMSSGLSSMKTLHVDILDRPLFIDFTESDANRKRRKIFERLRKRAEDITSMVKFHEISYNLFEMKPIPYELYMASFGRYNYTQTSVQTFDDGVSEEVQTEEVLMCSKWTQHPIEFSNQIIHLNGQDYLNNTTEKFHNKDVMNKVNISDTYINNPLRIYFEQKDGVGNDTVLPYDGYKNKLKKNQYDANKLRKFMKRVESRVCNVLSKNAGNINMSNLVRTSKFPFSTGYMAICTKSLNEESLVKDTEITGVVFSDSKSNLIITIHKKFTVGVLKNKCALCLWDLSVARVEPLKTLMASDDVRIGKFRGSTDGYFVGALEDGSISLWDLSEEAWWSHDANTLADDRVRESNLSQCELDREWNLRNNSNYVKLQQQRYVAQVSAFTSSGINVCEDVSVDSIVGLEFTDYTGSVGSDVIGQICSLQRIGVLTIWSIIRERAVKSDIGKALWSKIKLRRSQVIVLSEYLEANRTVGTDFNLNSAKKRIAARKREKNIIKRQHSRPKSSSTFNLDRTDSVAINKTDTNFWENGIICSDLKIIHLKVDNYLVGKNLGEVLCCMKNMGGVKINRFTVASKLLNHGYVLANYETTLLSGGHRYWNHQLVFSHRLQSSTHFGLQLQQQRYVAQVSAFTSSGINVCEDVSVDSIVGLEFTDYTGSVGSDVIGQICSLQRIGVLTIWSIIRERAVKSDIGKALWSKIKLRRSQVIVLSEYLEANRTVGTDFNLNSAKKRIAARKREKNIIKRQHSRPKSSSTFNLDRTDSVAINKTDTNFWENGIICSDLKIIHLKVDNYLVGKNLGEVLCCMKNMGGVKINRFTVANSSTMVTCLQTTKLPYFLAATDTGTINLCSLIDYRVLLTLDCRNISTPATDKCLSDNKGRFVESRTVRKTPLDGHPISSLYWSYTNPLRILCVQSRVCVWSLAQSDVNALCADRAICCAGNDRAFIRKKDPRIIGAISYRPYCFSAQDYDAENGPTNPRHGDSILLHLLLRTADWVHSLLWRWSARWCAVSAVLLHKDIEVCFIAMKVTLLLLMMTSSVIGYNILCLHHIPSISHHNLAKGIVQPLLKAGHKVTWVTPYPDKTPTNSNLTVIDVSFLTAFSQSMETKVTSFSAIKTLTRNISTATIHHPEVRSALVRNKYDAVVTEWFLSDMEAGYAAVQQAPWILFSGVIYHPHLEYLIDTTRSIPVHPTMIFHFPIPMSFLQRSLNTIIYVITKLDSIKESFVHASLYDEWFSPLAAARGVTLPPFSEAVHNISILFINSHPSYSTPNVLPPNAIEIGGFFVDETQELPKDLRNLVDGFRQGFIYFSMGSLLKSSNFPQKMKQELIKVLGELPFPVLWKYEEDIENLPKNIHLRKWIPQVSVLAHPNIKLFITHCGLLSSLEALHHGVPMLAVPVFGDQPHNADTATREGRAIRVTFDENLPENLQAGLKQMLSDDNYNQRAKYLSKLFRNRPVSPASLINHYIELAIETRGAQHLRSKAQLYSWYQLLMLDQLAFFSLIFYLIFKMIKMFILFVKKMFKKDKKKTE-