Monarch geneset OGS2.0

DPOGS209776
TranscriptDPOGS209776-TA3078 bp
ProteinDPOGS209776-PA1025 aa
Genomic positionDPSCF300397 + 68246-78233
RNAseq coverage404x (Rank: top 30%)
Annotation
HeliconiusHMEL0226194e-14951.00% 
BombyxBGIBMGA010289-TA4e-13248.10% 
DrosophilaUgt86Di-PA1e-7032.79% 
EBI UniRef50UniRef50_G6CTZ60.0100.00%UDP-glucosyltransferase n=3 Tax=Obtectomera RepID=G6CTZ6_DANPL
NCBI RefSeqNP_001182386.13e-12846.89%UDP-glucosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638960863e-16657.38%UDP-glycosyltransferase UGT40L1 [Helicoverpa armigera]
NCBI nr blastxgi|3638960865e-16356.85%UDP-glycosyltransferase UGT40L1 [Helicoverpa armigera]
Group
Gene OntologyGO:00081522.1e-129metabolic process
GO:00167582.1e-129transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG66589e-69 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[13-510] IPR0022132.1e-129UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL10161 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209776-TA
ATGAAGTTACAAATCATATTTTCTTTGGTCTTCTTGGCCTCATTCACCCAAGCTTTAAGACTTCTTGTATTCTTTCCTATGATGTCTAAGAGCCACAGCATTTTAGGCCAAGGTATGGTGAGCCGATTACTGGAAGCTGGACACGAGGTGGTCCATGTGACAGCTTTCCCTAGAGACAAGCCACAAGCAAATTTGACAGAAATAAACTTAATTGAGGCAAGCAACCGAATGAAAGAGCAGTTTCAAGGCGACGATAATTTCAAATTGAAAAACCTCATTGGGAAAGAGAATATCGGTGATTCCGTATTTTTTGTCTATATGACATATGAGATAAGTAAAAATATTATTGAAGACCAAACTTTTATCAAGTTTCTTTTGAACCCAAATGAAAAATTTGACGCCGTCATAATTGAGTGGTTCTTTTCTGATATTTTTGCCGGAATTGCACCACTTTTTAAATGTCCCCTTATCTGGTTTGGATCGACTGAAGCTCATTGGCAAATCCTCCAAGTAGTTGATGAAATTCCTAACCCATCCTACAGTGTAGACATTTTCTCAGTCAGCAAACCACCTCTCAATTTTATGGAACGTTTAAGAGAACTCTATAGGATTGCAAAAAAATATATTTTTATCAGTTTATTGAGCACTCCATTTGAAAGATCACTTTACAACAGCGTTTTTTCTGACATAGCAAACAAAAGAGGGGTCACATTACCTTCTTACGACGAAGTAATTTATAACGCTTCCCTATTGCTCATCAACTCTCACCCATCTATAGGCACGCCTTTTAGACTCCCACAGAATGCTAAATATATCGCTGGGTACCACATTGACAGAGAAGTGAAACCGCTACCAAAGGATCTACAAAAATTAATGGATGAAGCAAAACATGGCGTTATATATTTTAGTATGGGAAGTAATCTGAAGAGTGAAGACATGTCTGAATCGATGAAAAAATCGTTACTAGCAATGTTTAGTAAACTAAAACAAACAGTGATATGGAAATTCGAAAGCGATTTGGATAAAGTTCCAGCTAATGTCCATCTGGTGAAATGGGCACCACAGCAAAGTATTCTCGCTCATCCTAATCTTAAATTCTTCATGACACACGGAGGACAGCTCTCCACGACTGAGGCTATACACTTCGCTGTACCGGTTATTGGTATACCAGTTGCAGCTGATCAGCACGTCAACATGAGGTCCGTTGCCAATAAAGGTTTCGGTATTTATATAAAAATAACAGAAGATATCACCGATGACTTGTATCCAGCGATTCAAGAAATGCTGCAAAATCCATCATACAAGTCAAAAGCGAAGGAACTGTCTTTCATCTACCATAATCGTCCTCTGACGCCAGGAGATGAATTGGTATTCTGGACAGAGCATGTTGTCCATACACGCGGCGCCCTACATTTACGATCGCCAGCTTTACAATTACCATTCTACCAGAAATTCTTTTTAGATCTCCTTATTCTCATCCTCGTGGCAATAGTATTATGCGTTCTTATGATCAGCAAATGTACGATGTACTACACGTGTTTATTGTTATTTATAAAGTTGTATTTATGTCACGCTTACAAAGTGTTGGTGGTATTCCCAATACCGTGGAAGAGCCATTACATTCTCGGAGAAGCTACCGTATGGCATTTGGTCAAGGCTGGACACGAGGTGACTTACCTCACACCTTCTCCGATAGAAAATGCTCCGAAAGGACTTCGACAAATCGACATATCCGCAAGTAGTCGGTTTTTCCCTAAAGGCCGCTTCGATATTGTTTCTTTGATAAACAAAAAAGGACGCCTTATGACCCAAGAACAGAAGATCAGGACAACTCATGATTATCTTGTGAAGAGTTTAGAGTTAGACAATGTTCAGAATTTCATCAAAGATTCCAACGAACAGTACGATGTTGTGGTAGTTGAATGGTTACACACAGAAACAATGGCTGGTTTTGCTTCGTTATTTAGCTGTCCTTTGATCTGGGTTTCAACAATGGAAGCTCACACATTAGTTCTGTCTCTTATAGATGAGCACTTGAATCCTAATTATAACGTTCTTTTCTACTCTACAAACTTCTCTAGAACATTGTGGAATAGGGCTAAACAATTATGGTCTCTAACTAAAATACTATTCTACACCTGGTATAGACAAAATCGAGAAAACGAGGATTTTAAAAACATTTTTGGGTCAGCAATCCTGAAAAGAGGCAGAGAATTACCATACTTTAGAGATGTTAAATATAACGCTTCCTTGATGTTTGGAAACTCTGATGTTGTAACGGGGGACGCAATATCTTTACCACAGAACTATATTCACATAGGAGGCTATCATATAAAAGAACCCATAGAACCGTCTCCCTCATTCGATTTGAAAGGTCTGATGGACGAATCATCGAACGGCGTAATTTATTTTAGCTTAGGTTCATCGCTGAATATAACAAGAATACCAAGATATTTAAAAAAAGGTATACTTAAGAGCCTCGGTGAAGTAGACCAAACTGTGATATTAAAAATGGATCATATTCCAGAAGATCAGCCCAAAAATGTTCATACTGTACCTTGGGCACCTCAACAGTATATTTTAGCACACCCTAACTGCAAACTCTTTATAACTCACGGGGGTCAGCTGTCGATTATTGAGACACTGTATTTTGGAATACCTATAATTGGAATTCCATTATTTGCTGATCAATACAATAATGTTAATAGAGCCGTTGCTAAAGGATTCGGGAAACAGATTGATTTCAACTCTAATACACCGGAAGTTTTGAAGAACACAATAAAGGAAATGATGACTAACTCCAGCTATCGAGCCACAGCGAAACACTTATCGTCCCTCTTCATCAGAAGCCCAACTCCAGGCCAAAGGCTTGTAAAATCCGTGGAACTGGTTGCAAGAACTGGTGGAGCGCAACATTTACGCTCTGTCGCATTAAATGTGCCTTTGTACCAAAAGCTTTATTTAGATCTGATATTAGTGTTTATAATTGGTGTCCTCGGTTTTTTGTTTGTAATAACATATCTGTATCATTTTGTTACCAATATACGGAGACCTAAAACATTAAAGAAAAAAGATTGA

Protein sequence:

>DPOGS209776-PA
MKLQIIFSLVFLASFTQALRLLVFFPMMSKSHSILGQGMVSRLLEAGHEVVHVTAFPRDKPQANLTEINLIEASNRMKEQFQGDDNFKLKNLIGKENIGDSVFFVYMTYEISKNIIEDQTFIKFLLNPNEKFDAVIIEWFFSDIFAGIAPLFKCPLIWFGSTEAHWQILQVVDEIPNPSYSVDIFSVSKPPLNFMERLRELYRIAKKYIFISLLSTPFERSLYNSVFSDIANKRGVTLPSYDEVIYNASLLLINSHPSIGTPFRLPQNAKYIAGYHIDREVKPLPKDLQKLMDEAKHGVIYFSMGSNLKSEDMSESMKKSLLAMFSKLKQTVIWKFESDLDKVPANVHLVKWAPQQSILAHPNLKFFMTHGGQLSTTEAIHFAVPVIGIPVAADQHVNMRSVANKGFGIYIKITEDITDDLYPAIQEMLQNPSYKSKAKELSFIYHNRPLTPGDELVFWTEHVVHTRGALHLRSPALQLPFYQKFFLDLLILILVAIVLCVLMISKCTMYYTCLLLFIKLYLCHAYKVLVVFPIPWKSHYILGEATVWHLVKAGHEVTYLTPSPIENAPKGLRQIDISASSRFFPKGRFDIVSLINKKGRLMTQEQKIRTTHDYLVKSLELDNVQNFIKDSNEQYDVVVVEWLHTETMAGFASLFSCPLIWVSTMEAHTLVLSLIDEHLNPNYNVLFYSTNFSRTLWNRAKQLWSLTKILFYTWYRQNRENEDFKNIFGSAILKRGRELPYFRDVKYNASLMFGNSDVVTGDAISLPQNYIHIGGYHIKEPIEPSPSFDLKGLMDESSNGVIYFSLGSSLNITRIPRYLKKGILKSLGEVDQTVILKMDHIPEDQPKNVHTVPWAPQQYILAHPNCKLFITHGGQLSIIETLYFGIPIIGIPLFADQYNNVNRAVAKGFGKQIDFNSNTPEVLKNTIKEMMTNSSYRATAKHLSSLFIRSPTPGQRLVKSVELVARTGGAQHLRSVALNVPLYQKLYLDLILVFIIGVLGFLFVITYLYHFVTNIRRPKTLKKKD-