Monarch geneset OGS2.0

DPOGS207651
TranscriptDPOGS207651-TA2484 bp
ProteinDPOGS207651-PA827 aa
Genomic positionDPSCF300133 - 148056-157203
RNAseq coverage536x (Rank: top 23%)
Annotation
HeliconiusHMEL0226450.059.15% 
BombyxBGIBMGA010433-TA6e-10361.19% 
DrosophilaCG17323-PA2e-7833.14% 
EBI UniRef50UniRef50_G9LPR61e-17957.36%UDP-glycosyltransferase UGT46A3 n=6 Tax=Obtectomera RepID=G9LPR6_HELAM
NCBI RefSeqXP_971626.27e-9038.82%PREDICTED: similar to glucosyl/glucuronosyl transferases [Tribolium castaneum]
NCBI nr blastpgi|3638961104e-17957.36%UDP-glycosyltransferase UGT46A3 [Helicoverpa armigera]
NCBI nr blastxgi|3638961106e-17457.36%UDP-glycosyltransferase UGT46A3 [Helicoverpa armigera]
Group
Gene OntologyGO:00081524.4e-126metabolic process
GO:00167584.4e-126transferase activity, transferring hexosyl groups
KEGG pathwayame:4087887e-80 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[316-827] IPR0022134.4e-126UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL20396 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207651-TA
ATGCGACCGCTTATTTATTTCGCTGCGATAATCGCTCTCTCCACCGTTGAAGGGGGACGGATTCTTGGGCTCTTCCCGCACACAGGCAAGAGCCACCAGATGGTCTTCGAGCCGCTGTTAAAGAAACTCGCTGAAAGAGGACACCAAGTCACAGTAGTGTCGTTCTTCCCCTTAAAAAAACCAATCGCAAATTACACTGATGTCAGCCTTGAGGGGATCGCGACTTTAGGTGTGGAAACTATTAATCTAGGCTGGTATGAGGAATTTAGTATTTGGACAAAGCTACCAAAAATAGGTAGAGTAATCAAGCAACTGTCGGAGTTCCAACCTTTAGCAGGAATGGCGTTAGACGTTTGCAGTAAGGCTGTAGATTGGGTTCCTTTAAAGGAAGCTCTGCAGAAGAATTACGATTTAGTGCTGGTTGAAAATTTCAACAGTGATTGTATGATGGGTCTGTTGCACATTTACGATATAAAGGCTCCTGTAATTGCATTACTGTCAAGTGCCATGATGCCATGGTCATCGGACCGTATTGGCGTAACAGATAACCCCTCTTACGTGCCAATTATAAGTTCAACCTTTACCCCGACAATGAATTTAATGGATCGAATTGAAAACACGTTCCTTAACGCCTATCATAAGATATGGTTCCGTTACGCAATACAATTAAAAGAACAAGCACTCATAGAAACACATTTTAGAACGAAAATACCAGATCTGGACATATTATCACGAAACATAACATTGATGTTCTTGAATACGTTTCATGCTTTGAATGGTGTAAAACCACTTGTACCAGGTATCGTGGAAGTTGGGGGGATGCACTTGGATCCTACGCGAAAAAACATCCCCGAGATAATTTTGAACAGTCTACTGCACGCCAGTCGGGTGCGGGCGCGCAAAATGATTTTAACGAAAATCATTATATTGTCATTCTTATCGTTACCCAATGATGTGTATTGTGCGAGAATATTAGGCCTATTTCCTCATCCCGGCAAAAGTCACTTCATGGTATTCGAGCCCCTACTGAAAAGATTATCCGAGTTAGGACACCATGTCACTGTAGCATCCTTCTTTCCACCGGAAAACCCACCCGCTAACTACACTCACATAAAGTTTGATGGAGCTGGTGAACCCAGATTGGATATATTAGATTTAAAAACGAACGATAATGTAAATTTTGTTAGGAGAATACCCATCCTCGGTGGCATTATCCAGCAGATGTCGGATTTCAATCTGTTAGCTGAACTAGCATTAACTAAGTGTCAGCAAATAATAGAATTTCAGCCATTAGCTAATGTTCTGAAAGAGGACTACGATTTGGTGTTCATAGAGATTTTCAACAGTGACTGTGCTCAAGGTCTCATACATCAGTATGGAATTAAGGCACCCATTATCGGCCTGTCGTCCTGTACAATAATGCCATGGACAGCTTACCGTATAGGAGTTTCTGATAATCCTGCGTATGTGCCAGTTATGGGCACAACTCATACTCCTACAATGTCATTATTACAACGGATGGAGAACACATTTATGTTGCTCTACCATAATTTGTGGTACCGGTACAAAGTCCAAGTAAAAGAACAGGCTATAATTGAGAATTACTTCGGACGGAAGATGGCCGACTTAGATTTACTGTCTCGGAATATATCATTACTGCTTGTGAACACGTTCCATCCCCTGAACGGTGTCAAACCACTCGTGCCAGGTGTGGTGGAAATCGGAGGAATACATTTAAATCCAAATAAAAAAAGTATTCCAGGGTACATCGAAAGATTCCTTAACGAATCAAAACACGGCGTCATCCTTTTGAGTTTCGGATCTCTCATCAAAACATCGACTATACCTAAGTACAAGGAAGAAATCATCGTTAATACTCTATCAAAATTCAAACAGCGCGTCATATGGAAGTATGAAGAGAGCGAGCCAGAGGGTACACTTGTGGGCAACATTCTGAAAGTAAGATGGTTGCCGCAATTTGAACTTTTACAACATGAAAAAGTTGTGGCTTTCATAGCCCATGGTGGGTTGTTGGGGATGACGGAGTCCGTGTATTCTGGGAAGCCGATGGTGGTGGTGCCTTTCTTCGGAGACCAACCCTCAAACGCCGCGGCCGCTGCCAACGCCGGCTTCGCTAAGATCATCTCCTATATAGACATGACAGAGAAAGATTTAGGTGATGCAGTTAGGAGCGTCCTGAGTGAAGAAATGCAACTGAATGCACGTCGAGTTTCAAAAATGTGGCAGGACAGAGAGTCAGCTCCACTAGATACGGCTGTATACTGGACTGAGCGTGTTTTAAGATGGGGACACTCAGGTCAACTTCATACGGCCGCAAGAGACTTGTCACTGTATGAACTTGCCCTTATAGATGTTTTTGCTGCGTATGCCCTTGCCTTAACAGTTATTTTGTTATCTGTGTGGTTCATTCTCACAAAGTTGATGAGGTTAATTATAAAGGAAAGTAAACAAAAAATACATTAG

Protein sequence:

>DPOGS207651-PA
MRPLIYFAAIIALSTVEGGRILGLFPHTGKSHQMVFEPLLKKLAERGHQVTVVSFFPLKKPIANYTDVSLEGIATLGVETINLGWYEEFSIWTKLPKIGRVIKQLSEFQPLAGMALDVCSKAVDWVPLKEALQKNYDLVLVENFNSDCMMGLLHIYDIKAPVIALLSSAMMPWSSDRIGVTDNPSYVPIISSTFTPTMNLMDRIENTFLNAYHKIWFRYAIQLKEQALIETHFRTKIPDLDILSRNITLMFLNTFHALNGVKPLVPGIVEVGGMHLDPTRKNIPEIILNSLLHASRVRARKMILTKIIILSFLSLPNDVYCARILGLFPHPGKSHFMVFEPLLKRLSELGHHVTVASFFPPENPPANYTHIKFDGAGEPRLDILDLKTNDNVNFVRRIPILGGIIQQMSDFNLLAELALTKCQQIIEFQPLANVLKEDYDLVFIEIFNSDCAQGLIHQYGIKAPIIGLSSCTIMPWTAYRIGVSDNPAYVPVMGTTHTPTMSLLQRMENTFMLLYHNLWYRYKVQVKEQAIIENYFGRKMADLDLLSRNISLLLVNTFHPLNGVKPLVPGVVEIGGIHLNPNKKSIPGYIERFLNESKHGVILLSFGSLIKTSTIPKYKEEIIVNTLSKFKQRVIWKYEESEPEGTLVGNILKVRWLPQFELLQHEKVVAFIAHGGLLGMTESVYSGKPMVVVPFFGDQPSNAAAAANAGFAKIISYIDMTEKDLGDAVRSVLSEEMQLNARRVSKMWQDRESAPLDTAVYWTERVLRWGHSGQLHTAARDLSLYELALIDVFAAYALALTVILLSVWFILTKLMRLIIKESKQKIH-