Monarch geneset OGS2.0

DPOGS207652
TranscriptDPOGS207652-TA1473 bp
ProteinDPOGS207652-PA490 aa
Genomic positionDPSCF300133 - 141473-145323
RNAseq coverage146x (Rank: top 54%)
Annotation
HeliconiusHMEL0226451e-17257.64% 
BombyxBGIBMGA005046-TA4e-6833.19% 
DrosophilaCG17323-PA2e-7031.29% 
EBI UniRef50UniRef50_G9LPR68e-16053.64%UDP-glycosyltransferase UGT46A3 n=6 Tax=Obtectomera RepID=G9LPR6_HELAM
NCBI RefSeqXP_001811749.16e-8136.02%PREDICTED: similar to AGAP007920-PA [Tribolium castaneum]
NCBI nr blastpgi|3796990181e-15953.14%UDP-glycosyltransferase UGT46A1 [Bombyx mori]
NCBI nr blastxgi|3796990181e-15453.14%UDP-glycosyltransferase UGT46A1 [Bombyx mori]
Group
Gene OntologyGO:00081522.9e-129metabolic process
GO:00167582.9e-129transferase activity, transferring hexosyl groups
KEGG pathwayame:4087881e-79 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[1-490] IPR0022132.9e-129UDP-glucuronosyl/UDP-glucosyltransferase
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207652-TA
ATGGTTTTCGAACCATTACTGCGACGGTTCGCTGAGAGGGGTCACAATGTTACCGTAGCATCATTTTTCCCCATGGAGAATCCACCCGAAAACTACGATCAGATCAGCTTCCTAGGACTGGCTGAATTGAGATTGGAATCGCTCGACTTGGAAATTTTTGAACGGACAAATTTACTCAATAAGATTCCTATCATTGGTAACGTCGCTAAAGCTCTCTCCGCCATCCCATCTTTGGCTAAATCGGCCTTGGATGTTTGTGAACGTGTCGTAAAACATCAAGAATTGAGTCAGGCTCTTAAGAACAAATACGATGCAGTCATAACCGAAAACTTTAACAGTGACTGCATGTTAGGTCTCCTTTATGCCTATGAAGTAGATGCTCCTGTAATTTCTATACTATCTGGAACTCCTATGCCATGGACCGCTCAACGTATTGGAGCTGACGATAACCCAGCTCATGTTCCGGTAATTTTATCGAAATTCACTTCTAGAATGTCGTTTACGGAGAGATTGGAAAATGCTTTAATCAATCTACTATCGAAGTATTTGTTTTACCATGAAATCCAAACGAAAGAGAGAGCATTCATTGAGAAAAGATTAGGAAAAATTCCACATCCACATGATCTTAGTAAAAATATGTCTTTAATATTATTAAATTCATTCCACCCTTTAAATGGTGTTAAGCCCTCCGTGCCTGGGATGATTGAAGTCGGTGGGATTCACTTGGCCGCTGAGAGAAAACCCTTGCCTACTTTCATTGAAAAATTCATCAACGAATCGGAACACGGCGTGATTGTGTTCAGCTTCGGTTCACATATAAAAACTAAAACACTTCCTAAATACAAAGAGGAAATTTTCTTAAGGGCACTTTCAAAAACAAAACAAAGGGTGATATGGAAGTTTGAAGAGAGCGACGAGGAAGGAACGCTAATTGGCAATATCCTCAGGGTCAATTGGATTCCGCAATATGAACTACTAAACCATGATAAAGTAGTGGCTTTCATATGCCACGGAGGTTTATTGGGAATGACAGAGGCTGTATCCTCAGGCAAGCCAATGCTCGTTCTGCCATTCTTTGGAGATCAATTCACGAATGCCGCAGCTGCAAGCGAAGCCGGGATCGCGAGGGTTGTATCTTACAACGATCTGTCTGAGGACACCTTTACCGACGCACTAAATGAAGTATTGGGTGCAAAAACTCGTGAAACCGCCCAGCGATTATCGAAAATTTGGAAGGACAGGGAATCTTCGCCGCTGGACACAGCTGTTTTTTGGACGGAGAGAGTGATTAGATGGGGCAAGGCGGCGCCACTACATTCCACATCAAGAGATCTACCATTCTACCAACTAGCCCTATTAGATGTAGCGGCAGCTGTCATTGTAGTTACGATTTTATTCATTACAGCCATATGTTACATTCTGGTTAAAATTTTACGTGTCATTACAAAAAGTTCTAAGGAAAAAATCCATTAA

Protein sequence:

>DPOGS207652-PA
MVFEPLLRRFAERGHNVTVASFFPMENPPENYDQISFLGLAELRLESLDLEIFERTNLLNKIPIIGNVAKALSAIPSLAKSALDVCERVVKHQELSQALKNKYDAVITENFNSDCMLGLLYAYEVDAPVISILSGTPMPWTAQRIGADDNPAHVPVILSKFTSRMSFTERLENALINLLSKYLFYHEIQTKERAFIEKRLGKIPHPHDLSKNMSLILLNSFHPLNGVKPSVPGMIEVGGIHLAAERKPLPTFIEKFINESEHGVIVFSFGSHIKTKTLPKYKEEIFLRALSKTKQRVIWKFEESDEEGTLIGNILRVNWIPQYELLNHDKVVAFICHGGLLGMTEAVSSGKPMLVLPFFGDQFTNAAAASEAGIARVVSYNDLSEDTFTDALNEVLGAKTRETAQRLSKIWKDRESSPLDTAVFWTERVIRWGKAAPLHSTSRDLPFYQLALLDVAAAVIVVTILFITAICYILVKILRVITKSSKEKIH-