Monarch geneset OGS2.0

DPOGS214629
TranscriptDPOGS214629-TA1611 bp
ProteinDPOGS214629-PA536 aa
Genomic positionDPSCF300050 + 424444-428509
RNAseq coverage1372x (Rank: top 9%)
Annotation
HeliconiusHMEL0225920.074.37% 
BombyxBGIBMGA005046-TA0.063.58% 
DrosophilaCG17324-PB9e-7431.11% 
EBI UniRef50UniRef50_G9LPW30.063.58%UDP-glycosyltransferase UGT47A1 n=3 Tax=Obtectomera RepID=G9LPW3_BOMMO
NCBI RefSeqXP_001811749.13e-10842.25%PREDICTED: similar to AGAP007920-PA [Tribolium castaneum]
NCBI nr blastpgi|3638961160.071.40%UDP-glycosyltransferase UGT47A2 [Helicoverpa armigera]
NCBI nr blastxgi|3638961160.070.22%UDP-glycosyltransferase UGT47A2 [Helicoverpa armigera]
Group
Gene OntologyGO:00081522.7e-138metabolic process
GO:00167582.7e-138transferase activity, transferring hexosyl groups
KEGG pathwayame:4093044e-84 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[11-516] IPR0022132.7e-138UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL12826 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214629-TA
ATGCAGGCGCTGCTGTGTCTGGCGCTATGCTGCGCTACAGTCCACGCTGCCAGCATATTGGCAGTGCTGCCCACCAACACCAAGAGCCATTACGCGATGTACGGCCGGCTCATAGAAGCTCTAGCGATCAATGGGCATAATATAACTGTGATCACGCATTTCGCTATGAAAAATCCGCTGCCGAACGTGGACGAGATCAACCTAGCTGGCACGATTCCAGATTTAGCCAACAACCTGACGAAACAGCAGTCGACCTTCAAACCGGACACTATAAGAAATTTGGAGAATATCATCAAGGAATGCGTACACGCTTGTGATGTTGTTTCCCAACATGCTGAAGTCAAAGCACTAGTAAATTCCTCCAAGACATTCGACCTGGTGATAATCGAAGTGTTTGGTAGCGAGTGCTTTCTGCCTTTCGGTAAGAGATTTGATGCACCCGTCGTGGGGTTGCTCTCGAGTGTTCCTTTGCCCTGGTTGAACGATCAATTGGGAAATCCAGAAGAAACTGCCTATGTACCAGCCTACATGATGGGTTACGGACAACATATGAACTTATTTCAACGTTTTATTAATACCGTAGCAGTGATATGGGCTAAGGCGTTTTACAGGAACAAATCACAGATACCATCACAGATAATCGCTGACAGGTTGTTTGGTCCAGGTCCGAGGCTTGAGAGTTTAGCTCAGAACTATAGCCTTGTATTGTCCAACAGTCACTTTAGTATAAACGAAGTTAGACCATTAGTACCAGCTCTGGTGGAGGTCGGGGGCTTACATCTTGACACCACGCAACAGTTACCGAAGGAATTAAGAAATCTCCTGGACAATGCTGACGAGGGAATCATATATTGGAGCTTCGGTTCTATGTCCCGCATCGAAACAATACCTTACGTACAGCTGACACAAATATTCGCTGCTTTATCTGAACTGCCACAGACCGTTCTGGTGAAAATGAACAAGAAGATGCTGCAGGGGAATCTGACGGTACCAGACAACATTTATGCAATGGATTGGATACCGCAATACAAAACTTTATGCCATCCAAACGTTAAATTATTCATATCTCACGGTGGTCTACTCGGTACGCAAGAGGCTGTTGCGTGCAGTGTTCCTATACTGATGGTGCCGTTGTACGCTGATCAGGCTTTAAACGCACGTGCTATGAGCGATCGAGGCGTCGCTAGGATTGTGACATTACGCGATTCGACCACCGAGATATGGAGAGACGCGTTAAGACAGCTATTGACAAATACGAGGTACAAACAGAAAGCTATCGAACTTAGAGATAAATTCTTGGACCGGCCTCTACCACCTCTGGAGACTGGGATTTACTGGATCGAATACGTCATAAGACACAGGGGTGCACATCACCTACGGTCCCCAGCTCTCGACTTGACCTACGCCCAGTACCACCTGCTCGATGTGGCAGCCCTAATCATAGCCATCACCGCCACCATCACATACATACTACATAAGCTGTTCAGATACCTATGCACCCGTTGCGTTCGGTGGTGCGAGAAACACACCGTTATAGAGAAGAGACTCTTTATAAGGAACAGTAGTTTGTTCCAGTGTTTTCTTTGGTTATACAAAGTGAAGCCTAATTAG

Protein sequence:

>DPOGS214629-PA
MQALLCLALCCATVHAASILAVLPTNTKSHYAMYGRLIEALAINGHNITVITHFAMKNPLPNVDEINLAGTIPDLANNLTKQQSTFKPDTIRNLENIIKECVHACDVVSQHAEVKALVNSSKTFDLVIIEVFGSECFLPFGKRFDAPVVGLLSSVPLPWLNDQLGNPEETAYVPAYMMGYGQHMNLFQRFINTVAVIWAKAFYRNKSQIPSQIIADRLFGPGPRLESLAQNYSLVLSNSHFSINEVRPLVPALVEVGGLHLDTTQQLPKELRNLLDNADEGIIYWSFGSMSRIETIPYVQLTQIFAALSELPQTVLVKMNKKMLQGNLTVPDNIYAMDWIPQYKTLCHPNVKLFISHGGLLGTQEAVACSVPILMVPLYADQALNARAMSDRGVARIVTLRDSTTEIWRDALRQLLTNTRYKQKAIELRDKFLDRPLPPLETGIYWIEYVIRHRGAHHLRSPALDLTYAQYHLLDVAALIIAITATITYILHKLFRYLCTRCVRWCEKHTVIEKRLFIRNSSLFQCFLWLYKVKPN-