Monarch geneset OGS2.0

DPOGS204754
TranscriptDPOGS204754-TA1497 bp
ProteinDPOGS204754-PA498 aa
Genomic positionDPSCF300231 - 125784-128344
RNAseq coverage15x (Rank: top 81%)
Annotation
HeliconiusHMEL0225454e-9540.16% 
BombyxBGIBMGA002854-TA1e-6434.76% 
DrosophilaCG15661-PB2e-3525.45% 
EBI UniRef50UniRef50_G9LPS01e-8137.25%UDP-glycosyltransferase UGT48A1 n=1 Tax=Helicoverpa armigera RepID=G9LPS0_HELAM
NCBI RefSeqNP_001037040.12e-6932.72%phenol UDP-glucosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638961184e-8137.25%UDP-glycosyltransferase UGT48A1 [Helicoverpa armigera]
NCBI nr blastxgi|3638961183e-8437.25%UDP-glycosyltransferase UGT48A1 [Helicoverpa armigera]
Group
Gene OntologyGO:00081521.7e-50metabolic process
GO:00167581.7e-50transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG156611e-33 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[11-474] IPR0022131.7e-50UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL34563 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204754-TA
ATGGACAAATTATTGTTACTATCGTGTTGTGTGGTCGCGAGTCACTGTTCCAACATCTTAGTAGTGTTCCCAGTACCGGAGAGGAGTCATCGTGTTCTGGGGGAATCTATAGTTAAAATTTTATTGGAGGCCGGTCATGAGGTGACCTGGGTGACTCCTTTCCCGAGGGACATAAAATGGTCTAAATTAGATTACATTGACATTAGCTCGGCTCTGATACACGAGACAGAGACGGGAAATAATGACGACCATGGCCCGAGTATTTCAGAACTGAGACTTAATATTCAGCAGCTGGGGTCTCGGTACGCCGGTCTGGCTCTCCGACACCCCGCCCTTCAGGAGCTGATGGTGAATACCACCGTACGCTTCGAGGCCGTGGTCGCGGAATGGTATCACTCTGGACTACTGGCTCCACTGGCATCCGTCTTCGATTGTCCATTAGTCTGGTATACTCCTGAAGATATCTCTTGGCAGACATATGGCCTCGTGCATGGAGATAGCAGTCTCGATTTTATGGCTTCCACTCTTCAATCCCCGAGCTACTCTCTCCCAGAGAGATTAAGATATTTGTGGTCAAAACTCTGTTTTGGTGTTCGAAATTACTTTCACATAAGCGCGACTGAATTGCCTGAATATGAAGCATGTTACCTCCGCGCGTTTCAGTCCCGGAGGCGTGTCTTACCAAACTATGAAGAATTGGTTTACCAGGGATCAGCACTTTTAATTAATTCTCATCCGCCACTCGGACACAAAATACCTCTACCATTGAATGCGAAATTTAATATTGCAAAATTATTAGATGGATCGAAGGCAGGAGTTATTTATGTAAACTTAGAATCGCATGTCACAAGCGGAGAGGTGTCACATCATGTTATACAGGAGCTTATAGAGATATTTGGTGTAGTTCAGCAAACTGTGATATGGAAGAGCGAGGAAATCCAGTGGAGCCTTCCACAAAACGTGTTCATGATGAAGAATCCACCTCAAAATATTATACTGAATCACACAAACACCATAGCATATATAAACCACGGCCAGATGCTTTCAATCGTGGATGCGATCAACTTTGGAGTGCCGGTCATCGGTATACCACTTCTAGAAGATCACATTGTCAACATGGACTCTGTAGTGAAAAGAGGATGCGGCATTAAAGTTGACTATACCAACGAGTTTGCCTGGAAGGTTAAAGACGCTGTCAACAGGATACTTAAAATGTCAAGTTATCGTGAGCAATCAAATAAAGATAAATTGATATTCCGCAACCGTGTGGCTACTCCTCAATCAGAAGTCCTGCACTTGATGCAACTGGTGCTGGACTCAGATGGAGCTGGACACCTGCAGTCCTCGACGCTGTTTCTTTCAGTCATGGAGAGACATAACTTGGACATTATTATACTAGTGTTGATGTTCTTTTGGTTCCTGAACAGGGCATGGAGCTTGTTCGGTGCGTACTTTGTTTGGGGGGAAGACGACAGTGATGATAAAAAGTACCAGTAA

Protein sequence:

>DPOGS204754-PA
MDKLLLLSCCVVASHCSNILVVFPVPERSHRVLGESIVKILLEAGHEVTWVTPFPRDIKWSKLDYIDISSALIHETETGNNDDHGPSISELRLNIQQLGSRYAGLALRHPALQELMVNTTVRFEAVVAEWYHSGLLAPLASVFDCPLVWYTPEDISWQTYGLVHGDSSLDFMASTLQSPSYSLPERLRYLWSKLCFGVRNYFHISATELPEYEACYLRAFQSRRRVLPNYEELVYQGSALLINSHPPLGHKIPLPLNAKFNIAKLLDGSKAGVIYVNLESHVTSGEVSHHVIQELIEIFGVVQQTVIWKSEEIQWSLPQNVFMMKNPPQNIILNHTNTIAYINHGQMLSIVDAINFGVPVIGIPLLEDHIVNMDSVVKRGCGIKVDYTNEFAWKVKDAVNRILKMSSYREQSNKDKLIFRNRVATPQSEVLHLMQLVLDSDGAGHLQSSTLFLSVMERHNLDIIILVLMFFWFLNRAWSLFGAYFVWGEDDSDDKKYQ-