Monarch geneset OGS2.0

DPOGS214685
TranscriptDPOGS214685-TA1095 bp
ProteinDPOGS214685-PA364 aa
Genomic positionDPSCF300503 - 29884-32513
RNAseq coverage1830x (Rank: top 7%)
Annotation
HeliconiusHMEL0226335e-10255.31% 
BombyxBGIBMGA013836-TA8e-8948.26% 
DrosophilaUgt35b-PA3e-3431.25% 
EBI UniRef50UniRef50_G9LPT04e-8245.91%UDP-glycosyltransferase UGT33K1 n=1 Tax=Bombyx mori RepID=G9LPT0_BOMMO
NCBI RefSeqNP_001135960.14e-8345.48%uridine diphosphate glucosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638960621e-9151.60%UDP-glycosyltransferase UGT33F1 [Helicoverpa armigera]
NCBI nr blastxgi|3638960623e-9251.60%UDP-glycosyltransferase UGT33F1 [Helicoverpa armigera]
Group
Gene OntologyGO:00081523.4e-56metabolic process
GO:00167583.4e-56transferase activity, transferring hexosyl groups
KEGG pathwaydpo:Dpse_GA101354e-31 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[1-312] IPR0022133.4e-56UDP-glucuronosyl/UDP-glucosyltransferase
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214685-TA
ATGAAAGAATTAGTTAACCGTGGCCATGAAGTTGTAGTTATCACAACAGACCCTATTTATCCCAGCGGTCAAGGCCCAAAAAATTTGACGGAAATCGATTTGCACGACATATCGTATGATATCTGGACTAAATTAACTAAATCTGGCGAGTTTAATATTGATGATCCAAGACATCAAGCAAAAATCATTTTTGAAAAAATAGCTTTGGTTTTCGAAAAACAAATGAAAACGCCATTAGTGCAAAAGCTTTTAAATGATAAAAATAATAATTTTGACCTAGTGTTTACTGAATCTTGTATAAGGTTATCGTTAGTTTTTTCGCATTTATACAAAGCGCCTTTAATTGAAATAAGTTCTCTGGGTGGCTTGTATGGAACTTTCGACGGCATAGTTTCTCCAATACATCCATTGCTTTATCCCATAGCAAATCAACAAAAAACCTATCAGCTTTCAATGTGGGAAAAAATATATCAATTGTATGTCTATTATGGCATCGGTAGTGCATATCAAAATTTAGAAAAAATGGAAAACAATTTGCTAAAAACTTTATTTGGACCCAATACACCAGATTTACAAGACTTAAAAAAGAATGTTGATATGCTTTTCCTGAATATACATTCTGTATGGGATTTCAACCGACCTGTTCCGCCTAATGTTTTATACTTAGGAGGTTTGCACTTGCAAAGAAAACCGGTTAAGGAATTACCCAAGGATTTAAAGAATTTTTTAGATTCATCCAGTGAAGGTGTCATTTATATGAGCTTCGGGACAAACGTTTTGCCGTCAGCACTACCAGCAGAGAGGATTAAAATAATTACCAACGTCTTCTCTGATCTTCCTTATAAAATTTTATGGAGATGGGATAGTGACAAAATTCCTGAGCATTCTAAAAACGTTCTTATTTCAAAATGGTTTCCACAGTCAGACTTACTTGGTTGGTACATAATTCTGCAATGTTTAAATAAATCTCACGAATTTGTTTCGACGCAGATATCAAAGCTCATTTTATTTGTTTATTTTGTGTTGGTCAGAAAAAAGACGTTCCCAAAACATCTAATATTCTATTTAAGTCTGTGTATATTGTACATTGTTTAA

Protein sequence:

>DPOGS214685-PA
MKELVNRGHEVVVITTDPIYPSGQGPKNLTEIDLHDISYDIWTKLTKSGEFNIDDPRHQAKIIFEKIALVFEKQMKTPLVQKLLNDKNNNFDLVFTESCIRLSLVFSHLYKAPLIEISSLGGLYGTFDGIVSPIHPLLYPIANQQKTYQLSMWEKIYQLYVYYGIGSAYQNLEKMENNLLKTLFGPNTPDLQDLKKNVDMLFLNIHSVWDFNRPVPPNVLYLGGLHLQRKPVKELPKDLKNFLDSSSEGVIYMSFGTNVLPSALPAERIKIITNVFSDLPYKILWRWDSDKIPEHSKNVLISKWFPQSDLLGWYIILQCLNKSHEFVSTQISKLILFVYFVLVRKKTFPKHLIFYLSLCILYIV-