Monarch geneset OGS2.0

DPOGS211314
TranscriptDPOGS211314-TA1386 bp
ProteinDPOGS211314-PA461 aa
Genomic positionDPSCF300125 - 54339-56768
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0226070.073.54% 
BombyxBGIBMGA004965-TA6e-17366.26% 
DrosophilaUgt86Dd-PB1e-3930.07% 
EBI UniRef50UniRef50_B5AU201e-17066.26%UDP-glycosyltransferase UGT34A2 n=5 Tax=Ditrysia RepID=B5AU20_BOMMO
NCBI RefSeqNP_001127730.12e-17166.26%uridine diphosphate glucosyltransferase [Bombyx mori]
NCBI nr blastpgi|2241847313e-17066.26%UDP-glucosyl transferase [Bombyx mori]
NCBI nr blastxgi|2241847311e-16666.42%UDP-glucosyl transferase [Bombyx mori]
Group
Gene OntologyGO:00081525.3e-65metabolic process
GO:00167585.3e-65transferase activity, transferring hexosyl groups
KEGG pathwaydpo:Dpse_GA197419e-38 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[76-418] IPR0022135.3e-65UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL21039 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211314-TA
ATGACACCTTATCCTGGGTATTTTGACTATCCAGATGTGGACCGAATTGTTGAGTTAAGTGTTGCTCAAGAATCAGCAAAGTATTGGGACGAATACAAGATCACAATGACCAACACCGATGACTATTACAAAAGAATGAGAGCTATAAATGATCTTTCCATAAAAATTGCTGTAACGCAACTAACATCGAAGCCTATGACAGCTCTGTTTGCTAATCCAAACGTGAAATTCGATCTAGTGATAACTGAAGCCGATGTCCCGATACTGTATGGTGCTGCTGAAAAGTATCGAGCCCCTCATATAGCAATAACTCCAACTAGCGGAAGAATCACACAATACGAATCCAAGGGTTCACCAATACATCCAATACTGTATCCAGACGTGAACACCTTGAATTATAGAAACTTAACGCGTTGGCAGAAAGTTGTCGAGTTGTATCGCTACTTTCAAACAAGACATGAATATTATAACAACTATCTCCCCCTCTGTGAAATAGCAGCTAAAAAGATTTTTGGGCTTAAAAGAAATATTCTTGAAGTTGAATATGATATCGATTTATTATTTGTTGCTGGAAATCCTCTTTTGACAGGCAACAGACCCACTTCATATGGCATTAAATATGTTGATCGGCTACATATTAAACCAGGTTTTGGATTGTCGGATGATCTCAATGGGATTTTAGATTCAGCCACGAAAGGAGCTATTTACTTTAGTTTAGGGGCCCTGCAGGAATCTGAACACCTGTCTGATAATATTTTACAGACATTAGCTGATGCATTCAGAGAATTGCCTTATTTAGTGTTATGGAAAATTGGTAACACGACTATGATAAATAAACCAGACAACGTTCTAGCGAATGCGTGGTTCCCTCAACAAGAAATTTTGGCACATCCTAATATTAAAGCCTTTATTACTCATGGAGGTCCACGTTCCCTAGAAGAGGCTCTATTTTATCAAGTTCCAATAATTGGTTTACCAACTGTTAAGTCTAGAGCGGTATTTATTAAGGAAATAACTCGCTATGGAGCAGGAGAGGTACTGGATGTAAATTATTTAGACAAGGACAAACTGAAAGAAGTAATCAATGTAGTTGCATCTACAGATAGCTATAAAAATGCAATGATCAAGTTAAAAAGTATGGTCGTGGATCCGTTAATATCGGGGCCAGAGAATGCTCAATGGTGGACGGAATATGTATTACGACACGGAGGTGGTAAACATTTACGATCACCAGCAAGTATGCTGCCTATAATAACTAAAGCCGATGTCCCGATACTGTATGGTGCTGCTGAAAAGTATCGAGCGCCTCATATATCAATAACTCCAACTAGCGGAAGAATCACACAATACGAATCCAAGGGTTCACCAATACATCCAATACTGTAG

Protein sequence:

>DPOGS211314-PA
MTPYPGYFDYPDVDRIVELSVAQESAKYWDEYKITMTNTDDYYKRMRAINDLSIKIAVTQLTSKPMTALFANPNVKFDLVITEADVPILYGAAEKYRAPHIAITPTSGRITQYESKGSPIHPILYPDVNTLNYRNLTRWQKVVELYRYFQTRHEYYNNYLPLCEIAAKKIFGLKRNILEVEYDIDLLFVAGNPLLTGNRPTSYGIKYVDRLHIKPGFGLSDDLNGILDSATKGAIYFSLGALQESEHLSDNILQTLADAFRELPYLVLWKIGNTTMINKPDNVLANAWFPQQEILAHPNIKAFITHGGPRSLEEALFYQVPIIGLPTVKSRAVFIKEITRYGAGEVLDVNYLDKDKLKEVINVVASTDSYKNAMIKLKSMVVDPLISGPENAQWWTEYVLRHGGGKHLRSPASMLPIITKADVPILYGAAEKYRAPHISITPTSGRITQYESKGSPIHPIL-