Monarch geneset OGS2.0

DPOGS211510
TranscriptDPOGS211510-TA924 bp
ProteinDPOGS211510-PA307 aa
Genomic positionDPSCF300354 - 100186-101901
RNAseq coverage103x (Rank: top 60%)
Annotation
HeliconiusHMEL0226092e-11265.20% 
BombyxBGIBMGA001338-TA1e-10258.90% 
DrosophilaCG15661-PB5e-5136.79% 
EBI UniRef50UniRef50_G9LPR14e-10359.42%UDP-glycosyltransferase UGT41D1 n=7 Tax=Obtectomera RepID=G9LPR1_HELAM
NCBI RefSeqNP_001182388.19e-7346.41%UDP-glucosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638961002e-10259.42%UDP-glycosyltransferase UGT41D1 [Helicoverpa armigera]
NCBI nr blastxgi|3638961006e-10561.41%UDP-glycosyltransferase UGT41D1 [Helicoverpa armigera]
Group
Gene OntologyGO:00081522.6e-102metabolic process
GO:00167582.6e-102transferase activity, transferring hexosyl groups
KEGG pathwaydpo:Dpse_GA101353e-49 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[12-307] IPR0022132.6e-102UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL17542 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211510-TA
ATGGAAGCAAATTATGTGTCAGACTTCTCCGCTATATCTTCAGCTCGCGGAGTTCCCCTCCCTCCGTACAACGACGCTCTGTATAACGTGTCAGTACTGCTGGTCAACTCTCACCCGTCCTTCGCCCCAGCTAGGAGTCTGCCACCCAACGTGGTCGATATAGCGGGCTACCACATGGATGAGAACATAGCACCACTGCCCAAGGATCTCCAAGACCTCTTGGACTCTTCCACCAAAGGAGTCGTGTACTTCAGTATGGGATCCGTTTTGAAGTCAGCTAATTTACCGGAGAAGACTAAGGAGGGTCTTATCAAAGTGTTCAGCGAGTTGCCTTACACGGTGCTCTGGAAGTTTGAGGAGAAGATCGAGGGTCTTCCCAAGAACGTGCACGTCAGGCCCTGGATGCCACAGTCCAGTATCTTATCTCATCCAAATGTACTAGTATTTATAACACACGGTGGTCTTCTCTCGACCCTGGAGTCTCTATACCATGGAATACCCATCATCGCGATCCCCGTGTTCGGAGACCAGCCTGGGAACGCCAAGCGATGTGTAAGGGAGGGCAGAGCTCTCATGGTCACCATCGGCCCAGATATGGCTCAAGACCTGGAAAAAGCTCTCAAGGAGATGCTCGGGAATGACAGTTATTATAAAAAAGCAAAGGAGCTGTCCAAGCTGTTCCGGAGCCGGCCGGTGAAGCCCAACAAGCTGATTCAGCATTACGTGGAGTTAGCTATTGAGAGTAAAGGAGCGTACCATCTCCGTTCCAAGACGCATCTCTACAAGTGGTACGAGCTGTGGATGCTGGATCAGATCGCATTCGTCTTAGCTGTGCTCGCGATAGCGTTTAGTCTCCTCAAGAAGGTGGCTGGACTGTTCATGACGAAACAGAAGCAGAAGGGGAAGAAGGAAAAGAAGAATTAA

Protein sequence:

>DPOGS211510-PA
MEANYVSDFSAISSARGVPLPPYNDALYNVSVLLVNSHPSFAPARSLPPNVVDIAGYHMDENIAPLPKDLQDLLDSSTKGVVYFSMGSVLKSANLPEKTKEGLIKVFSELPYTVLWKFEEKIEGLPKNVHVRPWMPQSSILSHPNVLVFITHGGLLSTLESLYHGIPIIAIPVFGDQPGNAKRCVREGRALMVTIGPDMAQDLEKALKEMLGNDSYYKKAKELSKLFRSRPVKPNKLIQHYVELAIESKGAYHLRSKTHLYKWYELWMLDQIAFVLAVLAIAFSLLKKVAGLFMTKQKQKGKKEKKN-