Monarch geneset OGS2.0

DPOGS211512
TranscriptDPOGS211512-TA1284 bp
ProteinDPOGS211512-PA427 aa
Genomic positionDPSCF300354 - 90532-95775
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0226092e-11164.03% 
BombyxBGIBMGA001338-TA2e-10157.28% 
DrosophilaCG15661-PB3e-5137.14% 
EBI UniRef50UniRef50_G9LPR13e-10257.41%UDP-glycosyltransferase UGT41D1 n=7 Tax=Obtectomera RepID=G9LPR1_HELAM
NCBI RefSeqNP_001182388.14e-7345.11%UDP-glucosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638961001e-10157.41%UDP-glycosyltransferase UGT41D1 [Helicoverpa armigera]
NCBI nr blastxgi|3638961001e-10359.38%UDP-glycosyltransferase UGT41D1 [Helicoverpa armigera]
Group
Gene OntologyGO:00081524e-106metabolic process
GO:00167584e-106transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG156613e-49 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[32-427] IPR0022134e-106UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL17542 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211512-TA
ATGTTAACTGACGTACTGACAGTATGGAAGTCTTTCGTTCACGTCTACATGAGAACAATGACCATGAAGCGTCTAATACTATTCCTGTCGCTTCTGTCCGCGGTGTTCGGACACCACATCCTCTGTATACAGAACGTGCCCTCAAGGAGCCATCACACCCTCATGATGGGGATCGTGAAGCCGCTGCTGGAAGCTGGTCACCAGGTCACAATGATAACGACGTTCCCGGAGGAGAAGCCGGCGAAGAACCTCAGATACATAGACGTCGGCCACTTGAGGGACCTGGTGCCGAAGAGTGGCTTTAAAAAACTGGGTTGCAATGTAAGTGGAAACGTTCATTTCAGACTGGACCAATCTAAGATGGAAGCAAATTATGTGTCAGACTTCTCCGCTATATCTTCAGCTCGCGGAGTTCCTCTCCCTCCGTACAACGACGCTCTTTACAACGTGTCAGTACTGCTGGTCAACTCTCACCCGTCCTTCGCCCCAGCTAGGAGTCTGCCACCCAACGTGGTCGATATAGCGGGCTACCACATGGATGAGAACTTAGCACCACTGCCCAAGGATCTCCAAGACCTCTTGGACTCTTCCACCAAAGGAGTCGTGTACTTCAGTATGGGATCCGTTTTGAAGTCAGCTAATTTACCGGAGAAGACTAAGGAGGGTCTTATCAAAGTGTTCAGCGAGTTGCCTTACACGGTGCTCTGGAAGTTTGAGGAGAAGATCGAGGGTCTTCCCAAGAACGTGCACGTCAGGCCCTGGATGCCACAGTCCAGTATCTTATCTCATCCAAATGTACTAGTATTTATAACACACGGTGGTCTTCTCTCGACCCTGGAGTCTCTATACCATGGAATACCCATCATCGCGATCCCCGTGTTCGGAGACCAGCCCGGGAACGCCAAGCGATGCGTACAAGAGGGCAGAGCTCTCATGGTCAGCATCGGTGAAAACATGGCTGAAGACCTCAATAACGCTCTCAAAGACATGCTCGGGAATGACAGTTATTATAAAAAAGCAAAGGAGCTGTCCAAGCTGTTCCGGAGCCGGCCGGTGAAGCCCAACAAGCTGATTCAGCATTACGTGGAGTTAGCTATTGAGAGTAAAGGAGCGTACCATCTCCGTTCCAAGACGCATCTCTACAAGTGGTACGAGCTGTGGATGCTGGATCAGATCGCATTCGTCTTAGCTGTGCTCGCGATAGCGTTTAGTCTCCTCAAGAAGGTGGCTGGACTGTTCATGACGAAACAGAAGCAGAAGGGGAAGAAGGAAAAGAAGAATTAA

Protein sequence:

>DPOGS211512-PA
MLTDVLTVWKSFVHVYMRTMTMKRLILFLSLLSAVFGHHILCIQNVPSRSHHTLMMGIVKPLLEAGHQVTMITTFPEEKPAKNLRYIDVGHLRDLVPKSGFKKLGCNVSGNVHFRLDQSKMEANYVSDFSAISSARGVPLPPYNDALYNVSVLLVNSHPSFAPARSLPPNVVDIAGYHMDENLAPLPKDLQDLLDSSTKGVVYFSMGSVLKSANLPEKTKEGLIKVFSELPYTVLWKFEEKIEGLPKNVHVRPWMPQSSILSHPNVLVFITHGGLLSTLESLYHGIPIIAIPVFGDQPGNAKRCVQEGRALMVSIGENMAEDLNNALKDMLGNDSYYKKAKELSKLFRSRPVKPNKLIQHYVELAIESKGAYHLRSKTHLYKWYELWMLDQIAFVLAVLAIAFSLLKKVAGLFMTKQKQKGKKEKKN-