Monarch geneset OGS2.0

DPOGS200225
TranscriptDPOGS200225-TA1410 bp
ProteinDPOGS200225-PA469 aa
Genomic positionDPSCF300414 + 5918-9358
RNAseq coverage63x (Rank: top 68%)
Annotation
HeliconiusHMEL0225310.082.20% 
BombyxBGIBMGA008381-TA5e-11272.11% 
DrosophilaCG30438-PD7e-14854.25% 
EBI UniRef50UniRef50_G9LPS10.066.92%UDP-glycosyltransferase UGT50A2 n=2 Tax=Obtectomera RepID=G9LPS1_HELAM
NCBI RefSeqXP_001816444.12e-16160.14%PREDICTED: similar to CG30438 CG30438-PB [Tribolium castaneum]
NCBI nr blastpgi|3638961200.066.92%UDP-glycosyltransferase UGT50A2 [Helicoverpa armigera]
NCBI nr blastxgi|3638961200.066.92%UDP-glycosyltransferase UGT50A2 [Helicoverpa armigera]
Group
Gene OntologyGO:00081522.6e-126metabolic process
GO:00167582.6e-126transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG66584e-61 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[5-427] IPR0022132.6e-126UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL13802 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200225-TA
ATGGGAGGTACAAAATCTCATAAAATACCATTCTTGGAACTGGCCAGAGGTCTGCTTAGAAGGAATCACAATATTACTTTAATAAGTCCTTTCACGCCTGATTTTCATATTGAGGGTCTGGAGGAGATCACTCCGGAAGGCTTGATGTCCTTTGTAAGAGGATATATGACCTTTGACCTGGTGGGTGCAAGGATGAGAGGTGAAGAACCATTGCCATATATGGATATTATAAGATATGGTTATGAGGCATGCGAAGCTTTTCTAAGCGACGTCGAGACAAAATCCTTTTTGAGATCAGGAAGGAATTACGATCTTATTGTTCTAGACGGGGCATATCCTGAGTGCGCTTTAGGCCTTGTCTATCGAATGAAAGTGCCTTTCATGTATATTAATACGGTGGGATTTTACGCAGGACCAACGAGTATTTCTGGTAGTCCGGCACCATACTCTGTCACTCCGATTTTTGCCAAAGCTTTCACAGATAATATGGGATTCATAGATAGAACACTCAATACTATGTGGAATATTGGAGCCATGTTAGGGCACATCGCAAGTGTTACAGTTTTGCAAGGAGTTTTAAAGAGACATTTTGGCTCAAAGATCCCTCACGTGTATGATATGGGCAAAAATGTCAGTTTTATTCTCCAAAACGGACATTATTCGGTTTCGTATCCGCGGCCGTTTTTACCAAATGTTGCTGAAGTAGCCTGTATTCACTGCAAGGAACCCAAGGTTTTAAGTTCGGATTTAGAAGAGTGGATAGCTGGTGCTGGTGAAGCTGGTTTTGTGTACGTATCTATGGGATCATCAGTTCGAACGAACAAGATGCCATTATCAGCGCACCGTCTGTTCGTGAAGGCATTGGGAAGACTGCCGCAGCGAGTTCTTTGGAAACAGGACGGGGAGCAAAACATGACGGACATACCTACCAACACCAGGATATACAACTGGCTACCACAGCAAGATTTATTAGGTCATCCGAAAATAAAGGCGTTTGTGACTCACGGAGGTCTTCTCAGTATGTTCGAGACGGTGTACCACGGAGTCCCCATCATCAGTATACCAGTTTTCTGCGACCACGACGCAAACGCAGCTAAGGCTGAACTCGATGGGTTTGCAAAAAAGTTAGATCTTCAGCAGCTCACCTCAGATAAACTATACAAAGCTATCAAAGAAGTGATCAATGACCCAAAATATAAGAAGCAAGTCAAAATTAGACAAACACTCCTGAGAGATCAAAAGGAATCACCCTTGGAACGGGCTATTTATTGGACGGAGTTCGGTTTTAAACGTGCCGTCGAATATTTACACAACAAGAGAATGGATAAGTTGATCAACAAGTCTAGCGATTTGTTAAAGAGATCCTCTAAATTAATAGGTGAAGCCACAATGAAGAAGAAACGGTTGTAA

Protein sequence:

>DPOGS200225-PA
MGGTKSHKIPFLELARGLLRRNHNITLISPFTPDFHIEGLEEITPEGLMSFVRGYMTFDLVGARMRGEEPLPYMDIIRYGYEACEAFLSDVETKSFLRSGRNYDLIVLDGAYPECALGLVYRMKVPFMYINTVGFYAGPTSISGSPAPYSVTPIFAKAFTDNMGFIDRTLNTMWNIGAMLGHIASVTVLQGVLKRHFGSKIPHVYDMGKNVSFILQNGHYSVSYPRPFLPNVAEVACIHCKEPKVLSSDLEEWIAGAGEAGFVYVSMGSSVRTNKMPLSAHRLFVKALGRLPQRVLWKQDGEQNMTDIPTNTRIYNWLPQQDLLGHPKIKAFVTHGGLLSMFETVYHGVPIISIPVFCDHDANAAKAELDGFAKKLDLQQLTSDKLYKAIKEVINDPKYKKQVKIRQTLLRDQKESPLERAIYWTEFGFKRAVEYLHNKRMDKLINKSSDLLKRSSKLIGEATMKKKRL-