Monarch geneset OGS2.0

DPOGS206678
TranscriptDPOGS206678-TA1401 bp
ProteinDPOGS206678-PA466 aa
Genomic positionDPSCF300048 + 1106142-1109696
RNAseq coverage426x (Rank: top 29%)
Annotation
HeliconiusHMEL0226540.076.07% 
BombyxBGIBMGA014622-TA2e-6636.68% 
DrosophilaCG17323-PA1e-7734.01% 
EBI UniRef50UniRef50_G9LPR50.072.65%UDP-glycosyltransferase UGT44A2 n=2 Tax=Obtectomera RepID=G9LPR5_HELAM
NCBI RefSeqXP_392319.31e-7735.58%PREDICTED: similar to CG17323-PA [Apis mellifera]
NCBI nr blastpgi|3638961080.072.65%UDP-glycosyltransferase UGT44A2 [Helicoverpa armigera]
NCBI nr blastxgi|3638961080.072.65%UDP-glycosyltransferase UGT44A2 [Helicoverpa armigera]
Group
Gene OntologyGO:00081521e-121metabolic process
GO:00167581e-121transferase activity, transferring hexosyl groups
KEGG pathwayame:4087884e-77 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[44-466] IPR0022131e-121UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL34670 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206678-TA
ATGCCAAACCCGCCTGTCAACTATCATGAAGTTCTACTTAATGACAAACAAGATAATAAAGGCCTGTCATTCGAATCAGTTATTGTGAATGAAGTATCGCGTGTGCCGTTCGAAACTCTGGTATCAACTAAAGAAGGAAACGATGATTGCAAAACGCTAATGAACAATCACGAGGTACTGCACATGATAAGAACGCAACCTAAATATTCTGTGATAATAGTCGAATCATACAATAGTGACTGCGCCCTTGCGTTGGCGGCAAATTTAAGTTCTCCGTACATAGCGTTTAATCCTCAGTCGATACACCCTTGGCATTTCAGTAGACTAGGAATACATTTCAACTCAGCTTACGTCCCCCAATCTCTACTGCCGTTTGGAAAAGAACCATGGTTTTTTGATAGAGTCAAAGGTTTTATATTGTACCACGTAGCGAACTGGGTGTATTATATCGGTTCGCAAGTAACGGATCACGTGTACCTCTATAAATATTTAGGGGATAATCTACCAGCGTTGGAGAGCATAGCGTCAAATGCCAGCCTCGTGTTTGTGAACACCCACAAATCTGTTTTCGGGGGTGTGGTGCGAGCTGATAATGTTGTCGACATCGGAGGAATACATATCAGACCACCCAAAAGTATACCTACGCATATAGAAAGATTTATTAACGAAGCTGAAAACGGAGTTATCTACGTCAACTTGGGGTCAACCGTCAAAGATTTCACATTACCGAGCGACAAACTCACAGAACTAATATCAACGTTCAGAAAATTACAACTCCGAATATTATGGAAATGGGATGGAGACAGCGTGGAAAATCTGCCAAGAAACGTTATGACTATGAAATGGTTTCCGCAGTATGATATTTTAAAACATGACAACGTAAAGGCGTTTATCTCCCACGGTGGTATTCTAAGTTGTACAGAGGCGTTGGATGCCGGCGTGCCAGTGGTAGCTATTCCTTTGTTTGGCGAACAGTATGGCAATTCCGCAGCCCTAGTTGATGCTGGCATTGCCAGTATAGTCACATATGAGAATCTTAAAGATGAACTACTGTTAGACGCCATCAATGAGGTCTTGGATCCAAGATGCCAGCAACAAGCTAAGCTTGTTTCTCGAATGTGGCACGACCGTCCGATGAATGCCTTAGAAACCGCCATCTATTGGATTGAATACGTAGCTCGATACAATGGTTCGCCAAATATGGGAGCGCCATCTGTTAAAGTACCTTGGTACCAACAACTGCAACTAGATGTCCTCGCATTTATTTTTATAGTATTTTATATTGTAATGTACGCTTTTTACAAAGTATTAAAAGTTTGCTGCTGTTGTTGTTGTCAACCGGAACCCCCAGTTGAAAAAATATCACGAGAACGAACGACAAGAAGAGTCAAATTTGAATAA

Protein sequence:

>DPOGS206678-PA
MPNPPVNYHEVLLNDKQDNKGLSFESVIVNEVSRVPFETLVSTKEGNDDCKTLMNNHEVLHMIRTQPKYSVIIVESYNSDCALALAANLSSPYIAFNPQSIHPWHFSRLGIHFNSAYVPQSLLPFGKEPWFFDRVKGFILYHVANWVYYIGSQVTDHVYLYKYLGDNLPALESIASNASLVFVNTHKSVFGGVVRADNVVDIGGIHIRPPKSIPTHIERFINEAENGVIYVNLGSTVKDFTLPSDKLTELISTFRKLQLRILWKWDGDSVENLPRNVMTMKWFPQYDILKHDNVKAFISHGGILSCTEALDAGVPVVAIPLFGEQYGNSAALVDAGIASIVTYENLKDELLLDAINEVLDPRCQQQAKLVSRMWHDRPMNALETAIYWIEYVARYNGSPNMGAPSVKVPWYQQLQLDVLAFIFIVFYIVMYAFYKVLKVCCCCCCQPEPPVEKISRERTTRRVKFE-