Monarch geneset OGS2.0

DPOGS200186
TranscriptDPOGS200186-TA1452 bp
ProteinDPOGS200186-PA483 aa
Genomic positionDPSCF300128 + 721174-725523
RNAseq coverage84x (Rank: top 64%)
Annotation
HeliconiusHMEL0225075e-15657.49% 
BombyxBGIBMGA014622-TA5e-7537.40% 
DrosophilaCG17323-PA5e-6730.63% 
EBI UniRef50UniRef50_G9LPR24e-7935.84%UDP-glycosyltransferase UGT42B2 n=6 Tax=Obtectomera RepID=G9LPR2_HELAM
NCBI RefSeqXP_001662128.12e-7533.18%glucosyl/glucuronosyl transferases [Aedes aegypti]
NCBI nr blastpgi|3796989942e-8436.44%UDP-glycosyltransferase UGT42A2 precursor [Bombyx mori]
NCBI nr blastxgi|3796989943e-8337.11%UDP-glycosyltransferase UGT42A2 precursor [Bombyx mori]
Group
Gene OntologyGO:00081524.1e-102metabolic process
GO:00167584.1e-102transferase activity, transferring hexosyl groups
KEGG pathwayame:4087883e-69 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[1-446] IPR0022134.1e-102UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL34341 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200186-TA
ATGGCATTCCAACCACTATTCCAGGAATTAGCTGTTAGAGGACATCGTGTTACTGTTATAAATAATTATCCGGATAGTAATCCTGAACCAAATTTAAAATACGTCGACATCAGTTTACCGTCTGCTAGAAAAATGCCTGTCATGTCCGAATTTGAGAATATCAGTTCTAAATATTTACATATCAGTAATTATTTAAAACATTTTATTCTTAGCAAAAATAACGTAGAGGGTGATTGTGATAATCTTTTCACGAATCCTAATATGCGAACATTTTTAAGTGAGGGTGAACGGTTTGATGTGATTTTCGTTGAGCAGTTTATGAGTGATTGTGGACTGGTTTTAGCTGGAACGATGTACGATGCTCCGATAATTGGTATAACGTCACATACACTTCTGCCCTGGGCTTACTCAAGGCTCGGTATATCTTTTGACGTTCTGTCAGATGCATTTTATTTTTCAAACGTTGGCACACAGCCGTCACTCTTAAAATCCATAGAAAACTATTATATGCATCTTTATATGAACACAATTGGAAGTTGGAAAATACAACGAATAATTAGCGATGTATTCAAACGACACGCTCCGAATGCTACTTTAGATTTTGAAATGATAGTTAGGGATAAAATGAAAATGATGTTTGTTTATCAACACTTTTCGGTGACGGGGGCCCGTTCGCTTCCGCCGCAGTTGGTCGAAATAGCCGGCATTCATATAAAAAAACCGAAACCAGTGTCACGGGACATAGAAAAATTTTTATCATCTGCGAAACACGGTGCGATTTACGTCAGCTTCGGTTCAAATTTGAAATCTAGTCTGATGTCCGAAAAGAGAAGACAAGCGTTTTTGGATGCGTTCAAAAAAATCCCCCAAAAGATTTTATGGAAATTAGAAAACGGTACTCTACCAGACGGCAACGACAATATACTCACAAGTTCTTGGTTTCCTCAACTAGATGTTTTATGTCACCCTAAGGTGAGAGGGTTCATATCTCACGGGGGAATGTTGAGTCTATCTGAAGCCGCCTACTGTGGCAAACCAATACTGGCTATGCCTTTCTTTGGAGATCAATTTTCAAATGTAGCGGCGATCGAAGAGAGCGGCCTCGGTCTGTCTATGTATTTTAATGAAGCTGACTCCGCGAGTCTAGTTGCAGCTATAAACAAATTAACATCAAACGAGATGCAACACAAAGCTGAACGCATATCAAAAATCTTCAATGACAGACCTTTAAACGTTATGGATAACGCCATCTATTGGACAGAGTACGTAGCTAAATATAGAGATGTACCAAAGCTCCCTGCTTTTTATACGCCCTGGTACAGGAGGCTAATACTAGATGCCGCTATAATCACAGCTGACGTTGGGATGACTTCCGGTTCATCCACTTACCTGGTAGGTCTGGGACTGGAAGGCCAACAGCCGACCCTGGCCACCGACCCCGCGTTGATCTAA

Protein sequence:

>DPOGS200186-PA
MAFQPLFQELAVRGHRVTVINNYPDSNPEPNLKYVDISLPSARKMPVMSEFENISSKYLHISNYLKHFILSKNNVEGDCDNLFTNPNMRTFLSEGERFDVIFVEQFMSDCGLVLAGTMYDAPIIGITSHTLLPWAYSRLGISFDVLSDAFYFSNVGTQPSLLKSIENYYMHLYMNTIGSWKIQRIISDVFKRHAPNATLDFEMIVRDKMKMMFVYQHFSVTGARSLPPQLVEIAGIHIKKPKPVSRDIEKFLSSAKHGAIYVSFGSNLKSSLMSEKRRQAFLDAFKKIPQKILWKLENGTLPDGNDNILTSSWFPQLDVLCHPKVRGFISHGGMLSLSEAAYCGKPILAMPFFGDQFSNVAAIEESGLGLSMYFNEADSASLVAAINKLTSNEMQHKAERISKIFNDRPLNVMDNAIYWTEYVAKYRDVPKLPAFYTPWYRRLILDAAIITADVGMTSGSSTYLVGLGLEGQQPTLATDPALI-