Monarch geneset OGS2.0

DPOGS206680
TranscriptDPOGS206680-TA2079 bp
ProteinDPOGS206680-PA692 aa
Genomic positionDPSCF300048 + 1117241-1121092
RNAseq coverage109x (Rank: top 59%)
Annotation
HeliconiusHMEL0226590.068.03% 
BombyxBGIBMGA014622-TA2e-14055.88% 
DrosophilaCG17323-PA5e-8634.53% 
EBI UniRef50UniRef50_G9LPR20.071.75%UDP-glycosyltransferase UGT42B2 n=6 Tax=Obtectomera RepID=G9LPR2_HELAM
NCBI RefSeqXP_317561.42e-9237.83%AGAP007920-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3638961020.071.75%UDP-glycosyltransferase UGT42B2 [Helicoverpa armigera]
NCBI nr blastxgi|3638961020.070.70%UDP-glycosyltransferase UGT42B2 [Helicoverpa armigera]
Group
Gene OntologyGO:00081522.8e-134metabolic process
GO:00167582.8e-134transferase activity, transferring hexosyl groups
KEGG pathwaydme:Dmel_CG173234e-84 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[14-465] IPR0022132.8e-134UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL14581 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206680-TA
ATGAGTTTGTATAGGTTATGTGTTTGTTTTCTATTAATATGGCAAAGTGTCATTGCTCTTAACATTTTAGCAATATACCCTTACCATGGGAAGAGCCATTTCATTGTGTTCAAAGTTTATTTACAAGAACTGGCAAGGAGAGGTCACAATGTTACTGTGATTTCGCACTTTCCAGAAGTTAATCCATCAAAGAACTACCACGACATCAGTTTAGCTGGAAGTATGCAAGAAATGGAAGATAGATTACCTTTCCACAGATCATATTTAACTGTTCTTGCCACTGCTCTTTATCTTACAAAATTTGGAACCGATAACTGTAAGGTTATGTTAGAAAACGAAAGGGTTCGTAATCTTATAAAGGATAAACCTAAGTTTGATGTAATTGTGTTAGAACAATTTAACAGTGATTGTGCTCTGGGTATAGCATACAAATTGGGTGCACCAGTTGTTGGCACTACATCCCATGTTTTAATGCCGTGGCATTATAATAGATTAGGTATTCCGAACAATCCCTCTTATGTCTCCTTCCATTTTTTGGAAGGTGGCACTAAGCCAACACTGTTCCAAAGAATAGAAAGATTTATTTTTAATTTATACTTTAACACTGTATACTATTATACATCACAAAGAGTAGATCAACAAACACTTGCTAATTATTATGATGATATACCCCCACTGGAAGATTTGGGTCGTCAAATGAAGTTTTTAATGCTATATCATAACTTTATCCTAACTGGATCAAGACTGTTTCCAGCTAATGTTATTGAAATAGGTGGTTACCATGTCAAAGAAGCAAAACCCTTGACTGGGGATTTATTAAAGTTTGTAGAAGAAGCAGAACATGGTGTGATATATGTTAGTTTTGGATCAGTTGTAAAAAGTTCAACAATGCCTGCCGACAAATTGAATGCTGTATTAGAAGCTATGACAGAACTGCCCCAAAGATTCATCTGGAAATGGGAGACAGATGTAGTACTTCTAGACAAAAAGAAGCTTTACATCTCAAGTTGGTTACCACAGGTTGATATTTTAGGCAATCCAAAAACATTGGCATTCCTGTCACATTCTGGTATGGGTGGGACAACGGAAGCAATACATTTTGGAGTGCCAGTGGTAGCAATGCCCGTCGTTGGTGACCAACCATCCAATGCAGCAGCAGTGGAAGAAAGTGGCCTAGGAGTGACGCTTCAGATACGTGATCTCACTAAGGAAAATCTTCTGGCTGCTTTCAGAAAAGTTTTAGATCCTAAATTTCGAGAAAACGTCAAAAGCATATCTAAAGCATGGCACGATCGTCCACTGTCCCCTCTAGACACTGCTGTGTATTGGACAGAGTTTGCTGCGCGGTATCCAAACCAAACGTTCAGAACCCCTGCGGCTGATGTCGCTTATATCATAATAATGGCATGCTGCTGTAAACCTGGTCGATCCAACAAGTCCTCATTAAGTTGCTTAGGGCGCTGTGACCTTCCCAAACATCAATGTTTGGACAAACACGAAGTAAGACCCGGACCAGTACCTAAATACATGGGATGGCACTATACAGCAAAAGATGCCCCTGAACATCAGTTGCGTTGTGGTTGGCGACCCGGCCATATTGCCTGTCCAGTTTTACGTAGAATCGAACAACACAATTGCATGAAATCCAGCGAATGTTCACCATGCGTCTCGAGTTGCGCAAAACCGCCATGTTACAAACCGTGTTGCATGCCACCGAAGTGTGTCGCCGTGTGCCCCTCTATACCAACATGCTGTAAAACTCCTCCTTGCAATCAACCATCAATAAGAATAGTAAGACATGGCGGTCGTTACACTATCAGCACCAAGCCTTCCAACATAAGCAATGCACCTTCTGGACCATACCCTCTAAAATATGTTCTAAACACTGATGATGACGATGAAACCAATGCTAAATTTAGTGTGGAAGCAAAATTCAACGACTACAATTTCGACTACACCAGCTCGCAGTCGTCTTTTGTTTTGGATTTTTCACCTCCAGAAGCGAGGTGCTTCAAGCCATGTCCTAAGAGGAGCACATCCTGTTTGCTTTGTAAAATAGATGGGAAGTGTCCCAAGTAG

Protein sequence:

>DPOGS206680-PA
MSLYRLCVCFLLIWQSVIALNILAIYPYHGKSHFIVFKVYLQELARRGHNVTVISHFPEVNPSKNYHDISLAGSMQEMEDRLPFHRSYLTVLATALYLTKFGTDNCKVMLENERVRNLIKDKPKFDVIVLEQFNSDCALGIAYKLGAPVVGTTSHVLMPWHYNRLGIPNNPSYVSFHFLEGGTKPTLFQRIERFIFNLYFNTVYYYTSQRVDQQTLANYYDDIPPLEDLGRQMKFLMLYHNFILTGSRLFPANVIEIGGYHVKEAKPLTGDLLKFVEEAEHGVIYVSFGSVVKSSTMPADKLNAVLEAMTELPQRFIWKWETDVVLLDKKKLYISSWLPQVDILGNPKTLAFLSHSGMGGTTEAIHFGVPVVAMPVVGDQPSNAAAVEESGLGVTLQIRDLTKENLLAAFRKVLDPKFRENVKSISKAWHDRPLSPLDTAVYWTEFAARYPNQTFRTPAADVAYIIIMACCCKPGRSNKSSLSCLGRCDLPKHQCLDKHEVRPGPVPKYMGWHYTAKDAPEHQLRCGWRPGHIACPVLRRIEQHNCMKSSECSPCVSSCAKPPCYKPCCMPPKCVAVCPSIPTCCKTPPCNQPSIRIVRHGGRYTISTKPSNISNAPSGPYPLKYVLNTDDDDETNAKFSVEAKFNDYNFDYTSSQSSFVLDFSPPEARCFKPCPKRSTSCLLCKIDGKCPK-