Monarch geneset OGS2.0

DPOGS205682
TranscriptDPOGS205682-TA2697 bp
ProteinDPOGS205682-PA898 aa
Genomic positionDPSCF300250 - 440976-451287
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0226261e-13348.82% 
BombyxBGIBMGA013860-TA0.045.44% 
DrosophilaUgt35b-PA1e-5833.59% 
EBI UniRef50UniRef50_G6CZJ20.049.50%Antennal-enriched UDP-glycosyltransferase n=11 Tax=Obtectomera RepID=G6CZJ2_DANPL
NCBI RefSeqNP_001040425.19e-12444.66%antennal-enriched UDP-glycosyltransferase [Bombyx mori]
NCBI nr blastpgi|3638961289e-12344.47%UDP-glycosyltransferase UGT33D4 [Bombyx mori]
NCBI nr blastxgi|1140517065e-12344.74%antennal-enriched UDP-glycosyltransferase precursor [Bombyx mori]
Group
Gene OntologyGO:00081526.7e-104metabolic process
GO:00167586.7e-104transferase activity, transferring hexosyl groups
KEGG pathwaydpo:Dpse_GA197512e-56 
 K00699 (UGT)maps-> Drug metabolism - cytochrome P450
    Starch and sucrose metabolism
    Porphyrin and chlorophyll metabolism
    Steroid hormone biosynthesis
    Pentose and glucuronate interconversions
    Ascorbate and aldarate metabolism
    Drug metabolism - other enzymes
    Metabolism of xenobiotics by cytochrome P450
    Retinol metabolism
InterPro domain[1-388] IPR0022136.7e-104UDP-glucuronosyl/UDP-glucosyltransferase
Orthology groupMCL19122 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205682-TA
ATGCTTCGTAATATGAATCCCCTCTTTATGAAGGTCATTGAGTATCATTTTCAATCCAAAGAAGTCCAGGAAATCGTAGCTAATAATAAATATGATCTGATATTGTTAGAATCTATTGTTCTCTCGGGATTGATATACTCACACATATTCAAGGCTCCAGTGATATTAGTGAGTTCATTCGGAGGTTATATAAATGAACATAAAATAATGGGGACACCGACTGCACCTATTTTGTATCCATTGCCTCTGCGAAATAAAATTTACAATCTTAATTTTTTTGAAAAGATCCGAGAAATATACAGACATTATTCAAACGAATATGCAGAATATTTGAATGACCTTGACATTGATAAATTTTTGAAAGATAGATTTGGTTCCCAAACTCCAACTATAAATGAATTGAGTGATAATATTCATATGCTCTTTTTAAATGTTCACACCATTTGGGCAGATCATAAGCCCAGCACTCCGAATATTGTCTATATGGGTGGTATACACCAAGTACCACAGAAAGATTTACCGAAGGACCTTGAGACATTCCTCAATTCTTCTAAACACGGAGTCATATATGTAAGCTTTGGGACAAATGCTTTGTCATATATGATTCCTTCAGATAAAATAGAAAATGTGGTAAAAGTTCTATCAAAACTTCCCTACGATGTGTTATGGAAATGGGATGGAGAGGAATTGCCGGGAAAGTCAGACAATATTAGGTTATCCAAATGGTTCCCACAATCTGATCTGTTGAGACATCCAAATATAAAACTTTTTATAACACAAGCTGGACTGCAATCTACTGATGAAGCTATAACAGGTGGGGTACCGTTAGTTGCCATACCAATGTTTGGCGATCAATGGTACAATGCTGAAAAATTTGAAAAATTCGGTATTGGTATTCAACTAGACATTACAAGCTTTACAGAGGAAGAACTGCATAATGCTGTAATTACCGTAATAAATAATGAAAGCTACCGGAACAACGTTTTTAAACTTCGTGAAATAATTCTTGATCAACCAATGAGTTCTATAGAACGTGCAATGTGGTGGACAGAATATGTATTAAGACACAGAGAAAAGAATCATTTTCGTACTCTAGCTAGTAACTTGTCATACATGGATTACTTCGATGTAAAGTTTTGGATGACTATTTTTGCAATCATTGTAGACATAAGTAAAAACATGAAAATTGAAATACTAACAATTTTTTTTATATTGGTATGGGTATATCAAGTAAAATCAGCAAGAATATTAGGTGTATTCCCAGTACCATCACTTAGCCATCAAATCGTTTTCCGTAAGATTACTCAAGAACTCCATAAACGAGGACATGAAATGACAGTGTTAACACCAGACCCAGCTTATCCAAAAGGAACTGCACCCGCAAATTATACCGAAATAGATTTTCACGATGCATCATACAAAATATTCAAAGCAAATATTTATGCCAGTTATAAAAGCGAAGGTTTAGCAATTAACTTCGACGCGGTTAGAGAAATATACAACCATTATTCAAACGAATATGCAGAATATTTGAATGACCTTGATCGTGATAAATTTTTAAAAGAGAGTTTTGGTCCCCAAACTCCAACTATGAATGAATTGAGTGAAAATATTCATATGGTCTTTCTAAATGTTCATACCATTTGGGCCGATAACAAACCTACTACTCCGAATATTCTCTACCTGGGTGGCATACACCAAGTACCGCAAAAAAAATTGCCAAAGATCAGAGAAATATACAGACATTATTCAAACGAATATGCAGAATATTTGAATGACCTTGACAATGATAAATTATTGAAAGAGAGATTTGGTTCCCAAACTCCAACTATAAATGAATTGAGTGATAATATTCATATGCTCTTTTTAAATGTTCACACCATTTGGGCCGATCATAAGCCCAGTACTCCGAATATTGTCTATATGGGTGGTATACACCAAGTACCACAGAAAGATTTACCGAAGGACCTTGAGACATTCCTCAATTCTTCTAAACATGGAGTGATATATGTAAGCTTTGGGACAAATGCTTTGTCATATATGATTCCTTCAGATAAAATAGAAAATGTGGTAAAAGTTCTATCAAAACTTCCCTACGATGTGTTATGGAAATGGGATGGAGAGGAATTGCCGGGAAAGACAGACAATATTAGGTTATCCAAATGGTTCCCACAATCTGATCTATTGAGACATCCAAATATAAAACTTTTTATAACACAAGCTGGACTGCAATCTACTGATGAAGCTATAACAGGTGGGGTACCGTTAGTTGCCATACCAATGTTTGGCGATCAATGGTACAATGCTGAAAAATTTGAAAAATTCGGTATTGGTATTCAACTAGACATTACAAGCTTTACAGAGGAAGAACTGCATAATGCTGTAATTACCGTAATAAATAATGAAAGCTACCGGAACAACGTTTTTAAACTTCGTGAAATAATTCTTGATCAACCAATGAGTTCTATAGAACGTGCAATGTGGTGGACAGAATATGTATTAAGACACAGAGAAAAGAATCATTTTCGTACTCTAGCTAGTAACTTGTCATACATGGACTACTTCGATGTAAAGTTTTGGATGACTATTTTTGCAATCATTGGTATCTTATTAACGTTATTTGTGGTAACGATTGCATATGTCATAAAATTTCTTATTAAGATATGGCTTGCATATAATAAGGTAAAAAAACAATAA

Protein sequence:

>DPOGS205682-PA
MLRNMNPLFMKVIEYHFQSKEVQEIVANNKYDLILLESIVLSGLIYSHIFKAPVILVSSFGGYINEHKIMGTPTAPILYPLPLRNKIYNLNFFEKIREIYRHYSNEYAEYLNDLDIDKFLKDRFGSQTPTINELSDNIHMLFLNVHTIWADHKPSTPNIVYMGGIHQVPQKDLPKDLETFLNSSKHGVIYVSFGTNALSYMIPSDKIENVVKVLSKLPYDVLWKWDGEELPGKSDNIRLSKWFPQSDLLRHPNIKLFITQAGLQSTDEAITGGVPLVAIPMFGDQWYNAEKFEKFGIGIQLDITSFTEEELHNAVITVINNESYRNNVFKLREIILDQPMSSIERAMWWTEYVLRHREKNHFRTLASNLSYMDYFDVKFWMTIFAIIVDISKNMKIEILTIFFILVWVYQVKSARILGVFPVPSLSHQIVFRKITQELHKRGHEMTVLTPDPAYPKGTAPANYTEIDFHDASYKIFKANIYASYKSEGLAINFDAVREIYNHYSNEYAEYLNDLDRDKFLKESFGPQTPTMNELSENIHMVFLNVHTIWADNKPTTPNILYLGGIHQVPQKKLPKIREIYRHYSNEYAEYLNDLDNDKLLKERFGSQTPTINELSDNIHMLFLNVHTIWADHKPSTPNIVYMGGIHQVPQKDLPKDLETFLNSSKHGVIYVSFGTNALSYMIPSDKIENVVKVLSKLPYDVLWKWDGEELPGKTDNIRLSKWFPQSDLLRHPNIKLFITQAGLQSTDEAITGGVPLVAIPMFGDQWYNAEKFEKFGIGIQLDITSFTEEELHNAVITVINNESYRNNVFKLREIILDQPMSSIERAMWWTEYVLRHREKNHFRTLASNLSYMDYFDVKFWMTIFAIIGILLTLFVVTIAYVIKFLIKIWLAYNKVKKQ-