Monarch geneset OGS2.0

DPOGS208154
TranscriptDPOGS208154-TA1038 bp
ProteinDPOGS208154-PA345 aa
Genomic positionDPSCF300058 - 14772-28080
RNAseq coverage500x (Rank: top 25%)
Annotation
HeliconiusHMEL0031288e-13165.85% 
BombyxBGIBMGA013774-TA1e-12262.61% 
DrosophilaCG6330-PA2e-8851.89% 
EBI UniRef50UniRef50_F4X7J52e-8851.96%Uridine phosphorylase 1 n=22 Tax=Coelomata RepID=F4X7J5_ACREC
NCBI RefSeqXP_001651486.18e-9053.00%uridine phosphorylase [Aedes aegypti]
NCBI nr blastpgi|3504022783e-8951.30%PREDICTED: uridine phosphorylase 1-like isoform 2 [Bombus impatiens]
NCBI nr blastxgi|3838541407e-8651.80%PREDICTED: uridine phosphorylase 1-like [Megachile rotundata]
Group
Gene OntologyGO:00048505.2e-129uridine phosphorylase activity
GO:00057375.2e-129cytoplasm
GO:00091665.2e-129nucleotide catabolic process
GO:00091161.2e-21nucleoside metabolic process
GO:00038241.2e-21catalytic activity
KEGG pathwayaag:AaeL_AAEL0058392e-89 
 K00757 (E2.4.2.3, udp)maps-> Drug metabolism - other enzymes
    Pyrimidine metabolism
InterPro domain[36-338] IPR0100595.2e-129Uridine phosphorylase, eukaryotic
[36-338] IPR0180175.2e-129Nucleoside phosphorylase
[90-311] IPR0008451.2e-21Nucleoside phosphorylase domain
Orthology groupMCL11127 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208154-TA
ATGATAGAAGCTAGTAACTTTGTCGTTTCTGAATTTAGGATGGATTGCGACTGTGATTTTCTTTTGCCACCTAGAGACAGTTGGGAGGGACATTACGGGAACTGCTCGTCTATTTGGTGTAAGGAACACGACGAACCTGCAAAGAATAAAGATGGCACTATCAGACTGAGAAACAAACATCTCGTGGCACTTGAATACGACGTGTTGTATCACCTCGGGCTTGATACCAAGAGCAATGATCTTCAAACCATGTTCGGGGATGTCAAGTTCGTTTGTATGGGTGGAACTAAAAAAAGAATGAAGGATTTCGCTGAATACATTGCCAATGTGCTCCAACTACCTAATGAGGGGCTGGTTAATATTACAAAGAATTCCCATAGATACGCCATGTATAAAATTGGACCGGTGTTGTCGGTTAATCACGGCATGGGAGTTCCCTCTATGACAATACTTCTGCAGGAAATCATAAAAATGCTTTTCTACGCTAAAGCCAAAGACCCAATATTCTTCAGGATCGGTACTTGTGGAGGATTAAAGATACCAGCGGGATCTGTCGTCATATCCTCCTGGGCACTAAACGGGACCATGGAGAAGTCATATAACTTACCGATCATGGGTGAAGTGCACAAACTGCCATCGTTTTTTGATAAGCGCCTTAACCAAGAATTGCACTTTCTTGCGTCCGAGGAGACGGGCTTCGAAACGTTCATAGCTGGTACTATGGCCGCTGATGACTTTTATCAAGGTCAAGCGCGGCTGGATGGTCCTTTCTGTGACTACTCAGAGGCAGACAAGATGTCATTTCTGAATCAGTTGTTCGATATCGGGGTCAGGAACATTGAAATGGAGGCTACCGCCTTCGCTGCGTTGACTTCACAGGCTGGTATAAGAGCGGCCGATGTTTGCGTCACATTTCTCGACAGACTTAAAGGGGATCAGGTAACGTCAAGCAAGTCTAAGCTAGTCGAGCTTCAGGAAAGACCTATGGTGCTTGTCGGAAAATATATCTCCAGATATTATGTGACGAAATTAAAATAG

Protein sequence:

>DPOGS208154-PA
MIEASNFVVSEFRMDCDCDFLLPPRDSWEGHYGNCSSIWCKEHDEPAKNKDGTIRLRNKHLVALEYDVLYHLGLDTKSNDLQTMFGDVKFVCMGGTKKRMKDFAEYIANVLQLPNEGLVNITKNSHRYAMYKIGPVLSVNHGMGVPSMTILLQEIIKMLFYAKAKDPIFFRIGTCGGLKIPAGSVVISSWALNGTMEKSYNLPIMGEVHKLPSFFDKRLNQELHFLASEETGFETFIAGTMAADDFYQGQARLDGPFCDYSEADKMSFLNQLFDIGVRNIEMEATAFAALTSQAGIRAADVCVTFLDRLKGDQVTSSKSKLVELQERPMVLVGKYISRYYVTKLK-