Monarch geneset OGS2.0

DPOGS208576
TranscriptDPOGS208576-TA1065 bp
ProteinDPOGS208576-PA354 aa
Genomic positionDPSCF300064 + 1599760-1604466
RNAseq coverage265x (Rank: top 40%)
Annotation
HeliconiusHMEL0031281e-7543.61% 
BombyxBGIBMGA010642-TA0.086.78% 
DrosophilaCG6330-PA1e-14773.11% 
EBI UniRef50UniRef50_F4X7J52e-14977.81%Uridine phosphorylase 1 n=22 Tax=Coelomata RepID=F4X7J5_ACREC
NCBI RefSeqXP_395069.21e-16380.42%PREDICTED: similar to CG6330-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|665128342e-16280.42%PREDICTED: uridine phosphorylase 1-like [Apis mellifera]
NCBI nr blastxgi|665128343e-15980.42%PREDICTED: uridine phosphorylase 1-like [Apis mellifera]
Group
Gene OntologyGO:00048503.2e-172uridine phosphorylase activity
GO:00057373.2e-172cytoplasm
GO:00091663.2e-172nucleotide catabolic process
GO:00091161.7e-28nucleoside metabolic process
GO:00038241.7e-28catalytic activity
KEGG pathwayame:4115994e-163 
 K00757 (E2.4.2.3, udp)maps-> Drug metabolism - other enzymes
    Pyrimidine metabolism
InterPro domain[14-328] IPR0100593.2e-172Uridine phosphorylase, eukaryotic
[14-328] IPR0180173.2e-172Nucleoside phosphorylase
[68-317] IPR0008451.7e-28Nucleoside phosphorylase domain
Orthology groupMCL11127 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208576-TA
ATGCCTCAGAGCAACGGTGCAACGAATGGACATACCAAACACAATGGTACATCCGACGAAATCGGTGCAGACACTAGGTACCCCGATGGTACGGTGCGTCTGCGGAACCCAAACATCGAACTCATGGACCAAGATATTCTTTATCATTTAGCTCTTGGCAGCGGATCTCATGATCTCGTTGAAATGTTCGGAGATGTTAAGTTCGTCTGTATGGGGGGTACTCCGAAACGCATGGAACAGTTCGCATACACTGTGATGGCTGAGATAGGTCACAAGCTTCCGTGCGGTACAACCTTACAGGATATAAGCCAGTTCTCATATAGATACTCCATGTACAAAGTTGGCCCTGTACTATGTATCAGTCACGGTATGGGCATCCCATCCGTGGGTATCTTACTACACGAAGTCATTAAGCTGATGTACCACGCCAAGGTACGAGATCCTGTGTTTTTCCGCATCGGAACTTGTGGGGGCGTGGGTTATGAAGGTGGAACGGTCATCATATCGGAGGATACTGTTGATGGCGCGCTCAGGAATGTTCTAGAATTGACAGTTTTGGGCAAGTCCGTACAACGTCCGGCAAAGTTGGACAGGAGGCTGGCCCGCGAGCTGAAGGCTCTATCCGACCCTGAAGATCCTTACGACACGGTCATGGGCAAAACTATGTGCACCTACGACTTTTATGAAGGTCAAGGTCGTCTGGACGGTGCCTTCTGTGACTTCACGGAGGCTGACAAGATGGAATATCTCGAGAGCATCCACAAATCTGGCGTCGTCAATATAGAGATGGAGTCACTGGCCTTTGCCGCTCTCACACACCACGCTGGGGTTAAGGCGGCCGTCGTGTGCGTCACGCTACTAGATAGGCTTAAGGGGGACCAGGTTCTAGCGCCCAAAGAGGTCTTAGACGAGTGGCAACAACGTCCAACCAAACTTGTATGCCGTTACATGAAAAGGTATTTACAAATAAAAGGACGTCTCTCGTTGGACGGTCACGGATCAGTGGCTGTCAAAAGTCCGCGGCGGTTCAAGCTAGTGCAGCAAGAATCGGAGACGTACGATTAA

Protein sequence:

>DPOGS208576-PA
MPQSNGATNGHTKHNGTSDEIGADTRYPDGTVRLRNPNIELMDQDILYHLALGSGSHDLVEMFGDVKFVCMGGTPKRMEQFAYTVMAEIGHKLPCGTTLQDISQFSYRYSMYKVGPVLCISHGMGIPSVGILLHEVIKLMYHAKVRDPVFFRIGTCGGVGYEGGTVIISEDTVDGALRNVLELTVLGKSVQRPAKLDRRLARELKALSDPEDPYDTVMGKTMCTYDFYEGQGRLDGAFCDFTEADKMEYLESIHKSGVVNIEMESLAFAALTHHAGVKAAVVCVTLLDRLKGDQVLAPKEVLDEWQQRPTKLVCRYMKRYLQIKGRLSLDGHGSVAVKSPRRFKLVQQESETYD-