Monarch geneset OGS2.0

DPOGS211622
TranscriptDPOGS211622-TA975 bp
ProteinDPOGS211622-PA324 aa
Genomic positionDPSCF300325 - 260396-265420
RNAseq coverage949x (Rank: top 13%)
Annotation
HeliconiusHMEL0063042e-15780.56% 
BombyxBGIBMGA011774-TA2e-14578.12% 
DrosophilaCG16758-PG2e-9758.66% 
EBI UniRef50UniRef50_E2BJE62e-11055.85%Purine nucleoside phosphorylase n=3 Tax=Endopterygota RepID=E2BJE6_HARSA
NCBI RefSeqXP_391850.23e-11463.09%PREDICTED: similar to CG16758-PD, isoform D [Apis mellifera]
NCBI nr blastpgi|3320176485e-12065.00%Purine nucleoside phosphorylase [Acromyrmex echinatior]
NCBI nr blastxgi|3320176487e-11865.20%Purine nucleoside phosphorylase [Acromyrmex echinatior]
Group
Gene OntologyGO:00167631.5e-134transferase activity, transferring pentosyl groups
GO:00047311.9e-102purine-nucleoside phosphorylase activity
GO:00061391.9e-102nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
GO:00091162.9e-49nucleoside metabolic process
GO:00038242.9e-49catalytic activity
KEGG pathwayame:4082999e-114 
 K03783 (punA)maps-> Purine metabolism
    Nicotinate and nicotinamide metabolism
    Pyrimidine metabolism
InterPro domain[21-315] IPR0013691.5e-134Purine phosphorylase, family 2
[55-313] IPR0112701.9e-102Purine nucleoside phosphorylase I, inosine/guanosine-specific
[55-313] IPR0112689.8e-97Inosine guanosine/xanthosine phosphorylase
[55-302] IPR0008452.9e-49Nucleoside phosphorylase domain
Orthology groupMCL12172 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211622-TA
ATGGCTCCAGTGAACGCTGATGTTGTTAATGGTAGCAAGGTGACTTCACCGGTCAAGAAAACACTACCCGAAGTTGAAGACAGCAATGGAAAAAGATATTCATACGAAATGCTCGTGGAGACAGCCAACTTCCTCCTATCCAAAACGGATGTGCGTCCCATCATTGGGGTTATTTGCGGATCTGGGATGGGTTCGTATTGGAACGAGGGTTCACTGGCTGAGAACATCGCGCAACCTGAATGTATCGCTTATGAAGACATACCAAATTTTCCTATCAGCACGGTTGAAGGTCATCACGGAAAACTAGTTTTTGGGTATATAGGTGAAGTTCCGGTTGTTGCTATGCAGGGCAGATTCCACTATTACGAAGGATACCCCCTTTGGAAGTGTTGTTTGCCAGTACGAGTTATGAAACTCCTTGGAGTAAAAGCTCTGATTGCGACGAATGCAGCGGGAGGTCTGAATCCAAATTACAAAATTGGGGATGTGATGATTGTGAGAGATCATATTAATATGATGGGTTTTGCTGGAAATAACCCTTTACATGGACCTAACGATGAACGCTTCGGTCCACGATTCCCACCGATGAACAAAGCCTATAATTATGAGTTCAGAAAAGTTGCTAAAGAGGTGGCTAAAGAATTAAATATCGATAATATAGTAAGGGAAGGTGTGTACACATGTTTAGGAGGACCGAATTTTGAAACTGTAGCAGAATTAAACATGTTGAAAATGCTTGGGGTGGATGCGGTTGGGATGTCTACTGTTCATGAGGTCATAACAGCAAGACACTGCGACATGAGCGTCTTCGCTCTGTCTTTAATAACGAATGAGTGTGTTACAAGCTACGATCATGATGCTGAAGCTAATCACGAGGAAGTTTTGGACGTTGGACGTATGCGCCAGGGCATACTTAGGGACTACGTATCGAAACTAGTAAACAGATTCGTAAAATATTTACCTCAAAACCCTTGA

Protein sequence:

>DPOGS211622-PA
MAPVNADVVNGSKVTSPVKKTLPEVEDSNGKRYSYEMLVETANFLLSKTDVRPIIGVICGSGMGSYWNEGSLAENIAQPECIAYEDIPNFPISTVEGHHGKLVFGYIGEVPVVAMQGRFHYYEGYPLWKCCLPVRVMKLLGVKALIATNAAGGLNPNYKIGDVMIVRDHINMMGFAGNNPLHGPNDERFGPRFPPMNKAYNYEFRKVAKEVAKELNIDNIVREGVYTCLGGPNFETVAELNMLKMLGVDAVGMSTVHEVITARHCDMSVFALSLITNECVTSYDHDAEANHEEVLDVGRMRQGILRDYVSKLVNRFVKYLPQNP-