Monarch geneset OGS2.0

DPOGS208676
TranscriptDPOGS208676-TA1062 bp
ProteinDPOGS208676-PA353 aa
Genomic positionDPSCF300043 - 868340-871493
RNAseq coverage106x (Rank: top 60%)
Annotation
HeliconiusHMEL0041785e-10658.79% 
BombyxBGIBMGA003330-TA1e-9254.37% 
DrosophilaCG11158-PA1e-2228.70% 
EBI UniRef50UniRef50_UPI00021A822E1e-3733.14%UPI00021A822E related cluster n=2 Tax=unknown RepID=UPI00021A822E
NCBI RefSeqXP_001952791.13e-3733.92%PREDICTED: similar to inosine-uridine preferring nucleoside hydrolase [Acyrthosiphon pisum]
NCBI nr blastpgi|3504217241e-3734.29%PREDICTED: pyrimidine-specific ribonucleoside hydrolase rihA-like [Bombus impatiens]
NCBI nr blastxgi|3504217246e-3834.29%PREDICTED: pyrimidine-specific ribonucleoside hydrolase rihA-like [Bombus impatiens]
Group
KEGG pathwayehi:EHI_1999602e-25 
 K01239 (E3.2.2.1, iunH)maps-> Purine metabolism
    Nicotinate and nicotinamide metabolism
InterPro domain[41-347] IPR0019103.7e-52Inosine/uridine-preferring nucleoside hydrolase domain
[41-299] IPR0231869.5e-49Inosine/uridine-preferring nucleoside hydrolase
Orthology groupMCL17222 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208676-TA
ATGTTTAATAAATGGCATTACTCCGTGGCGTCCGTCCTTTTAGTCATGATCTTTGTAACTTTGATAGTATTATTTGCCGTGTCAAGCAGTGCTGGGCCAGCTCCTGACGGCCAAAAAGTGAAAAAAAAGTTAATTATCGACCATGACGGCGGAGCAGATGATGCCATGGCAATATTTATCGCCGTGCTATATGAGAAATACTTTGATGGGCCAGAGGTAGTCGCACTCACAACAACCTTTGGCAATGTTGGGGAAGATCAAGTTTTCAACAACAGCCAAAGAATCCTCAGTGTTGCTGATAGAAGAGATGTTCCAATTTATAGAGGAAGTAAAGTTTCATTGATACAGACTATTCCCACCGATGCTTTTTTTGGTTATGATGGACTTGGAGACAATGAAATAATTGAACATTTTTCGCCAATTGAAGCCCAGAAGGACTGTGCTGCCATCGCCCTTATTGAATTATCAAAGAAATATAAAGGTGATTTAATAATAGTTTCTATAGGTCCCCTCACAAATATAGCACTAGCTATGAGACTGGATCCTTTATTCCTATCTAGACTTTCACATATTTACATTGGAGCCGGAAACGTCTATGGTGACAATTATAAGAATGCCGAGTTCAACGCAGCAATGGATGTCGAAGCTTATTATATTGTAACACAGAGTGGAATTCAAGAAAAGATGACTGTAGTACCGTTTTCACAAATTATAGATTATTTACCATTAACTCAGGATTGGAGAGTAAAGGAACTGGGATCAATCACGACGAGAATAATGAAACATCAAAATGACTTCGAACGGACGTCTATAAATAGTAGTACGACATGGTGTCTCTTAGATCCAGCTGTTATGGCAATAGCATTAGAGGAAACCTCAATAGTAGAAGAGATACGTTATTCTAATCACAGTGTGATGATTTGCAATGCAGACAGAGGCCGAAACACTAACATATACTCAACAAAAGAAGATGCAAATGTAGCTTTAGTGTACAGAGTGAGGAAGGAGGCCTATGAAGATTTCCTTTATTCCGTGTTTGCTTCTGAACTAAGATCTACCTAG

Protein sequence:

>DPOGS208676-PA
MFNKWHYSVASVLLVMIFVTLIVLFAVSSSAGPAPDGQKVKKKLIIDHDGGADDAMAIFIAVLYEKYFDGPEVVALTTTFGNVGEDQVFNNSQRILSVADRRDVPIYRGSKVSLIQTIPTDAFFGYDGLGDNEIIEHFSPIEAQKDCAAIALIELSKKYKGDLIIVSIGPLTNIALAMRLDPLFLSRLSHIYIGAGNVYGDNYKNAEFNAAMDVEAYYIVTQSGIQEKMTVVPFSQIIDYLPLTQDWRVKELGSITTRIMKHQNDFERTSINSSTTWCLLDPAVMAIALEETSIVEEIRYSNHSVMICNADRGRNTNIYSTKEDANVALVYRVRKEAYEDFLYSVFASELRST-