Monarch geneset OGS2.0

DPOGS210370
TranscriptDPOGS210370-TA1665 bp
ProteinDPOGS210370-PA554 aa
Genomic positionDPSCF300025 + 612210-619646
RNAseq coverage745x (Rank: top 17%)
Annotation
HeliconiusHMEL0138460.089.86% 
BombyxBGIBMGA011932-TA0.090.24% 
Drosophilal(2)k01209-PC0.073.65% 
EBI UniRef50UniRef50_Q8MQK40.073.65%Uridine kinase n=19 Tax=Opisthokonta RepID=Q8MQK4_DROME
NCBI RefSeqNP_001153854.10.077.89%uridine-cytidine kinase 1-like 1 isoform 1 [Acyrthosiphon pisum]
NCBI nr blastpgi|3320315820.076.22%Uridine-cytidine kinase-like 1 [Acromyrmex echinatior]
NCBI nr blastxgi|3071717070.075.27%Uridine/cytidine kinase-like 1 [Camponotus floridanus]
Group
Gene OntologyGO:00055241.7e-74ATP binding
GO:00167731.7e-74phosphotransferase activity, alcohol group as acceptor
GO:00081521e-59metabolic process
GO:00163011e-59kinase activity
KEGG pathwayapi:1001660320.0 
 K00876 (E2.7.1.48, udk)maps-> Drug metabolism - other enzymes
    Pyrimidine metabolism
InterPro domain[115-316] IPR0007641.7e-74Uridine kinase
[116-303] IPR0060831e-59Phosphoribulokinase/uridine kinase
Orthology groupMCL14079 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210370-TA
ATGTCTCTGAAGCTGAACAAATTTGAGACTCCAAGCTCTGCTAGCTCAGATAGTGACACAGAATTGGCCCTAGAGCCGGAGCAGGTATTCCTAGATGATGAGGACTACACACGAGCCCCGGCCTCACCTACAACCGTACCCAATGTACGGACATATAGACCACCTTCCACAGGTTCGAATATAATGAAGTCCCCCCGAACGCCTCGTGTTCGTACAGCGTCTATGTCGCAGTCGAACAAACGGACAGCGGCGGGTAGCATTTTACACGCGGACAGACGGACTATATATACAGCTGGCAGGCCCCCTTGGTATAACTGTACCGGGGGTCAGGAGGTGGAGCCCTTTCTTATAGGTATCTGCGGAGCGAGCGCGTCGGGGAAGACGACTGTCGCCGAAAAAATAGTAGAATCCCTTAACATACCCTGGGTCACCATCGTCAGCATGGACTCCTTCTATAAGGTGCTGACGGAGAAGCAGCACATAGCCTCGATGCATAATGAATACAACTTCGATCACCCGGACGCGTTCGACATGGACCTGCTGGTGGGGGTGCTGCAGAGGCTGCGCGAGGGGAAGAAGGTCGAGGTGCCCATATACAACTACGTGACGCACTCCAGGGAGAACAGGACGAAAACAATGTACGGTGCGAACGTCATAATATTCGAGGGTATTCTGGCGTTTTATAACACGGAGGTGCTGAAGTTGCTGGATATGAAGGTGTTCGTTGACACTGACGCGGACATCAGGCTCGCGAGGAGACTGAGGAGAGATATAGTCCAACGAGGCCGAGACCTCGAAGGCGTCCTGAAGCAGTACATGACGTACGTGAAGCCTTCCTACCAGAGTTACATAGCGCCGTGCATGGCACACGCGGACATTATAGTGCCGAGGGGCGGAGAGAACAAGGTGGCCATCAGCCTCATAGTACAACACGTCCACAAACAACTACAACTGCGAGGCTTCAAGGTCCGGGAGAAGTTGGCGGTGGCGCACATAGGCCAGCCAGTGCCGGACTCGTTGTACGTACTCAAAGATACCCCGCAGGTCCAGGGTCTGCACACGTTCATCCGCAACAAGGACACTCCCCGTGACGAGTTCATCTTCTACTCCAAGCGTCTCATGCGCCTGGTCATAGAGTTCGCCCTGTCCCTCATGCCCTACTCCGACCACTCCGTGGACACGCCGCAGGGGATTCCCTACACCGGTCGGAAGTGCGACGTGGAGAAGATCTGCGGCGTGTCCATCCTCAGGGCGGGAGAGACGATGGAGCAGGCCGTCTGTGATGTTTGTAAGGACATACGGATCGGAAAGATCCTCATCCAGACCAACCAGCAGACGGACGAGCCAGAGCTGTACTATCTGCGCCTGCCGAAGGACATCAAGGACTACCAGGTGATCCTGATGGACGCCACGGTGGCGACGGGCGCCGCCGCCATCATGGCCATCCGCGTGCTGCTCGACCACGACGTGCCCGAGACCAACATCTCGCTCGTGTCGCTGCTCATGGCCGAGATCGGCGTGCACTCCATAGCGTACGCCTTCCCGCAGGTAAAAATCGTCACATCAGCCCTGGACCCGGAGATAAACGAAAAGTTCTACGTGCTGCCGGGGATCGGCAACTTCGGGGATCGATACTTCGGTACGGAACCCGCCGACGACGAGTGA

Protein sequence:

>DPOGS210370-PA
MSLKLNKFETPSSASSDSDTELALEPEQVFLDDEDYTRAPASPTTVPNVRTYRPPSTGSNIMKSPRTPRVRTASMSQSNKRTAAGSILHADRRTIYTAGRPPWYNCTGGQEVEPFLIGICGASASGKTTVAEKIVESLNIPWVTIVSMDSFYKVLTEKQHIASMHNEYNFDHPDAFDMDLLVGVLQRLREGKKVEVPIYNYVTHSRENRTKTMYGANVIIFEGILAFYNTEVLKLLDMKVFVDTDADIRLARRLRRDIVQRGRDLEGVLKQYMTYVKPSYQSYIAPCMAHADIIVPRGGENKVAISLIVQHVHKQLQLRGFKVREKLAVAHIGQPVPDSLYVLKDTPQVQGLHTFIRNKDTPRDEFIFYSKRLMRLVIEFALSLMPYSDHSVDTPQGIPYTGRKCDVEKICGVSILRAGETMEQAVCDVCKDIRIGKILIQTNQQTDEPELYYLRLPKDIKDYQVILMDATVATGAAAIMAIRVLLDHDVPETNISLVSLLMAEIGVHSIAYAFPQVKIVTSALDPEINEKFYVLPGIGNFGDRYFGTEPADDE-