Monarch geneset OGS2.0

DPOGS208830
TranscriptDPOGS208830-TA789 bp
ProteinDPOGS208830-PA262 aa
Genomic positionDPSCF300036 + 703206-708766
RNAseq coverage476x (Rank: top 26%)
Annotation
HeliconiusHMEL0041892e-10380.77% 
BombyxBGIBMGA007936-TA1e-13687.97% 
DrosophilaCG6364-PB2e-8972.09% 
EBI UniRef50UniRef50_D6WCQ72e-8972.89%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WCQ7_TRICA
NCBI RefSeqXP_393563.14e-10071.15%PREDICTED: similar to Probable uridine-cytidine kinase (UCK) (Uridine monophosphokinase) (Cytidine monophosphokinase) isoform 1 [Apis mellifera]
NCBI nr blastpgi|3454937298e-10172.33%PREDICTED: probable uridine-cytidine kinase-like isoform 2 [Nasonia vitripennis]
NCBI nr blastxgi|3454937292e-9772.33%PREDICTED: probable uridine-cytidine kinase-like isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00081521.6e-47metabolic process
GO:00055241.6e-47ATP binding
GO:00163011.6e-47kinase activity
GO:00167735.5e-25phosphotransferase activity, alcohol group as acceptor
KEGG pathwayame:4100761e-99 
 K00876 (E2.7.1.48, udk)maps-> Drug metabolism - other enzymes
    Pyrimidine metabolism
InterPro domain[20-214] IPR0060831.6e-47Phosphoribulokinase/uridine kinase
[18-35] IPR0007645.5e-25Uridine kinase
Orthology groupMCL14870 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208830-TA
ATGAGTGAAAAACCAACAGTTCCCATGACTAACGGCTTCGAGAGCAAAACGCCATTTCTAATCGGAGTTGCTGGCGGTACGGCAAGTGGCAAGTCAACAGTATGCCAAAGGATTATGGAGAAACTGGGACAGCAGCACAAAGAACAAACTGAACGGCGAGTTGTCTGCATCAGTCAGGACTCCTTCTACCGCACTCTCACCACGTCAGAGAGGCTGAGGGCCGAGCGGGGACAGTTCAACTTCGACCACCCGGACGCATTTGACGACAAGAAGCTCCTCGCCGTGCTCAAAGACATCCTGGACGGAAAGAAGGTGGAGGTGCCAGAGTATGATTATATCACAAACTCTATAAGCAACCGCTCGCACACCATCTACCCGGCGGACGTGGTCCTGATCGAGGGCATCCTTGTTTTCTACTTCAAGGAAGTTAGAGAACTCTTTCACATGAAGTTGTTTGTTGACACGGATTCCGACACCCGGCTCGCTAGGAGAGTACCTCGCGACATCATGGAGCGTGGCCGCGACCTGGAACAGGTCCTCAACCAGTACATGAACTTTGTGAAGCCCGCCTTCGAGGAGTTCTGTCTGCCGACAAAGAAGTTTGCTGACGTCATCATACCGAGAGGCGCTGACAATCTAGTGGCCATTGACCTTATCGTGCATCACATTTGGGATATTATGTATAAGAAGCGACCGGCGAAGATTAGCAACGGTTGTAACGGTCATATCGAGGAGGAGGGGAACGGCAGGCGGCTGTCCGGCAGCTCGGAGGACACGCTGAGCCGATAG

Protein sequence:

>DPOGS208830-PA
MSEKPTVPMTNGFESKTPFLIGVAGGTASGKSTVCQRIMEKLGQQHKEQTERRVVCISQDSFYRTLTTSERLRAERGQFNFDHPDAFDDKKLLAVLKDILDGKKVEVPEYDYITNSISNRSHTIYPADVVLIEGILVFYFKEVRELFHMKLFVDTDSDTRLARRVPRDIMERGRDLEQVLNQYMNFVKPAFEEFCLPTKKFADVIIPRGADNLVAIDLIVHHIWDIMYKKRPAKISNGCNGHIEEEGNGRRLSGSSEDTLSR-