Monarch geneset OGS2.0

DPOGS200496
TranscriptDPOGS200496-TA708 bp
ProteinDPOGS200496-PA235 aa
Genomic positionDPSCF300158 + 150572-155774
RNAseq coverage69x (Rank: top 66%)
Annotation
HeliconiusHMEL0128063e-7379.61% 
BombyxBGIBMGA010540-TA1e-8475.00% 
Drosophilanmdyn-D6-PA2e-2237.86% 
EBI UniRef50UniRef50_UPI00022C9CB94e-4950.25%UPI00022C9CB9 related cluster n=1 Tax=unknown RepID=UPI00022C9CB9
NCBI RefSeqXP_002730778.17e-5149.49%PREDICTED: non-metastatic cells 5, protein expressed in (nucleoside-diphosphate kinase)-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|2912215421e-4949.49%PREDICTED: non-metastatic cells 5, protein expressed in (nucleoside-diphosphate kinase)-like [Saccoglossus kowalevskii]
NCBI nr blastxgi|2912215422e-4850.00%PREDICTED: non-metastatic cells 5, protein expressed in (nucleoside-diphosphate kinase)-like [Saccoglossus kowalevskii]
Group
Gene OntologyGO:00062289.3e-44UTP biosynthetic process
GO:00055249.3e-44ATP binding
GO:00062419.3e-44CTP biosynthetic process
GO:00061659.3e-44nucleoside diphosphate phosphorylation
GO:00045509.3e-44nucleoside diphosphate kinase activity
GO:00061839.3e-44GTP biosynthetic process
KEGG pathwayspu:5854735e-48 
 K00940 (E2.7.4.6, ndk)maps-> Purine metabolism
    Pyrimidine metabolism
InterPro domain[16-153] IPR0015649.3e-44Nucleoside diphosphate kinase
[166-204] IPR0078589.5e-11Dpy-30 motif
Orthology groupMCL16542 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200496-TA
ATGTCTGTGGCGTCGTCTGACTCCGGTGCGGAGTATCACACGTTCTCAGAACGAACGCTGGCGATCATCAAGCCGGAGGCCTTTGATGACGCTGATGCTATCGAGGACCATATAGTCGATAACGGGTTCATGATTCTTGCTCGCCGCAAAGTCAAACTGACACCAGAGCAAGCCGCGGAGTTGTACCGCGGGCACTATGGAAGACATCACTTCCCTCACTTGGTCGCTCACATGTCTAGTGGACCTATTATAGCTCTGGTATTGGCGGCACAGAACTGCATTCACAAGTGGAGGGTGTTAATGGGACCTGCTAGAGTGGTCGAAGCGCAGGCCTATTGGCCGGACAGTCTGCGGGCTTGTTACGGACGTCGCACTAAATACGGGGACTACTTCAACGCTCTTCACGGAAGCGAAAACTACGGGGAGGCGATACGGGAAATACATTTCTTCTTTCCCGAAATGATCGTGGGTCCGCTGCTCCGTCAGTGGCAGGTAGGTGACTATATCCTGAAGTACATATCCCCGACCCTCGCCCCGGCCCTGACCACGCTCGCCCACGACCGGCCCGCGGAACCCTTGTTGTGGCTGGCCGACTACCTGCGGAGACACAACCCCAACCAGCCGGAACTGGCGCCGCAACCGACTGACATGAGGGAAGAGAGGAAATGCCAAACGCCCACGCCGTCAGAAGTCACACCAGACAAATGA

Protein sequence:

>DPOGS200496-PA
MSVASSDSGAEYHTFSERTLAIIKPEAFDDADAIEDHIVDNGFMILARRKVKLTPEQAAELYRGHYGRHHFPHLVAHMSSGPIIALVLAAQNCIHKWRVLMGPARVVEAQAYWPDSLRACYGRRTKYGDYFNALHGSENYGEAIREIHFFFPEMIVGPLLRQWQVGDYILKYISPTLAPALTTLAHDRPAEPLLWLADYLRRHNPNQPELAPQPTDMREERKCQTPTPSEVTPDK-