Monarch geneset OGS2.0

DPOGS200333
TranscriptDPOGS200333-TA1350 bp
ProteinDPOGS200333-PA449 aa
Genomic positionDPSCF300026 + 358309-359869
RNAseq coverage258x (Rank: top 41%)
Annotation
HeliconiusHMEL0000600.077.06% 
BombyxBGIBMGA005628-TA0.071.65% 
Drosophilar-l-PA2e-13252.37% 
EBI UniRef50UniRef50_Q016372e-13052.37%Uridine 5'-monophosphate synthase n=17 Tax=Endopterygota RepID=UMPS_DROME
NCBI RefSeqNP_001040160.10.071.88%uridine 5'-monophosphate synthase [Bombyx mori]
NCBI nr blastpgi|2613359240.078.17%putative uridine 5'-monophosphate synthase [Heliconius melpomene]
NCBI nr blastxgi|2613359240.078.17%putative uridine 5'-monophosphate synthase [Heliconius melpomene]
Group
Gene OntologyGO:00081521.5e-80metabolic process
GO:00038241.5e-80catalytic activity
GO:00045901.4e-67orotidine-5'-phosphate decarboxylase activity
GO:00062071.4e-67'de novo' pyrimidine base biosynthetic process
GO:00442053.6e-56'de novo' UMP biosynthetic process
GO:00062211.6e-35pyrimidine nucleotide biosynthetic process
GO:00045881.6e-35orotate phosphoribosyltransferase activity
GO:00091161.2e-09nucleoside metabolic process
KEGG pathwayaag:AaeL_AAEL0113091e-137 
 K13421 (UMPS)maps-> Drug metabolism - other enzymes
    Pyrimidine metabolism
InterPro domain[191-444] IPR0137851.5e-80Aldolase-type TIM barrel
[217-429] IPR0017541.4e-67Orotidine 5'-phosphate decarboxylase domain
[219-430] IPR0147323.6e-56Orotidine 5'-phosphate decarboxylase
[188-448] IPR0110608.7e-54Ribulose-phosphate binding barrel
[3-151] IPR0044671.6e-35Orotate phosphoribosyl transferase, clade 1
[16-116] IPR0008361.2e-09Phosphoribosyltransferase
Orthology groupMCL12216 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200333-TA
ATGACGAAAAGTGGAATAAAGTCACCGGTCTACTTTGATTTAAGGGTCATTGTTAGTTACCCGGAAGTTATGGAGTTGATTACCGATTTACTTTACGATTTAGCTCTCAAAGATGATCAGTATGACCATGTGTGTGGAGTTCCATATACAGCTTTACCAATAGCAACATTGTTAGGGGTTAAAGCGAGAAAATCTATGTTAATGAGAAGGAAGGAAGCAAAGGCTTATGGAACAAAAAAGATGATAGAAGGACATTTTAAAAAAGGAGATTCATGCTTAATAATAGAAGATGTTGTAACCTCTGGATCAAGCATATTAGAAACTGTAAGTGATTTAAAAAGGGAAGGACTAGTTGCAAATCAGGCAGTTATAATATTAGATAGAGAACAAAACGGGAAACAGAATTTAGCAGCCAATGGTGTTTACATAAAATCACTTTTCACTATGTCAGAGATACTAAAAATTCTGGTTGATAACGAGAAAATAACAAAGCAAATGGGCATTGATGTTATTAACTATTTGAATGATGTTAAAGCGCCTGCATCAGTTCCAGTTGTAGACAGAATAATACTTCCATACGAGGAAAGAGCGAAGTTAGCTGTGAATCCTGTTGCAAAACAATTGTTTTCGATCATGGCCACAAAAAAATCTAACCTGTGTCTATCTGTTGATCTGACTTCGGCAGTTGATATACTTGATTTATTGGAAAAAGTTGGTGAACATGTATGCTTAGTGAAGACTCACATTGATATTATTGAAGATTTCTCTGATAATTTTGTAACGCAACTAAAACAGTTAGCACAAAGATATAATTTCCTAATTCTGGAAGACAGAAAGTTTGCAGACATTGGGCATACAGTATCATTGCAATATTCAAAGGGTGTATATAAAATAGGAGAATGGGCTGATTGTGTGACATCACACTCTCTACCTGGAGATGGTATCCTGAAAGCTCTAAACAATGTAATGGATTGTGCTAATAGGGGTGTATTTCTGCTAGCTGAAATGAGTAGTGCAGGTAATCTCATTTCTTCAGACTATACTGAGGCAACAGTTAAAATGGCTGCCAAATATCCTGAGGCTATACTTGGATTTGTTTGTCAAAATAAGAAAACATTTAATGAACCTGGTTTCATCCAACTCACTCCAGGAGTTCAGCTCGAAAGTTCAAAGGATGAGCTTGGACAAGTCTATAACACTCCAGAAAAGGTTATATTAGAAAATGGTGCTGATGTTGTGGTAGTGGGGAGGGGGATAGTAGCTGCTAAAAGTCCTGAAACTCAAGCGGTAATCTATAAGGATACTCTATGGAAATGTTACATGAAAAGAATTTCTGGCAAATTGGAGTAG

Protein sequence:

>DPOGS200333-PA
MTKSGIKSPVYFDLRVIVSYPEVMELITDLLYDLALKDDQYDHVCGVPYTALPIATLLGVKARKSMLMRRKEAKAYGTKKMIEGHFKKGDSCLIIEDVVTSGSSILETVSDLKREGLVANQAVIILDREQNGKQNLAANGVYIKSLFTMSEILKILVDNEKITKQMGIDVINYLNDVKAPASVPVVDRIILPYEERAKLAVNPVAKQLFSIMATKKSNLCLSVDLTSAVDILDLLEKVGEHVCLVKTHIDIIEDFSDNFVTQLKQLAQRYNFLILEDRKFADIGHTVSLQYSKGVYKIGEWADCVTSHSLPGDGILKALNNVMDCANRGVFLLAEMSSAGNLISSDYTEATVKMAAKYPEAILGFVCQNKKTFNEPGFIQLTPGVQLESSKDELGQVYNTPEKVILENGADVVVVGRGIVAAKSPETQAVIYKDTLWKCYMKRISGKLE-