Monarch geneset OGS2.0

DPOGS211261
TranscriptDPOGS211261-TA1329 bp
ProteinDPOGS211261-PA442 aa
Genomic positionDPSCF300506 - 46576-48039
RNAseq coverage1772x (Rank: top 7%)
Annotation
HeliconiusHMEL0038130.074.33% 
BombyxBGIBMGA001609-TA0.068.87% 
Drosophilammy-PA1e-13852.54% 
EBI UniRef50UniRef50_Q8IPJ63e-13652.54%Mummy, isoform B n=40 Tax=Coelomata RepID=Q8IPJ6_DROME
NCBI RefSeqXP_001844758.17e-14453.14%UDP-n-acteylglucosamine pyrophosphorylase [Culex quinquefasciatus]
NCBI nr blastpgi|2239514420.070.42%UDP-N-acetylglucosamine pyrophosphorylase [Spodoptera exigua]
NCBI nr blastxgi|2239514420.070.14%UDP-N-acetylglucosamine pyrophosphorylase [Spodoptera exigua]
Group
Gene OntologyGO:00081521.8e-171metabolic process
GO:00167791.8e-171nucleotidyltransferase activity
KEGG pathwaycqu:CpipJ_CPIJ0031702e-143 
 K00972 (E2.7.7.23, UAP1)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[1-416] IPR0026181.8e-171UTP--glucose-1-phosphate uridylyltransferase
Orthology groupMCL12255 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211261-TA
ATGTCGTACGAGAAGTTAAAAGAACAGCTCGCCCTCCACGGCCAGGAACACCTCGTGAAATACTGGCCGGAGTTGACCGAAAACGAGCGGGAACAGTTAGCGAGCGAAATTCAAAACCTCGATCTGGCGGAAGTGAACGAAATATTTCGGCGCGCAACGGAATCAACAAAAGTTATTCTAGAAAAGTTCGACGATGATCTGAAGCCGATCCCGCAGGCGCATTATGAATCTGTGCCTGGACTGTCGCAGGAGAAAGTTTTGGAGTATGAGAATATAGGTTTTCAACAAATCAGTGATGATAAAGTGGGTGTTCTGTTGCTGGCTGGTGGACAGGCGACTAGATTAGGCTTCGGATACCCCAAAGAACACACTATGGGCCCTACGGCAGACTTCTTCAAAAGCCACAACTACTTTGGCTTGGATGAAGATAACATTATCTTCTTCAACCAAGGAAGATTGCCCTGCTTCGACTTCAACGGCAAGATATTTCTGGATGAAAAATATCATTTATCGACTGCACCTGACGGAAATGGTGGTATCTATAGGGCTCTGAAAACTCAAGGAATATTGGATGATATCGCTAGACGGGGAGTTGAACACCTACATGCCCATTCTGTTGATAATCTCCTCATCAAAGTCGCTGATCCAGTATTCATTGGTTATTGCAAGAGTAAAAACGCCGATTGTGCCGCCAAAGTAGTTAGTAAGTCATCACCAAGTGAAGCTGTGGGTGTTGTATGTAGAGTGAACGGCTACTACAAAGTAGTGGAATATTCCGAGCTCACAGAGGAAGCAGCAAACAGAAGGAATCCTGACGGCAGGCTAACATTCTCAGCCGGAAGCATATGCAACCATTACTTCTCGGCTCAGTTCCTACAGAAAATATGCAATTATGAATCAAAACTCAAACACCACATATCAAACAAGAAGATCCCCTTCATCAACGAGGATGGGGTCCGCGTGAAACCGTCCGAACCTAATGGAATCAAGTTGGAGAAATTTATCTTTGACGTGTTTGAGTTCGCCGAGAACTTTATATGTCTGGAAGTAGCGAGGGACGTTGAGTTCTCCGCGCTGAAGAACTCCGACTCCGCTAAAAAGGATTGCCCCTCGACCGCCCGGGAGGATTTATTAAGACTGCACAGGAAATACATCAGAGAGGCCGGTGGTGTTATAGACGATAATGTAGACGTAGAAATATCTCCATTGCTCTCCTATGGCGGCGAAGATCTTAAGGATCTGGTGGAGAACGAAGCATTTGTGATATCCCCATTCCACTTGAAGAGTATGATGGAATCTTCCAGCAACGGAGTCAACGGCAACCACTAG

Protein sequence:

>DPOGS211261-PA
MSYEKLKEQLALHGQEHLVKYWPELTENEREQLASEIQNLDLAEVNEIFRRATESTKVILEKFDDDLKPIPQAHYESVPGLSQEKVLEYENIGFQQISDDKVGVLLLAGGQATRLGFGYPKEHTMGPTADFFKSHNYFGLDEDNIIFFNQGRLPCFDFNGKIFLDEKYHLSTAPDGNGGIYRALKTQGILDDIARRGVEHLHAHSVDNLLIKVADPVFIGYCKSKNADCAAKVVSKSSPSEAVGVVCRVNGYYKVVEYSELTEEAANRRNPDGRLTFSAGSICNHYFSAQFLQKICNYESKLKHHISNKKIPFINEDGVRVKPSEPNGIKLEKFIFDVFEFAENFICLEVARDVEFSALKNSDSAKKDCPSTAREDLLRLHRKYIREAGGVIDDNVDVEISPLLSYGGEDLKDLVENEAFVISPFHLKSMMESSSNGVNGNH-