Monarch geneset OGS2.0

DPOGS215218
TranscriptDPOGS215218-TA1314 bp
ProteinDPOGS215218-PA437 aa
Genomic positionDPSCF300143 + 265092-269036
RNAseq coverage3x (Rank: top 91%)
Annotation
HeliconiusHMEL0092650.092.44% 
BombyxBGIBMGA008725-TA0.088.38% 
DrosophilaUGP-PA2e-16770.35% 
EBI UniRef50UniRef50_C0KJJ66e-16668.78%UDP-glucose pyrophosphorylase n=3 Tax=Opisthokonta RepID=C0KJJ6_LOCMI
NCBI RefSeqXP_001847482.12e-17172.80%utp-glucose-1-phosphate uridylyltransferase 2 [Culex quinquefasciatus]
NCBI nr blastpgi|2625300800.090.15%UDP-glucose pyrophosphorylase [Spodoptera exigua]
NCBI nr blastxgi|2625300800.090.15%UDP-glucose pyrophosphorylase [Spodoptera exigua]
Group
Gene OntologyGO:00081522.3e-217metabolic process
GO:00167792.3e-217nucleotidyltransferase activity
KEGG pathwaycqu:CpipJ_CPIJ0060686e-171 
 K00963 (E2.7.7.9, galU)maps-> Starch and sucrose metabolism
    Galactose metabolism
    Pentose and glucuronate interconversions
    Amino sugar and nucleotide sugar metabolism
InterPro domain[17-435] IPR0026182.3e-217UTP--glucose-1-phosphate uridylyltransferase
[1-437] IPR0162677e-214UTP--glucose-1-phosphate uridylyltransferase, subgroup
Orthology groupMCL11868 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215218-TA
ATGGATCATGTACAGTTTTGGATCCGAAGTCATCAACGTACGCCGTCTGGGTCCCGGGACTTTAAAGAGGCGACCAAACGTGATGCTCTGGCTCGTCTGGAAGTGGAACTCGAGAGACTTCTGTCTACCGTCCCCGAGCCCAAACAGCCCCTAGTGGAGAAAGAGTTCGCGGGATTCAAGAATCTATTCAGCAGGTTCCTCGCTGAACAGGGTCCGTCTGTTACGTGGGAGAAAATCCAGAAGCTTCCAGAACATGCCGTCATTGATTACACCACGCTTCAGACTCCTACCACCGATAGCATTCACCACATGCTGGATAAGCTGGTGGTGGTGAAGCTGAACGGTGGCCTGGGAACCTCCATGGGCTGTAAAGGACCCAAGTCGGTCATCCAAGTCAGAAATGAACTGACCTTCCTCGATCTGACTGTGCAACAAATTGAGCATCTGAACAAGACGTACAAATGCAACGTTCCCCTGGTGCTCATGAACTCTTTCAACACTGACGAGGACACGCTCAAGGTCATCCGCAAGTACCGCGGTCTGAAGCTCGATATCCACACCTTCAACCAGTCCTGCCATCCCAGGATCAACAGGGAATCCTTACTGCCGCTGGCCAAAGACGCTGACGTACACTCGGATATCGAGGCTTGGTACCCGCCTGGTCACGGAGACTTCTACGAATCTTTCTACAATTCTGGTCTTCTGAATAAATTTATTAAAGAGGGCAGGACGTACTGCTTCATCAGCAACATAGATAATCTGGGGGCGAACGTCGATCTGAACATCCTCAACCTGTTGTTGAATCCGGACCAGAAGGAGCAATCGGAATTCGTCATGGAGGTCACCGATAAAACCAGAGCCGACGTCAAAGGTGGCACTCTCATACAGTACGAGGATAAACTTCGTCTCCTGGAAATCGCTCAGGTGCCCAAAGAACACGTGGACGACTTTAAATCGGTGAGCCAGTTCAAATTCTTCAACACCAACAATCTTTGGGCGAAGCTGGACGCCATCAAGAGGGTCGTCGAACGAGGGTCCCTGAACATGGAGATAATCGTGAACAATAAGAGTCTAGCTGACGGAGTGAACGTCATTCAACTGGAAACGGCCGTGGGCGCGGCCATGAAGTGCTTCGAAGGCGGCATCGGTGTCAACGTCCCACGAAGCAGATTCTTGCCGGTCAAGAAGACCTCGGACCTGTTGTTGGGCACGGTTATAATAATAGCCAACCACGGCGAGCGCATCGACATCCCCTCCGGGGCGCTGCTGGAGAACAAAATAGTCTCAGGAAATCTAAGGATATTGGACCATTAG

Protein sequence:

>DPOGS215218-PA
MDHVQFWIRSHQRTPSGSRDFKEATKRDALARLEVELERLLSTVPEPKQPLVEKEFAGFKNLFSRFLAEQGPSVTWEKIQKLPEHAVIDYTTLQTPTTDSIHHMLDKLVVVKLNGGLGTSMGCKGPKSVIQVRNELTFLDLTVQQIEHLNKTYKCNVPLVLMNSFNTDEDTLKVIRKYRGLKLDIHTFNQSCHPRINRESLLPLAKDADVHSDIEAWYPPGHGDFYESFYNSGLLNKFIKEGRTYCFISNIDNLGANVDLNILNLLLNPDQKEQSEFVMEVTDKTRADVKGGTLIQYEDKLRLLEIAQVPKEHVDDFKSVSQFKFFNTNNLWAKLDAIKRVVERGSLNMEIIVNNKSLADGVNVIQLETAVGAAMKCFEGGIGVNVPRSRFLPVKKTSDLLLGTVIIIANHGERIDIPSGALLENKIVSGNLRILDH-