Monarch geneset OGS2.0

DPOGS215219
TranscriptDPOGS215219-TA1539 bp
ProteinDPOGS215219-PA512 aa
Genomic positionDPSCF300143 + 277305-281728
RNAseq coverage318x (Rank: top 36%)
Annotation
HeliconiusHMEL0092650.093.64% 
BombyxBGIBMGA008725-TA0.090.04% 
DrosophilaUGP-PA0.073.61% 
EBI UniRef50UniRef50_C0KJJ60.074.39%UDP-glucose pyrophosphorylase n=3 Tax=Opisthokonta RepID=C0KJJ6_LOCMI
NCBI RefSeqXP_001847482.10.076.14%utp-glucose-1-phosphate uridylyltransferase 2 [Culex quinquefasciatus]
NCBI nr blastpgi|2625300800.091.43%UDP-glucose pyrophosphorylase [Spodoptera exigua]
NCBI nr blastxgi|2625300800.091.43%UDP-glucose pyrophosphorylase [Spodoptera exigua]
Group
Gene OntologyGO:00081523e-285metabolic process
GO:00167793e-285nucleotidyltransferase activity
KEGG pathwaycqu:CpipJ_CPIJ0060680.0 
 K00963 (E2.7.7.9, galU)maps-> Starch and sucrose metabolism
    Galactose metabolism
    Pentose and glucuronate interconversions
    Amino sugar and nucleotide sugar metabolism
InterPro domain[5-512] IPR0162672.5e-287UTP--glucose-1-phosphate uridylyltransferase, subgroup
[21-510] IPR0026183e-285UTP--glucose-1-phosphate uridylyltransferase
Orthology groupMCL11868 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215219-TA
ATGGAGTTTGAAATGGATCATGTACAGTTTTGGATCCGAAGTCATCAACGTACGCCGTCTGGGTCCCGGGACTTTAAAGAGGCGACCAAACGTGATGCTCTGGCTCGTCTGGAAGTGGAACTCGAGAGACTTCTGTCTACCGTCCCCGAGCCCAAACAGCCCCTAGTGGAGAAAGAGTTCGCGGGATTCAAGAATCTATTCAGCAGGTTCCTCGCTGAACAGGGTCCGTCTGTTACGTGGGAGAAAATCCAGAAGCTTCCAGAACATGCCGTCATCGATTACACCACGCTTCAGACTCCTACCACCGATAGCATTCACCACATGCTGGATAAGTTGGTGGTGGTGAAACTGAACGGTGGCCTGGGAACCTCCATGGGCTGTAAAGGACCCAAGTCGGTCATCCAAGTCAGAAATGAACTGACCTTCCTCGATCTGACTGTGCAACAAATTGAGCATCTGAACAAGACGTACAAATGCAACGTTCCCCTGGTGCTCATGAACTCTTTCAACACTGACGAGGACACGCTCAAGGTCATCCGCAAGTACCGCGGTCTGAAGCTCGATATCCACACCTTCAACCAGTCCTGCCATCCCAGGATCAACAGGGAATCCTTACTGCCGCTGGCCAAAGACGCTGACGTACACTCGGATATCGAGGCTTGGTACCCGCCTGGTCACGGAGACTTCTACGAATCTTTCTACAATTCTGGTCTTCTGAATAAATTTATTAAAGAGGGCAGGACGTACTGCTTCATCAGCAACATAGATAATTTGGGGGCGAACGTCGATCTGAACATCCTCAACCTGTTGTTGAATCCGGACCAGAAGGAGCAATCGGAATTCGTCATGGAGGTCACCGATAAAACCAGAGCCGACGTCAAAGGTGGCACTCTCATACAGTACGAGGATAAACTGCGTCTCCTGGAAATCGCTCAGGTGCCCAAAGAACACGTGGACGACTTCAAATCGGTGAGCCAGTTCAAATTCTTCAACACCAACAATCTTTGGGCGAAGCTGGACGCCATCAAGAGGGTCGTCGAACGAGGGTCCCTGAACATGGAGATAATCGTGAACAATAAGAGTCTAGCTGACGGAGTGAACGTCATTCAACTGGAAACGGCCGTGGGCGCGGCCATGAAGTGCTTCGAAGGCGGCATCGGTGTCAACGTCCCACGAAGCAGATTCCTGCCGGTCAAGAAGACCTCGGACCTGTTGTTGGTGATGTCGAATCTATACAGCCTGTCGCACGGGTCGCTGGTGATGTCGTCTCAGAGGATGTTCCCATCGACGCCTCTAGTGAAACTCGGTGACAACCACTTCGCCAAGGTGAAGGAGTTCCTGAACAGGTTCGCTACGATCCCCGACCTCATCGAGCTCGACCACCTCACCGTCTCCGGAGACGTGACCTTCGGCCGCGGCGTGTCTTTGAAGGGCACTGTTATAATAATAGCCAACCACGGCGAGCGCATCGACATCCCCTCCGGGGCGCTGCTCGAGAACAAAATAGTCTCAGGAAATCTAAGGATATTGGACCATTAG

Protein sequence:

>DPOGS215219-PA
MEFEMDHVQFWIRSHQRTPSGSRDFKEATKRDALARLEVELERLLSTVPEPKQPLVEKEFAGFKNLFSRFLAEQGPSVTWEKIQKLPEHAVIDYTTLQTPTTDSIHHMLDKLVVVKLNGGLGTSMGCKGPKSVIQVRNELTFLDLTVQQIEHLNKTYKCNVPLVLMNSFNTDEDTLKVIRKYRGLKLDIHTFNQSCHPRINRESLLPLAKDADVHSDIEAWYPPGHGDFYESFYNSGLLNKFIKEGRTYCFISNIDNLGANVDLNILNLLLNPDQKEQSEFVMEVTDKTRADVKGGTLIQYEDKLRLLEIAQVPKEHVDDFKSVSQFKFFNTNNLWAKLDAIKRVVERGSLNMEIIVNNKSLADGVNVIQLETAVGAAMKCFEGGIGVNVPRSRFLPVKKTSDLLLVMSNLYSLSHGSLVMSSQRMFPSTPLVKLGDNHFAKVKEFLNRFATIPDLIELDHLTVSGDVTFGRGVSLKGTVIIIANHGERIDIPSGALLENKIVSGNLRILDH-