Monarch geneset OGS2.0

DPOGS204233
TranscriptDPOGS204233-TA5475 bp
ProteinDPOGS204233-PA1824 aa
Genomic positionDPSCF300046 - 590904-606161
RNAseq coverage293x (Rank: top 38%)
Annotation
HeliconiusHMEL0151600.074.29% 
BombyxBGIBMGA007512-TA0.075.94% 
DrosophilaUgt-PA0.063.72% 
EBI UniRef50UniRef50_Q093320.063.72%UDP-glucose:glycoprotein glucosyltransferase n=14 Tax=Diptera RepID=UGGG_DROME
NCBI RefSeqXP_969332.20.065.30%PREDICTED: similar to UDP-glucose glycoprotein:glucosyltransferase [Tribolium castaneum]
NCBI nr blastpgi|1892373480.065.30%PREDICTED: similar to UDP-glucose glycoprotein:glucosyltransferase [Tribolium castaneum]
NCBI nr blastxgi|2700070940.064.51%hypothetical protein TcasGA2_TC013545 [Tribolium castaneum]
Group
Gene OntologyGO:00064861.3e-63protein glycosylation
GO:00039801.3e-63UDP-glucose:glycoprotein glucosyltransferase activity
GO:00167572.7e-08transferase activity, transferring glycosyl groups
KEGG pathwaytca:6578040.0 
 K11718 (HUGT)maps-> Protein processing in endoplasmic reticulum
InterPro domain[6-1824] IPR0094480UDP-glucose:Glycoprotein Glucosyltransferase
[1528-1746] IPR0024952.7e-08Glycosyl transferase, family 8
Orthology groupMCL11445 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204233-TA
ATGAAAGCTACAATTTTGGGCATTATTTTAACTGTAGTAATTTCAATCTTTGGTAATGTTGTTGCCAATGGTGACTTACCGAAACGAGAGGAAAGAAAGTCTAAAGGAGTTACTACATTTATTAGCGCCAAATGGGAAGCGACGCCAATTGTGCTAGAACTGGCGGAATATTTGTCTGCTGAAAGTTCTGATTTATTTTGGTCTTATTTTGATGGCATTATTTCACTTAAATCAAGCTTAGAGTCTTTGGAGACAGATAAACAAGTTTATGATGCTTGCATTGGAGTAGCAAGTACATTATTAGCTCCGGCACAGCTCCGTATGGCTAAGCTAGCCTTGTCCATGCATTTGACTTCACCAGCAGTCCGCATGTTTGATCAGATTGCTACACAAAACGGTGCAAAAGAGTTGCCCTGTGAAACATTTGTGGCAATTGCATCAAGAAAAGTTTGTGATAATGATATCCTTAGGGACATTCTTAAATCTACAGTCAAATTTGATCCAGAGGAGCATAGAATTGAAACATACCTATTAGACCACTCATATCCCAGCAGTGACAATAGAAGCCTCACAGCTATTCTATATGGAGAGCTGGGAAACTCTGACTTTTCAGCCAAACATAAAATATTATCTGGCTACGCTGATAAAGGTGTTATTAACTATGTGGTCAGATGGAACATAAAATCTAGAGGCAAGCCAAAACTTCGTCTGTCTGGATATGGAATTGAATTGCAATTGAAGAGTACAGAATACAAAAGTCAAGATGATACCACTCCTAAGGAGACTGTAGATGATGCAGGAGTGCCCTCAGAAGAAGAAGACGAAAATGATCCCCAGAACCAAATAGATGGATTCAATTTTGGAAGACTAAAGAATTTATTTCCGGCACTTCGCACACCTCTCGAGCGTTTCCGAAGACATCTCTCTGAAATGAGTGAAGAAATAGAGCCCCTTAAAGTATGGCAGATGCAAGCTCTGAGTATGCAGGCCGCTGCTGCTGTGATGGATGCACACGACGCGGGCGGAGATGAGGCTCTTAAAGTGTTGATATCTCTAGCACAGAACTTCCCCATGCAGACTAAATCGTTGATCCATGTGAATGTGCCCCGATCCTTCCGCGATGAAGTCCTGTACAATCAAGACGTTTGGTCGTCATCTCTAGGGCTCCGGCCTGCGGAACCCTTGTTGCTCGTATCCGGGGCTCAGTACGATGCTGACGAGGTCGACCTTATGGCCCTCTTAGCAGCGCTCAGAGAAGACATAGGACCTATGAATACTCTGCATGCTTTGGGTCTGAACAGGAAGCTCATCAACAAGCTTCTATCACTTGAACTCGGTGAGTCTTTCACTTGGGAAGAGTATGGCTTAGACATCCGTGACACAGCCATCACCTGGCTCAACGATCTAGAGACAGACGATAGATACAGACGATGGCCATCTTCATACATGGAACTCCTACGACCCACATATCCTGGTATGCTGAGGAACTTAAGAAGAAATATATATAATTACGTGATAGTGATAGACCCAACATCACCGTCGTCCGCGCCCCCTTTAAAGCTGGGTGAGACATTACTGAAACATGCTACGCCTGTACGAGTGGGCTTGGTACTGGCACCGGGACGCGACTCCGCTCTGGGCACCGCACTAAGAAGCGCCTTCAACTATGTAGCACAGGAGAGGAATTCTAACAAGGAGGCCTTCTATTTCCTTACACAGGTTCTCAATTCTCTTCAAGAAGATGCTCTGAGTGTGGATCATATAAAAAAGTATCTGAAAAAGTATGCCAGTTCGAGCGCAAATCTCGATGAAATCATTTCAGAGGAATCTGAATTCAACTTCGGACACCAACTGGCTGAGGAGTTCGTGTCGAAGCTGGGAACTAATAAATTCCCTCAAGTGATAGTGAATGGCGTTCCTCTGTACGATGAGGGCTCTGGTGCGTTGTCTTCGGTGGAACTGCTCGAGGAGGCGCTAGTGACGGCACTGTCGCGTCACACGGCGCGTCTACAGCGAGCCGTGTTTAGAGGGAACCTCGCAGACTCCGACGACGCCGTAGAGTATCTCATGAAGCAGCCGCATATTGTGTCCAGGTTAAACCGTCGCGTTCTTGCATCTGAACCGTCTCAGTACTTAGACTTGTCTGGTGTATCCTCATCGAGAGATTTATTTTCCGAAGACAAAATTCATCGTCTCCTGCATTTAACTGGACGCGACGCTCTAGCAACAGCTTTACCTATCTTTAAATATTTCACAAAACCCGGGAAGTCTGAGAAGATAACACAAACTCTTTGGATAGCGGGAGATCTCAATAAGAAGGAATCTAGAGAATTGTTAAGAAATGCTCTCACGTTTATGAGGGAATCGGGTGGAATTCGTGTTGCATTTATACCCAATGTCGACGGTTCCGGCGAAGATCAATCGTTTAATAAAGTAGTTCTTGCCGCTCTGACGAGTTTGGAACCCGCAGAGGCTACCAAATATGTAGTTCAACTCTTAGAGGACGAAGGATGCCATGAGAGAAAAGATTGCGAAATTCTGCCTGAGTTGGTACCGGCGTTGAACAAGTACGAGTGGGTGTTGAAGGCGTCCCGCGTGTTATGTGCTCGGAGCCTCAAGCTGCGTGGCTCTGAGCGAGCAGTCATACACAACGCCAGAGTTATAGGACCCTTCAACAAAGGAGAGAGCTTCTCCCTAGAAGACTTCGCACTGCTTGAGAGGTACAGTAACCAAGTGTATGGAGACAAGCTATCCGAATTGTTACACCAGAACAAGAAGCTGTCAAATAATGTTTTGGACGATGACGATGACATCACTGATATAAGCACAGATAACTATTTGAAGGTTATATCAGTGCTTGCTTCGCGTAGTCCCCGTGTGCGCACGCCCTTACCGAGCGGATTACGAACGGATCATTCTGTTATAGAACTACCTCCTTTGTATGAGGACGAAGCGGCCGTTGAAATAGTAGCCGTGGTGGACCCGGCGTCGGCGGCCGCTCAGCGCCTAGCTCCGCTGCTGCTGGTGTTGCGACGCGTTGTCAACTGTCGCTTACAATTGTTCCTCAACCCGCAGGACAAGAATTCTGACATGCCGCTTAAGAGTTTTTACCGCTACGTGTTGGAGCCGGAGCTACAATTCAATAGCGCGGGTGTGCAGACGGGCGGTGCGATCGCGCGTTTCTCCCGTTTGCCGCACGCTCCTCTTCTATCGCTGGAGCTGCGTACGCCGCCTAATTGGCTGGTCGAGTGCGTGAAGTCTGTATACGACTTGGATAATATACGCCTGGCCGATGTCGAGTCACTCGTTCACAGGTTGGGACATTATTATTATCCTGAGTTGGTACCGGCGTTGAACAAGTACGAGTGGGTGTTGAAGGCGTCCCGCGTGTTATGTGCTCGGAGCCTCAAGCTGCGTGGCTCTGAGCGAGCAGTCATACACAACGCCAGAGTTATAGGACCCTTCAACAAAGGAGAGAGCTTCTCCCTAGAAGACTTCGCACTGCTTGAGAGGTACAGTAACCAAGTGTATGGAGACAAGCTATCCGAATTGTTACACCAGAACAAGAAGCTGTCAAATAATGTTTTGGACGATGACGATGACATCACTGATATAAGCACAGATAACTATTTGAAGGTTATATCAGTGCTTGCTTCGCGTAGTCCCCGTGTGCGCACGCCCTTACCGAGCGGATTACGAACGGATCATTCTGTTATAGAACTACCTCCTTTGTATGAGGACGAAGCGGCCGTTGAAATAGTAGCCGTGGTGGACCCGGCGTCGGCGGCCGCTCAGCGCCTAGCTCCGCTGCTGCTGGTGTTGCGACGCGTTGTCAACTGTCGCTTACAGTTGTTCCTCAACCCACAGGACAAGAATTCTGACATGCCGCTTAAGAGTTTTTACCGCTACGTGTTGGAGCCGGAGCTACAATTCAATAGCGCGGGTGCGCAGACGGGCGGTGCGATCGCGCGTTTCTCCCGCTTGCCGCACGCTCCTCTTCTATCGCTGGAGCTGCGCACGCCACCCAATTGGCTGGTCGAGTGCGTGAAGTCTGTATACGACTTGGATAATATACGCCTGGCCGATGTCGAGTCACTCGTTCACAGTGAGTTCGAGTTGGAATACCTGCTTGTGGAAGGTCACGCGTGGGATACGTCTCTGGGCACGCCGCCTCGCGGGTTACAACTCGTGCTGGGCACGAGACACCGACCAGACACAGTTGACACCATCGTGATGGCCAACCTCGGCTACTTCCAGCTCAAGGCCAACCCCGGTGCCTGGACGTTGCGTCTCAGACCCGGCCGCTCTGACGATATTTACGAGATTGTCGGGCACGAAAACACTGACACCCCAGCCGGCAGTAAAGACATCCAGGTCCTGATGAGTTCATTCCGGAGTCAAGTTATTAAATTGAGGGTCACTAAGAAGGCGGATAAACAACACCTTGATCTTTTAGCTGAAAATGACGAAAAGAACGCTGGTGGGATATGGAATTCTATTGCAAGTTCGTTCGGAGGTGGCGAAGAACAAGAAGCGCAAGACGAGACTATCAACGTGTTCTCAGTAGCATCCGGTCACTTGTACGAACGTTTTCTACGTATTATGATGCTGTCTGTACTAAAGAACACTAAGTCACCCGTGAAGTTCTGGTTCTTAAAGAACTATCTCAGCCCCTCACTTAAGGACATCCTTCCATACATGGCGCAAGAGTACGGGTTCCAGTACGAGCTGGTACAGTACCAGTGGCCTCGCTGGCTGCAGCGGCAGCGTGACAGACAGCGGACCATCTGGGGGTACAAGATACTGTTCCTCGACGTGTTATTCCCATTGGACGTCAAGAAGATCATCTTTGTTGATGCTGATCAGATTGTTCGAGCTGATCTAAAGGAACTAGTAGATTTGGATCTAGGCGGAGCTCCCTATGGATACACCCCGTTCTGTGACAGTAGAAAAGAAATGGAAGGATTCAGGTTCTGGAAGCAAGGCTACTGGCGGAATCATCTCCAAGGTCGGAGTTATCACATCAGTGCACTGTACGTGGTGGATCTGAAGCGTTTCAGACGAATCGCTGCCGGCGACCGACTGAGGGGACAGTACCAGGCGCTCAGCCAGGACCCTAACAGTTTGTCAAATCTAGATCAAGATCTTCCCAACAATATGATTCACCAGGTGGCTATAAAGTCTCTGCCCCAGGAATGGTTGTGGTGTGAGACCTGGTGCGATAATGAATCCAAGAAATACGCCAAGACCATTGATTTGTGCAACAACCCTATGACGAAGGAGGCCAAGTTGTCAGCAGCTATGCGCATCGTGCCTGAGTGGAGCGACTATGACAACGAGCTGAGAGCATTGCACGCCCGCGTCAGGCAGGGACACTACCAGGACGACACCGAACAGGAAATCGAGACTCATGAACATGAACAAGTCAGCAAAGAAGATAAAACTGATAAAGCACAGGAACACACTGAGTTATGA

Protein sequence:

>DPOGS204233-PA
MKATILGIILTVVISIFGNVVANGDLPKREERKSKGVTTFISAKWEATPIVLELAEYLSAESSDLFWSYFDGIISLKSSLESLETDKQVYDACIGVASTLLAPAQLRMAKLALSMHLTSPAVRMFDQIATQNGAKELPCETFVAIASRKVCDNDILRDILKSTVKFDPEEHRIETYLLDHSYPSSDNRSLTAILYGELGNSDFSAKHKILSGYADKGVINYVVRWNIKSRGKPKLRLSGYGIELQLKSTEYKSQDDTTPKETVDDAGVPSEEEDENDPQNQIDGFNFGRLKNLFPALRTPLERFRRHLSEMSEEIEPLKVWQMQALSMQAAAAVMDAHDAGGDEALKVLISLAQNFPMQTKSLIHVNVPRSFRDEVLYNQDVWSSSLGLRPAEPLLLVSGAQYDADEVDLMALLAALREDIGPMNTLHALGLNRKLINKLLSLELGESFTWEEYGLDIRDTAITWLNDLETDDRYRRWPSSYMELLRPTYPGMLRNLRRNIYNYVIVIDPTSPSSAPPLKLGETLLKHATPVRVGLVLAPGRDSALGTALRSAFNYVAQERNSNKEAFYFLTQVLNSLQEDALSVDHIKKYLKKYASSSANLDEIISEESEFNFGHQLAEEFVSKLGTNKFPQVIVNGVPLYDEGSGALSSVELLEEALVTALSRHTARLQRAVFRGNLADSDDAVEYLMKQPHIVSRLNRRVLASEPSQYLDLSGVSSSRDLFSEDKIHRLLHLTGRDALATALPIFKYFTKPGKSEKITQTLWIAGDLNKKESRELLRNALTFMRESGGIRVAFIPNVDGSGEDQSFNKVVLAALTSLEPAEATKYVVQLLEDEGCHERKDCEILPELVPALNKYEWVLKASRVLCARSLKLRGSERAVIHNARVIGPFNKGESFSLEDFALLERYSNQVYGDKLSELLHQNKKLSNNVLDDDDDITDISTDNYLKVISVLASRSPRVRTPLPSGLRTDHSVIELPPLYEDEAAVEIVAVVDPASAAAQRLAPLLLVLRRVVNCRLQLFLNPQDKNSDMPLKSFYRYVLEPELQFNSAGVQTGGAIARFSRLPHAPLLSLELRTPPNWLVECVKSVYDLDNIRLADVESLVHRLGHYYYPELVPALNKYEWVLKASRVLCARSLKLRGSERAVIHNARVIGPFNKGESFSLEDFALLERYSNQVYGDKLSELLHQNKKLSNNVLDDDDDITDISTDNYLKVISVLASRSPRVRTPLPSGLRTDHSVIELPPLYEDEAAVEIVAVVDPASAAAQRLAPLLLVLRRVVNCRLQLFLNPQDKNSDMPLKSFYRYVLEPELQFNSAGAQTGGAIARFSRLPHAPLLSLELRTPPNWLVECVKSVYDLDNIRLADVESLVHSEFELEYLLVEGHAWDTSLGTPPRGLQLVLGTRHRPDTVDTIVMANLGYFQLKANPGAWTLRLRPGRSDDIYEIVGHENTDTPAGSKDIQVLMSSFRSQVIKLRVTKKADKQHLDLLAENDEKNAGGIWNSIASSFGGGEEQEAQDETINVFSVASGHLYERFLRIMMLSVLKNTKSPVKFWFLKNYLSPSLKDILPYMAQEYGFQYELVQYQWPRWLQRQRDRQRTIWGYKILFLDVLFPLDVKKIIFVDADQIVRADLKELVDLDLGGAPYGYTPFCDSRKEMEGFRFWKQGYWRNHLQGRSYHISALYVVDLKRFRRIAAGDRLRGQYQALSQDPNSLSNLDQDLPNNMIHQVAIKSLPQEWLWCETWCDNESKKYAKTIDLCNNPMTKEAKLSAAMRIVPEWSDYDNELRALHARVRQGHYQDDTEQEIETHEHEQVSKEDKTDKAQEHTEL-