Monarch geneset OGS2.0

DPOGS207186
TranscriptDPOGS207186-TA2550 bp
ProteinDPOGS207186-PA849 aa
Genomic positionDPSCF300001 + 5208029-5216323
RNAseq coverage247x (Rank: top 42%)
Annotation
HeliconiusHMEL0174800.062.72% 
BombyxBGIBMGA011033-TA0.062.26% 
Drosophilasxc-PC0.062.45% 
EBI UniRef50UniRef50_Q7KJA90.062.45%O-glycosyltransferase n=27 Tax=Bilateria RepID=Q7KJA9_DROME
NCBI RefSeqXP_001846338.10.060.89%o-linked N-acetylglucosamine transferase, ogt [Culex quinquefasciatus]
NCBI nr blastpgi|1700369790.060.89%o-linked N-acetylglucosamine transferase, ogt [Culex quinquefasciatus]
NCBI nr blastxgi|1700369790.060.89%o-linked N-acetylglucosamine transferase, ogt [Culex quinquefasciatus]
Group
Gene OntologyGO:00054881.5e-73binding
GO:00055151.1e-08protein binding
KEGG pathway 
InterPro domain[666-667] IPR0119901.5e-73Tetratricopeptide-like helical
[111-144] IPR0014401.1e-08Tetratricopeptide TPR-1
[111-144] IPR0197342.4e-07Tetratricopeptide repeat
Orthology groupMCL11196 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207186-TA
ATGACGCGTCTTCAGTGCGCTCGCGCCGTTAGTTGCGACGCTAGCATCCGAGGATATGTCCGTGCTATGGATTACCAATGTGTGGTTGTGTATACCGTCATCAATATCTCACCACCACGAATGTCCCAAGCAGCATCTATCACGAACGGATGCGTGGATTGGTTCAGAAATAAAATTAAAAGGGCCATTGCCGTATATTTTCATTGTCTAAAATTGACTCCAAACAATGGTATTATACACGGAAACCTGGCTTGCCTCTACTATAAACAAGGCTTCATAGATTTAGCTATTGATACTTACCGACAAGCTATTGAACTGCACCCTAACTTTCCCGATGCGTATTGCAATCTTGCTAATGCACTAAAAGAAAAAGGTCTTGTTGAAGAAGCTGAAGAATGTTATAACAAAGCTTTGTATCTCTGCCCGTCACACGTCGATACTTTAAATAATCTTGGAAATGTTAAGAGAGAACAAGGAAAAATTGAAGAAGCCACTAGACTATATATGAGAGCGTTACAAGTATTTCCTCATTTTGCAGCTACTCATAGTAATTTAGCATCATTATTACAGCAACAAGGTAAGTTTCAAGATGCTCTGTATCATTACGCACAAGCTATAAATATTCAACCAAAATTCGCTGATGCTTATAGTAATATGGGGAACACTTTAAGGGAGATGCAAGATACAAGTGGTGCTCTAAGATGTTTTAAAAAAGCAATTGAAATAAATCCTCTTTTTGCTGATGCTCATTGCAATTTAGCGAGTATTTATAAAGATATGGGAAATATTTGTGAGGCAATAACATCCTATAACAATGCTCTAAGAATTAAATCAGATTTTCCTGACGCTTATTCTAATTTGGCACATTGTTTACAAATAATTTGTAACTGGGAATGCTATCAAGAAAGAATGCATAAACTTGTTTCAATTGTAGAGAATCAGTTGTTAACTAGTGATAAATTATGCTCCGTACACCCACATCACACTATTCTGTACCCATTGTCAAATGTAGCAAGGAAGGAAATAGCTGCTCGACATGCAGCTTTATATTTGGAAAAAGTAAATATGCTTACTTCTACCACCTTTAGGCACACCAAAAAGAGAAAAGGCCGCCTCCGTATTGGTTATGTTAGTAGTGACTTTGGAAATCACCCTACTTCTCATTTAATGCAGTCAATACCAGGACTCCACAATCGATTAAATGTAGAAATTTTTTGCTATGCTTTAAATGTAGATGATAAAACAACATTCCGTAATAAAATTGTAAGTGAATGTGACAATTTCACTGACTTGTCATCAATTAAGTCTAATATTGAAGCTGCTGCCAAAATAAATAGTGATGATATTAATATATTAATAAATATGAATGGTTATACGAAGGGCGCTAGAAATGAAATATTTGCCTTAAAACCAGCACCAATACAAGTTTTATGGCTTGGTTACCCCGGAACCAGCGGGGCAGGGTACATAGATTATATCATAAGTGACGAGATTTCAAGCCCTCTTTCTATGTCAGATGATTTTACTGAAAAGTTTGCCTATATGCCTTATACGTATTTTGTTGGTGATCATAAGGAAATGTTTCCTCATTTAAAAAACCGGTACAGCCTAAGAAAATACGACGAAAGACCCATTAGTGAAAATTTTGCCGTTATAAACTGTGCTGATACTATAAACTTAAATGATTATTTTGAGGTAAAATATAACAAACAAGTTTTGTCATATAAAAATCTAGAACCGATAGAAGTTACTGAGTATGAAGTAGACATTCCTATATATGCTATCGAATCAACAATAAGTTGTAAACAAGAACAGGTATGCGTAAATAATATATTTATAGATAATGGTTTATCTTTAAGATTATCAAAGAAAAAAATTGCGAGCGGAGAAGAAAGGTACGATAACATAATTTTAACAACTAGAAGACAATACAATTTACCAGAAGACGCTGTGGTGTTTTGTAATTTTAATCAACTTTATAAAACAGACCCAAAAGCTTTAGAAATGTGGATAAATATATTGAATAATGTCCCAAACAGTGTACTTTGGCTTTTAGCTTTTCCAGCAGCTGGGGAATCCAATCTGCGCCATTTTGCACAGATACGAGGACTTTCACCGGATCGTATTATATTTTCAAAAATAGCACCTAAAGAAGAACACGTCCGAAGGGGGCAAATATCAGATGTTTGCTTAGATACCCCTTTATGTAATGGCCATACAACTACAATGGATATTCTTTGGACTGGAACTCCCGTGGTCACTTTACCGGGAAAAACATTAGCCTCAAGAGTAGCCTCATCTCAGCTGACTGCATTAAAATGTACTGAACTTATAGCAAAAAGTGAGAAGAATTATGAAGAAATAGCCACAAAACTAGGCATGGACGCAGAATATCGTAGATATATAAGAGCCAAAGTATCAAATGCTCGAATAACAAGCACATTGTTTGATTGTAAACACTATGCTATGGCAATGGAGGATCTTTATAATAAGATGTGGCAGCTATACGAGGATGGGAAGGAGCCAAATCACGTTTACGCTTTAAAGTAA

Protein sequence:

>DPOGS207186-PA
MTRLQCARAVSCDASIRGYVRAMDYQCVVVYTVINISPPRMSQAASITNGCVDWFRNKIKRAIAVYFHCLKLTPNNGIIHGNLACLYYKQGFIDLAIDTYRQAIELHPNFPDAYCNLANALKEKGLVEEAEECYNKALYLCPSHVDTLNNLGNVKREQGKIEEATRLYMRALQVFPHFAATHSNLASLLQQQGKFQDALYHYAQAINIQPKFADAYSNMGNTLREMQDTSGALRCFKKAIEINPLFADAHCNLASIYKDMGNICEAITSYNNALRIKSDFPDAYSNLAHCLQIICNWECYQERMHKLVSIVENQLLTSDKLCSVHPHHTILYPLSNVARKEIAARHAALYLEKVNMLTSTTFRHTKKRKGRLRIGYVSSDFGNHPTSHLMQSIPGLHNRLNVEIFCYALNVDDKTTFRNKIVSECDNFTDLSSIKSNIEAAAKINSDDINILINMNGYTKGARNEIFALKPAPIQVLWLGYPGTSGAGYIDYIISDEISSPLSMSDDFTEKFAYMPYTYFVGDHKEMFPHLKNRYSLRKYDERPISENFAVINCADTINLNDYFEVKYNKQVLSYKNLEPIEVTEYEVDIPIYAIESTISCKQEQVCVNNIFIDNGLSLRLSKKKIASGEERYDNIILTTRRQYNLPEDAVVFCNFNQLYKTDPKALEMWINILNNVPNSVLWLLAFPAAGESNLRHFAQIRGLSPDRIIFSKIAPKEEHVRRGQISDVCLDTPLCNGHTTTMDILWTGTPVVTLPGKTLASRVASSQLTALKCTELIAKSEKNYEEIATKLGMDAEYRRYIRAKVSNARITSTLFDCKHYAMAMEDLYNKMWQLYEDGKEPNHVYALK-