Monarch geneset OGS2.0

DPOGS211665
TranscriptDPOGS211665-TA4311 bp
ProteinDPOGS211665-PA1436 aa
Genomic positionDPSCF300151 - 214097-235146
RNAseq coverage200x (Rank: top 47%)
Annotation
HeliconiusHMEL0158900.064.69% 
BombyxBGIBMGA001365-TA2e-17361.86% 
DrosophilaGalNAc-T1-PB1e-11552.61% 
EBI UniRef50UniRef50_Q6WV202e-11352.61%Polypeptide N-acetylgalactosaminyltransferase 1 n=18 Tax=Diptera RepID=GALT1_DROME
NCBI RefSeqXP_002092358.12e-11452.61%GE11714 [Drosophila yakuba]
NCBI nr blastpgi|3071739634e-11452.26%Polypeptide N-acetylgalactosaminyltransferase 1 [Camponotus floridanus]
NCBI nr blastxgi|3071739631e-11052.26%Polypeptide N-acetylgalactosaminyltransferase 1 [Camponotus floridanus]
Group
KEGG pathwaydya:Dyak_GE117145e-114 
 K00710 (GALNT)maps-> O-Glycan biosynthesis
InterPro domain[633-778] IPR0089978.1e-24Ricin B-related lectin
[125-294] IPR0011737e-19Glycosyl transferase, family 2
[653-776] IPR0007721e-16Ricin B lectin
Orthology groupMCL15437 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211665-TA
ATGAAATTGCCGCTTCGTAAGTGCTATCATGCGGCAAAAGTGGGTTTATTTCTGTTAATAGTGGTCACAATTTTGGCTCTATATGAACAATGGAGAGGAGGAAAACGCTCAGCCCGCGCTGAAACCTATGACTCGGAAGACAAATATGAGAAAGAGATTTTGGAAGACGAAGCGCGTATTATTCCTGGCCTAGGTGAAGGCGGCGTCGCAGCATATTTGACTGGTGAAGCGAAACGCCTCGGGGAGGAATCCGAGAAGAAACTCGCTATAAATGTTTATTTGAGTGACCGTATCGCCTACAATAGAACTTTAAAAGACTATCGTAATCCGGCATGCCAAAGAGTCATTTATGATGCTGAGCTGCCGTCAGCTTCGGTGATATTAATATTCCACAACGAACCCTACTCTGTGGTCATCCGCACTATATGGAGCGTCGTGAACAGCGTGAGACGGACTAATCCTTGGTTTGCGAAGGCTAACGATATTGACCGAGAGTCCAGCCGGACTATGACAATGGGCTACCCAGGTCAAGACCCTTCCTCTCCATTCGTTTATCTCAAGGAGATCATTCTGGTGGATGACAAATCCACCCTTCCCGAGCTGAAGGGCAAACTGAGTCACTATGTGAGGACTCGTCTACCACCTGACCTGATACGGATATTGAGACTGCCGCAGCGGGTGGGTCTTACGAGGGCACGCTTAGCCGGAAGTGAGGTGGCCGGTGGTGACGTCATGGTATTCCTTGACTCACACACGGAACCAGTGACGGACTGGCTGAGGCCCATCCTGCAGAGGATAAAAAACGACACGACCGTGGTCGTGACCCCCATCATCGATGTCATAGAGCAGGATACCTTTGAATATAAGACGGGGAATCGCAATGACTTCGAGGTGAATCATGTTCCTGGTGACACGTCTACACTAGAAGGGGTAGATGTAGGACGAACTGAAGTCGTGGGTCTGACCCGGGCGCGGCTAGCTGGGGCTAGATATGCCACGGGCGATATCCTAGTGTTCCTAGACTCGCACTGCGAGGTGCAAGAGGATTGGCTGAGGCCTCTCTTACAAAGGATCCGGGACTTCCCCCACGCGGTGGTCGTCCCCATCATTGACGTCATCGAGTCCTCCAACTTCTACTACAGCGTCCAAGACCCCGTGATATTCCAGGGTCTTATCCTGGCTCGTATATCCGGTGCCCGTATCGCCCGCGGGGATGTGTTGATCTTCCTCGACTCCCACTGCGAGGTGAACGTCGACTGGCTCAGACCCTTGCTGCAGAGGATCTCACACAAACGTGACGCAGTCCTGACACCGCTCATCGACGTCATCGACCAGTCCACCTTCGAACTGGAGGCGGCCCAGCAGTTTCAGGTAGGTGGTTTTACCTTCATGGGACACTTCACATGGATCGAGGTTCCAGAGAGGGAGAAAAGGAGACGCGGATCAGACATAGCGCCGACGTGGTCACCAACTATGGCGGGAGGTCTGTTCGCCATCAACCGACAGTACTACTGGGAGCTGGGAGCGTATGATGAGCAGATGGCTGGGTGGGGGGGCGAGAACCTGGAGATGTCGTTCCGGATATGGCAGTGTGGTGGCACGCTGGAGACGGTGCCGTGTTCTCGCGTGGGTCACGTGTTCCGAGCCTTCCATCCTTATGGGCTGCCAGCTCACACAGACACACACGGTACACACAGACTCATGTTATACACCGCCTCCGCGGCTTACAACACACTGGGTACAGGCATCAACACGGCTCGCATGGCCGAGGTGTGGATGGACGAGTACGCTGAGCTGTTCTACCTGAACCGACCCGACCTCAGGAAAAGTCCCAAGATCGGTGACGTCACGCACCGTAAGATCCTCCGGGAAAAGCTGAAGTGTAAGAGCTTCCAGTGGTACCTGGACAACATCTACAAGGAGAAGTTCGTGCCTGTCAGGGATGTCTTTGGATACGGGAGGTTCATGAATCCGTCCTCGGCGATGTGTCTCGACACTCTTCAAAGGGAAGGTGAAGCGACAGCCTTAGGTCTGTATCCGTGTCACAGTCGCCTGGAGCCCACGCAGCATCTAGCGCTGTCCCTCGCCGGGGAACTCCGGGACGAAGAGAAGTGCGCCGAAGTTCAAAGTCCAGTGGGTTCCAACGAGAACGTCAGCAGGAGAGTGTTGATGGTCACCTGTCACGGAAAACATCGAGGCCAGCACTGGCGATATCTGCCGACACAACAGATCCAGCATACGGAGAGCGGCCTCTGCCTCCACAGTACAGGCATCTCGGGGTCTGACGCTCTGGTGATGCGATGCAGAGCTGGCGGCGCGCAGGTGTGGGTCATCGATTACAGCGAGATCAATGATTTTAGAATGAACGGTGGTTTTACCTTCATGGGACACTTCACATGGATCGAGGTTCCAGAGAGGGAGAAAAGGAGACGCGGATCAGACATAGCGCCGACGTGGTCACCGACTATGGCGGGTGGTCTGTTCGCCATCAACCGACAGTACTACTGGGAGCTGGGAGCGTATGATGAGCAGATGGCTGGGTGGGGGGGCGAGAACCTGGAGATGTCGTTCCGGATATGGCAGTGTGGTGGCACGCTGGAGACGGTGCCGTGTTCTCGCGTGGGTCACGTGTTCCGAGCCTTCCATCCTTATGGGCTGCCAGCTCACACAGACACACACGGTACACACAGACTCACGTTATACACCGCCTCCACAGCTCACAACACACTGGGTACAGGCATCAACACGGCTCGCATGGCCGAGGTGTGGATGGACGAGTACGCTGAGCTGTTCTACCTGAACCGACCCGACCTCAGGAAAAGTCCCAAGATCGGTGACGTCACGCACCGTAAGATCCTCCGGGAAAAGCTGAAGTGTAAGAGCTTCCAGTGGTACCTGGACAACATCTACAAGGAGAAGTTCGTGCCTGTCAGGGATGTCTTTGGATACGGGAGAAGTCCAGTGGGTTCCAACGAGAACGTCAGCAGGAGAGTGTTGATGGTCACCTGTCACGGAAAACATCGAGGCCAGCACTGGCGATATCTGCCGACACAACAGATCCAGCATACGGAGAGCGGCCTCTGCCTGCACAGTACAGGCATCTCGGGGTCTGACGCTCTGGTGATGCGATGCAGAGCTGGCGGCGCGCAGGTGTGGGTCATCGATTACAGCGAGATCAATGATTTTAGAATGAACGATAACGAGGTGCCGAGTGAGCAAGAAATGAAACTAAAAAAGCTGAGAGGCCAGCGCCGTATCTCGCGATCCCTCCTGTCGTACGAGGACACGACCACCAGGCGACACCACAGGAAGAAGAAGCACAAGAAGAAGAACAAGTTCATCCTCAAGCTGGTGAGGAACGAGACGGAACACCTGGAGGTGGACGTGTACTGCAAACACGCGCGGCTCTACCCCAACAACAGCTTCGTGCGGGACCTGGTCACCATCCTCAACGACAAAGACATCAAGGTCATCAACAACGGACGTGTCTTCGAACGGAACAAGGTGTCTGATGTCGTGAACCGGGAGCACTCCGCCGTGACACTCACCACCGACACCGTCAACACCCCCGACACCCCCGATACCCGCGACACTCCCGACACCTTCAACCCCTACACCGCCGACACCCCGTCGGACTACCCTCAGACGAAACCTCTGAAGAAACGGAAGCGGAAGAAGCACCGACCGGTGGCCGCGAGCCCGGAGCTCGGAGACGCCGCCAGCAACAAGGAGGAGGAGCAGGCTGGAGACCAGGACGACCTGAACAGGCCCGGCTCCAAGAAGATCATCATAGAAGACTTCGTGGAGATCAAACCCATGGGGGCCTTGGCCAAGCACCTGAAGAAACGGAAACGGGCGAAAAAAACCCGGAGCACGACCGGCGCGCCGCGGGTGGGCGACTGGGAGGACTCCATGAGGATGGAGAGTTCAGCTGACGACTCGCCCATGGTGGAGGCAGACGGCCGACCCGACACAGGGCGACGCGCCCCCAGCAGCTCGCGGGCACCTCGGCGCTCGGACCTGCTGCCTCAGAACAGAGCGCTCAACGCACAGGCACAGGAACCGGTGAAGCTCATCATGAAGAGCAACGTCACCCTCAAGCTGGGAGACGAGTTCTTCACCTGGACCAAGAGGGGCCGCACCGCGGACGATGTGGCCGACATCCTCGGCGAGCTCATTATACCGGACACGGCGAAAATCAGTCTGGAGAACGAGACGAAGGAGAAGGAAACGGAAGGATTTAAGGGGAACAAGGAGGAGTCGGGGAGGAGGAGAGACTCTAGCTCCTCGGAGAGTGGACAGGCAGACTGA

Protein sequence:

>DPOGS211665-PA
MKLPLRKCYHAAKVGLFLLIVVTILALYEQWRGGKRSARAETYDSEDKYEKEILEDEARIIPGLGEGGVAAYLTGEAKRLGEESEKKLAINVYLSDRIAYNRTLKDYRNPACQRVIYDAELPSASVILIFHNEPYSVVIRTIWSVVNSVRRTNPWFAKANDIDRESSRTMTMGYPGQDPSSPFVYLKEIILVDDKSTLPELKGKLSHYVRTRLPPDLIRILRLPQRVGLTRARLAGSEVAGGDVMVFLDSHTEPVTDWLRPILQRIKNDTTVVVTPIIDVIEQDTFEYKTGNRNDFEVNHVPGDTSTLEGVDVGRTEVVGLTRARLAGARYATGDILVFLDSHCEVQEDWLRPLLQRIRDFPHAVVVPIIDVIESSNFYYSVQDPVIFQGLILARISGARIARGDVLIFLDSHCEVNVDWLRPLLQRISHKRDAVLTPLIDVIDQSTFELEAAQQFQVGGFTFMGHFTWIEVPEREKRRRGSDIAPTWSPTMAGGLFAINRQYYWELGAYDEQMAGWGGENLEMSFRIWQCGGTLETVPCSRVGHVFRAFHPYGLPAHTDTHGTHRLMLYTASAAYNTLGTGINTARMAEVWMDEYAELFYLNRPDLRKSPKIGDVTHRKILREKLKCKSFQWYLDNIYKEKFVPVRDVFGYGRFMNPSSAMCLDTLQREGEATALGLYPCHSRLEPTQHLALSLAGELRDEEKCAEVQSPVGSNENVSRRVLMVTCHGKHRGQHWRYLPTQQIQHTESGLCLHSTGISGSDALVMRCRAGGAQVWVIDYSEINDFRMNGGFTFMGHFTWIEVPEREKRRRGSDIAPTWSPTMAGGLFAINRQYYWELGAYDEQMAGWGGENLEMSFRIWQCGGTLETVPCSRVGHVFRAFHPYGLPAHTDTHGTHRLTLYTASTAHNTLGTGINTARMAEVWMDEYAELFYLNRPDLRKSPKIGDVTHRKILREKLKCKSFQWYLDNIYKEKFVPVRDVFGYGRSPVGSNENVSRRVLMVTCHGKHRGQHWRYLPTQQIQHTESGLCLHSTGISGSDALVMRCRAGGAQVWVIDYSEINDFRMNDNEVPSEQEMKLKKLRGQRRISRSLLSYEDTTTRRHHRKKKHKKKNKFILKLVRNETEHLEVDVYCKHARLYPNNSFVRDLVTILNDKDIKVINNGRVFERNKVSDVVNREHSAVTLTTDTVNTPDTPDTRDTPDTFNPYTADTPSDYPQTKPLKKRKRKKHRPVAASPELGDAASNKEEEQAGDQDDLNRPGSKKIIIEDFVEIKPMGALAKHLKKRKRAKKTRSTTGAPRVGDWEDSMRMESSADDSPMVEADGRPDTGRRAPSSSRAPRRSDLLPQNRALNAQAQEPVKLIMKSNVTLKLGDEFFTWTKRGRTADDVADILGELIIPDTAKISLENETKEKETEGFKGNKEESGRRRDSSSSESGQAD-