Monarch geneset OGS2.0

DPOGS201438
TranscriptDPOGS201438-TA3906 bp
ProteinDPOGS201438-PA1301 aa
Genomic positionDPSCF300006 - 1138235-1150725
RNAseq coverage1896x (Rank: top 7%)
Annotation
HeliconiusHMEL0159870.051.59% 
BombyxBGIBMGA002596-TA0.044.66% 
Drosophila% 
EBI UniRef50UniRef50_E2AEK62e-14333.46%Glycogenin-1 n=1 Tax=Camponotus floridanus RepID=E2AEK6_CAMFO
NCBI RefSeqXP_624798.23e-11167.05%PREDICTED: similar to Glycogenin CG9480-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3071800539e-14333.46%Glycogenin-1 [Camponotus floridanus]
NCBI nr blastxgi|3071800533e-16132.67%Glycogenin-1 [Camponotus floridanus]
Group
Gene OntologyGO:00167571.1e-43transferase activity, transferring glycosyl groups
KEGG pathway 
InterPro domain[7-222] IPR0024951.1e-43Glycosyl transferase, family 8
Orthology groupMCL14501 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201438-TA
ATGTCAAACCGAGCATGGGTGACGCTAGCTACAAACGATTCCTATGGTCTTGGGGCTCTGGTTCTGGCGCATTCTTTGCGTCGGGCGTCCTCCTCTTACCCAGCAGTCGTGCTGATTACACCCTCGGTCACTGAGCCCATGAGGGAGCGTCTCCGTGCAGTATTCGCTGAAGTAATCCTAGTGGACGTTTTGGATTCCAAGGATGCAGCTCATCTCGCCTTGCTGCAGCGGCCGGAACTGGGCATCACATTTACCAAAATACATTGTTGGAATCTCACGCAGTATGAGAAATGTGTCTTCCTTGATGCCGATACACTGATCGTTCAGAACTGTGATGAGTTATTTGAACGCGAGGAGTTGTCCGCGGCTCCCGACGTCGGCTGGCCCGACTGTTTCAATTCGGGAGTTTTTGTATTCAAACCTTCCGCCGATACATTCAGCAAACTCGTCACATTTGCATCCGAAAGGGGCAGTTTTGATGGTGGTGACCAGGGACTCTTGAATTCCTACTTCTCGGATTGGGCCCATGGTGACATTAACAAGCATTTGCCCTTTCTGTACAATGTGACATCTGCTGCCTTTTACTCCTATATCCCAGCCTTAAAGCATTACGGCCAAAATTTAAAAATTATCCATTTCATCGGCGCCGCTAAGCCGTGGCTCCAGCATTTCAACTGGCAGTCTCGGTCAGTCGAGGCCCCCGAACATTTACGAGGTTTCTTGCAACTATGGTGGGACCTCTTTGTTGCACAAGTTCATTCACAGCTAGACACACAAATGGCTGAGGAAGTTCCTCTGGGGATTGACTTAGAAGAAGAAGAACCGAGTGAATATGATGAACCAGTACAGGATTATAGTTTCTATGAACCGACACTGGATCCCAGTTCTGAGTTTCCATGGCATCGTCCTTATGATCAGATCAAAAACACAGAATCCATTGAGCCCAGCATTGACATAGGTCAATTTCATGATCCATGGCAAATTTACAGAGGGAACATACCTCCTAGTAAAGATGATGCAAGTTGTATAAATGCAACGGAAAGTCATAGACAGTATGCCTGGGACTACATGCAACCGCAAACACAGCATTATACACCTGAGAATAGTCACAATTCTGAAAACACTTACACTCAAAATTATAATAGCGAAATATGGCAATATAATTCCGAACATAGTTCTCAACCTCAAACTACGCAACAGTTTACGACATTTACACCTTCAATAAGTTCTCAATGGGAAGAGAGTCAATGCAACATAAATGTTCACGATCAACACTATCACACACCGATCCAAGAAATAATTGTTCACCATGATCATTACCCAAGTCACAGTAATCAACAAAGTTCCCCTGAATCTCAAAATCAGCCTGGTGATCATCAAGGTCATAACAACCACCAAAGTCACACCCATCACCAACATATCGAACATCATCAAAACAATTATCAGAATTACAATCAATCTCATCACCATGAAAATGATCAGAACCAAAACGATTTCACTCAGCAGCGTTTTGAAAGTCATGTACATGAACAAAATCAAAATTATAATTATCAACATTCATCAGAACATCATCACGAGTCACAATCCTATCACGACCCTGGTTTTGAGCAATCACATCAAAGAGCCGATTATTATCAAGACAAACACGATTCACAGTCTCTTTTTCATAACCATTCCCATAGCAATATCACAGAAAATAACAAAAATGTTAACAATGATGAAAGATTCAATAACAGTTATATGAAAAATGTTGAAATAAATTATTCGCAGTTTAAAAAACAAACTCAACCACAAATATACACTGTCATGATGGATCATGAAAGGTTGCATAATGTTCGTAAATTACATTCAAATCTAAATGGCTGTGAAGCAGAATACTATAGTAATACTTTTGAAGATATCCCTAGGCATCCGTACGATGGATTTTATCTAAGACATAGAACTACTATAGATTCTCGGGGGCGAAAAATCTGTATTCATGAAATACCTTTATCCCCTCCTTCGCCAACACCGTCACTCGAGTCGTCACTTGAAAGCGATGATGAAAATGAAATATTCAAAGATATTAACTATGACAGGCTGAACGGTGAGGAAAGTCAAACTGGTGTAGCAGGTAACCTGGCTAAAGTAGTGCCGGGTGAACCGCAGCAACAAGAGGCAGTCGATGAGCTTACGAGGCGACAGGGCTGGGAAGCCGGCAATATTGATTATATGGGTGCTGACTCCTTTGACAATATCTGGGCGAAAATATCGCAAACACTAAGTCAACCACCAAGCTCTCCGCCCCGACAACCTTCTCCATCTAATGACCAGTCTGTTCAACCTAGCGAAGATCGTGCTGTCGCAATAGAAGAAGTAAAAGAGGCCGTGGTAGCACCGGTTGAATCAAAACCAGAAGAACCAGTCAAAGGTAGTATACCGTCAGACGCATCCTCTGAAACACCCGTTGCAGTAGAGGCACCGGTCATGGCATCCGAAGTAGATGCCTCTACTGCTTCTACTGAAACTGTAGAAAATGTTGTGTCACTGGACGCTGCAGAAACTGTCGCGCCTACTGAGACAAGTGAAAGCGTTCCCCCTCCAGAAGCACCTGCATGTGTTTCCCTTCCCGTGGCATCTGATTGTGTTTCCTCGCCTGAAGTTCCAGCAACTGTTGATGCAACAGAACCTGTTGTACAGATTGAAGCTGAAGCAACAGAAAGTGTTTCCCCTCCCGAAGCATCAAAACCTGTTGCACAGATTGAAGCAACAGAAAGTGTTTCCCCTCCCGAAGCATCAGAGCATGTTGCACAGATTGAAGCAACAGAAAGTGTTTCCCCTCCCGAAGCATCAGAGCATGTTGCACAGATTGAAGCAACAGAAAGTGTTTCCCCTCCCGAAGCATCAGAGCATGTTGCACAGATTGAAGCAACAGAAAGTGTTTCCCCTCCCGAAGCATCAGAGCATGTTGCACAGATTGAAGCAACCTGTGTTTCCTCACCCCAACCATCGGAAAATATTCCAATAGTTGAATCAAAAGAAACTGTGGCACCACCTGTCGCAGACGTAAACGTTGCTTCGACTGAAGTCTTAGAAAATGTTTCTGCGCCTGCAACACCTGTTGCTCCGGCTGAATCGATCGATAGTTTGACTGCTCCTGAACCACCTGCACCTGCAACTGAAAATAAAGATGATGTTGTCCCTGAAACTTCACCAGCAGCAGCGTCAATTGAAGAGCCTGTCGTAGATGTCCCAGTTCCAGCGAGTGAAGCCGCCGAATCTCCAGATTCTCCTTTAAATAAATCATTGGCAACCAATGAACAATCCTCAGACAAAAAGGAAGTGGCCTCTGACAGTCCTCCTCTGGCCAATACCCCTTCCAAAGAAGAAATACCTAGTCCGCCAGCTGCCAAAACGGAGGAGCGTCGCAAGCCGTTGGGGAAACTGTCGCTGCCGCCCGCGGCTTGTGACACGCTGCCAACACCTGACAGCGAGCTAGAGGACGCGGCCTCGCTTGCACACGCCATCATCGCCGGTGAACTGCGCACGCCTACTGTCACTTCCCCCTCACCTCCCGTCATATCTTCTTCACCTCAAACACAACCCTCACAAACACAAGCCCGCAGTCTATCCATCGACCAGCCCGAAGCACCAACTCCTCCCCTTGATTCCCCCCTATCATTATCTCAGATCGGCGTCAAATCAAAACCCACCATCGCATCTCAAATAGAAACCTCGGTTTCTAAGACCGAATCGGCCCCGACTTCCGAGGTGTCCGAAGCACCTAAACCGAAGTCAGACGCTCCCAAAAAGAAAATAGTGAAGAAAGTGGTGAAGAAGGTGGAAAAGGAAGGCGGTGCCAGTGGTGACGCGCCAGTCCCCGTCCCGCCGCCGCGGAAAAAGGAAAAGAAACCCAAGGAGAAATAA

Protein sequence:

>DPOGS201438-PA
MSNRAWVTLATNDSYGLGALVLAHSLRRASSSYPAVVLITPSVTEPMRERLRAVFAEVILVDVLDSKDAAHLALLQRPELGITFTKIHCWNLTQYEKCVFLDADTLIVQNCDELFEREELSAAPDVGWPDCFNSGVFVFKPSADTFSKLVTFASERGSFDGGDQGLLNSYFSDWAHGDINKHLPFLYNVTSAAFYSYIPALKHYGQNLKIIHFIGAAKPWLQHFNWQSRSVEAPEHLRGFLQLWWDLFVAQVHSQLDTQMAEEVPLGIDLEEEEPSEYDEPVQDYSFYEPTLDPSSEFPWHRPYDQIKNTESIEPSIDIGQFHDPWQIYRGNIPPSKDDASCINATESHRQYAWDYMQPQTQHYTPENSHNSENTYTQNYNSEIWQYNSEHSSQPQTTQQFTTFTPSISSQWEESQCNINVHDQHYHTPIQEIIVHHDHYPSHSNQQSSPESQNQPGDHQGHNNHQSHTHHQHIEHHQNNYQNYNQSHHHENDQNQNDFTQQRFESHVHEQNQNYNYQHSSEHHHESQSYHDPGFEQSHQRADYYQDKHDSQSLFHNHSHSNITENNKNVNNDERFNNSYMKNVEINYSQFKKQTQPQIYTVMMDHERLHNVRKLHSNLNGCEAEYYSNTFEDIPRHPYDGFYLRHRTTIDSRGRKICIHEIPLSPPSPTPSLESSLESDDENEIFKDINYDRLNGEESQTGVAGNLAKVVPGEPQQQEAVDELTRRQGWEAGNIDYMGADSFDNIWAKISQTLSQPPSSPPRQPSPSNDQSVQPSEDRAVAIEEVKEAVVAPVESKPEEPVKGSIPSDASSETPVAVEAPVMASEVDASTASTETVENVVSLDAAETVAPTETSESVPPPEAPACVSLPVASDCVSSPEVPATVDATEPVVQIEAEATESVSPPEASKPVAQIEATESVSPPEASEHVAQIEATESVSPPEASEHVAQIEATESVSPPEASEHVAQIEATESVSPPEASEHVAQIEATCVSSPQPSENIPIVESKETVAPPVADVNVASTEVLENVSAPATPVAPAESIDSLTAPEPPAPATENKDDVVPETSPAAASIEEPVVDVPVPASEAAESPDSPLNKSLATNEQSSDKKEVASDSPPLANTPSKEEIPSPPAAKTEERRKPLGKLSLPPAACDTLPTPDSELEDAASLAHAIIAGELRTPTVTSPSPPVISSSPQTQPSQTQARSLSIDQPEAPTPPLDSPLSLSQIGVKSKPTIASQIETSVSKTESAPTSEVSEAPKPKSDAPKKKIVKKVVKKVEKEGGASGDAPVPVPPPRKKEKKPKEK-