Monarch geneset OGS2.0

DPOGS213562
TranscriptDPOGS213562-TA2244 bp
ProteinDPOGS213562-PA747 aa
Genomic positionDPSCF300033 - 46316-50655
RNAseq coverage313x (Rank: top 36%)
Annotation
HeliconiusHMEL0108670.089.83% 
BombyxBGIBMGA011842-TA0.084.37% 
DrosophilaSu(z)12-PB3e-15144.36% 
EBI UniRef50UniRef50_E2BG050.055.46%Polycomb protein Suz12 n=10 Tax=Coelomata RepID=E2BG05_HARSA
NCBI RefSeqXP_392695.20.058.45%PREDICTED: similar to Polycomb protein Suz12 (Suppressor of zeste 12 protein homolog) [Apis mellifera]
NCBI nr blastpgi|3838628360.055.53%PREDICTED: polycomb protein suz12-B-like [Megachile rotundata]
NCBI nr blastxgi|3838628360.055.85%PREDICTED: polycomb protein suz12-B-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[498-633] IPR0191354.8e-52Polycomb protein, VEFS-Box
Orthology groupMCL14072 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213562-TA
ATGCCCCCGAAGAAGCGTGAAAAAGAAAGCGAAACAAATAAAAACTCTAAAATAGATCATTTGCAAGCGGATCATGAACTATTCTTACAAGCTTTCGAAAAACCAACACAAATATACCGATTTTTGAGAACACGTAATATGTTATCGCCCATATTTTTAAATCGGACGCTGTCTTATATGAAACGTAGGATGTCTAGAAGCAATAAAAGCCGTATCGGCTTCAAAGTGGACTCCTTGCTTGAAAAGATAACATTAAAAAAGAGCACTGAATTACAGCTTAACAGTCTGGGTGGTTATATGACTTTAACGTTTTTGGGTTTTTACGATAAAAGTTTAGAAGAACCTAGTGATTATCAAGTGAAAGTTGAAACATTACTCTTAAAGATATGTCACAAGAAAAGGAAAGAAAGCTCATCAGCTATAGTGGAAGTTTCTGTTGGCAGTTGTTCAGTTCCCCTCAACCCTTCGACTTCTGAGCCTCCAGCTATGGCCTCTGCTGTGAGCATATCAAGTGACACTTTCAGCCCTTCACATGGACCCAATGTAAAATCCTACATGCTCATGTTAAGGGTAACTGTCACAAAGGCTTCCATGAGCAATGGTGCAAGTTCAACAGAAATAACTAATGGAGATGATGAACCTCTAACAAAACGTCTCAAATCATCATCCGATATAAATTCAAGCACAGATAATAAATGGAGCAAACTTTATGGCAGTGAGTTAATAGTGTATGACAAACATAACAGATGTCTTCTAACTAACGGGGAGTATGATCTAGTGTTGCATGATGTAACCCCCGGCGGCCGGAGCGCTACCATTGCCAGATCTCCTCACAAAGCTCTGATGGCCCAATGGGAAACTATACCTAATGAAAATGACCTACAAACAGAGACAAATCCTTTCGATATATTCAAAGTCAGACCGTTATTGAAGTTGAAGCTCAGCTGGAGTCAGGAACCCACAAACGGTTTAGTGAACCGGCCGAAACTCTACCAGCAGGACAGTAATGGTGTGAAGAAAGATAACACTGCCAAAGAGAAAATTTCCACCCCGAGGTGCTCCACTAGTGAAATAAAAAATGGAGAGAAATCCAAGCAGGAGTCTACAGATTCATCCAAACGTCAACAGATCATTTATCAGTTCCTATACAACAACAACTCCCGCCAACAGACCGAGGCCTGCGATGACTTACATTGTCCATGGTGCTCCCTCGACTGCGGAGCGCTATACTCCCTGCTCAAACATCTCAAGCTGTGCCACTCAAGATTTAATTTTACTTATTTTCCAATACCAGGTGGCGCTCGTATCGATGTGTCTATCAACGAGTTATATGATGGATCGTACACGGGCTCCCCTCACGACCTGATAGCGGCTCAAGGTCGGTCCAGGGGCGGAGGCCCACGCGCGGGGCCTACTCGTAGGGCTTCGCTGACACATCTACTAGTGTTGAGACGACGAAGACACAGGCATAGTCTGGCAGAATTCCTAGAGCTGGATGATAATGATGTGGACGCACAGAGACCCTACCTAACAGGACATAATAGATTATACCACCACACAATAACCTGCCTGCCGGTGTACCCTAATGAGCTGGACATCGACTCCGAGAGCGAAACGGATCCGCTCTGGCTCCAACAGAAGACCATGATGATGATAGACGAGTTCACCGACGTCAACGAGGGAGAGAAGGAGCTCATGAAGATGTGGAACCTTCACGTCATGAAGTACAACTACGTCGGCGACTGCCAGATACCGCTCGCTTGCCAGATGTTCCTACAGATGAAGGGCAAGGAACTGCTCGAGAAGAATCTATACAGGAATTTCATACTCCACATGTGTTCATTGCACGATTTCGGACTTCTGAGTCCGGTGGCGCTCTATCAAACCGTACAAATGTTGAACCAAATGTTGGCGGACAGCGCCGACGCCAAAGAGAAGATGAGAGAGTCGCTGAGGGCGCAGAGGGACCACTGGAACGCCGTAGGGAAATTCCAGCAACCTGTGATAATAGAACAAAAACCTAACAACGCCACTGTCAAGCTGAACAACGTCGAGGCGTCGCCGTCCGTCAGAAGAAAAACGTCGAATTTACAGAACGCCAACAGAATGGGCAGCGCTAGCTCGAACTTCAACAAATCATCTAGTCCCGGACCCGCCAACGGCGAAACCAAAAGGAAAATGTCTTCCGGAAGCATTCAGAGTAGAAAGAGGTCTTCTATATCCGAAAGAAAGAGCTCGGTATAG

Protein sequence:

>DPOGS213562-PA
MPPKKREKESETNKNSKIDHLQADHELFLQAFEKPTQIYRFLRTRNMLSPIFLNRTLSYMKRRMSRSNKSRIGFKVDSLLEKITLKKSTELQLNSLGGYMTLTFLGFYDKSLEEPSDYQVKVETLLLKICHKKRKESSSAIVEVSVGSCSVPLNPSTSEPPAMASAVSISSDTFSPSHGPNVKSYMLMLRVTVTKASMSNGASSTEITNGDDEPLTKRLKSSSDINSSTDNKWSKLYGSELIVYDKHNRCLLTNGEYDLVLHDVTPGGRSATIARSPHKALMAQWETIPNENDLQTETNPFDIFKVRPLLKLKLSWSQEPTNGLVNRPKLYQQDSNGVKKDNTAKEKISTPRCSTSEIKNGEKSKQESTDSSKRQQIIYQFLYNNNSRQQTEACDDLHCPWCSLDCGALYSLLKHLKLCHSRFNFTYFPIPGGARIDVSINELYDGSYTGSPHDLIAAQGRSRGGGPRAGPTRRASLTHLLVLRRRRHRHSLAEFLELDDNDVDAQRPYLTGHNRLYHHTITCLPVYPNELDIDSESETDPLWLQQKTMMMIDEFTDVNEGEKELMKMWNLHVMKYNYVGDCQIPLACQMFLQMKGKELLEKNLYRNFILHMCSLHDFGLLSPVALYQTVQMLNQMLADSADAKEKMRESLRAQRDHWNAVGKFQQPVIIEQKPNNATVKLNNVEASPSVRRKTSNLQNANRMGSASSNFNKSSSPGPANGETKRKMSSGSIQSRKRSSISERKSSV-