Monarch geneset OGS2.0

DPOGS212569
TranscriptDPOGS212569-TA2169 bp
ProteinDPOGS212569-PA722 aa
Genomic positionDPSCF300075 + 89025-97320
RNAseq coverage68x (Rank: top 67%)
Annotation
HeliconiusHMEL0088020.066.97% 
BombyxBGIBMGA002076-TA0.060.97% 
DrosophilaSmyd4-PA5e-6235.77% 
EBI UniRef50UniRef50_UPI00021A807F2e-11937.26%UPI00021A807F related cluster n=3 Tax=unknown RepID=UPI00021A807F
NCBI RefSeqXP_002428677.11e-11033.38%set and mynd domain-containing protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3838496831e-12537.43%PREDICTED: SET and MYND domain-containing protein 4-like [Megachile rotundata]
NCBI nr blastxgi|3407231802e-12437.26%PREDICTED: SET and MYND domain-containing protein 4-like [Bombus terrestris]
Group
Gene OntologyGO:00054883e-09binding
KEGG pathway 
InterPro domain[658-686] IPR0119903e-09Tetratricopeptide-like helical
Orthology groupMCL16673 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212569-TA
ATGTGCAGTGATGTGACGCTGTGTGCGAACAATAAGGGTTTTTTTAAGAACTTCGCAGAAGAAATGGTTTCATTGGCAGAAATTGATGGATGGTTGGATGACTTTGAACTTATTGAGGACAAAGAAAAAGTCCTAGCAGTAAGGAATAATCAAAAAATTATGGAACCTATCAATGAGTTATTATCTAGAATTCAACCCTTATTCCGAGGTAAGGACGCAAGAGTATCGCATGAAAAGAAAACTGCAGCTCTGATAGCCTTAAAAAATGGAGATCTCGTTAAAAGCCTTTCTCTAGCCAACCAAGCCGTGCTTCGAGCTCCAATGACTGGTACCGATGAAATAATAGATAGTGGTATAACACTCGCTTTAAGCCTCTGGGTTCGTTCGGAAGTTTTGTTAAGTCTAAACCGTCCGAAACCAGCTTTAGAAGACCTTAAGCTAGCTTTGAAAGAACGGCTTCCAGCTCGTATGAGAGCAGATTATTATTGGAGGATGGGCCACTGTTACAAGGGTACTGGTGAAACAACACGAGCTAAGGTTTCATATGAATTAGCTAGCCGGTTATTGGGGGATAAAAAAGAAGCAAAAATTCAACTAGCTAATGATATCGAATCATTAAAACACTCTACACAATCCGAAAGTCCTTCCAAACTGAAAGAACCTCAACTCACAAGCGGTGCAACGTTAAATTTACCAGCTCTTTCAAAATTATTAAAAATTACTGAAGATAACGAAAAGGGCCGTTACGCAGTTGCTAATGCTCCAGTAAAAACCGGTGACATAGTTTTAGTTGAAAGTCCCTACGCTGCTTGTTTACTCGCTGATTGCCATGGCTCTCACTGCCTTCATTGCTTTGTAAGATTAGAAGATTTTGAGGACTCGGCTCCAATATGGTGTCCCAATTGCTCAGGAGTAGCATTTTGTTCGATACAATGTCGAGATGCTGCAATTTCCACATATCATTTATACGAGTGCCCGTTTTTTAACCTATTTATTGGTTCCGGAATGTCGGTACTTAGCCACATTGCTCTCCGTATGGTAACCCAAGCCGGACTGGACACAAGTCTTTCAATACATTCGAAGTTTTTAAGCAATGAAGTTAAGACTATACAGAGTCCGGTATTAAACGATGTTGAAGGAGAAAAAAAAAAGTTTAAGATAAAAAGTAGAAAAGAGAGATTGAACAGAACAAGAAAAGGTATGAACATTATCGAAAATAAAACTTCCGATACACAAGAAATTGAACCACAAATTAAAAATGAGACGAGTTACAATGAAAAGATAGAAATGGCAGCTGAGCAAATTTATTCACTGCTGGCTCATTCACGACAAAGGAAGGGAGCAGATTACCTAAAGCGTATAATTATGGGCATGTTTCTAACGGAATGTTTGAAGAAAACCGATTTTTTTAAAAATTGTGAAAAAGAAAATATAACAAGAGCTGAAATATCAATTTGCGAATTGATAGTTCGTAACTTGCAATTATTACAATTTAATGCCCACGAGATATATGAAACAGTGCGTGGAGAACATCAATTTAGAGGATCTAAACCAGTCTACATAGGCGTAGGAATTTATCCTACAGGAGCCTTATTTAATCACGAATGTTATCCCGCAGTGGCACGATATTTCTATGGTAAAAAAATGTCATACCGCGCGATACGACCTCTTGAACCAGGAGAGATTGCCGCTGAGAACTATGGACCGCATTTTTTGATGCGCACGCTTAAGGAACGCCAAAGGATGCTGACGTGTCGATACTGGTTCAGATGTCAATGTATAGCCTGCGTTGAGGATTGGCCGACTCTCAAAGAAACTGAATCTAAATCACCAATATACTTGAGGTGTCTCAATAAGAAGTGCCACGGAAAAATTAAAGTTATCAAAAATCCAACAAACTTGAAGTGCCCGAAATGTTCTATGGCCTTTAATAAGACTTCTTTGAAAGAATGTTTAAACGAGGTTGACATAGTTCTCTCGCAGTACGAGGCAGGTGCGAAGCTAATGGAACAGCAGCGGCCCCAAGATGCTATCGAAATATTCTCAAAAGCCATTGATTGCTTTTATGACTTTGCAATGCCTCCACATCGAGAAACACATATAGCACAAGAATCGCTAAGGTCGTGTTATGCTACATTTGGAAACACCCATATTTTAAAAGAAAAATGA

Protein sequence:

>DPOGS212569-PA
MCSDVTLCANNKGFFKNFAEEMVSLAEIDGWLDDFELIEDKEKVLAVRNNQKIMEPINELLSRIQPLFRGKDARVSHEKKTAALIALKNGDLVKSLSLANQAVLRAPMTGTDEIIDSGITLALSLWVRSEVLLSLNRPKPALEDLKLALKERLPARMRADYYWRMGHCYKGTGETTRAKVSYELASRLLGDKKEAKIQLANDIESLKHSTQSESPSKLKEPQLTSGATLNLPALSKLLKITEDNEKGRYAVANAPVKTGDIVLVESPYAACLLADCHGSHCLHCFVRLEDFEDSAPIWCPNCSGVAFCSIQCRDAAISTYHLYECPFFNLFIGSGMSVLSHIALRMVTQAGLDTSLSIHSKFLSNEVKTIQSPVLNDVEGEKKKFKIKSRKERLNRTRKGMNIIENKTSDTQEIEPQIKNETSYNEKIEMAAEQIYSLLAHSRQRKGADYLKRIIMGMFLTECLKKTDFFKNCEKENITRAEISICELIVRNLQLLQFNAHEIYETVRGEHQFRGSKPVYIGVGIYPTGALFNHECYPAVARYFYGKKMSYRAIRPLEPGEIAAENYGPHFLMRTLKERQRMLTCRYWFRCQCIACVEDWPTLKETESKSPIYLRCLNKKCHGKIKVIKNPTNLKCPKCSMAFNKTSLKECLNEVDIVLSQYEAGAKLMEQQRPQDAIEIFSKAIDCFYDFAMPPHRETHIAQESLRSCYATFGNTHILKEK-