Monarch geneset OGS2.0

DPOGS201317
TranscriptDPOGS201317-TA2334 bp
ProteinDPOGS201317-PA777 aa
Genomic positionDPSCF300176 + 298478-303699
RNAseq coverage495x (Rank: top 25%)
Annotation
HeliconiusHMEL0172508e-8156.20% 
BombyxBGIBMGA003106-TA5e-9774.80% 
DrosophilaSet2-PA5e-3437.61% 
EBI UniRef50UniRef50_E9IEB74e-6529.23%Putative uncharacterized protein (Fragment) n=1 Tax=Solenopsis invicta RepID=E9IEB7_SOLIN
NCBI RefSeqXP_001606723.18e-6630.96%PREDICTED: similar to huntingtin interacting protein [Nasonia vitripennis]
NCBI nr blastpgi|3407268971e-6730.12%PREDICTED: hypothetical protein LOC100652142 [Bombus terrestris]
NCBI nr blastxgi|3227999454e-8130.80%hypothetical protein SINV_04653 [Solenopsis invicta]
Group
Gene OntologyGO:00063556.3e-10regulation of transcription, DNA-dependent
GO:00056946.3e-10chromosome
GO:00349686.3e-10histone lysine methylation
GO:00180246.3e-10histone-lysine N-methyltransferase activity
GO:00055157e-07protein binding
KEGG pathwaynvi:1001231152e-65 
 K11423 (SETD2, SET2)maps-> Lysine degradation
InterPro domain[725-771] IPR0132576.3e-10SRI, Set2 Rpb1 interacting
[526-570] IPR0012027e-07WW/Rsp5/WWP
[667-744] IPR0071431.8e-06Vacuolar protein sorting-associated, VPS28
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201317-TA
ATCAACCAAGAGGAAGAAGACGTAACAACCTCAAAGACGGGCACTTCAGAATCCTCTGAGGACAAGCATGCTGTGAACGCCACAGACCAGGCCCGGCCGAGAGTACGGAAACCCAAGAGGGACAGGAAGTATAAACCAAAGTCTGGAGACTTAGTACAAGACGCTGATATCGAGGAGGACCTAGAAGCGCTCGGTCGTACTGGCATTAAGAATCAGGCTCACACGCTGCGACTGTCACGTACGGTGGTCCGCGCTAAAACACGACGCGCACAGACGGCACTGCTGCGACTACTGAGAGACGCGCACCTCCCGTGCCGCAGACTGTTCCTCGACTACCGCGGTCTCAGGCTGCTGGCGCCCTGGTGTAACGACGCTCCTTTGAACTTCAAATTGGAAATGCTCCAAACTGTTGATCGTCTACCGATCACCAATAAGACGATGGTTCAAGAGAGTCGTTTCTTCACAATCGTGGAGCGATGGCTCAGTTCCTCGGACGCTCCTCCTGAACCCGTCTTTATTGATGAGGCCACTGGCTTGCCAGTAGAAATCCCGAGGCCAACTCCAGATGAACCCGTCACACCAGAAAAGACAAAAGAAAATGCAGCAATGTCTGAAAAAGTTAAAGATCTCTGTAACCAAATGCTTGAAAGGTGGTCCAGTTTAAAGGAGGTTTTCAAGATACCAAAGAAAGAGCGCATACAGCAAATGAAAGAACACGAGAGGCAAGCGAACGTCGAGAGGCGAGCGGCTGATTCGAGCGGTACGAGAGATCGAGATAGAGAACGGGATAGGAGGGACGAGAGGGATAGAGACAGAGATAGAAGAGATGAGCGAGATAGAGAGAGGGACAGGGACCGAGACCGCGAGAGAGAGAGGGAGAGAGACAGGGACAGAGACAGGGAGAGGGAACGGGAGAGAGAACGAGAGAGAGACAGATATAGGGACAGAGACCGGGATAGAGATAGAGACGACCGCGACAGAAGAAAACGGAGAAATAGTCCGGAGGGAGGACGGCGCAGTATAAAACTGAGCGAGCGAGTGTTAGCGGCCGTGCCTCCTATAAGTAAAGAGGAGCGGCGACGCGCGTTCGCGGAAGCTGCGGCCGCCGCCGACGAGAGCAGGAGGCAGAGGGAGAGGGAGCACGCGCAGGCCTGGCACTACAGGCACTGGCCGCAAGAGGCGTTCGCACAGATGTTTCCCCAGGGCATGATGGGTAACGGACAGAATATGATGGGCGGTGGACCGAATATGATGAACGGCGCACCGAACATGTTGGGGGGAGGTCAGAACATGATGGGTGGGCCTAACATTATGGGGCAGATGGTCCCTGGGATGCTACCGGGGGTGATGCCGTCGGAGGTCTCCAATGAATGGATCGGTCCCAACGGCGAATTCCAAGGCCCGCCCGGGTTTTGCCAGCCTTTCCCTGGACCACAATCGTTTTGTCTGCCTCAGTCTAATATGATGGGTTTACCCGGTTTCGGCATGGGTGGTTTTATGTTCGGTCAGCAACTGCCGGGCGCCTTCCCGCCCCCTCAACCTGCTCTACAACAACCACAGCAACCTCAGACCATCACAGATAATATAACGGAGTCGGGGGGCGAGCCTCCTCTGCCGTCGATGTGGCGGAGCGCCGTGGATGGTCGAGGTCGCCGGTACTACTACCACGTGAAGCTGAGACAACCACAGTGGCTGCCACCGCCGCCACCACCACAAGAGGAAAGTTCGTCTGAGGAGGAAGTGGAGCCCATGACCGCCATGGAGTCCGCTGTGATCGGGCGGCCGGTCAAAGGGAAACTCGTTGAGGGGGTCAACGGTATATACGAAGTTATCAAAGAGGATCCCCAGAACGGCCTCATTCCAGATCACGCCTTACTCAACATGAAGCCGCGCAAGAGAAGGCCCGGGCTGGTCACTGAGAGGCCTATCAGTCCGAGAACCGAAGAGGACAAGCTGGCCGGACGTATGGAGGTGAAGAGATACAAGCAAACCAAAGAGAAGTTACGGAGACGAAGAGAGAAGTTGCTGCAGAAGGTGAAGATGTTGACTGACAGGAGACGGAAAGATATGAAGTTAGATTGTCCGGCAGCATTAGAGAGGATAAGGGAGAACAAACCAAACCTGATTAAAGATGACAAAGGGAACACTAACAAATATATCGCTGAAATTGTATCGTTGACGCACTTCGTGATGCTGAAAGAGTTGAAGCACTGTCGGTCGGTGGACGAGCTGGAGGTGACGGATTCCGTCCGCACCAAGGCCAAGCTGTTCGTCAAGAGATATATGATGAAGTTCGGGCCCGTTTACAAGAGACCGCCCGAGGAGGCCGACTAG

Protein sequence:

>DPOGS201317-PA
INQEEEDVTTSKTGTSESSEDKHAVNATDQARPRVRKPKRDRKYKPKSGDLVQDADIEEDLEALGRTGIKNQAHTLRLSRTVVRAKTRRAQTALLRLLRDAHLPCRRLFLDYRGLRLLAPWCNDAPLNFKLEMLQTVDRLPITNKTMVQESRFFTIVERWLSSSDAPPEPVFIDEATGLPVEIPRPTPDEPVTPEKTKENAAMSEKVKDLCNQMLERWSSLKEVFKIPKKERIQQMKEHERQANVERRAADSSGTRDRDRERDRRDERDRDRDRRDERDRERDRDRDRERERERDRDRDRERERERERERDRYRDRDRDRDRDDRDRRKRRNSPEGGRRSIKLSERVLAAVPPISKEERRRAFAEAAAAADESRRQREREHAQAWHYRHWPQEAFAQMFPQGMMGNGQNMMGGGPNMMNGAPNMLGGGQNMMGGPNIMGQMVPGMLPGVMPSEVSNEWIGPNGEFQGPPGFCQPFPGPQSFCLPQSNMMGLPGFGMGGFMFGQQLPGAFPPPQPALQQPQQPQTITDNITESGGEPPLPSMWRSAVDGRGRRYYYHVKLRQPQWLPPPPPPQEESSSEEEVEPMTAMESAVIGRPVKGKLVEGVNGIYEVIKEDPQNGLIPDHALLNMKPRKRRPGLVTERPISPRTEEDKLAGRMEVKRYKQTKEKLRRRREKLLQKVKMLTDRRRKDMKLDCPAALERIRENKPNLIKDDKGNTNKYIAEIVSLTHFVMLKELKHCRSVDELEVTDSVRTKAKLFVKRYMMKFGPVYKRPPEEAD-