Monarch geneset OGS2.0

DPOGS203219
TranscriptDPOGS203219-TA2910 bp
ProteinDPOGS203219-PA969 aa
Genomic positionDPSCF300035 + 943753-950097
RNAseq coverage805x (Rank: top 16%)
Annotation
HeliconiusHMEL0064960.084.95% 
BombyxBGIBMGA011500-TA0.085.21% 
DrosophilaSu(var)3-9-PA4e-6736.87% 
EBI UniRef50UniRef50_Q9N6T90.074.62%Putative heterochromatin protein (Su(Var)3-9) (Fragment) n=3 Tax=Obtectomera RepID=Q9N6T9_9NEOP
NCBI RefSeqXP_975868.10.063.90%PREDICTED: similar to heterochromatin protein isoform 2 [Tribolium castaneum]
NCBI nr blastpgi|910774700.063.90%PREDICTED: similar to heterochromatin protein isoform 2 [Tribolium castaneum]
NCBI nr blastxgi|2700021410.063.69%hypothetical protein TcasGA2_TC001103 [Tribolium castaneum]
Group
Gene OntologyGO:00055157.9e-31protein binding
GO:00055252.9e-27GTP binding
GO:00039242.9e-27GTPase activity
GO:00056342.1e-21nucleus
GO:00082702.1e-21zinc ion binding
GO:00349682.1e-21histone lysine methylation
GO:00180242.1e-21histone-lysine N-methyltransferase activity
KEGG pathwaydpo:Dpse_GA196220.0 
 K11419 (SUV39H, CLR4)maps-> Lysine degradation
InterPro domain[747-861] IPR0090004.9e-42Translation elongation/initiation factor/Ribosomal, beta-barrel
[867-958] IPR0152563.2e-36Translation initiation factor 2, gamma subunit, C-terminal
[862-960] IPR0090011.8e-34Translation elongation factor EF1A/initiation factor IF2gamma, C-terminal
[393-524] IPR0012147.9e-31SET domain
[626-743] IPR0007952.9e-27Protein synthesis factor, GTP-binding
[295-385] IPR0077282.1e-21Pre-SET domain
[131-198] IPR0161971.1e-16Chromo domain-like
[139-189] IPR0237802.2e-13Chromo domain
[138-191] IPR0009534.6e-11Chromo domain/shadow
[775-857] IPR0041614.3e-10Translation elongation factor EFTu/EF1A, domain 2
[289-377] IPR0036061.8e-09Pre-SET zinc-binding sub-group
Orthology groupMCL14615 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203219-TA
ATGACTTCGAACGAAGGAAGAGGACAAGCAAACTTGCATCAGCAAGATTTATCTAAATTAGACGTCACAAAACTTTCTGCTTTGTCTCCCGAAGTTATATCGAGACAGGCCACTATCAACATAGGTACTATTGGTCATGTAGCTCATGGAAAGTCTACAGTGGTTAAAGCCATTTCTGGAGTTCAGACTGTCCGATTCAAGAATGAATTAGAAAGGAACATTACTATAAAATTAGAGCGCCTGTCCGATTCAGTTATAAGAATGTATCGTGAACGAGCTGATGAAAGGAAAGAAAGGATATTCTTTGAAAAAATGAGAAAAAATAAAAAAAGGTCATTGCCACCCGATCCAGACGTGACTCCTGAAAAACCACCAAAAAAGAAACAAAAGAAAAATAAACAAGAAAATGAGGAATTTATTATTGAAAGGATATGTGGGTTTAAGTTTCAATCAGGAAAAGAATTGTTTCATATAAAATGGAAAGACTATAATGAGAGTGAAGCTACTTGGGAACCAGCGGAGAATCTAATTAATTGCCCAGAAATATTACATGAATTTTTGAGTAAGGAAGAATTGAAGCATGCTGATAAGATTGAAAAACTAAAAGAAGAAATATCTTTTGGTAATCTACTCGAAGATGAATACCTCATTCAAAGGTTGGATGAAGTAGAGGATTCGGAACTTACTAAATTAAAAAATGATCTCATTGTCAAACTTCTTACCATGATCTGTCTGAAACAAAGTGATGAACATTATGCTTCTCAACTGGTGCAGGATACAAGAAAAATTTATCAATTATATGTCTTAACAAGGAAACGCTATCAACAACTTATGGCTCTGAAGAATTGGGAAGATTATCTCAATCAAGTGGATATATGCAAGAAATTAACCGTTGAAAATGATGTTGACTTAATCGGTCCACCAGAAAATTTTACATATATCAATCATTCAATTCCCGCCGCCGGCGTAACTATTCCGGATGAACCTCCCATCGGCTGTGAATGTGAATCATGTAATTGTCGCTCAAAGTCCTGTTGTGGAATGCAGGCTGGTTTATTTCCTTACACAGTCAAAAGAAGACTTCGTGTAGCGCCCGGAACACCAGTCTACGAGTGCAATAAAGCCTGCAAATGCTCATCAGATTGTAACAATCGCGTTGTACAAAGGGGACGTAATACTAAACTAACCATATTTCGTACATCCAACGGATGCGGATGGGGAGTCAGGACTGAACAGAAAATCTACCAGGGACAATTTTTATGCCAATATGTTGGAGAAGTCATTACTTTCGAAGAGGCAGAGAAACGTGGACGCGAATATGATGCTAATGGTTTAACTTATCTTTTCGATTTGGACTTTAACTCGGTTGAAAATCCGTATGTGGTGGATGCATGTAACCTAGGAAATGTAACTCACTTTATAAATCATTCTTGTGATCCTAATCTGGGGGTGTGGGCTGTATGGGCTGATTGTCTAGATCCAAATTTGCCGATGTTGGCTTTGTTTGCAACTCGTGATATTGAAGCGGGGGAGGAAATTTGTTTTGATTATTTACAAAAATCATTAGAAAACGAGGAAGAAACGAATACTTCTGTTGAAAATGTTGAAGAAGGCGATTCAAATTTACCCGATGCTGCAGAAGCTAGTACTGCTGTATCTCCCGTGTCGCCCGTTAAAACGAGATTTGAAATCCAACAACAGAATAGAGCAATGCTAAGAAATCTCACTGAGTGCAAATGTGGTTATGCGAATGCGAAAATCTACAAATGCGACAACCCTAAATGCCCTCGTCCCACGAGCTTTATATCAGGCGGTTCGTCCAAGGATGACAGCTTTCCGTGTCTGAGACCGGCATGTACTGGTCGCTTCCAACTCGTTCGCCACGTTAGCTTTGTGGACTGTCCCGGTCACGACATCCTTATGGCCACCATGCTTAACGGCGCAGCGGTCATGGACGCCGCCCTACTGCTTATTGCGGGTAACGAGTCCTGCCCCCAGCCTCAAACCAGTGAGCACTTGGCCGCTATAGAGATTATGAAACTGAAGCACATACTTATACTTCAGAATAAAATTGATTTGGTAAAGGAAGGTCAAGCTAAGGAACAGCACGAGCAAATCGTTAAATTCGTTCAAGGAACCGTGGCGGAAGGTGCACCGATCATACCGATATCGGCTCAGTTGAAGTACAATATTGAGGTATTATGCGAGTACATAACAAAAAAGATACCTGTGCCGTTACGCGACTTTACTTCCCCGCCCAGAATGATCGTGATTCGGTCTTTCGATGTGAACAAGCCGGGATGTGAGGTCGATGATCTACGTGGAGGAGTCGCTGGCGGGTCCATACTACAGGGTGTGCTAACCGTCGGTATGGAAATTGAGGTTCGTCCAGGTCTAGTGAGTAAGGATGCGGACGGTAAGCTGACTTGTCGTCCGATATTTTCTCGTATCGTATCACTATTCGCTGAACAGAACGAGTTACAATACGCTGTTCCCGGTGGACTCATCGGTGTTGGAACTAAGATTGAGCCGACGTTGTGTCGAGCTGACCGTCTTGTTGGACAGGTACTTGGCGCAGTGGGAGCCCTACCTGGTATATTTGTCAAGCTTGAAGTGTCATACTATCTTCTGAAACGTCTTCTCGGTGTGCGTACGGAGGGCGACAAAAAGGCTGCCAAAGTACAGAAATTGGCAAAGAATGAGGCGTTATTGGTTAACATTGGATCTCTCAGTACTGGTGGGAGAGTTATCGCTACAAAGGCTGATTTGGCGAAAATAGCTCTTACGAGTCCCGTTTGCACCGAAATTGGAGAAAAAGTCGCACTCAGTAGAAGAGTTGAGAACCATTGGAGGTTAATCGGTTGGGGTCAGATACAAGGAGGAGAAACCATTGAGCCCGGAAAGAACTAA

Protein sequence:

>DPOGS203219-PA
MTSNEGRGQANLHQQDLSKLDVTKLSALSPEVISRQATINIGTIGHVAHGKSTVVKAISGVQTVRFKNELERNITIKLERLSDSVIRMYRERADERKERIFFEKMRKNKKRSLPPDPDVTPEKPPKKKQKKNKQENEEFIIERICGFKFQSGKELFHIKWKDYNESEATWEPAENLINCPEILHEFLSKEELKHADKIEKLKEEISFGNLLEDEYLIQRLDEVEDSELTKLKNDLIVKLLTMICLKQSDEHYASQLVQDTRKIYQLYVLTRKRYQQLMALKNWEDYLNQVDICKKLTVENDVDLIGPPENFTYINHSIPAAGVTIPDEPPIGCECESCNCRSKSCCGMQAGLFPYTVKRRLRVAPGTPVYECNKACKCSSDCNNRVVQRGRNTKLTIFRTSNGCGWGVRTEQKIYQGQFLCQYVGEVITFEEAEKRGREYDANGLTYLFDLDFNSVENPYVVDACNLGNVTHFINHSCDPNLGVWAVWADCLDPNLPMLALFATRDIEAGEEICFDYLQKSLENEEETNTSVENVEEGDSNLPDAAEASTAVSPVSPVKTRFEIQQQNRAMLRNLTECKCGYANAKIYKCDNPKCPRPTSFISGGSSKDDSFPCLRPACTGRFQLVRHVSFVDCPGHDILMATMLNGAAVMDAALLLIAGNESCPQPQTSEHLAAIEIMKLKHILILQNKIDLVKEGQAKEQHEQIVKFVQGTVAEGAPIIPISAQLKYNIEVLCEYITKKIPVPLRDFTSPPRMIVIRSFDVNKPGCEVDDLRGGVAGGSILQGVLTVGMEIEVRPGLVSKDADGKLTCRPIFSRIVSLFAEQNELQYAVPGGLIGVGTKIEPTLCRADRLVGQVLGAVGALPGIFVKLEVSYYLLKRLLGVRTEGDKKAAKVQKLAKNEALLVNIGSLSTGGRVIATKADLAKIALTSPVCTEIGEKVALSRRVENHWRLIGWGQIQGGETIEPGKN-