Monarch geneset OGS2.0

DPOGS211967
TranscriptDPOGS211967-TA2055 bp
ProteinDPOGS211967-PA684 aa
Genomic positionDPSCF300011 + 1149897-1152030
RNAseq coverage517x (Rank: top 24%)
Annotation
HeliconiusHMEL0177330.073.69% 
BombyxBGIBMGA000919-TA0.070.18% 
DrosophilaHmt4-20-PA8e-10751.28% 
EBI UniRef50UniRef50_D2A1612e-14059.82%Putative uncharacterized protein GLEAN_08337 n=1 Tax=Tribolium castaneum RepID=D2A161_TRICA
NCBI RefSeqXP_974904.14e-14159.82%PREDICTED: similar to Suv4-20 CG13363-PA [Tribolium castaneum]
NCBI nr blastpgi|910815359e-14059.82%PREDICTED: similar to Suv4-20 CG13363-PA [Tribolium castaneum]
NCBI nr blastxgi|910815352e-15345.76%PREDICTED: similar to Suv4-20 CG13363-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055152.2e-12protein binding
KEGG pathwaytca:6637761e-140 
 K11429 (SUV420H)maps-> Lysine degradation
InterPro domain[125-241] IPR0012142.2e-12SET domain
Orthology groupMCL16646 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211967-TA
ATGGTTGTTGGATCCGCGAGTTATCCAGGGAGGCAAGTGCTCCACAAAATGCAGCCCATGGGAATGACCCCGCGCGAGTTGTCGGAGTACGACGACCTGGCGACGGCTCTCATAATCGACCCTTATCTCGGAATAACCACTCACAAAATGAACATCAGGTACAGACCGTTGAAAACGAACAAAGAGGAACTGAAGAATATCATTAAAGAATTCCTCCAGACTCAGGACTACAACAAGGCTTACTCCAAACTGGCTAATGGTGAATGGATGCCGAGACACTTCAGCAAAAATAAACACCAGCAAAACAAACTAAGAGAACACATTTACCGTTATCTTAGAATTTTTGACAAGAAGGCTGGTTTTGTTATAGAACCTTGTTATAGATATTCATTAGAAGGAAGAATTGGAGCTAAAATTTCAACTACAAAAAAATTTTTTAAGCATGAGAGAATCGACTTCCTAGTGGGTTGTATCGCTGAGATGACCGAGGAAGAGGAGAAGCAACTTCTTCATCCAGGGAAAAATGACTTTTCAGTTATGTATAGTTGTAGAAAAAATTGTGCCCAGCTGTGGTTGGGACCCGCTGCTTATATAAATCATGACTGCAGGCCCACCTGTACCTTTGAAGCAACAGACCGGGGAAAGGCATTTGTACGAGTGTTGAGGGATATAGAAGTTGGGGAAGAGATAACCTGCTTCTACGGAGAAGACTTCTTTGGCAATGGGAATTGTTACTGTGAATGTGAGACATGTGAGAGACGAGGGAAGGGGGCATTTTCAGTACAGAATGCTCACAATGACGAGCAGGCCACGAGGTACAGGTTTAGAGAAACTGATAATAGAATAAATAGGACTAAAGCAAAGCAAATTCAAAAACCTGTGAACAGTAAAAACTCTGAAAAGTCAATAGCGCCCGCAGGACAAGTCTCTACTATAGTGTCTCCATTGAGTATGAAGGAAATGAAGCAGAAGGGTTTGACGAAGTATGATGCGGAGTTGTTAATAGCACAAGGTTGCATAGCGGACATCATGGACGGGAACGGTAAGAAGAACGCGCAAGGGAGCCGGGAGTCATCGGCGTCCAGCCGCGGGGAGCGTCTCCGGCGGAGAGCGGACAGCAGTCGTCCTGTCAACGGCACCTCCACACACACACCCAGCGCCAGTCAGGCTGCTCAGTCGCGATGTTGTAGCGTCACAAGCTCGTGTAGCAGTCGCGACTCGCATTCTGGAATAGTTTTAAGAAGTCACAGGCGTCTCACCGAGTCCAGTGTCCCCGCTGTCTGTTCGAAAGTAAGGAATTCTTCGAAAGCCACAACCGAAACCAAAATTTGCCGAAATCACAGCACGCCAAAAACAGAACCAGATTTACCAGAGCCGGAAGTGGATACACAGCCGGTGAAAATGGAACCTCCGCAGGAACACGAGGAGGTCACGCACGACCCGGAAACACACATAGAAACTAGGAGTGGCGATGAAGCTGCAATCACGGGCGAGGAATGCCACAAGAATGAATCTCCGCTGCAAGCAACAGAGACACCTCCCCGAGGAAATGTAAAGACGGACACCGAGGACACAGAGCCCGCGGAGAAGCGGCTCGTAGAGAGCAAGTGTGTGGGCGAAGACGCGTGCATCAGCGAGAGTTGTGACTTTAGAGAAAACGTGAACCCGAGCGAGGCGAAGGAGAGTAAGGCAGTGAAACAGAAAGCGGATGTGGCAAGGAGCGACGAGAACAAGAAGGAGTGCGAGGGCCAATGGCTGGCGGACAAGTCCAACTGCGGCGGAGAGTGTCCCTGCACCCCGCCCAGGAGGGGCCTGAAGTTGACGCTCAGGGTGAAGAGGAGCCCCGTGGTGGAGGAAGAGGTGCCCGAGTACGAGGTGCTGCGGCTGGAGGGCGTCGACCCCGACACGGCCCGCCGCCTCAAGAAGAGACGCCGCTCCAAGGAACGACGGAAACACAGCCCCGTCCGCCCGCTGCCTCCCATGAAGCGACTCAGGTTGATCTTCGGCAACGAGAGCCGCACCATCGACCTCCCGCCCGCCCTCACGGCAGACTGA

Protein sequence:

>DPOGS211967-PA
MVVGSASYPGRQVLHKMQPMGMTPRELSEYDDLATALIIDPYLGITTHKMNIRYRPLKTNKEELKNIIKEFLQTQDYNKAYSKLANGEWMPRHFSKNKHQQNKLREHIYRYLRIFDKKAGFVIEPCYRYSLEGRIGAKISTTKKFFKHERIDFLVGCIAEMTEEEEKQLLHPGKNDFSVMYSCRKNCAQLWLGPAAYINHDCRPTCTFEATDRGKAFVRVLRDIEVGEEITCFYGEDFFGNGNCYCECETCERRGKGAFSVQNAHNDEQATRYRFRETDNRINRTKAKQIQKPVNSKNSEKSIAPAGQVSTIVSPLSMKEMKQKGLTKYDAELLIAQGCIADIMDGNGKKNAQGSRESSASSRGERLRRRADSSRPVNGTSTHTPSASQAAQSRCCSVTSSCSSRDSHSGIVLRSHRRLTESSVPAVCSKVRNSSKATTETKICRNHSTPKTEPDLPEPEVDTQPVKMEPPQEHEEVTHDPETHIETRSGDEAAITGEECHKNESPLQATETPPRGNVKTDTEDTEPAEKRLVESKCVGEDACISESCDFRENVNPSEAKESKAVKQKADVARSDENKKECEGQWLADKSNCGGECPCTPPRRGLKLTLRVKRSPVVEEEVPEYEVLRLEGVDPDTARRLKKRRRSKERRKHSPVRPLPPMKRLRLIFGNESRTIDLPPALTAD-