Monarch geneset OGS2.0

DPOGS208774
TranscriptDPOGS208774-TA3615 bp
ProteinDPOGS208774-PA1204 aa
Genomic positionDPSCF300036 - 866833-875792
RNAseq coverage339x (Rank: top 34%)
Annotation
HeliconiusHMEL0154240.065.73% 
BombyxBGIBMGA007949-TA0.046.32% 
DrosophilaG9a-PA3e-8334.69% 
EBI UniRef50UniRef50_E0VU861e-9938.58%Histone-lysine N-methyltransferase, H3 lysine-9 specific, putative n=1 Tax=Pediculus humanus corporis RepID=E0VU86_PEDHC
NCBI RefSeqXP_002429680.12e-10038.58%histone-lysine N-methyltransferase, H3 lysine-9 specific, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3272665122e-9930.83%PREDICTED: histone-lysine N-methyltransferase EHMT2-like [Anolis carolinensis]
NCBI nr blastxgi|2420184332e-9338.58%histone-lysine N-methyltransferase, H3 lysine-9 specific, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055152.8e-28protein binding
GO:00056342.1e-18nucleus
GO:00082702.1e-18zinc ion binding
GO:00349682.1e-18histone lysine methylation
GO:00180242.1e-18histone-lysine N-methyltransferase activity
KEGG pathwayphu:Phum_PHUM4478106e-100 
 K11420 (EHMT)maps-> Lysine degradation
InterPro domain[737-908] IPR0206836.6e-52Ankyrin repeat-containing domain
[1048-1179] IPR0012142.8e-28SET domain
[929-1026] IPR0036062.1e-18Pre-SET zinc-binding sub-group
[931-1035] IPR0077282.3e-16Pre-SET domain
Orthology groupMCL10368 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208774-TA
ATGCTTACCGACAGCGAAAACCCCGTCACCAAGCAAGATGATGATAACGACGCTTCGCCTGACAACCTTACTGTGAGGGTCGATAAAATTGAAGTCGTAAAAAAACCCGATAAAGAGGCAGATGCAGAAGATCCGAAACCACGAATAGTCCTTACATTTCGTTCAGAAAAGTCAGGTGCAAGGAGCAGCAACATGAAGATTGTATCCACAGAGGAGAAACATGAAGATATATCCCCTAGACGTTCAATCAGGCGACGCAATACTATAAACTATAATTATGTAAAGGAATCAGATGATGACAATATAAGTCACGCAGATTCAGATGATGATGAGCCTATTGAAAGTATGACACACAAACGTTCAACGAGGCGTAGAAGTAAAGATTTTTCTGATGTCATAGCCAATGCAATAGCAAGAAAAGAGAAATCTTACAATGAAAGTAGCTCAGTTCCAACTCAAAGATTGTCTCGCAGAATTAAACCAACTGCTAAGATATTGGCGAATGAGGAACTGAGGATGGGTTTAGAGTCACAAAATAATGCCCGTCTTGGAATATCAACTGAGAAAACAACAGAAGAGGGTGTGAGGACAAGGAGATCAGCTCAAGTCAGGAATTCTGAGAGTGTAACAGAAAAAAGATCATCTAAACGTAAAATCCATGAAGATAGTACATTTGAAAGTAAAGACGGTGATAACTCAAATAAAAAGTTAATGCACATGGGAACATTGGGGCTGAAGATAGCGAAAGAGGAGGACTCCAGCGAAGCTGAGAACAGAAAAACAGAAAGTTCAGCACACGACGGGGAAGACGAGGAGATCGATGACGACACAGAGGTGATCAGTCAACTGCTGCAGGCTGATGAGGAGAGTGCCAGCGATGAAGACTTCTGTCCCGATACATCTAAGAGGAGACGATCGCGAAGAAATTGTTCACCTGCTCCTCTCCGTCGTTCGTCTCGTAAAGCCAACCTCGGTCTTTACAACTATGACGCGTATCAGTTTGACGACATCATTGATAGTGACTTCGAACCCAAGAGAAAAACTACTCGAACACATACAGAAGAGCCTCCGTCGGAAGATGAGCCGGAAACAAAAATTGCAAGTGAGGCTGTAGGCGGTGAGGAAGAGGAATCAGAACCAGCTGCTCCATCAGAGGCGGCCACCGTCGTCGCCACCTGTCTCTGTGAGGAAACTAGCAACGTATACGCAGCGCCCGCAGACCTTACAGAGCCGGTATTCTGTCAGGCTATCGAGATGGTGGAGGGTGTGCGTGTGGGGTGCTCACACCGGGCGGCCCGGGCTCCCGGGGGCGAGCTGCTGGCCCTCCGACGGCCGGGTCTGAGGGCTCCCTACTTCCTCGCCTGTAAGCTGCACGCCGCACAGCTTGCCAAACACATGTGCTGTCCAACCTGCGGCCTGTTCTGTACACAGGGTATTTTCTACCAGTGCTCCAAGGACCACCTGTTCCACGTGGAGTGTGGTATAGGCGGCGAGGCGAGACAGCGCGCGGGCTGTCCTCACTGCGGAGTGCTGTCTCATAGGTGGCAGCCTCTCAACACGGACTACGGGCGGGTCAGGATCGACATGCACTGCAGTAACAAAAGAGTCTTCCTGCCCGACCAGAGGGAACAGTGCACCCCAGCGTTCCTCGGCTTCTCTTCACTCGACCCCGCTCTACTGGACCCCGAGCCGACGTTCCCAGACGACTTACTGCCTTTGATACCCGACGTGAAGAAACTCATAGAAGCGGCGGACGACGAAGACCGCGACCACTGCACGGCGCAGAATATATACGACTTAATCATGACTGAAAACGACGCGGAACAAGTACTGACCAAGATAGTGCGTTGTGATAACATAAACGAGTGTGTGCCCGAGGCGTCCGGCGGCACCCTGGCGCACGCGGCGGCGGTCCGCGGCCGGCTGGCGCCGCTGAGTGTGCTCAGGGCGCGGGGGGCCGACCTCGACGCCGCCGACTCCTCCTGCAGGACACCGCTCATGAGGGCCATTCAAGCACTACTAGACAAAGAACATTCGGAGGAAACGGAATTCGAAGGCAACGAAGCGGAAGTGTCGGTTAAGAAAGAAGATGAAGTTGTTGCAAATGACGAAGACAAGGTAAAGACGGAGACCGAAGACAAAGAAGACGCCGAGCACGAGCTCAAGGACGGCCAAGAAGTCCCCGAGGACCCCAGCAGACCGGCCGACGACGAGCTCCTTAGTGTTATCAAATACTTAATAGCAGCTGGCTGTGACGTCAACAAACAGGGTCCGGAGGGCATGTCGGGCCTGCACATGTCGTGTCAGTACGGCGGCGCGGCCGTGTGCCTTATGTTACTGGAGGCAGGCGCTGCGGTCGACGCCAGGGACCACGGCGGGTGGACGCCGCTCGTGCGCGCCGCCGAGAACAAACACGCCGCCGTCGTCAGGTTACTGCTGGCAGCCGGGGCGGACGCCGCGTCTTGTGACAACGAAGGCAACCAGCCCATACACTGGTGTACACTGGCGGGCGACTCGCGCTGCCTCGCCATGATACTGAGGGCCGCGCCGCACGCCACCAACGCTCCTAACGCTCACACTGACACACCGCTTCACATCGCCGCTCGCGAGGGTCACTACTCGAGTGTGGTCGTGCTGCTCGCCCACGGAGCCAGGACGGATATAGAGAACTCGTCCGGAGAACTTCCGGTGGAGGTGTGCAGCGGTCCGTGCCACGAGGCCATCTCTATGAACATGCAAATGACACTCGCCGTCAAAGACACTATGACACGGGTGAAGGTCATTACGAGTGACCTGTCCAACGGCCGCGAGCCGTACCCCGTGAGTGTGGTCAACGAGGTGGACGACGCCTCGCCCGCCGCCTTCACGTACGTGTCACAGCATGTGCTCACTGAACACCTCACCATAGACAACACCATAGAGACCATGCAGGGCTGCGAGTGTGCGGGTGGGTCGTGCGACGGCGAGTGCGGCTGCTGCGTGCTGTCCGTGCGGCGTTGGTACCGCGCCGGCCGCCTGCCGCCCGCCTTCCCCCACCACGACCCGCCCGTCATGTTCGAGTGTAACTACACGTGCGGCTGTAACATGAAACGGTGCACAAACCGCGTGGTGGGTCGGATGGAGAGCGCGGGGTCGCTGAACACCCCGGTGCAGGTGTTCAGGACCAGGACGCGCGGCTGGGGACTGAGGGTGCTGACCAGGGTGAGCCGGGGGGAGCTGCTGGCCCTGTACCGGGGGGAACTCGTCACCAGCGAGCGAGCCGACGCGCGGACCGACGATCAGTACATGTTCGCCTTGGACCTGAAGCCCGACCTACTGGAGCAATGCAGTGACAAGACGCTGCTGTGTGTGGACGCGTGTCGCTTCGGTAGCGCGGCTCGGTTCATGAACCACAGCTGCCGGCCGTCCGCGGCGCCCGTGAGGGTGTTCACCTCGGGCCGCGATCTGCGCCTGCCGCACGTCGCCTTCTTCGCTCTCAGAGACCTCGCGCCCGGCGACGAGCTCACTTTCGACTACGGAGACAAATTTTGGTCAGTGAAGTCGAAATGGATGAAATGCGAGTGCGAGTCGCCCGACTGCAGATACCCGACCAAGATGGAGGAGGCTGATACATAG

Protein sequence:

>DPOGS208774-PA
MLTDSENPVTKQDDDNDASPDNLTVRVDKIEVVKKPDKEADAEDPKPRIVLTFRSEKSGARSSNMKIVSTEEKHEDISPRRSIRRRNTINYNYVKESDDDNISHADSDDDEPIESMTHKRSTRRRSKDFSDVIANAIARKEKSYNESSSVPTQRLSRRIKPTAKILANEELRMGLESQNNARLGISTEKTTEEGVRTRRSAQVRNSESVTEKRSSKRKIHEDSTFESKDGDNSNKKLMHMGTLGLKIAKEEDSSEAENRKTESSAHDGEDEEIDDDTEVISQLLQADEESASDEDFCPDTSKRRRSRRNCSPAPLRRSSRKANLGLYNYDAYQFDDIIDSDFEPKRKTTRTHTEEPPSEDEPETKIASEAVGGEEEESEPAAPSEAATVVATCLCEETSNVYAAPADLTEPVFCQAIEMVEGVRVGCSHRAARAPGGELLALRRPGLRAPYFLACKLHAAQLAKHMCCPTCGLFCTQGIFYQCSKDHLFHVECGIGGEARQRAGCPHCGVLSHRWQPLNTDYGRVRIDMHCSNKRVFLPDQREQCTPAFLGFSSLDPALLDPEPTFPDDLLPLIPDVKKLIEAADDEDRDHCTAQNIYDLIMTENDAEQVLTKIVRCDNINECVPEASGGTLAHAAAVRGRLAPLSVLRARGADLDAADSSCRTPLMRAIQALLDKEHSEETEFEGNEAEVSVKKEDEVVANDEDKVKTETEDKEDAEHELKDGQEVPEDPSRPADDELLSVIKYLIAAGCDVNKQGPEGMSGLHMSCQYGGAAVCLMLLEAGAAVDARDHGGWTPLVRAAENKHAAVVRLLLAAGADAASCDNEGNQPIHWCTLAGDSRCLAMILRAAPHATNAPNAHTDTPLHIAAREGHYSSVVVLLAHGARTDIENSSGELPVEVCSGPCHEAISMNMQMTLAVKDTMTRVKVITSDLSNGREPYPVSVVNEVDDASPAAFTYVSQHVLTEHLTIDNTIETMQGCECAGGSCDGECGCCVLSVRRWYRAGRLPPAFPHHDPPVMFECNYTCGCNMKRCTNRVVGRMESAGSLNTPVQVFRTRTRGWGLRVLTRVSRGELLALYRGELVTSERADARTDDQYMFALDLKPDLLEQCSDKTLLCVDACRFGSAARFMNHSCRPSAAPVRVFTSGRDLRLPHVAFFALRDLAPGDELTFDYGDKFWSVKSKWMKCECESPDCRYPTKMEEADT-