Monarch geneset OGS2.0

DPOGS201680
TranscriptDPOGS201680-TA4092 bp
ProteinDPOGS201680-PA1363 aa
Genomic positionDPSCF300103 + 507888-513733
RNAseq coverage348x (Rank: top 34%)
Annotation
HeliconiusHMEL0120700.070.09% 
BombyxBGIBMGA005382-TA0.079.25% 
DrosophilaKdm2-PB3e-0855.81% 
EBI UniRef50UniRef50_Q5W7N60.080.07%Cytosine-specific methyltransferase n=2 Tax=Obtectomera RepID=Q5W7N6_BOMMO
NCBI RefSeqNP_001036980.10.080.07%DNA cytosine-5 methyltransferase [Bombyx mori]
NCBI nr blastpgi|1129834300.080.07%DNA cytosine-5 methyltransferase [Bombyx mori]
NCBI nr blastxgi|1129834300.073.96%DNA cytosine-5 methyltransferase [Bombyx mori]
Group
Gene OntologyGO:00056340nucleus
GO:00038860DNA (cytosine-5-)-methyltransferase activity
GO:00901160C-5 methylation of cytosine
GO:00036771.6e-51DNA binding
GO:00063061.6e-51DNA methylation
GO:00082706.1e-14zinc ion binding
KEGG pathwaynvi:1001220290.0 
 K00558 (E2.1.1.37, DNMT, dcm)maps-> Cysteine and methionine metabolism
InterPro domain[1-1364] IPR0171980DNA (cytosine-5)-methyltransferase 1, eukaryote
[162-1361] IPR0015250C-5 cytosine methyltransferase
[162-283] IPR0227024.5e-26DNA (cytosine-5)-methyltransferase 1, replication foci domain
[389-432] IPR0028576.1e-14Zinc finger, CXXC-type
[503-631] IPR0010252e-12Bromo adjacent homology (BAH) domain
Orthology groupMCL11989 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201680-TA
ATGGACAATGCTACACATTCAACAACTCGAAAGTCCGACAGAAAAGCTTCAACAAGAGACGGTCAACTAAAAATTACAAGTATGTTTGCAAAAAAGAGAAGTAGAAGTCCAATTGAAAGTGCTGAGAAAGATGATACGAAAAAGCTAAAGATTAACACAGAGTCCCATGAACATGAAGTCTTTAATGAAAAATCAAAAGTAGTAAATGGAATAAATAGTAAATCAGACGACGTCAATTCTCAAGAAAGTGAAGAAACTTCACCAAACTTGGTTTCTATGAAATTAAACAACGAGAGAAGTCTTGTAGATGAAGATGAAAACCATAATATTGAAGCTAAAACTGTTCCTGAGATTATTAATACTATGAATGGTTGTAACCAAAACGGGGATGAAATGACTAGAGACGGCCTTGAAAACCAGGTCCAACAAAATGCTATAGTAGAACAAGATCCACCAGAACCAAAACCAACTGCTAAAATTCCAGATCAACATGGCCATTTATGTCCCATAGATGGAGGTCTTATTGAGAGTGATGTAAGGATATACATGTCGGGCTATCTTAAATCTATTTGTTCTGATTCACCTGATATTGATGAAGAATCTATAGCAGTCAAGGATGTCGGTCCCATCATAGAATGGTTTATTCATGGATTCGACGGTGGTTCGAGAAATTGTATTACACTGTCGACGGAATTTGGTGAATACAATCTACTCAAACCGAGCGCTGAATATACTCCCTTGATGGATAATTTGTATGAAAAAATATGGCTCAGTAAAGTAGTCGTTGAATATTTAGAAGAATACCACTATCTTCAGCCAACTTATGAGGATTTGTTAGAAGTTATAAGGGAACATTCCATACCAGATTTGGAGGACAAGAGGATGACCGAAGAAATGCTCCACAAACATGCTCAATTTGTTTGTGACCAAGTTGTTAGTTTAGAAGCCGATGAAGACAATGAACCCTTGATAACCCTTCCCTGTATGAGAGAGTTAATAAAATTAATGGGCATTAAATTCGGCAAGAGGAAAGTTCGCGCGAAAATCGACTACAAGAAGATTGACAAAAAAGCTTGGACCAAAGCAACAACGACTCCGCTCGTTCAGAAGACATTCGAGCATTTCTTTGCAAACCAATTAGATAAGACGAATCATGAGCTGGTGTTGAGAAGGAAACGATGTGGGGTTTGCGAAGCTTGCCAATTACCTGACTGTGGAGAGTGCAATGCTTGTAGGGCCATGTTAAAATTTGGTGGTCACGGCCGCACCAAAAAGGCGTGCGTCAGACGATTGTGTCCCAACATGGCGGTTCAACAGGCTGAAGATTCGGAGATAGAAGACGAAGAAGAATACCAACAGATGGCTGAAAAGCGACATCTCGATAAAATCGATGACGCTCTACCCGTTAAATTAACTGGCGGAAGTAATAAGATCATTAGATGGATCGGCGACCCTGTTAAGGCCGACGCTACTAAAGTTTACTACGAGAAAGTTGAAATTGACGGATCAGAACTGTCGCTAGGGGACTTCGTTATGGTCGAAACGTCACAATCGAATATCCCCGCGTTAGTAGCCAGAGTCACGTATATGTGGAAGGAGAGTATTAATCCTAAGTCGGGTTATTTCCATGCTGAAGTTTTCATTAGATCGTCTGACACCGTGTTGGGAGAGGTCGGTGACCCGCGAGAAGTGTTCTTGGGCGATAGATGCTGCCATGGCGCCCCTTTATCGTCTATATTGAGAAAAGCGTTCGTCGAAAAGAAAGAAACACCGGCTGATTGGTTCAAGCTCGGCGGGAAGGAAGTGGTCGACCACTTCTTTGAAGATGACGGCAAAACTTACTTTTATCAGAAATACTATGAAAGGTTCACGGCACGATTCGAAGATCTTCCGAACGATCCAGAATGTCCTAACGCATTACGAAAACACAGATTCTGTCCATCCTGTGAACGGAAGACGAGACGGGATGCTCGCGATATACCAAAAATATCTGGAAAACTAACAGAGAAGTCTGAAATTGTTAAAGAAGCTAACAGATTTGAATGGACGACTATCAGGTGGCGGGACCACGATTACAAGAAGGGCTGCGGAGTGTTCTTAAAACCTGGAACATTTAGATTCAAAAACTCCATGATCAATAGCAGTAATGGCATTAATAGGGTTAAGTTAGACAAAGTCGACGAGGATATATATCCAGAGTATTATAGGAAGACTGATAATTATTTGCGAGGCTCGAATATAGACACCGGCGAGCCGTTCTGCGTCGGTTATATAGCAGCGGTGACGGCGGCTAGCGAGGGGCCGCTGGTTATTCCACAGGATATCTACATTAAAGTCAACGTGATGTATCGGCCAGAGAACACCAACAACAGATTTCCGCATCACGAGGACGTCAATGTCGTGTATTGGAGCGACGAAATCAAGGAGATATCGTTTTCAGCCGTCGTAGGACCTTGTAATATATGTTATGTAGACAACATACCACAGCAAGATCACATCTACGACTGGTTAGAGAAGGACCCAAGTAGAGTATACTTCCGTATGGCATTTAACAAGAAATCCGGTCAAGTAGAGGATGTTCCGCAGCACGTTAAATATGTCGGTAGGGGTGATAAGGGTAAAGATAAAGGTAAAGGGAAAGGGAAGTCGAGCAAAGGCGCACAATCTACAGTCACGGTGAAAGTCGATGAAGTTAAGGTCAGGCCTTTAAGGACTTTGGACGTGTTCGCGGGTTGCGGCGGTTTATCTGAAGGCCTTCATCGATCAGGTGTCGCCGAGTGTCGTTGGGCCGTCGAAAATCTAGAAGCGGCCGCTCATGCTTATTCCATCAATAATAAAAACTGCATCGTGTTCAACGAAGATTGCAACGCCTTGCTGAAGGACGCAATGGATGGGGCGACTCACAGTGCGGGGGGATTGAGAATTCCGATGCAAGGCGAAGTGGAACTGCTCTGCGGTGGACCGCCGTGTCAAGGCTTCTCAGGGATGAACAGATTTAACTCGAGAGAATATTCCAACTTCAAAAACTCTTTAGTTGCATCGTATCTGTCGTTCTGCGATTTTTACAGGCCTAAATACTTCATCTTAGAGAATGTTAGGAATTTCGTCGCCTTCAAGAAGGGCATGGTTTTGAAATTGACTCTCAGAGCGTTGTTGGATATGGGATACCAATGCACGTTCGGTATCCTTCAGGCTGGGAATTATGGGGTACCGCAGACTCGTAGAAGACTCATTATACTAGCCGCGGCGCCGGGCTACAAGCTTCCTTTATATCCGGAACCCACGCACGTTTTCAGCAGGCGAGCTTGCTCATTAACAACCACCATAGACGGGAAGCGTTTCGTCACTAACATACAATGGGACGAATCCGCGCCGAGACGGACTTGCACCATCCAGGACGCTATGAGCGATCTACCGCAGATATGTAACGGTGCGAATAGAATAGAAATCGATTACGGCTGTATGCCAGAAACTTACTTCCAGAGACTTATTAGGAGCAGAGATGAGAGCGCCAAACTGCGGGATCACATATGTAAGAACATGGCGCCGCTTATACAGGCACGTATGAGTAGAATACCAACTACGCCGGGCTCTGATTGGAGAGATTTGCCAAATATATCCGTTGCACTATCTGATGGTACCAAATGCAAGGTGTTGCAATATCGTTACGACGACATCAAAAACGGTCGTTCCACCAGCGGTGCACTCCGCGGAGTCTGCGCCTGTTCCGCCGGTGGAGTGTGTTCCGTAGCCGACAAGCAAGAAAACACGCTCATACCGTGGTGTCTACCGCATACAGCCAACAGACATAACAATTGGGCCGGACTCTATGGGCGTATATCCTGGGACGGCTACTTCAGTACAACTGTGACGGACCCCGAGCCGATGGGCAAGCAAGGCCGCGTGCTCCACCCCGAGCAAAACCGCGTCGTTTCTGTTCGCGAGTGCGCTCGCTCGCAGGGATTCCCCGACACTTACCTATTCGCCGGCTCCATACAGGACAAACATCGACAGGTTGGCAACGCGGTGCCGCCACCTTTAGGAGCGGCTTTGGGCAGAGAAATCAAGAAAGCGTTGAGTGCCTTATCTTGA

Protein sequence:

>DPOGS201680-PA
MDNATHSTTRKSDRKASTRDGQLKITSMFAKKRSRSPIESAEKDDTKKLKINTESHEHEVFNEKSKVVNGINSKSDDVNSQESEETSPNLVSMKLNNERSLVDEDENHNIEAKTVPEIINTMNGCNQNGDEMTRDGLENQVQQNAIVEQDPPEPKPTAKIPDQHGHLCPIDGGLIESDVRIYMSGYLKSICSDSPDIDEESIAVKDVGPIIEWFIHGFDGGSRNCITLSTEFGEYNLLKPSAEYTPLMDNLYEKIWLSKVVVEYLEEYHYLQPTYEDLLEVIREHSIPDLEDKRMTEEMLHKHAQFVCDQVVSLEADEDNEPLITLPCMRELIKLMGIKFGKRKVRAKIDYKKIDKKAWTKATTTPLVQKTFEHFFANQLDKTNHELVLRRKRCGVCEACQLPDCGECNACRAMLKFGGHGRTKKACVRRLCPNMAVQQAEDSEIEDEEEYQQMAEKRHLDKIDDALPVKLTGGSNKIIRWIGDPVKADATKVYYEKVEIDGSELSLGDFVMVETSQSNIPALVARVTYMWKESINPKSGYFHAEVFIRSSDTVLGEVGDPREVFLGDRCCHGAPLSSILRKAFVEKKETPADWFKLGGKEVVDHFFEDDGKTYFYQKYYERFTARFEDLPNDPECPNALRKHRFCPSCERKTRRDARDIPKISGKLTEKSEIVKEANRFEWTTIRWRDHDYKKGCGVFLKPGTFRFKNSMINSSNGINRVKLDKVDEDIYPEYYRKTDNYLRGSNIDTGEPFCVGYIAAVTAASEGPLVIPQDIYIKVNVMYRPENTNNRFPHHEDVNVVYWSDEIKEISFSAVVGPCNICYVDNIPQQDHIYDWLEKDPSRVYFRMAFNKKSGQVEDVPQHVKYVGRGDKGKDKGKGKGKSSKGAQSTVTVKVDEVKVRPLRTLDVFAGCGGLSEGLHRSGVAECRWAVENLEAAAHAYSINNKNCIVFNEDCNALLKDAMDGATHSAGGLRIPMQGEVELLCGGPPCQGFSGMNRFNSREYSNFKNSLVASYLSFCDFYRPKYFILENVRNFVAFKKGMVLKLTLRALLDMGYQCTFGILQAGNYGVPQTRRRLIILAAAPGYKLPLYPEPTHVFSRRACSLTTTIDGKRFVTNIQWDESAPRRTCTIQDAMSDLPQICNGANRIEIDYGCMPETYFQRLIRSRDESAKLRDHICKNMAPLIQARMSRIPTTPGSDWRDLPNISVALSDGTKCKVLQYRYDDIKNGRSTSGALRGVCACSAGGVCSVADKQENTLIPWCLPHTANRHNNWAGLYGRISWDGYFSTTVTDPEPMGKQGRVLHPEQNRVVSVRECARSQGFPDTYLFAGSIQDKHRQVGNAVPPPLGAALGREIKKALSALS-