Monarch geneset OGS2.0

DPOGS200066
TranscriptDPOGS200066-TA5358 bp
ProteinDPOGS200066-PA1785 aa
Genomic positionDPSCF300044 - 847039-863246
RNAseq coverage448x (Rank: top 27%)
Annotation
HeliconiusHMEL0067550.076.83% 
BombyxBGIBMGA012543-TA0.060.43% 
Drosophilasba-PD1e-2334.91% 
EBI UniRef50UniRef50_E2A6472e-15635.24%Methyl-CpG-binding domain protein 5 n=3 Tax=Formicidae RepID=E2A647_CAMFO
NCBI RefSeqXP_392332.35e-15033.90%PREDICTED: similar to six-banded CG13598-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3071847929e-15635.24%Methyl-CpG-binding domain protein 5 [Camponotus floridanus]
NCBI nr blastxgi|3320223172e-17531.90%Methyl-CpG-binding domain protein 5 [Acromyrmex echinatior]
Group
KEGG pathwayptr:4581749e-06 
 K00558 (E2.1.1.37, DNMT, dcm)maps-> Cysteine and methionine metabolism
InterPro domain[1651-1714] IPR0003131.3e-06PWWP
Orthology groupMCL17879 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200066-TA
ATGTGTTCAGAAATAGAAAATGAAAATAGGAAAGGTGACTGGAAGATGAGCGATAGGAGAAGAATTATCCGTAGAAGGAAAAAGAGAAGTAGTCGTGAAAAAGATCCTCCGCATGTATCGAGTAAGCCTTACCAAGTGGCTGCTGGCGCGGAGCTGACAAAGCTTTGCAATCATAAGCGGAAACTTCTGGCCTCCCTGCAGTCGAGGGTCCAAAGTCCAGTAACTCCGCCACCGGCCATCGATCAGAAGAAAGCAGCGACCGCTGAAACTACTAAGAAGAAAATGAAGAAGCGTTCCGGTTTCATACCTAACATCAGCGTATCCCAAATGATGGTGCAAAGGGATAGACCCCTCAACGAGTTGAAGGCTGATAATGATCAGAAGCGGACTGCGTCCCCGGGAGCCATGTCCCAACGTCCCGGCAATATGCAGTCTTGCTTGCCGTATCAACAGCAAAACATGATGAATATGGTACAGGATGAGGTTAAACAAGAAACCCTTGTACAAAGAAGTTCACATTATCTGACTGTTAGCAGTGGATGGATAACACAACATCCTGACAGTATGGAATCTGGACACAAGCAGAATCAAATTATCGCAGGACCTGGAAATAATTCAGTGCATGTTGGCCTTCCAGTGCTAGGCACTAATGGGCAAGTTATTGGAGTGAATACCTCATATAACAAACACGTTACGAATATCAAGAATACTGTTAACCTCACACCGCAAAATGTAATGGAACATGAGCGTAAAGAAGATAAAGACATAAATCCCGGTAAACAATCATCGAAAGAAGACGACTCGCAAATATTCTACGGTTTGAATCAGCCCCTAACACCGGAGGTGATGCAAAAGATTAATGAACAGCAGCAAATATTTATACAAAGAAACCAGAAACAAATCGAAGTAATGAAGGGCTACAAAACCCAAATGTCAGATAATGTTACTCAAAACAAACCTCATAACCAAAATCAGTATGTCCCTCAGAACCAGTTAGTTATTCAAAATGGAACGGTAGTAAAAACAAATTCTGCAAAGACACCACCGTGGCAAGTGAAGAGGGTAGATAATACATCGAATGCCTTAATATGTCCCAGCCCTAAACAGCTAAAAATTGAGAATTCTGAATATGAAGATTCTAATCCCAATCCCATGGAGGATTCTTCGCCATCGGGTTCGTTTGCGAACAATGACCAGATGATGAACCCGGCTATGTGTTCCCGAGTTCCTCCTCTACCACAACATTACAACATGATGGGACAGCAATGGCCAAATGTTGATAATAAAAAGAAAGCTAGATGTAACAGCAAAACTTCTAAGAAGAAAGCCGCTGCCAATGACCCTAAGAATATGAATTTTGGTAACAATGGATTGAGACATATGCAAGACGTCAAAGGCAAGCCTTGTGAAGATATAATGAGAAATAACGTTCCTTCGTTCATGGAAGATCCTAGTGGTTATCTAGCACAACAAACAGTACTTTTGAATAACACTATATCGAGACAGGTAGGTGTGAATGTTCCATATGAAGCTAACCAGTTTGAGAACTCCGGAAATTCATATAACACTTTGAATCCAACGGACAAGAAAAATCTCCCGAAGCAAAGTGAACCCATGAATGTGTTTAGAAATAATGTTGTAGCTAATTCAACACCATCCCCAAACAACGCTCCTGATAGTAGTCTTATACCAGAGAAAGTCGTTGAAAATAATCTACAGTCATGTTGCAAGGGCTGCAATGTTTCCTACAAAGGCAACTGTTCCGATACTTCGGATAACCCGATGCGCTCAAAGTTATTAAAACAACATACTAATGTTTATGGTCTGGAAATGGACAGTGGGGTGGCTACAACGTCTAGTCCTAAACATGGATTCGAAGATAATCCAGTTACATCATCAACCTATATAGATCGTAGTATGATGTTAAATGATAATTGCGGTCCGATACAAGCCGGTATTGTTAGCACTAGCAATGTGTCACCGAGTGAGACTCTGCAACAACCGGAGCCTTCACCTACATTGTCTAATAGCTCCCGTAATACGGACACTCCTCATAGTAGTGGCAGCAACTCAATGCAAAACTGTACATTCCCTGTTCCTAGTCCAGCCTATTCCAATCCAGGTTCCAGCAGAGATAACTTTAATAGTAATCCTTCTCCTCCAGGCAGTAATCAGCCATATCCGTACAATACCGTAGGGCCACCCAATACAGAATCGGTATCCGGAAATCAGGTTATGCATTTCATATCAAATCACAATATCCACAAAAATCTAATGGAAGTGAATAACATATCAATGGTTGGTGTCAAACCAGGAATGAGGAGCCAAGAATATAGAATGGATGGCAAGAAACGTATTGAAAATACAATGCCCGGTTACTGTCCGCCCCCACATCTAGGCGGGGGTCCGGCACATTCCCTTATACAAACTTGTTACATACAAACAATTGTTACGACTATGGCTAGCGGTTTTTCCGTTACAAGGGATACTGTAACTTCAGTTCTAGCGGGAAAAGCTAATACTGCGACTACCTCTATAAATGCATCACAAGCAAATTTCATAAGAACTCCGCCGCCTCCAAGTGTCAATTTGGCGACGACCTATGCTATACCGAATAATCAGCCCGATCCCTTTATTAACTCTGTTAATTTGCCAACAAGTTATCCGCTGCACATCGCCGGTACAACCGCGCAAAATATGATATCAAAATCGCCTTTGGAGATGGTACAAAATGTTATAAGCAGTTTACCTTCCAAGCCGGAGGCGAGCAATTGCCAATCGACCATAGCGCAGAGAAGATCGTGTTCCAACTCATCCGGTCAGATACTTATTTCATCGACCGGTCAAATAATTGTGTCAAACAATCAAATGCCGCCACCTCCACCTAAAAATACAACCACGATGTCACCAATTGGTTCTAACGCTATAACAAATGTGACCACTTCAGTTACTCAAGTCGTACCAGCTGTTGGCAATGTTCAACCGATTGTCAATCAACCGACTGTCGTGGTAAATGCATTGCAAACTCCTTTCGTTATTCAACCTCCCATGATGGCTGTTGAGGGGCAAGTCGTCCAACCGAATACCGTCCTCCCTCAAATAGTAACTGGAGGCATCGTGTCTGGACCCAACGAAATTTCACGGCAAATTGATATGAAGAATGGACAAAACTTCGTCCAAGGTGTGGCCATGCTTTCTCCTGAAAGTTTAAAGAAGAGAAGTAAAAAGAAGAAAAATCAAACGGCGAATATAACGAACGTTCTGCAAATCACAGCACCGCAACAGAACCCAAATAATATCATGGTACACTCATCGCCACAACATAATTCTAGTCCACAGTTTTCGCCACGCGGTTTTCAATTGTCGCCAACAAACAATATATCGCCGACGCCGATGTTGCAAGCACTAACTATAGTACCCGGGAAGTCCGGTACTCCAGCACATATTGTAATGAACGGCCAAGGGAATTCAAATAACTTTGGATCTCAACAGATAATCACAAACACTTCACCATCACAACAAATTAACTTATTACAACCAGTCAATCTAATCAATAACGCCAGTAATGTGATGTCAAATTTCCCGGCGTTCCAGCAATTTATTCTGCCAAATCTCGGCGGTATGGTAATGACAGCTGATGGCACAGCGATCATTCAAGACAATTCAACAGGAATGCCTATGCAATTACAATTGCAAACGGTAAACGGACAAAACGTTTTAACGCCTGTGCAAAATCCTGGATTGTTTACCGCCGGGAATAACAGCGGTGTCGTTATAAGAGCACAAAACCAGCAAGGAAAAATAATACAATCGCAACACAGCCCTGGTGCTCAATTTTTGTCGCCGAACAGTCAAGTTATGGTCAACAGTCCCAACTTCAACGGACAGCTCAGCCCTTTACTTGCGAATCTTAGTCCAACTAACGTCGCGTTCAATAGTTCGCAAGTGCGAGCGGGGAACGTCCAAACACAGGAGTTCATCCAGACGAATCAAATGGGCCAAACATTAATGGTTCCGTTGTCCCCGAAACCAGTTCCCATATCCGCTAGTAACAGAAACTCCACTTTCGTCCAGAACACGACTATAGTTCAACAACAGACAACACTAGTTTCCAACTCTCATAATACTTCGATGAACATGAACAATACCTCTCGATTGAGCATCGATCCAAATGTGCTTCTTACTCAGAAGGTTGGGAAGAGGTTTCCTCAAGAAAATCAGGAAATAAAGGACAAGATCCTACCGTTTGATGTGAAGGAAGATAAAGAGGATAGCGAAGGTTACGCCCGTTTAATGCAGGAGTTGGGTGGTGTAAGACATTCCGTTTCGACTCAAACTTTGGGTCAAAGAGGGGGTAAATCCCCTAACACAGCCGGGGGTTCCCCACCAGATACTACGACACACAGTCCGTTGGGAGCCTTAGATTCCGCTCCAAGCCCAAGATTATATTCCGCAGCAACTCACGCTACCTACGCCGATACCACAACCAAATCACCCGAACCCGCGGATGTTCATTCTTCGGCGATGGTGCAATGTGTGTCCAGCAGTGAACAAGACATGGCTGAGTCAAGGGAACGAGCGTGGCCCGTGAGGAACGAAGAAACAGACTCACACAATCTAATGTCAAACATACAGCTTATGAGGTTAAACCGCCACAACTCTGTGGAGTCGAGCGAGATGTACCACAGCATGGCTGCGAACCAGCAAATGATGCAAATGCCGTACGAAAGACCTCACAGCTACAAACGGAAAAACGACTTCGGGGACGGCTCATCGTATAGGAACGACAAAATTATGAGGACCAATTATAACATGGTCCACATGCCCAAAGAATACGACATGAACCAGGACAAACAGAGGAGATTCTCGCCCGGCAGTTCGGAGAGCATGCAGAAAGTATTCGATAGACATTTAGGCTCGGAAGACGATACAGAGGATAATTACCGGAGATTCGAAGTCGGCGATCTGGTATGGGGGCCGGTTAAAGGTTATGTCTCGTGGCCGGGCAAGTTGGTGTCGCGTGTGTCGGACACCAGCTGGAGTGTCCGGTGGTTCGGTGGCAGGCCTACCAGCGAAGTCGAACACTCCAGACTGTTGACACTCTGCGAGGGTCTGGAGGCGCATCACGCGGCACGGATGAGACACAGGAAGAGCAGAAAACTAAATACGCTATTAGAGAACGCCATACAGGAGGCAATGGCTGACTTAGATAAGAAGACGGAAAATAACGACGCTGATGATTTACAAGAGGATGTATCTGATGTAACAGACACTACTAAAAGGAAAACACCCAAACTTAGGAAACACAAGAAAAACCCATCCAAAGTCGACGGAACCCGTTTGAGGAGCTCGCGATGA

Protein sequence:

>DPOGS200066-PA
MCSEIENENRKGDWKMSDRRRIIRRRKKRSSREKDPPHVSSKPYQVAAGAELTKLCNHKRKLLASLQSRVQSPVTPPPAIDQKKAATAETTKKKMKKRSGFIPNISVSQMMVQRDRPLNELKADNDQKRTASPGAMSQRPGNMQSCLPYQQQNMMNMVQDEVKQETLVQRSSHYLTVSSGWITQHPDSMESGHKQNQIIAGPGNNSVHVGLPVLGTNGQVIGVNTSYNKHVTNIKNTVNLTPQNVMEHERKEDKDINPGKQSSKEDDSQIFYGLNQPLTPEVMQKINEQQQIFIQRNQKQIEVMKGYKTQMSDNVTQNKPHNQNQYVPQNQLVIQNGTVVKTNSAKTPPWQVKRVDNTSNALICPSPKQLKIENSEYEDSNPNPMEDSSPSGSFANNDQMMNPAMCSRVPPLPQHYNMMGQQWPNVDNKKKARCNSKTSKKKAAANDPKNMNFGNNGLRHMQDVKGKPCEDIMRNNVPSFMEDPSGYLAQQTVLLNNTISRQVGVNVPYEANQFENSGNSYNTLNPTDKKNLPKQSEPMNVFRNNVVANSTPSPNNAPDSSLIPEKVVENNLQSCCKGCNVSYKGNCSDTSDNPMRSKLLKQHTNVYGLEMDSGVATTSSPKHGFEDNPVTSSTYIDRSMMLNDNCGPIQAGIVSTSNVSPSETLQQPEPSPTLSNSSRNTDTPHSSGSNSMQNCTFPVPSPAYSNPGSSRDNFNSNPSPPGSNQPYPYNTVGPPNTESVSGNQVMHFISNHNIHKNLMEVNNISMVGVKPGMRSQEYRMDGKKRIENTMPGYCPPPHLGGGPAHSLIQTCYIQTIVTTMASGFSVTRDTVTSVLAGKANTATTSINASQANFIRTPPPPSVNLATTYAIPNNQPDPFINSVNLPTSYPLHIAGTTAQNMISKSPLEMVQNVISSLPSKPEASNCQSTIAQRRSCSNSSGQILISSTGQIIVSNNQMPPPPPKNTTTMSPIGSNAITNVTTSVTQVVPAVGNVQPIVNQPTVVVNALQTPFVIQPPMMAVEGQVVQPNTVLPQIVTGGIVSGPNEISRQIDMKNGQNFVQGVAMLSPESLKKRSKKKKNQTANITNVLQITAPQQNPNNIMVHSSPQHNSSPQFSPRGFQLSPTNNISPTPMLQALTIVPGKSGTPAHIVMNGQGNSNNFGSQQIITNTSPSQQINLLQPVNLINNASNVMSNFPAFQQFILPNLGGMVMTADGTAIIQDNSTGMPMQLQLQTVNGQNVLTPVQNPGLFTAGNNSGVVIRAQNQQGKIIQSQHSPGAQFLSPNSQVMVNSPNFNGQLSPLLANLSPTNVAFNSSQVRAGNVQTQEFIQTNQMGQTLMVPLSPKPVPISASNRNSTFVQNTTIVQQQTTLVSNSHNTSMNMNNTSRLSIDPNVLLTQKVGKRFPQENQEIKDKILPFDVKEDKEDSEGYARLMQELGGVRHSVSTQTLGQRGGKSPNTAGGSPPDTTTHSPLGALDSAPSPRLYSAATHATYADTTTKSPEPADVHSSAMVQCVSSSEQDMAESRERAWPVRNEETDSHNLMSNIQLMRLNRHNSVESSEMYHSMAANQQMMQMPYERPHSYKRKNDFGDGSSYRNDKIMRTNYNMVHMPKEYDMNQDKQRRFSPGSSESMQKVFDRHLGSEDDTEDNYRRFEVGDLVWGPVKGYVSWPGKLVSRVSDTSWSVRWFGGRPTSEVEHSRLLTLCEGLEAHHAARMRHRKSRKLNTLLENAIQEAMADLDKKTENNDADDLQEDVSDVTDTTKRKTPKLRKHKKNPSKVDGTRLRSSR-