Monarch geneset OGS2.0

DPOGS209930
TranscriptDPOGS209930-TA6675 bp
ProteinDPOGS209930-PA2224 aa
Genomic positionDPSCF300180 + 101309-111976
RNAseq coverage843x (Rank: top 15%)
Annotation
HeliconiusHMEL0167510.078.90% 
BombyxBGIBMGA004310-TA0.065.92% 
DrosophilaCG9007-PA4e-3740.27% 
EBI UniRef50UniRef50_F4W8D63e-8943.02%Histone-lysine N-methyltransferase MLL5 n=3 Tax=Formicidae RepID=F4W8D6_ACREC
NCBI RefSeqXP_001847142.12e-8840.40%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3287848251e-8943.60%PREDICTED: hypothetical protein LOC100577280 [Apis mellifera]
NCBI nr blastxgi|3838514532e-9829.11%PREDICTED: uncharacterized protein LOC100875107 [Megachile rotundata]
Group
Gene OntologyGO:00055152.4e-18protein binding
GO:00082706.5e-07zinc ion binding
KEGG pathway 
InterPro domain[947-1077] IPR0012142.4e-18SET domain
[721-796] IPR0110111.9e-15Zinc finger, FYVE/PHD-type
[723-782] IPR0130834.2e-12Zinc finger, RING/FYVE/PHD-type
[734-779] IPR0197872.8e-09Zinc finger, PHD-finger
[734-778] IPR0019656.5e-07Zinc finger, PHD-type
Orthology groupMCL22065 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209930-TA
ATGTCGGCGACATCAGAATATGAACCATCGGTCCGGCAAGGTTACGGGCCGACCACCTCCAGTGATAACATTTATGAGCATGTGTTGGATCACAAATTAATTGATACAATGGATATATCTAGTGTAAGAACCCCAGTACTGAGGGCCGTCACTCTGTCTCCCACCCAGGATCCCGAACCCGATTCCACTAACCCTGAAAGCATAATACAATCTATACATAAAGATAAAAAAGATATTGAATATCACATTACTAAGAGGTGTGAAACTGAAATGTCATTAGACCCCCTTCCTAGAATTGTAAGCATTGAAGAAACTACTTATGATAATGTCGGACATATAGCTTACCATAGCATACAAGAAACAGACGAAGTAGGCAATGGTAATTATGTAGAGAACGCTACACTAATACCTGCCCCTCTGAATGCTGCTATTGTCCAAAATGTGACAAAACTCCCCCAAAACTTTACAATTAATGTAGATGCAGCAGTTGGCAATATTACGGCCATACAGAATGTGTCTCAAGATGTAACGGGTAACCAAACTCTATGGCTGCCAACGATCCTCGCTACAAGTGCTACTACGGATAGTAAAAGTGAAGAGGAGACGCCGCCGGGCGTGTCGCAAATAATCATTACAAGTGAGAGCTATGTTAATGATGTCAGTCACACCGGACGGAGAACTAATATCATAACAGAAACAAACTATATAAATCGACCGAAGACATCTAAGGTTCAGATCTTAAGTAATATATCGCTACCTAAAAATTCTAACTACTCCCAACAATATATAGCCCAAGGCAAGGAGGTGAACGCTCCTGTATACGGAACCCAGAATCATGTGTATAAACTGAATACAAATGTTGTATCTCAGAAACATTTAAACAGCACTATGCTAAGCACAAAACCTCCGAAAAGCCAATCTCTAATAGGTGCTACTCTATCTCCGAGTGTTATAAATAATAGTCCAATTAAAAACGTGCCATACGGTCACACATACTCCAAAAGCACAAACGTAAATAAGGTAAACAATGGTGCTAATGTCAACACTATGGTAACTGCAGCGACCGGTAACACTCAATGCCACATTTTATCCAGAGTTGTGTCCGGACCTAACAAAATATCTGTACATTCTGGTCGAAAATCTGTAAATACATTCAAAGGGTCTAAAAACTCAACAACCAGTCAGAAGAATCAGTCTAGGTCTATAAAAATCATTCAGCAGACTGGCAGCGCTCATAAGTCTGAAAATAAATCGTGGTCGGCGAATAACACGACGTACAAGAACCAAGGCACAAGTTACGGGGTTATAGAATCTAATACTTCCAAAGTTATACAGAAAGTTGGAAGTCCCACGCATAAAAACCAGCAGACCATGACTCTACAGAAGGTTCCTAAAGGCGAGCGTCTAGTGTTGCAGTCGCCTTGCGGGCCGGTCCTATTGTCCACGGCTCCATTGAGTAGTAGTTTACCCAAAGGCCCCCACTACGTTCAGTCAGGTTCTACTCCCAATCTCAGATATGTTCAAACCTACGGTCCCGACAACCAGCTCTCTACAGTTTCCCAAGTGTCGGCTAACCAACAGTTGACTGCTCAGATTTTACAGTCGCTATCTCAGCCCAAACTCATATTACAGAGTCCTACACAGATTCAACCAGTGAAAAATAATATAATACAACCACCCCTTGTGGAGGAACCAGCAGATATAAAACCTGTTAATCAAAAGAGGATTGTATTTGGTGACAAATCTACGCTTCTGACAGCCGATGATATCATCGGTATGGAGGAAAAGCCAAACCTGTCGGAGGAATTGCGTCGATATTCATATCAGCCGCTAGCATTTGTGATGCTGGATCATACATATGCCTTGCCAGCTCAAAAACAACCGACAGTCAGCACTCCGACAACAACAATCTCATCAGTGTCACCAGTGATCTCGTCGCCGGCGACAACAACAGCACCGATGAGTCCTATGAAGTCATCGCCGATCGCTACAGTATCCCAAGAAACTGCAGCGCCTATACCGACAAGCGCTATCGTAGCTCCGACGACGACAGCGCAGTCGTCTCTCACATACAAGTCTTCCATCCAAGATGACGACACGGCATCTGTGATATCTTCTATAGAAGGTGAGAGGCGACCTCCCGCCCCGGGTGGCAGTGACACAGAAACCGCTCCCGAAGGAGAGGAAGAGGGAAAAACGAGGTGCATCTGCGACTTCACCCACGACGACGGTTATATGATATGCTGCGATCGATGCGGCGAGTGGCAGCACGTGGATTGCATGGGCATCGACAGGAACAACATACCCGACGCATACATGTGTGAGCTCTGTCAACCGAGGACTATCGACCGGCGACACGCTAGGGCCATCCAACTGAGAAAGAGAGAGGAATTAAGCGCTCTGGGGGCATCGGATTCCGATTCGTCTGAATGTAGTCGACCGCCAGGACAGAGAAGGAAACGTCTACTGACCGTTACCACGTACACTAACACCAGCGGATCCTGTGTCACGACGTACAACTCTAATTTGCCTGTATTGCCGCCATTACCTCAACCCACAGTATCGCCTCTGCCGAAACGGGGCCCGAAACGTCCCAAGAAAGCTGAAGTTGTGAGGAAGTGTACGAAGAGAAAACTGACGGAGAAGAGAGTCAAGAGGAAGAAAGAGATGTTATTAAACAGGAGCAAGTATAATTCCACTATACCCTCGGGCCAATCACACTGGCGGGACCTGTACGAGCTGGCTATGACAAATCATTATAGTCCGGAATTAAGAGCTAAAATCATGAAATACAGCAGCAAACTCGGAAGCACGCCGAATATGGCTTCGGCTATTACAGCGCATTTATGTACTACGGTACCACATGCGGGCGGGAAAATACTCATCGCCACAAAAGAACTGAAAGAAAATACTCCTGTTATAGAATTACGAGGCAAGTACATGCTTTCAAATCAACACAGGCCTCAACTGCAAAACACTGCTCGGGCGGGAAGCCAAAAACCGGGTCCCTTTGTGTTCTTCTACAGGTTACCGAAAGATAATACACAAATATGTATCGACACTCGGACCTATGGGAACGAGGCGAGATTCGTTCGTAGATCCTGTAAACCGAATGCCGAATTGCAACACTGCATAGTTAAAGGAGCGTTGCACGTCTACTTGGTGTCCATTACTGACATACCGTCGAATACTGAAATAACCGTCGGTCATGACACGAACGGCAGCAAACAGCCTTGCGCCTGCGGGAATCCCAAGCACTGCAAGGTGAACGGGTTGAGTGTGATCGTGCCTCGCAAAAGCTTGGACTACCCTCAGAGAGAGAAAAGTAAAAGGAGCAGATGTTACAGTTCCTCGTCACCGGTCTCTCCGCCGCCAGCTCCGGTTTTGCCCACAGTGAAGGATATTCCAGCGCCATTTTCACCAAAATGTGAGAAGAAGTCTCCTCTCAAATATGAGTATCCACCGATGTCCCCTGTGAAAGAACCGCCTGTATTGTCGTTAGATTATAATCCGGCACCGTTGAAGTTTGACCTGTTCCAAGACTTCAAACAGGAAAATCAAGAGTTCGATCTCACCGCCAAAGAAGAGATGAAGTCTGATTTGGACGAGCCAGATTTTATCAAAATCGAGCCTAAAGTTGAAGAGCCTTTACTTGAACCGGAAGAGGAACCCGAACCGGAAACGATTCCAGTTCCGGACGAACCGGAGCCTGAACCTGTTACAGAGGAAAGCCTGGAAACCAAAGAGGAAATGCTTCTGTCGCCACCGCCGCCTCCACCGCCGCCCGAAGAACCAGAACCAGAACCGGAGCCGGAACCAGAACCAGAACTGGATCCTGAACCGGAACCGGAACCAGAAGTTGATAGAAAAGACGAGGAACAGGCGACAGAATTTAAAAGCGAAGCTGAAAGACCTGTGACCAGGGAATTGGCAGCTAAGTCAGCATGTCACGACAGGTCGGCACGGTCTAGCCGCACCACGTGCGTCAACCAAGACTCGCTGGACTACAAGACCGACGACTCACAGGACAAGCTCCCCACCAAGACCAGTAAAGATAAAGACAAACGGAAAATGACTCGGGAAGAACGCAAGATGGAAGCGATAATGAAGGCTTTTGAGAGAATGGAAAAGGCACAGCAGCGGAAGCAGGAAGTGAAAGAAAGACAGAAGAGAAGAGAGTCAGACCCACATCCTCACACTGAGAAAGAGGAAGAAGAGGACTTGAACTGCCTCTCCAAGAAGAGAAAGAAACGTAAAGGACGCGCCCGGACAGCGTCTCAGTCGAATAGACGCAGGTTGAACTCCGCTGATAGTGATATGGTGACGTCAGGAGACGAGGCACAGGCTATGTCGCCCAGAGCACCGTCCAGGCAGGACCAGTCTCTCCCGTCCGCAGACACAGATAGGCCACACGAAGCCGCAGAGGATCTCGGACTGAGTCCCGCGTGTCTGCTGGTGGAGGCGGCCGTGGGTTCGGTGGAATCGGCCTTCAAACTTCCAAAGACGAAGAAGACTATGGCGACTGAATGGGTCGGCAGGTCTCCCGAGAGGACGCCTTCGCCGTACCAATCGCCTTACAGACCGGCCTTAGTGTCAGCGCCCTCGCTGGAGAGTCTCGTCCGAGTGGCGTCCACAATGATAGGAGACCTCAGTAGGACCCCTGACTACCAGGACGACGAGCATCACTCGCCGCCGCGGACACCGGGCAGAGACAGGAACAGACCTCCGAAGAAGGCGAGGCGGATAACGAGGAGCAACCCCGCGGAAGTCACCGAAGTGCTTCCGGTGCAACATAGTGCCAAGAAGAGATGGCTGCGGCAAGCCATCAGCGAGGAAAGCGACTCCCCGAACGTCGAATCGCCGCCAAACGAAATGGTCACACCCTTGAAAAAGAGGCGCATGGCGAGGGAATCTCTCTCCTGCGAACAGAACCCGATCGTGCCTTGCAACGATGAGACGTCGCCGATATTGACATCGGAGGACTCCCCAGTAAAGGACGACTCGCTATGTTTAGCGCGTCAGTACAAGAGGAACATCATGGACATGTACAGTCGGGACAGGACGCGCTCGGACAGCGGGCAGGGGTCGGACGACCAGTGTAACATAGACCACGACGTCCTCAACGTCAACATCAAGGGAGCCCACGACACCGAACACATCAGGCGAATCATAGGAGTCCCCACGCCCGAGGACGAGGCGCCGCCCGAGATCACCGGCTCGAGCTCACCGGTCAACAACAACAACAACCTGGAGCTGGAGGAGAGTCTGTACGATAAAGTCGTACCCATGGACATAGACACCACCGTCATCACGCAGACCAAGATAGAGTCCAACGACCTGCTGGCGGACATGAAGGGCATCGAGAGCCCCAACGACAAGGCGAACAGCTCCCAGTCCGCAGACACCAGCGGCAACTCCTCGCCGCAGAGAGACGAGATGGACGACATACAGAAAAAGATACACTCCTTCCACACCGAGAACATCATGATACTGAAGAGCAGGAACAAGAAACCGCCCAAGGAAAAGAGGAAGAAGGTCAACCTGAACTTCGACCTCAACATGGTGGACGACCGGGTCAGCATACAGTTGCGGACCGACGACGACTCCAGCTCCAAGTCCTCGGAGCTCAACGGAGACGGCCACGACGACCCTGACGACAAGCTGCACGTGGCGCCAGAGAACATCCCGCTGCCGCCCGTGGAGTCCATCCCGCTGCCCGCGCCAGAGACCATCCCGTTGCCCGAGGAACCCGCCCCGCTGGCGCTCATCTCCTCCCCCGAGACCATCCCGCTGCCCGAGGAGACCATGAAGCCTCTCAAACCCGTCAACGCCATCCCCCACACGGAGAGAGCCGACCGGAGAGAAAAGGAGCCCGCCGACACGATGTCCTTCTCCACCAGGTTCAATTCCACCGGCCTGTTCTCGGGGATATTCAGCAACATCGCGCACAAGTATAGAGTGGACGGCTGCATCAACGAAAACATACCCAACATGTCGATCATTAAGAGTGCTATAGACAGGACTACAAGTTTAGATAATAGTTTGTTCGACAAGGAGAGTGCGGGTCAGGTGGACGAGCTGAAGAGTGTACAGGAGATCCTCACCCGCGTTAACAATATGGACTCTAATAACAGTGTGTTGTTGTCGGGGGTGCTTCGGGGCTCGGGGGTGGCGGGCGGCGCGGCAGGGAGCGGCTCCACCCCCCTCCGATACGGAGCGAGGGTCAACGACCCGCGCCTGCATCCGCCACCGCAGGACAAACCGAAACCTGTCAGGAGAAAGCTGTCTATATCAGAGTACCGCCTGCGACATAAATGTGTGGAGGGGAGCGAGTGGGGCGGTGGGGAGGCCGGGGAGGAGGAGGGCTCGCGCTCCTCCGCCTCGCTCTCCCCGCAGAGACTGGACGCGGCCGCGGCGCTGGCGGCCGACGACCTGGAGCAGAGGCTGCACAGGGACCTGGCCGCGCACCAACCCAAGGGCGTGTTCGACGCTCAGCCGACGGCGTCTGAACGACAGAGAGAGAACCTCAGCTCGCGGCTGCGAAGAGAGTTCGGTCTGGCGCTGCCCGAGGAGGAACACCACGAACACGTTAATAAGTCGAGCGTGTGTGATGTAACAACGAATAGCCAGCCCAGTGATGATTTCTATATTCGCTCGTTTTCTATAGAGGCCTAA

Protein sequence:

>DPOGS209930-PA
MSATSEYEPSVRQGYGPTTSSDNIYEHVLDHKLIDTMDISSVRTPVLRAVTLSPTQDPEPDSTNPESIIQSIHKDKKDIEYHITKRCETEMSLDPLPRIVSIEETTYDNVGHIAYHSIQETDEVGNGNYVENATLIPAPLNAAIVQNVTKLPQNFTINVDAAVGNITAIQNVSQDVTGNQTLWLPTILATSATTDSKSEEETPPGVSQIIITSESYVNDVSHTGRRTNIITETNYINRPKTSKVQILSNISLPKNSNYSQQYIAQGKEVNAPVYGTQNHVYKLNTNVVSQKHLNSTMLSTKPPKSQSLIGATLSPSVINNSPIKNVPYGHTYSKSTNVNKVNNGANVNTMVTAATGNTQCHILSRVVSGPNKISVHSGRKSVNTFKGSKNSTTSQKNQSRSIKIIQQTGSAHKSENKSWSANNTTYKNQGTSYGVIESNTSKVIQKVGSPTHKNQQTMTLQKVPKGERLVLQSPCGPVLLSTAPLSSSLPKGPHYVQSGSTPNLRYVQTYGPDNQLSTVSQVSANQQLTAQILQSLSQPKLILQSPTQIQPVKNNIIQPPLVEEPADIKPVNQKRIVFGDKSTLLTADDIIGMEEKPNLSEELRRYSYQPLAFVMLDHTYALPAQKQPTVSTPTTTISSVSPVISSPATTTAPMSPMKSSPIATVSQETAAPIPTSAIVAPTTTAQSSLTYKSSIQDDDTASVISSIEGERRPPAPGGSDTETAPEGEEEGKTRCICDFTHDDGYMICCDRCGEWQHVDCMGIDRNNIPDAYMCELCQPRTIDRRHARAIQLRKREELSALGASDSDSSECSRPPGQRRKRLLTVTTYTNTSGSCVTTYNSNLPVLPPLPQPTVSPLPKRGPKRPKKAEVVRKCTKRKLTEKRVKRKKEMLLNRSKYNSTIPSGQSHWRDLYELAMTNHYSPELRAKIMKYSSKLGSTPNMASAITAHLCTTVPHAGGKILIATKELKENTPVIELRGKYMLSNQHRPQLQNTARAGSQKPGPFVFFYRLPKDNTQICIDTRTYGNEARFVRRSCKPNAELQHCIVKGALHVYLVSITDIPSNTEITVGHDTNGSKQPCACGNPKHCKVNGLSVIVPRKSLDYPQREKSKRSRCYSSSSPVSPPPAPVLPTVKDIPAPFSPKCEKKSPLKYEYPPMSPVKEPPVLSLDYNPAPLKFDLFQDFKQENQEFDLTAKEEMKSDLDEPDFIKIEPKVEEPLLEPEEEPEPETIPVPDEPEPEPVTEESLETKEEMLLSPPPPPPPPEEPEPEPEPEPEPELDPEPEPEPEVDRKDEEQATEFKSEAERPVTRELAAKSACHDRSARSSRTTCVNQDSLDYKTDDSQDKLPTKTSKDKDKRKMTREERKMEAIMKAFERMEKAQQRKQEVKERQKRRESDPHPHTEKEEEEDLNCLSKKRKKRKGRARTASQSNRRRLNSADSDMVTSGDEAQAMSPRAPSRQDQSLPSADTDRPHEAAEDLGLSPACLLVEAAVGSVESAFKLPKTKKTMATEWVGRSPERTPSPYQSPYRPALVSAPSLESLVRVASTMIGDLSRTPDYQDDEHHSPPRTPGRDRNRPPKKARRITRSNPAEVTEVLPVQHSAKKRWLRQAISEESDSPNVESPPNEMVTPLKKRRMARESLSCEQNPIVPCNDETSPILTSEDSPVKDDSLCLARQYKRNIMDMYSRDRTRSDSGQGSDDQCNIDHDVLNVNIKGAHDTEHIRRIIGVPTPEDEAPPEITGSSSPVNNNNNLELEESLYDKVVPMDIDTTVITQTKIESNDLLADMKGIESPNDKANSSQSADTSGNSSPQRDEMDDIQKKIHSFHTENIMILKSRNKKPPKEKRKKVNLNFDLNMVDDRVSIQLRTDDDSSSKSSELNGDGHDDPDDKLHVAPENIPLPPVESIPLPAPETIPLPEEPAPLALISSPETIPLPEETMKPLKPVNAIPHTERADRREKEPADTMSFSTRFNSTGLFSGIFSNIAHKYRVDGCINENIPNMSIIKSAIDRTTSLDNSLFDKESAGQVDELKSVQEILTRVNNMDSNNSVLLSGVLRGSGVAGGAAGSGSTPLRYGARVNDPRLHPPPQDKPKPVRRKLSISEYRLRHKCVEGSEWGGGEAGEEEGSRSSASLSPQRLDAAAALAADDLEQRLHRDLAAHQPKGVFDAQPTASERQRENLSSRLRREFGLALPEEEHHEHVNKSSVCDVTTNSQPSDDFYIRSFSIEA-