Monarch geneset OGS2.0

DPOGS201750
TranscriptDPOGS201750-TA3492 bp
ProteinDPOGS201750-PA1163 aa
Genomic positionDPSCF300279 - 79160-87982
RNAseq coverage598x (Rank: top 21%)
Annotation
HeliconiusHMEL0067080.077.83% 
BombyxBGIBMGA002656-TA0.071.85% 
Drosophilagpp-PD1e-11776.23% 
EBI UniRef50UniRef50_D6W7340.055.69%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W734_TRICA
NCBI RefSeqXP_001813884.10.055.69%PREDICTED: similar to histone h3 methyltransferase [Tribolium castaneum]
NCBI nr blastpgi|1892337480.055.69%PREDICTED: similar to histone h3 methyltransferase [Tribolium castaneum]
NCBI nr blastxgi|2420199520.048.49%histone H3 methyltransferase, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00180242.3e-79histone-lysine N-methyltransferase activity
KEGG pathwaytca:6606580.0 
 K11427 (DOT1L, DOT1)maps-> Lysine degradation
InterPro domain[33-235] IPR0131102.3e-79Histone methylation DOT1
Orthology groupMCL14478 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201750-TA
ATGGACAATGTCAAGGAAAAAGGAACTTCACTGCCAGCTCAGCGTTTAAACAAATATCCCTCACGTGGTCTTCTAAAACATATTTTACAACAAACCTACAATCAAGCCGTCTCAGATCCTGACAAACTTAACCAATATGAACCGTTCTCCCCTGAGGTATACGGAGAAACGTCGTACGAACTAGTCTGTCAGATGATAGATCAGATAAATATATCAGCTGAGGATGTGTTCGTTGACTTGGGATCGGGCGTGGGGCAGGTGGTGCTGCAGATGGCCGCCGCTACACCGTGCCGCATCTGTTTCGGTGTCGAGAAAGCTGAAGTGCCTAGCAAATATGCTGAGAGTATGGATTTACATTTTAGAATGTGGATGAGATGGTATGGGAAAAAGTATGGAGAATATAAATTAATAAAAGGCGATTTCCTGATGGATGAGCATAGAGAGAAAATTAATTCCGCCACCATTGTGTTCGTCAATAATTTTGCATTCGGTCCCCATGTGGACCATCAATTGAAAGAGAGGTTTGCCGACCTCAAGGATGGAGCTAAGATTGTCTCGTCTAAGAGCTTCTGCCCTCTCAACTTTAGGATAACGGATAGAAATCTGAGCGACATTGGGACAATCATGCACGTTAGCGAAATGTCGCCGCTCAAAGGCTCCGTGTCTTGGACCGGCAAACCAGTATCATACTATTTACATATAATAGATAGGACCAAATTGGAGAGATACTTTCAAAGACTGAAAAATCCGAAACTCAAGAACTGCCAGGACGAGGCCGAGGGAAAGAGGAACGTGAGCGCGCCTCCAACAAGACAACCCACACCGGACCTGTTGAACGGCAACAGCAACCACAGCACCGGCTCGCTACCAGAGAGACGAAGGAAGGTCGCCCGTCCGAGACCCATCAGGGGCGAAGCCGGACGGACTCGAGCGCGTGTGGCGGCGGCGGCCAAGAGGCGCGTCGTGAGACGCTCCAGCGACGAGAGTGAGGAAGAGTCCAGCGGCACCGACGAGCCGCCGGGACCCGAGCCCGCGCCGAGAGAGTGGGGGGCGCCCTGGGCCTCCTCGCCGCACTCCAATCGGAACCGGCGCACAACTAACAAACGCAGCGCTAGCGCCGGCGGTGCTCGTCGTCGTGTCGCTCGCGCCAAGCGTCGCCGCGCCGCGCCCGCTGCCATCGCCGGCTTGGATCTGCTCCACTCCACCACCCTGGCGTCCACGCTGCACGCGGGCGCGGTGACGGCGCCGCCCCCGGGCTGCGTGGAGCAGCGTCTGTCGGCGCTGGGCGTGCAGCTGCAGCCGGCAGACCGTCTGCACTCGGAGCTGGACATACCCCGTGCTCCTCACGCGCCCTACTCCCTGCAACTGCTGTTGGACATGTTCCGCGATCAGTACCTCGCCTTCATCCGCCGCATGAACACACCGGACTACGCACTCGAGATTCGCGCGCAAATCGACAAAGAGAAGGAAAGAAATCAAAAACTGAAGTCTCGCGCATCTCAGTTGGATAAACAGATCAACGTATTAATTAGCGACAGCGTGGCTTTGTTGAAGGCACGGATGAGCGAGTTGGGGATACACGCGAATAATACGGTCGACCTGCTGGCCAAGGCGAAAGAGATTGTGGGGCGACACAAAGAGTTACAGAGCAAAGCAAGCAAATTACAGGCTCAGGTGAATAATATAGAGGCTGAACAAGCGATGCTTGTGAAACAACGAACTTTTGAAATTACAGAGAAGTATAGGCAGCTGGGACATATACCCCCAGATGTCGAGATCACGCAGAGTATAGCTCACGATTGTATATTGAAGGAGATCTCGGCGACGTTGGCGCACAGGAAACGTCTCCATGCACAAGTTGGACACTTGCAGAACGAGATCATTCAGATGGAACGAGCCAGCGAACAACAGAAAGTGGCGGCTCCTACTGTACCCGTGGCGACCGTCAAACCTCACACTGTCAATTCAAAACCAAGAAAGTCGAGGGAACATCGGTCACGTTCGCAGGAATGGCCGGATGTGCCGGATGTGGGGAAGATCGAAGAACAAAATCCAGAGATCCTGGCCCAAAAGATATTAGAAACTGGCAGACAGATTGAGGCTGGGAAGATTGCTAAACCGAATGTTATTGTAAATGGGTATATAAGAGATCCTGAAAGACATGTTGATCATAGGCAGGGTCGGGCGGTCGCTCGAGCCGCGCCGCCGTTGGCGCGACACTCGCCCGTCAAGTCTCAGAAACCGCTCAACGTCGTGGCCAAGGTACAGGAGTCACCGAAGGTCATCAACTTCGAAGACCGACTCAAGAGCATCATCACATCCGTCCTCAACGAGGACCAGGAACAGAGGAAGGCGTCGCGGCTAGAGCCTCGTCAAGCAGTAGCGAATCAAGCGTACGCTAACGGGTACGTCCGACCCGTAGCGGTGACGTCCAGCACGCTCGGCGCGTACGCAGCGAGGCCGGCCGTCTCCGCCGCGCCGCCAGCCGTGTCGCCATACGCGCGTGGCCTGCGGGATGCGAGGGACGTGCGGGACGCGAGGGACGCTCGTGACGTACGGGACGTTCGTGACACACGGGACGCGAGGGACGTACGGGACATGCGGCGGGAACGCTTCGGCTTCGATCGTCGGGAGCCGCGGCCGCACCCGCATCACGAAGCACGGCCGCCGGACATGCGTCATCATCACTCCGCGCAGCCTGATTACACACAGGTGTCTCCGGCGAAATTGGCACTGCGTCGGCATCTATCACAAGAGAGGCTGGCGGCGCCGGGGGCTCGCACCATCGGCGACCTCGTCAACGGAGAGATCGAGCGCACGCTAGAGATATCAAATCAGAGCATCATCAACGCCGCCGTCCACATGAGCGCGCGCTACCACGAACCTGCGGCCGCGCACCAGCCGCTTGAGGGTCTAGCCGCCTGTTTACAAGCACGCGTTCTAGCGTCGGAGTACTGGCGCGGGCGGAACGGCCCCGCGCCCGCCGGCGAGCGGCAGGAGGAGGCCGGCCGGCGGCGCTCGCCCGCGCCCGACCCTCACTCCAACACCTCCACGCCACTCGTCGACGAGCCGCCGGAGCCCCGGCCGGCCGCGCCGGATGCCGAGGCCGGCGAGGAGGTCGAGGAGAGCAAGTGGCAAGATCGGATAGCGTTTCGCTTCGATCAGATCATATCGTTCGCTTCCACCGCCATGGACGACAAGCGGCGGCGGTCCGACGAGGCGTGTAACACCTCACCCGACTCCGGCATAGGTCACGGCGAAGCGGCCCGCGGCGCCGCAGACGGCGTCACGACCGCGCCGGCGGGAGAGACGGGCGGGGGAGACGCCACGGGCGCGCCGCCCGCGGCCGAGCCGGCGCCGGTCCGCCGCTCGCCGTCCCCGCCGGCGCCGCACCACTTCAAAAAGCGTTTCTTCCGCCGCGAACGATGGGGCGCGTGGTCGTCGGGCACGCCGCCGCCGGCCGCCGTGGAGTGGGAGCGTCCGCCGATGTGA

Protein sequence:

>DPOGS201750-PA
MDNVKEKGTSLPAQRLNKYPSRGLLKHILQQTYNQAVSDPDKLNQYEPFSPEVYGETSYELVCQMIDQINISAEDVFVDLGSGVGQVVLQMAAATPCRICFGVEKAEVPSKYAESMDLHFRMWMRWYGKKYGEYKLIKGDFLMDEHREKINSATIVFVNNFAFGPHVDHQLKERFADLKDGAKIVSSKSFCPLNFRITDRNLSDIGTIMHVSEMSPLKGSVSWTGKPVSYYLHIIDRTKLERYFQRLKNPKLKNCQDEAEGKRNVSAPPTRQPTPDLLNGNSNHSTGSLPERRRKVARPRPIRGEAGRTRARVAAAAKRRVVRRSSDESEEESSGTDEPPGPEPAPREWGAPWASSPHSNRNRRTTNKRSASAGGARRRVARAKRRRAAPAAIAGLDLLHSTTLASTLHAGAVTAPPPGCVEQRLSALGVQLQPADRLHSELDIPRAPHAPYSLQLLLDMFRDQYLAFIRRMNTPDYALEIRAQIDKEKERNQKLKSRASQLDKQINVLISDSVALLKARMSELGIHANNTVDLLAKAKEIVGRHKELQSKASKLQAQVNNIEAEQAMLVKQRTFEITEKYRQLGHIPPDVEITQSIAHDCILKEISATLAHRKRLHAQVGHLQNEIIQMERASEQQKVAAPTVPVATVKPHTVNSKPRKSREHRSRSQEWPDVPDVGKIEEQNPEILAQKILETGRQIEAGKIAKPNVIVNGYIRDPERHVDHRQGRAVARAAPPLARHSPVKSQKPLNVVAKVQESPKVINFEDRLKSIITSVLNEDQEQRKASRLEPRQAVANQAYANGYVRPVAVTSSTLGAYAARPAVSAAPPAVSPYARGLRDARDVRDARDARDVRDVRDTRDARDVRDMRRERFGFDRREPRPHPHHEARPPDMRHHHSAQPDYTQVSPAKLALRRHLSQERLAAPGARTIGDLVNGEIERTLEISNQSIINAAVHMSARYHEPAAAHQPLEGLAACLQARVLASEYWRGRNGPAPAGERQEEAGRRRSPAPDPHSNTSTPLVDEPPEPRPAAPDAEAGEEVEESKWQDRIAFRFDQIISFASTAMDDKRRRSDEACNTSPDSGIGHGEAARGAADGVTTAPAGETGGGDATGAPPAAEPAPVRRSPSPPAPHHFKKRFFRRERWGAWSSGTPPPAAVEWERPPM-