Monarch geneset OGS2.0

DPOGS210338
TranscriptDPOGS210338-TA4410 bp
ProteinDPOGS210338-PA1469 aa
Genomic positionDPSCF300025 - 390268-400714
RNAseq coverage428x (Rank: top 29%)
Annotation
HeliconiusHMEL0072550.073.16% 
BombyxBGIBMGA011979-TA0.068.50% 
DrosophilaSmg1-PA5e-1122.33% 
EBI UniRef50UniRef50_E2C5450.035.36%Serine/threonine-protein kinase SMG1 n=3 Tax=Formicidae RepID=E2C545_HARSA
NCBI RefSeqXP_001122895.11e-18034.93%PREDICTED: similar to PI-3-kinase-related kinase SMG-1 [Apis mellifera]
NCBI nr blastpgi|3071946940.035.36%Serine/threonine-protein kinase SMG1 [Harpegnathos saltator]
NCBI nr blastxgi|3071946940.035.25%Serine/threonine-protein kinase SMG1 [Harpegnathos saltator]
Group
Gene OntologyGO:00167722.5e-06transferase activity, transferring phosphorus-containing groups
GO:00167733e-05phosphotransferase activity, alcohol group as acceptor
KEGG pathway 
InterPro domain[126-173] IPR0110092.5e-06Protein kinase-like domain
Orthology groupMCL18911 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210338-TA
ATGGAACTTCACTTGCATCTGACTGCTGTTTCTCGTGCTTTGGACTACAAATGGGGAACACCTGTGGACCTGGAAACTTCCCAGGACATGAGGACGTGTTCTCCGGCCGTCATAGATTTACCAACGACCACGTTCTCCTCCGAAGAATATTTAAGGAGCATAAGAAGGAAAACATCTGAATGTGCACCAAGAAAACCGAACCTCCTAAGAAAGGACACGTTCGCTAATATTTCAAGTTCGAGATTATCCTGTGAAGAAATTATCTGTGACGTTTTGGATCAAACCATTTACTTCTCGGACGACACTGTAGATTCAAAGTATGATTTAGATAGCTCTGGATTTATCGACAATTACACTAAAAATAATCTTCGGAACAAAGACATCGAATACTTTATGAAAGAAAAAACTAAGGGTATATTCCGCCTGGCGTGTGAGCACGTAGTCCGCACCATGCGTCGCGGCCGGGAGACCCTCCTCACACTGCTGGAGGCGTTCGTGTACGACCCTCTGGTGGAGTGGGGTGGTGGGCGTCGTAAGCGCGGAGCCCGCCACGTGCGGGCAGCCAAGGCCATGCTTGCGGTTCGCGTGAGGGAATTGAAACACTCCGCCACCGATATCACAAATCAGCTGTTGGCCGCACTACCTGAAGTAAAGCACTGTGCGGATAAGTGGTTAGAAGAAAACGAACAGTTACATACTGTGGAGGAAAAACTTGACATGTGTCATAAACAGATGGCCCTCATCAAAGAGGTCGAGGCGTGCGGCAGCAACCTCAGCGACCATCCTCTGTATGCCATCTCACAGAAATATACGTCACATAAACAGGCCAAAAACGCGGTCGAGGACTCGATGAAAGCTCTCGTAAAAATCCTCAACGACTTCGATACTCAGATAGAAAGTTTTGCAAACACTAATGACATATTGAACGGGCCGCAGTTGATGACCTGGGTGCAAGAGTTCTCGGGGCCTCATTACGACGACGAGAAACCCATATTCGAACCCATAAAAGAGTTTTTAACCAACGCCGGAAAAGGTTCCATGATATCACAATTCGAACAGGCGGAAGTAGAGTTGAACCAGACCATGCAGCAGACAAACCTACTGGTGAGGTCCTGCCTCGAGCTGTTGACACAGTACGTGGCCGTCTCTCAGTACTATCCACAGAGTCGGACGGAATACCACCGCTTAGTGATGTTCCGCAAGTACCTGGCGACAGCTTTAGAAAGTAAATCCCCGGAGGTCTGTCGGGAGGTGGCCGGCCAAGTCACGGCTCTGGTGGAGTCGGACAACGTCACCGGTGACTCGCAGCAAATTATCGCTTACAACTATCGTTTACACCAAGTCAACGCGGAGGCGAACGCGTACCTCAACAAATGCCACGAAAGATTGCAACTAGAGGGCGGGCCCGACGCCATCGCTCTGGCACAAGACAGCTACATGGAAGCGAAAAATAATATAAACAACTGGCTGCGAACGGAAGAGGGCGCCGGAGCGGCTCTGGAGAACGCCGCTATCGGGATGCTGTGTAATTTGAACAGGAGATATCTAATGTTGGAGAACGGCGCACAGAGCGCAGGAGACTGCCTACTCGACCTAACGTCCAGGGACGGGGAATGGTTCCTGGACGACATGAACTCGCTGTCGATGCAGGCCATCGAGTTGCTGACTCTGCTGCCCTTGCCTTCGACCACGGCGGACGACGCCGCCCTGTCAGTGGCCGTGGAATGTGTCAAGAATGCTAATCTTCTAATAGCTGACTTGGTTCAACTGAACTACAACTTTAGTACAATCATTTTACCAGAGGCCGTTAAGAAAGTGCACTCCGAGGATCCCTCGACCTTGCACATGATAAACGAACTGAACGCGCTCATCACCAACACGCCCGTGACCCTCAACGACCTGCTCACACAACTCGAAATGCACTTCCGCTATCTGGTGATGGACATGGAGTCCCCAGCTGCCGGTGCACAAATACTAGCGGCCGAGCTCCGTAGTCGCTACGAGGAGCTCTTGTCTCCGTCGGAGGGCGAGGCCCCGGGACAAACCCCAGGCCGCATGTTGCTGATGGGGTTCAGCGGACTGTTCGCGGCCGTGGAGCTCCGAGCGAGGGAGCTGGCCGACCACCTCGCCGCCCCGCTGCAGCCCGCCTGGAGGAAGATCGACCAGATCAATGACGCCATGCACATGTCGGCTGCCATGCAGAATCCCGCGCTGCGCTCCGTGCTGGAGGATATCTTCACGGTGCGGCGCGTGCAGACCGTGGCGGAGGTGTTCTCTCTGTGCCTGCAGCTCGCCCGCGCCTTCCGCGGCTCTCCCCCGCCCGCCGCGCCTCCTCCGCCCCCCGCCGCCTCACAGCCCCTCCTGGACGACGCCGCGCTGTGTAAACCAGTCAGACGTTTTACGGCGGAGTACGTGTCCCGCTGTTTCCTGGGCGTCCACTCCCTGTCTCTGGGCCGCGCGCTGTGCCTCCTGCTGCGGCGGGCGCGGCTCGATCTGCGCGCCGAGGTCGAGCAGAAGGAGATCGGCGCCTCCTGGAGCGTGTCCCTGGAGTCTCTGGTGGAGAAGGCCTGCGGGTCCGTGTCGAGTCGCGCGGGGTCCCTCGCCGGAGCCTTGCAGGCGTCCCGGGCCCGCACGCTCCGCGCCGCCGCCGCCGTGCGCGCCCTGGACCGTGCCCGCGCCTCGGCTCGCGCCGCCCGTCTGCGCGCCGCCGCCCACGCCAGTCTACACGCTGAAATCGTGTCCGGAGCCCCCGAACCAGGCCCGGCGTTGTTCCGCGCGGCCCGGGACCTCACCGCCGCCCGGGGACGACTAACTGACGCCTTGGAGAAGGCACAGGCTCTCCTCACACCCGCACATCAGAGTAAAGTGGGGCGCGGGGCGAACCCCGACCTGCTGGGGGCGGTGGTGGCGCTAGAGTCGGGCTGGGCGGCGCGGGCCGGGAGGGCGCATGCTCTGTGTCGCGCAGGGGCCGTGCTGGCGCCACACGCGCGCTCCGCCTCCGCCCTGGCCGCGGCCAGGCCCGCGAGACACGCCAGGGCCCTGCGACACGCCCGCACGCTCAGGACCGCGCTCGCGCACTGGGAGAAGGCGTGTACGCTCGCTCAGAAGTACTCGCTGCACGTGTCGCCCGTCGAGGAGTCGCTCATGGAGATGCTGCACCCCGAAGGAAACATCGACACACACTGGGTGGGAGACGTGTCGGCGCTGGTCCAGGATACGCTAGGATCCCGGGCGTCGGCCGCCAGCTCAGCGCGTGCCCGCCTGAGCGTCGCCAGCGACGCCCTCAGCGCCGCCGCCGCCCGCCTCAGGGACGCAGCCGCCGCCCGGGAACATCTGCTGCACGACCTGGCCGGGCCGCTGCACGCGCTCGCCCCCTACAACGAGGACATAAAACGGAGACAGCAATCGAGGAAAGTTGGTATTTTGAATAAACACGAGCAGCGGGGCCTGTGGTCAGCATTTAGTTTTGGTGACAATCCGTCCCAGGAGTTCCTGTCTCTGTGGCGCGCGGTGTCCGAGCGCCTGGCGGCAGTCTGCGGCCTCCTAGACCTTGACCTGGACCTGGAGCGCGTCGCTCGGACAGCTGCAGAAACACACGCACTGCTCACAGACCTGCCGCTGCTGCTGGATATGATGCTCCAATTGCCCGGAAACACGGACACGAGCCGGCGACTCACGCGGCAGGCGGCCGTCGGCAGACCGCCCGCCAGGCACGGTCACGAGCAGCGGAGTGCGACCGGCGCCGGCGTGTGGCGGCGCGTCAGACTCAAACTGGAGAAACGAATGACGCCGCAGGAACAGGTACTCGCGCACTCCACAGGAGATGTGTACCCTTGTGGAGTACATTATATCGGAAGCGACCAGCGCGGACAACCTCTGCCTCATGTACGAGGGCTGGATGGCGTGGGTGTGAGTCCCGGGCGGTGCTCCGCCCTTCGCCGTGTGCGGACGGGCGCTCCCGTGCCGTTCGGCCCGCGGGCGCCCTCCCGAGAGCCCCACGAGCCGGACCGAAGCCCCCGGGCTCGGCCCCGACGACGACGTCAGCTACGGAAGACACCCCGCACGGGGAACTCTCCGTATCACGATTCGATGTTAGATTTAAAGTATCGCGACTCCGACGACGACTCGCGCGTTCTCCAGGAGCCGCCGAGGCCCAGGCTCCCTCCACCAGCACAGGGACACGCTCCCGGCCGAGCGACGTCGACTCACGGATCGTCTTCATTGAAATATTCATATATATGTATCTCAGCTACCGAGTATGTTGTAAGGTTACCGTGTTTGGACCCCAGCTTGTTTTCTCCACGCGACCATCGCTCTCCCCCGCTCCCTCGGGCTGCCGCGATCCTGACCTGTCACCGCGGACTCCCGGGCCTCGCCCTGGCCGGCCGCGGACAGGGGATCATGCGATAA

Protein sequence:

>DPOGS210338-PA
MELHLHLTAVSRALDYKWGTPVDLETSQDMRTCSPAVIDLPTTTFSSEEYLRSIRRKTSECAPRKPNLLRKDTFANISSSRLSCEEIICDVLDQTIYFSDDTVDSKYDLDSSGFIDNYTKNNLRNKDIEYFMKEKTKGIFRLACEHVVRTMRRGRETLLTLLEAFVYDPLVEWGGGRRKRGARHVRAAKAMLAVRVRELKHSATDITNQLLAALPEVKHCADKWLEENEQLHTVEEKLDMCHKQMALIKEVEACGSNLSDHPLYAISQKYTSHKQAKNAVEDSMKALVKILNDFDTQIESFANTNDILNGPQLMTWVQEFSGPHYDDEKPIFEPIKEFLTNAGKGSMISQFEQAEVELNQTMQQTNLLVRSCLELLTQYVAVSQYYPQSRTEYHRLVMFRKYLATALESKSPEVCREVAGQVTALVESDNVTGDSQQIIAYNYRLHQVNAEANAYLNKCHERLQLEGGPDAIALAQDSYMEAKNNINNWLRTEEGAGAALENAAIGMLCNLNRRYLMLENGAQSAGDCLLDLTSRDGEWFLDDMNSLSMQAIELLTLLPLPSTTADDAALSVAVECVKNANLLIADLVQLNYNFSTIILPEAVKKVHSEDPSTLHMINELNALITNTPVTLNDLLTQLEMHFRYLVMDMESPAAGAQILAAELRSRYEELLSPSEGEAPGQTPGRMLLMGFSGLFAAVELRARELADHLAAPLQPAWRKIDQINDAMHMSAAMQNPALRSVLEDIFTVRRVQTVAEVFSLCLQLARAFRGSPPPAAPPPPPAASQPLLDDAALCKPVRRFTAEYVSRCFLGVHSLSLGRALCLLLRRARLDLRAEVEQKEIGASWSVSLESLVEKACGSVSSRAGSLAGALQASRARTLRAAAAVRALDRARASARAARLRAAAHASLHAEIVSGAPEPGPALFRAARDLTAARGRLTDALEKAQALLTPAHQSKVGRGANPDLLGAVVALESGWAARAGRAHALCRAGAVLAPHARSASALAAARPARHARALRHARTLRTALAHWEKACTLAQKYSLHVSPVEESLMEMLHPEGNIDTHWVGDVSALVQDTLGSRASAASSARARLSVASDALSAAAARLRDAAAAREHLLHDLAGPLHALAPYNEDIKRRQQSRKVGILNKHEQRGLWSAFSFGDNPSQEFLSLWRAVSERLAAVCGLLDLDLDLERVARTAAETHALLTDLPLLLDMMLQLPGNTDTSRRLTRQAAVGRPPARHGHEQRSATGAGVWRRVRLKLEKRMTPQEQVLAHSTGDVYPCGVHYIGSDQRGQPLPHVRGLDGVGVSPGRCSALRRVRTGAPVPFGPRAPSREPHEPDRSPRARPRRRRQLRKTPRTGNSPYHDSMLDLKYRDSDDDSRVLQEPPRPRLPPPAQGHAPGRATSTHGSSSLKYSYICISATEYVVRLPCLDPSLFSPRDHRSPPLPRAAAILTCHRGLPGLALAGRGQGIMR-