Monarch geneset OGS2.0

DPOGS205511
TranscriptDPOGS205511-TA3042 bp
ProteinDPOGS205511-PA1013 aa
Genomic positionDPSCF300056 - 209288-232823
RNAseq coverage1473x (Rank: top 9%)
Annotation
HeliconiusHMEL0112960.080.24% 
BombyxBGIBMGA011088-TA2e-12139.51% 
DrosophilaJIL-1-PB2e-13342.93% 
EBI UniRef50UniRef50_UPI000224668E0.053.97%UPI000224668E related cluster n=3 Tax=unknown RepID=UPI000224668E
NCBI RefSeqXP_395099.30.057.74%PREDICTED: similar to ribosomal protein S6 kinase, 90kDa, polypeptide 5 isoform a [Apis mellifera]
NCBI nr blastpgi|3287786500.057.74%PREDICTED: ribosomal protein S6 kinase alpha-5 [Apis mellifera]
NCBI nr blastxgi|910931500.060.43%PREDICTED: similar to ribosomal protein S6 kinase, 90kDa, polypeptide 5 [Tribolium castaneum]
Group
Gene OntologyGO:00167723.6e-78transferase activity, transferring phosphorus-containing groups
GO:00055246.4e-77ATP binding
GO:00046746.4e-77protein serine/threonine kinase activity
GO:00064686.4e-77protein phosphorylation
GO:00046724.7e-57protein kinase activity
GO:00047135.1e-09protein tyrosine kinase activity
KEGG pathwayame:4116300.0 
 K04445 (MSK)maps-> MAPK signaling pathway
    Neurotrophin signaling pathway
InterPro domain[1-265] IPR0110093.6e-78Protein kinase-like domain
[342-609] IPR0022906.4e-77Serine/threonine-protein kinase domain
[347-600] IPR0174424.7e-57Serine/threonine-protein kinase-like domain
[239-299] IPR0009611.5e-14AGC-kinase, C-terminal
[259-301] IPR0178923.9e-09Protein kinase, C-terminal
[342-601] IPR0206355.1e-09Tyrosine-protein kinase, catalytic domain
Orthology groupMCL10897 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205511-TA
ATGAAGGTGCTGAAGAAGGCCAGCATAGTGCAGAAACTCAAGACGGCGGAGCACACCAGGACGGAGAGACAGGTCTTGGAGGCTGTGAGGGCCTGTCCCTTCCTGGTGACCCTACATTACGCCTTCCAGACAGATGCCAAGCTACATCTCATACTCGACTACGTGGCCGGCGGGGAGCTGTTCACACACCTGTACCAGCGAGAACACTTCCACGAAAACGAAGTCAGGATATACATCGCCGAGATCATACTGGCGTTGGAACAGCTGCACAAGCTGGGCATCATCTACCGCGACATCAAGCTGGAGAACATCCTGCTTGATGCGGAAGGTCACATCGTGTTGACGGACTTCGGGCTGTCCAAGGAGTTCTGCGGCGGAGAGAGCCGCGCCTACAGCTTCTGCGGCACCATAGAGTACATGGCCCCGGAGGTCGTGAGGAGCGGCTCCCAGGGACACGACATAGCTGTGGACTGGTGGTCTGTGGGAGTATTAACGTACGAACTGCTGACGGGTGCTTCGCCCTTCACAGTCGAGGGTGAGAAGAACACTCAGCAGGAGATCACCAAGAGAATTGTGAGATGCAGTTACCCCGTGCCCAACGACGTCAGCCCCGCTGTTCAGGACTTCATCAAGAAACTTTTAGTAAAGGACCCTCGTCGTCGTCTAGGAGGCGGTGATGATGATGCAGAGGAATTGAAACGACACCCCTTCTTCCAGGACCTGGACTGGGAGGCGGTGTCCCGGCGCGAGGTGGCCGCCCCGTTCGTGCCGCAGCTGTCGCACGCAGCTGACACGTGTAACTTCGCCGACGAGTTCACCAGGATGCCGCCCACCGACTCGCCGGCACAGGCGCCCAAACACCACGACAAACTGTTCCTAGGTTACTCATATGTGGCACCCAGCATCCTGTTCTCTGAGAACATCATCTCGGATCAGATCTGGCTTCAGGCGACCGGACAGAAGCACGAGAAGCTCAAGGGATGGATCGCTAAGGACTCGCCGTTCTTCCAGAAGTATTCTGTTGACTTGAGCACTCCTCTGCTGGGTGACGGCTCATATGCTGTTTGTAGGAAGTGTATACACAGACAGACGGGCAAGGAATATGCTGTTAAGATAATATCGTCACAGAAGAAGGACGTGAAGCAAGAGATAGACCTGCTGAAGACCTGTCAGGGCTGTCCTTACATCATACAGCTGCATGAGGTGTTCCACGACACTGCGTTCACCTACATAGTGACGGAGCTGGCGATGGGCGGTGAGCTATCCTCGGTGCTGGGCGCGGTCAGCGAGCGGGTGGCGAGGAGGCTCATCGTGCAGCTGTCGCTGGCCGTTAGACACATGCACGCCAGGAGTGTCGTGCACAGGGACCTCAAGCCTGAGAACATCCTGCTCAGCAGCACCCGGCTGCACGAAGCTAAGGTGAAGGTGGTGGACTTCGGGTTCGCGAGGCGCCTCCCCGACTGTGACGACCGGCAGAGGATGATGACGCCCTGCTTCAGCCTGCCGTACGCGGCGCCAGAGGTCGTGTCGTGCGCGAGGGGCGCGGCCGCCGGCTACGGGCCGGGGTGCGACCTGTGGAGCTTGGGAGTTATATTTTACTGCCTGGTGTCTGGGCGGGCGCCCTTCTCTCCGGGGGGCCGGGAGCCGGTCGCCGCGCTCGTACAGAGGATCAGGGCGGGGACCTTCACTATGGACGGTCCTGTTTGGGACAACATATCCAATGACAGTAAACGTCTGATCGCCGGCCTCCTGGCCGTGGAACCGGCGGACAGGCTGACCATCACGCAGGTGTTGCAGGAGCTGGGAGTTAACTGTGACGACGGGAGCGGGTTCAAATTACAGGACATGACCAAAGCCGATCTCTACAAACGTCGCAGCAAGAACAAGCAGCGGGGCTACAGTGACGGCGACCAGGGAGCGACGGATGCCGACAACGACCACGACACCTCCATCACGGACACGCTCGACACGCTGCACCGCATCAACAAGAACTCCATAGAACACGACGTGGCGCTCATCAAAGCTAACACCGCGCACACGCCCGACGCCGGGGACCACCATCTCACGGAGGACAACTACGAAGTCTCTCCGCTCAAACCCGACATGACCAAAGCCGATCTCTACAAGCGTCGCAGCAAGAACAAGCAGCGGGGCTACAGTGACGGCGACCAGGGAGCGACGGATGCCGACAACGACCACGACACCTCCATCACGGACACGCTCGACACGCTGCACCGCATCAACAAGAACTCCATAGAACACGACGTGGCGCTCATCAAAGCTAACACCGCGCACACGCCCGACGCCGGGGACCACCATCTCACGGAGGACAACTACGAAGTCTCTCCGCTCAAACCCGTGGAGAAGACCTACTCCAAGTCTAGATCCAAACTAGACGAATATATTTATATAGAGAGCAGCCAGGGACTCGAGGACACGGAAGTAGCGTTCCCGAAAACCACGCGCAAGAAGGAGTCTGCATCGCCCCTACCAGCGAAGAGAAGAAAAATAGACACCAGGTCCGCCAAGAAGACAGACTCGACCGCCGAGACACAGAAGAGAGGCAGAGGGAGGCCCAAGAAGAACGTAGAGGAAGTTAAAAACACGGAGAAGAAAACTAGAAATACCAAAGAGTCCATAGAAAACGACAAAACTACACTCACGAGAGTCACGCGGAAGAGAAAGTACGAGGAGATCGCGAAACCGGTGTTAAGGGAGAAGGGGCAGAACGGGAGACAGTCTAAGAACAGTGACGGGAAGGTGGAGAACAGAAAGATAGCGAGGCCCAAGAGGAATGTAGAAGTAAGAGTAGAACTGGACAACGTGAGGATGACCAGGTCCAGGAGGAGGAGGCTGGAGGTGAGTCTGTCGCCCTCCGAGGTCAAGACTGTCATACCCGCCTTCTCCTTCGAGTCGGACAGGAGAGTCAACTCCGTAGAGAGCAACAGGACAGCCGGCGCTAAGAACAAGAAAGCCAAAGGCAAAGCGAAAAGACAAGCCAAGCCCAAACGATCCACACGAGCCCGCGCCGCCAGGAGGTGA

Protein sequence:

>DPOGS205511-PA
MKVLKKASIVQKLKTAEHTRTERQVLEAVRACPFLVTLHYAFQTDAKLHLILDYVAGGELFTHLYQREHFHENEVRIYIAEIILALEQLHKLGIIYRDIKLENILLDAEGHIVLTDFGLSKEFCGGESRAYSFCGTIEYMAPEVVRSGSQGHDIAVDWWSVGVLTYELLTGASPFTVEGEKNTQQEITKRIVRCSYPVPNDVSPAVQDFIKKLLVKDPRRRLGGGDDDAEELKRHPFFQDLDWEAVSRREVAAPFVPQLSHAADTCNFADEFTRMPPTDSPAQAPKHHDKLFLGYSYVAPSILFSENIISDQIWLQATGQKHEKLKGWIAKDSPFFQKYSVDLSTPLLGDGSYAVCRKCIHRQTGKEYAVKIISSQKKDVKQEIDLLKTCQGCPYIIQLHEVFHDTAFTYIVTELAMGGELSSVLGAVSERVARRLIVQLSLAVRHMHARSVVHRDLKPENILLSSTRLHEAKVKVVDFGFARRLPDCDDRQRMMTPCFSLPYAAPEVVSCARGAAAGYGPGCDLWSLGVIFYCLVSGRAPFSPGGREPVAALVQRIRAGTFTMDGPVWDNISNDSKRLIAGLLAVEPADRLTITQVLQELGVNCDDGSGFKLQDMTKADLYKRRSKNKQRGYSDGDQGATDADNDHDTSITDTLDTLHRINKNSIEHDVALIKANTAHTPDAGDHHLTEDNYEVSPLKPDMTKADLYKRRSKNKQRGYSDGDQGATDADNDHDTSITDTLDTLHRINKNSIEHDVALIKANTAHTPDAGDHHLTEDNYEVSPLKPVEKTYSKSRSKLDEYIYIESSQGLEDTEVAFPKTTRKKESASPLPAKRRKIDTRSAKKTDSTAETQKRGRGRPKKNVEEVKNTEKKTRNTKESIENDKTTLTRVTRKRKYEEIAKPVLREKGQNGRQSKNSDGKVENRKIARPKRNVEVRVELDNVRMTRSRRRRLEVSLSPSEVKTVIPAFSFESDRRVNSVESNRTAGAKNKKAKGKAKRQAKPKRSTRARAARR-