Monarch geneset OGS2.0

DPOGS203905
TranscriptDPOGS203905-TA2763 bp
ProteinDPOGS203905-PA920 aa
Genomic positionDPSCF300005 - 1089668-1093780
RNAseq coverage550x (Rank: top 23%)
Annotation
HeliconiusHMEL0179330.070.23% 
BombyxBGIBMGA002014-TA0.074.26% 
DrosophilaCG7028-PA0.051.52% 
EBI UniRef50UniRef50_E2BM250.048.75%Serine/threonine-protein kinase PRP4-like protein n=10 Tax=Eumetazoa RepID=E2BM25_HARSA
NCBI RefSeqXP_002092970.10.051.16%GE21040 [Drosophila yakuba]
NCBI nr blastpgi|3320165410.048.57%Serine/threonine-protein kinase PRP4-like protein [Acromyrmex echinatior]
NCBI nr blastxgi|2420231080.050.21%serine/threonine-protein kinase prp4, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00167721e-76transferase activity, transferring phosphorus-containing groups
GO:00055242.7e-66ATP binding
GO:00046742.7e-66protein serine/threonine kinase activity
GO:00064682.7e-66protein phosphorylation
GO:00046721.9e-49protein kinase activity
GO:00047134.6e-06protein tyrosine kinase activity
KEGG pathway 
InterPro domain[585-918] IPR0110091e-76Protein kinase-like domain
[599-916] IPR0022902.7e-66Serine/threonine-protein kinase domain
[603-916] IPR0174421.9e-49Serine/threonine-protein kinase-like domain
[599-916] IPR0206354.6e-06Tyrosine-protein kinase, catalytic domain
Orthology groupMCL11521 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203905-TA
ATGGGCAAAGAAAGTAATCTTGATGTGGAAAAATCTACTAACGGTAGCGCAAAAGAAGACGAAAGTAAGAAGAAAAAGAAAACTAAGAAACACAAAAAACGTAAAAGACCAGTGGATGAGGACACTGAAAAAGGCTATAAAAAAAGAAAGAAAGCCAAAAAGGAAAAACAAAAAAGTCCGAAGCCAGAGAGTGAAACATCACTTTCTGATGTAGAGGAGTTAATTAAAAAATCCAAAAGATGTAAGCCTGCAAACAAAGAATCTTCATTGGAAATTGCCTCGGCTGATAGTGCGAAAAATACTAAAACGAAAAATGGCAATACGAATGCTGATCAGTCGCGTAGTAGTAGCAGAAATACAAAGAAAGTTGACGACACTTTAGAAATTTTATCTTCTACTTCTGAAGCCGAAAACTATCAAGACGACTGTGCTAGCCCAGAACTTACTATAATTGAAGATGATCTTAATCTTGAAGAATTGATGAAACAAAAAGAAATGTTGCAAGCACGTTTAACTGCCTATAATTCAGACAAAAGCGATGAAGAAACTCGTCTTAAGAAAAACACAAATGATGATGTGATCTGTGTCGATGAAGTATCGAAAGAACCCGTTTCCAAAAAGAAAAAAGATTCCTCTTCTGAACACAAAAAAACTAAAAGTAAAACCACTAAGCAGAGACCATCGGAGCTTGAAAAAATTAAGTTAAAACGTCAAGAGGAAGATTTAAGAGAAATAATCAATAGAGAAAGTCGAAAAGAGATGGAAAAGAGATTAGAAGAAAGAGAATATCGTGAAAAGAAGGAACGTGACCGACGTAAAGCCGAATCAGATAGAGAACGAGAACGTGAGCGGGAAAGAGAGAGAGAAAGAGAAAAGGAGCGAGAAAAAGAAAGGGAACGAGAAAGAGATCGTGAACGTGATAGAGAAAGGGAACGGGAAAGAGAAAAAGAAAGGGATAGAGAAAAGGATAGAATAGATCGTGAGCGTGAAAGAGAAAGGGAAAGGAAAAGGGAGAGAGAAAGAGAGAGAGAACGTCGGTCTTCTGAATTTAGGCGAAGGGAACGAGATTCGATTCGTAGTCGAAGGTCCAGGGACCGTTCTCCAATTTCAAGATATGAAAGAAGACATTCAGACCGTGATAGAAAGTCACGTGACCGTAGTAGGGAAAGATATAGAGGTCGGTCGCGAGACAAAAATTATAAAACTGATCATAGGGAGAAACGCAGAAATTCTACAAATGAAGTAACTCAAATAGTACCCAGCTCTAGTGATGAAGAGCTTAACATATCAATCAACACTGACGATGAAGAAGAAACTGAAGAACAAATCATTGAAAGGAGAAGGAAGCAAAGAGAACAGTTACTTAAGAGACTAGAAAGCAAAAACAAGGCAAACTCAACATCAGATCAAAATCTAGATACCAAAACATCTTCAACTGATACAAATCAAAAGTCATCGGTAGCTGTGCCAGAAAAGATTACCAAAGACACACTCCCCGGAAATGTGAATGTTCTCAATAAAATGGAAGCGGAGGTAAAAAATGAAAAAAGTAACACAAGTTCTTTGATAAGGAAGATTGAAAATGAGTCCAAGACAAGTATAAAATTGGAGAAATCCACGAAAGGGAGTGAGTGGGATATGTTTGCAGATCAAGATAATTTTGACAGTGTCGATACACCTACAGCAGGAAAACCAAGGAGTAAAAATACTATGGAAAATCCATCACTTACTGATAACTGGGATGACGCCGAAGGTTACTATAGAGTACGCATTGGAGAAACACTGGATAACAGATATACAGTTTATGGGTACACAGGGCAGGGGGTCTTTTCTAATGTTGTAAGAGCAAGAGATCAAGCTCGCGGTAATACTGATGTTGCAGTCAAAATAATCAGGAATAATGAGATCATGCACAAAACTGGTTTGCGGGAATTGGAAATTCTGAAACGATTAAATGACTCAGATCCAGAGGACAAATTTCATTGCTTACGTTTATTTCGCCATTTTTTCCATAAACGTCATTTGTGCATGGTAATGGAACCTCTTTCAATGAATCTGAGAGAAGTGCTCAAAAAATATGGAAAAAATAGTGGTATTCATATCAAAGCTGTTCGGAGCTATACCCAACAAATGTTGTTGGCATTAAAACTATTGAAAAAAACTGGAATATTACATGCTGATATCAAGCCTGACAATATTTTGGTGAATGAGAGTAAGTTAATATTAAAGCTGTGTGATTTCGGTGCTGCGTCACATGTATCGGACAACGAAATCACGCCATATCTGGTTTCGAGATTCTATCGTTCTCCAGAAATCATTCTTGGAGTCCCCTATGACCACGGTATAGATATGTGGTCGACCGCTTGTACCATATATGAGTTGTCTACTGGGAAAATTATGTTCAGTGGCAAATCAAATAATGAAATGTTAAAATATTTTATGGATTTGAAAGGGAAGATACCAAATAAGATTATTAAAAAGGGTCATTTTAAGGAGCAACATTTTGATAGCAATTGTAATTTTTTGTATCATGAACTGGACAAGATTACAGAAAGGGAAAAAGTGGTTATAATGTCAAGCATAAAACCTACAAGAGATTTACAAACTGAGTTGGCTCCTCCTCACCACCGACTGCCAGTTCCAGAAGCGAAGAAAATAACTCAACTTAAAGACCTACTTGAAAGGATGTTAATGCTAGACCCATCAAAACGGGCTTCTGTCAATCATTGTCTTGCTCATCCTTTTATTCAAGAAAAAATATAA

Protein sequence:

>DPOGS203905-PA
MGKESNLDVEKSTNGSAKEDESKKKKKTKKHKKRKRPVDEDTEKGYKKRKKAKKEKQKSPKPESETSLSDVEELIKKSKRCKPANKESSLEIASADSAKNTKTKNGNTNADQSRSSSRNTKKVDDTLEILSSTSEAENYQDDCASPELTIIEDDLNLEELMKQKEMLQARLTAYNSDKSDEETRLKKNTNDDVICVDEVSKEPVSKKKKDSSSEHKKTKSKTTKQRPSELEKIKLKRQEEDLREIINRESRKEMEKRLEEREYREKKERDRRKAESDREREREREREREREKEREKERERERDRERDREREREREKERDREKDRIDRERERERERKRERERERERRSSEFRRRERDSIRSRRSRDRSPISRYERRHSDRDRKSRDRSRERYRGRSRDKNYKTDHREKRRNSTNEVTQIVPSSSDEELNISINTDDEEETEEQIIERRRKQREQLLKRLESKNKANSTSDQNLDTKTSSTDTNQKSSVAVPEKITKDTLPGNVNVLNKMEAEVKNEKSNTSSLIRKIENESKTSIKLEKSTKGSEWDMFADQDNFDSVDTPTAGKPRSKNTMENPSLTDNWDDAEGYYRVRIGETLDNRYTVYGYTGQGVFSNVVRARDQARGNTDVAVKIIRNNEIMHKTGLRELEILKRLNDSDPEDKFHCLRLFRHFFHKRHLCMVMEPLSMNLREVLKKYGKNSGIHIKAVRSYTQQMLLALKLLKKTGILHADIKPDNILVNESKLILKLCDFGAASHVSDNEITPYLVSRFYRSPEIILGVPYDHGIDMWSTACTIYELSTGKIMFSGKSNNEMLKYFMDLKGKIPNKIIKKGHFKEQHFDSNCNFLYHELDKITEREKVVIMSSIKPTRDLQTELAPPHHRLPVPEAKKITQLKDLLERMLMLDPSKRASVNHCLAHPFIQEKI-