Monarch geneset OGS2.0

DPOGS208294
TranscriptDPOGS208294-TA4887 bp
ProteinDPOGS208294-PA1628 aa
Genomic positionDPSCF300079 + 481080-496147
RNAseq coverage1208x (Rank: top 10%)
Annotation
HeliconiusHMEL0083690.075.48% 
BombyxBGIBMGA006461-TA0.075.66% 
DrosophilaDoa-PP2e-16574.30% 
EBI UniRef50UniRef50_E2BX100.052.44%Serine/threonine-protein kinase Doa n=6 Tax=Formicidae RepID=E2BX10_HARSA
NCBI RefSeqXP_970822.20.054.78%PREDICTED: similar to Darkener of apricot CG33553-PG [Tribolium castaneum]
NCBI nr blastpgi|1892419090.054.78%PREDICTED: similar to Darkener of apricot CG33553-PG [Tribolium castaneum]
NCBI nr blastxgi|3838624810.051.31%PREDICTED: uncharacterized protein LOC100880767 [Megachile rotundata]
Group
Gene OntologyGO:00167727.8e-82transferase activity, transferring phosphorus-containing groups
GO:00055246e-74ATP binding
GO:00046746e-74protein serine/threonine kinase activity
GO:00064686e-74protein phosphorylation
GO:00046729.4e-60protein kinase activity
KEGG pathway 
InterPro domain[1265-1627] IPR0110097.8e-82Protein kinase-like domain
[1279-1596] IPR0022906e-74Serine/threonine-protein kinase domain
[1279-1596] IPR0174429.4e-60Serine/threonine-protein kinase-like domain
Orthology groupMCL12012 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208294-TA
ATGAGCGATCTAAAAGATCTCATCACCGGCGCTCACCGACATCACACCGGCGGAGGAGACATCACCGCGACGATTCATCAGCGCGCTCCCCACCCGAACAGTACGCGCCGCGGGCGGCCCGCGCGAGCCATGGCTGCCCAACCGCCGCGCCTTCCCCCTAACAACCTGCTCTTCAGCGGACCTCCTCCATCTTTACCGCGCGTGGTTTTCTTGCCAAGTGACACCGCTGCAAATAATCCCGTTATAGAAAGCGCATGCGATGATGACCTTGACCAGAAAAACGATGCGGATGTCTCCGACGAAATAAACAACGAGACAGTGCCTGTGAATGGTGAAGATCACAATCAAAAAGTTTACAAACCATCAGACCAAGAACAGTTCTATACATTAAATAATTTAAGAAGAACTCGAAGTTTTGATGTGTTAGATATTTATACTTGTAGAGATGATAGTGTCATAGAAGGAGGTGTAATTGGCTTAAAACATCGGAGGTCTGAACCGGACTTGAATAAATATGGTCTTTTTGTTGAAACTGAAATTGAATGTGAACCAATACAACCACCTCCGCTCGTTCTAAACAATCCGTTTTATGATGCTTCATATGCTTTAATTGGTGATGATGTAAACGAAAATATTGTTATGTTGCCCGAAAATTACTTGGCCTTTGAAGATTCGTTTGATCCCACATGGAATTACAAAACAAATGTTGTGGGTGATCAGGGATTATACTATGACGGTCAACCTCTCATACCACCCAATGATCCATGGTCGATATATAGCGACAATAACAATAGTTTTGAATATTATACACCCCAATTTGAAATACCTACGGAAGATGGAGTACAATATATGCGCCTCAGTGACTTAGATTACGCCATACCTCAGTATATGAGCCTTCCGTTAGTCGAAAAAAAGATTCATAGTGAACCGTGCTCAGATTGTTGTAGTCAAATGGAGAAAGAACAGGAAATAAGGAAAACTAAAGTGGATGAAGACGCAGAAAAAAATGAAAATGTTTGCAGTGAATTAGTAACAGAGAATATTCAACCTTTAACCGCTACTGTTGTAAATACATTAAGCTTAGATAAAACTGATAGTGTAGACTCTTTACCGAACAGAGAATCCTCATGCAGTGAATCCTTTAATGCGGATATATCTATTGATGTGACGTCCAGTTTAGCTTTCATGCCTAGTAGTAAAAGCTCCCGACGTTCCCAAGGAACTGACAATACTAGTGATAACACGTCACCATGTTCCACTGATTACCACGAAGCTTCGGCTCTTGACATAGCTCAAAGCTTAGATGAACTTTCATGTTCTGGCAGCACAGACTTTTCGCAGAGTCGAGAGGAAGTCTCTCCTATACCGGAGGATCAAACAACTAGTTGTAAACAACAAAGCAGAGTAAATAGTAACAATAACAACGAACCCAACAACATTGAAACAAATAAGCCACTGGCCAGTTTACCACTATGTAAACTACCTTCCATTCCATTACACGAGCAAATGCCACCAAAGCTCCCCCCTCAGAGGAATAATGTGATGTGTAATAATAAAAATGTACATAATAATTCATTTACAAATACAGACGCTAGTGTTGTCCAGTGTGAATCGACAATTCAAAGTGATAAAACAACTAAAGATAATAATGTCAAACATGCTGCAAAAACAACTGCTCCTAACGCAGTTAATGTAAACGACTCATCAAAATGTATGAGAGCGGCACCTCAACCGCCGTCTGTGCCGCCAGCGTGGTTGACAAAAAATACAACAGGTGGAAATAAACAGAAAGTTACCAAAGACGTGCCTCAAATACTTATAAATAATGTAGATGAGACCAAACACGACAAGAGTACTGCGTCCACACAAATAACAACTAAGTCGACTGATCCTGGAGACCCTCAGCCCTCATGCTCTTATGCACCGCCAGTACCTCCGCCTCCAGCCCAGCTGAAGCCCAAAGATGTAGAGGTCAAGTTCGTGTTATCCGTGGCTGGTCCCATTTCTGGTGGTCTTCTCCTAAGGTCACTTCTCCGAGCTCCGGCCGCCTTAGTTCACCTATTCGTTAAGACCCTGCTGCTTCCGGCCGGCCTTGTATTGCCGCCAGCGTTGCAACAACCGTCCCGCTCCGATGATTTACGAACGGCGATCATGCACATGTTTTCACAGATTGTTTTAAAAAACGTTCGCCACCACGTGCCGCTACGCCTGTCCTCGTTCTACGACCTGTATAAGAGATTCCTTAAGGATCACGGTTGCAAGTGGCACGATGTCGCTAAACTTTTATCTGTTCTGGCGGATTTACTGTTTGATGTGGACGGTGCATGTAGTAAGATAGCGGGCAAGTTCTTGGAGTGGGTCGCATTTTTTATTCGTTCCATGTGCAGCAGTGGCAACATGACCACCAGCCCTCGTCGTCGATACACACGAGCGTCCACGACCAGCGTTACTCAGCTGTTATCAGATGGATACTCCAACATCATGAATCGACTCACTCGAAGAGGACCCTCCGAGAAAAATGATCATATAATTGATACGAAGTTAACAGCTGCTCGTAATCGCTACGACGACAAATTACTATCGAACAATAATTCAGTATTAACAAATGTCCGACGTTACGAAAACAATAGAAAAATATCCCCATACAAACCGTTCACTTCCCCTATAACAGTGACTGCGAAAAAGTTTGGAGACGATCGCGGATACTCAAGTTACCTGAGCTCGCCCAAGACCCGGATAGACACATCACCTGTGCTATCGAATCCCAGTATGAGCGCTCTCACTCGCAGTGACTCATTTCGCAGGGCTTCCAAGAAAGATAATAAATCATCACCGTTCACCAAACGATATCCTCTCAAGGAAACTAATAATAACACATTAGAAAGTACAATTGCTTTAGGTAGTACACGCAGCCGTTTAGAGGATAAATACTCCTCGGTTTTAGACAAAATAGCTATTCAGAAGAAAGAGAGAGCTAAGAAAGAGAAAGAAGACCGTGACAAAACTTTAGAACCTGAACCGGCTTCTTTTTCTAGAGGGTTAATGAGAAGTTTTACGACCGCTGTATTCGGAGAGAATTCTTTTAAAAGAAATAATTATTCGCGAGAAAAAACAAGGGATAAGACGCCGTTCAGAAATACGACTGATCGTCGTTTACCGTCTAGTCATAAACAAAGCTCTAAAAACGAATTAAAAAATGGCTTCGATGCGTCTCTAAGAGATGGAAACCAATTTATGAAAGATAGGGATAGTATTTATAGAAAACACCACAGGAGATCACTGAAAGTAGAAAAAAGCAGTAGTGATAAGAGAAGTGGCAAGTTATCTTTAAGGCCGATTGATATCAGCTTACAATCTGGTACAAGAGATTTAATCTCACCCATACAACCAGAAATCAAATATAAGCCGACAAAAACGCCTGCTTCTTCCCCGGTCTGCGAAGGCCGGCAAAAACAAATATACTTCCCTTCGAGTGACGAGGACGACGATAAGACTCCGGTGGGCGACCGCGCTCTCACGGAACGAGAGACGCGAAGGAAAGAAATACAGGGACTGATCATGAAATATGCACATCTGGATGAGGTATACGCTCGGATTACTGAAAAGGAACCCAACGGAGTCACCAAAGACTTAGTGCCGAGGAAATTGGAGCCTATCGGAGTCGGCGATGTGGTAGCGTTGCCACCAGAGCTAAGGAGTCGCCATCGACCACAACGGCCGCGACACCTCATGAGGCACGCGGCCACGCCGCCTAGCTCCCGGGCACGCTCCTCCGTCAAGGACGACAAGGACGGACATCTGGTGTACTGGCCCGGATATGTCATGGGAGCGAGATACAAAATCATCGAGACGCTCGGTGAGGGAACCTTCGGGAAGGTGGTCGAAGTGAAGGATCTCGAAATGGAGCACAGAATGGCTCTGAAGATAATAAAAAATGTGGAGAAATACAGAGAGGCTGCGAAATTAGAAATAAACGTATTAGAAAAATTAGCTGACATTGACCCCGATTGTAAGAATCTGTGCGTGAAGATGCTAGACTGGTTTGAATATCACGGACACATGTGTATCGCGTTTGAAATGCTCGGACAAAGTGTATTTGACTTCCTGAAAGACAACAACTACCAGCCATATCCCCTGGAGCAGGTGCGACACATCTCCTACCAGCTGATACACAGCGTGCTGTTCCTACACGACAACAAACTCACACACACCGACCTCAAGCCCGAGAACATACTGTTCGTGGACAGCGACTACGAGGTCGTCAGTGTGTACAACACCTCCAAGAAGAAGCACGACCTCCGTCGCGTGAAGCGCAGTGACGTCCGCCTGATAGACTTCGGCAGCGCGACCTTTGACCACGAACATCACTCGACAATAGTCTCCACGAGACATTACAGGGCACCAGAGGTCATACTCGAGCTGGGTTGGTCTCAGCCGTGTGACGTGTGGTCCATCGGCTGCATCATGTTCGAGCTGCACCTGGGCATCACACTGTTCCAGACACACGACAACAGAGAACACCTCGCCATGATGGAGAGGATACTAGGACCGATACCATACAGAATGGCAAGAAAAACAAGGACGAAATATTTCTATCATGGCAAATTAGACTGGGATGAAAAGTCATCGGCGGGGAGATACGTTAGAGAGAATTGTAAACCGTTATTAAGGTATCTCCAGACTAACAGCGAGGAGCTCCGTCAGCTGTTCGAGCTGATCGGCCGCATGTTGGAGTACGAGCCCTCACAGAGGATCACGCTCAGGGAGGCGCTGCAGCATCCCTTCTTCAGCAAACTACCGCACAACCAGAGACTAGGCAATGACCGCGCGCGCTGCAACGGCGAGAGCTCGGCGTCCCGCGAGCGATCTCACTCACTGAGCCGGTGA

Protein sequence:

>DPOGS208294-PA
MSDLKDLITGAHRHHTGGGDITATIHQRAPHPNSTRRGRPARAMAAQPPRLPPNNLLFSGPPPSLPRVVFLPSDTAANNPVIESACDDDLDQKNDADVSDEINNETVPVNGEDHNQKVYKPSDQEQFYTLNNLRRTRSFDVLDIYTCRDDSVIEGGVIGLKHRRSEPDLNKYGLFVETEIECEPIQPPPLVLNNPFYDASYALIGDDVNENIVMLPENYLAFEDSFDPTWNYKTNVVGDQGLYYDGQPLIPPNDPWSIYSDNNNSFEYYTPQFEIPTEDGVQYMRLSDLDYAIPQYMSLPLVEKKIHSEPCSDCCSQMEKEQEIRKTKVDEDAEKNENVCSELVTENIQPLTATVVNTLSLDKTDSVDSLPNRESSCSESFNADISIDVTSSLAFMPSSKSSRRSQGTDNTSDNTSPCSTDYHEASALDIAQSLDELSCSGSTDFSQSREEVSPIPEDQTTSCKQQSRVNSNNNNEPNNIETNKPLASLPLCKLPSIPLHEQMPPKLPPQRNNVMCNNKNVHNNSFTNTDASVVQCESTIQSDKTTKDNNVKHAAKTTAPNAVNVNDSSKCMRAAPQPPSVPPAWLTKNTTGGNKQKVTKDVPQILINNVDETKHDKSTASTQITTKSTDPGDPQPSCSYAPPVPPPPAQLKPKDVEVKFVLSVAGPISGGLLLRSLLRAPAALVHLFVKTLLLPAGLVLPPALQQPSRSDDLRTAIMHMFSQIVLKNVRHHVPLRLSSFYDLYKRFLKDHGCKWHDVAKLLSVLADLLFDVDGACSKIAGKFLEWVAFFIRSMCSSGNMTTSPRRRYTRASTTSVTQLLSDGYSNIMNRLTRRGPSEKNDHIIDTKLTAARNRYDDKLLSNNNSVLTNVRRYENNRKISPYKPFTSPITVTAKKFGDDRGYSSYLSSPKTRIDTSPVLSNPSMSALTRSDSFRRASKKDNKSSPFTKRYPLKETNNNTLESTIALGSTRSRLEDKYSSVLDKIAIQKKERAKKEKEDRDKTLEPEPASFSRGLMRSFTTAVFGENSFKRNNYSREKTRDKTPFRNTTDRRLPSSHKQSSKNELKNGFDASLRDGNQFMKDRDSIYRKHHRRSLKVEKSSSDKRSGKLSLRPIDISLQSGTRDLISPIQPEIKYKPTKTPASSPVCEGRQKQIYFPSSDEDDDKTPVGDRALTERETRRKEIQGLIMKYAHLDEVYARITEKEPNGVTKDLVPRKLEPIGVGDVVALPPELRSRHRPQRPRHLMRHAATPPSSRARSSVKDDKDGHLVYWPGYVMGARYKIIETLGEGTFGKVVEVKDLEMEHRMALKIIKNVEKYREAAKLEINVLEKLADIDPDCKNLCVKMLDWFEYHGHMCIAFEMLGQSVFDFLKDNNYQPYPLEQVRHISYQLIHSVLFLHDNKLTHTDLKPENILFVDSDYEVVSVYNTSKKKHDLRRVKRSDVRLIDFGSATFDHEHHSTIVSTRHYRAPEVILELGWSQPCDVWSIGCIMFELHLGITLFQTHDNREHLAMMERILGPIPYRMARKTRTKYFYHGKLDWDEKSSAGRYVRENCKPLLRYLQTNSEELRQLFELIGRMLEYEPSQRITLREALQHPFFSKLPHNQRLGNDRARCNGESSASRERSHSLSR-