Monarch geneset OGS2.0

DPOGS206674
TranscriptDPOGS206674-TA1656 bp
ProteinDPOGS206674-PA551 aa
Genomic positionDPSCF300048 + 796191-800810
RNAseq coverage377x (Rank: top 32%)
Annotation
HeliconiusHMEL0123131e-11072.19% 
BombyxBGIBMGA008506-TA0.079.72% 
Drosophilambt-PA6e-13477.40% 
EBI UniRef50UniRef50_E2C8F82e-13277.24%Serine/threonine-protein kinase PAK mbt n=4 Tax=Formicidae RepID=E2C8F8_HARSA
NCBI RefSeqXP_969620.20.064.00%PREDICTED: similar to mushroom bodies tiny CG18582-PA [Tribolium castaneum]
NCBI nr blastpgi|1892363820.064.00%PREDICTED: similar to mushroom bodies tiny CG18582-PA [Tribolium castaneum]
NCBI nr blastxgi|1936435130.059.90%PREDICTED: serine/threonine-protein kinase PAK 7-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00167721.3e-84transferase activity, transferring phosphorus-containing groups
GO:00055241.7e-84ATP binding
GO:00046741.7e-84protein serine/threonine kinase activity
GO:00064681.7e-84protein phosphorylation
GO:00046726.6e-66protein kinase activity
GO:00047136.5e-18protein tyrosine kinase activity
GO:00055152.9e-12protein binding
KEGG pathwaycfa:4845133e-136 
 K05734 (PAK4)maps-> Axon guidance
    Regulation of actin cytoskeleton
    T cell receptor signaling pathway
    Focal adhesion
    ErbB signaling pathway
    Renal cell carcinoma
InterPro domain[269-548] IPR0110091.3e-84Protein kinase-like domain
[283-534] IPR0022901.7e-84Serine/threonine-protein kinase domain
[287-534] IPR0174426.6e-66Serine/threonine-protein kinase-like domain
[283-534] IPR0206356.5e-18Tyrosine-protein kinase, catalytic domain
[11-46] IPR0000952.9e-12PAK-box/P21-Rho-binding
Orthology groupMCL11672 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206674-TA
ATGTTTTCTAAAAAAAAGAAGAAGCCTCTGATTTCACCACCAAGTAACTTTGAACACAGAGTCCACACCGGCTTTGATAAGAGTGAGGGAAAATTTGTGGGTCTGCCTCTTCAATGGGCTTCTCTTGTTGGCAATAACCAGATACTGAAGTCAACAAATCGCCCTTTACCTTTAGTCGATCCCTCAGAAATAACACCTACCGAAATATTAGATTTAAAAACTATAGTGCGTGGAGACCATAGAGCCTCTTCAGTGGCCACTGTTTCTAGACCAATATCAACTAATCCTATGACCCAGCCGATACAAAATGGCGTAGTTCTTCCAAAAACATCAAATGTTGCCAGATCAAATTCGCTAAGAAGCACAAGTCCGCCTAGAATAAGGCGAGATTTTCGCAACCACCCAAATATACCACCATCTGTACCTGAAGAGTCCCCACAAGTTCCCGCACCCCCACAATACCCAAGTTTGAGACGGGAACAAAGCAACTTCTCCATAAACAAGCCCATATCAGATTCACCCGTTGAATGGTCGCACTCACACGAAGCGACTGTAAGGAATCTTAGTGAAATCAATGACAATCAACAGGCGGCGATGACATATCAGAATATTCAGAACGCTAATGTGCCGTACCAACATAATCACCCACCACTACCTATTCCAGGACCTCACACAGTTAATGGAAACAATAATCTAGGGGACGGCTATCCACAATCACTTCGACCACAGCAACCTGGCCCCCCACATATACCATTAGGAGAATCAAAGCATGAACTGAGATTGACTCATGAACAGTTCCGAGCTGCTCTTCGCCTGGTGGTGAGTGAGGGTGATCCTCGTGAGACGCTGACGGGTTTCACTAAGATCGGTGAGGGCAGCACCGGGGTGGTGTGCGCCGCCACCGACACTCGCACCAGGAGACGGGTCGCTGTCAAGATGATGAACCTGCTCAAACAACACCGCAGGGAACTGTTGTTTAATGAAGTGGTAATAATGCGCGACTACCCTCACCCTAACATAGTAGAGATGCATGCTTCTTACCTGGTGGGTGACGTGCTATGGGTCGTCATGGAATACATGGCGGGCGCGTCTCTAACACAGATAGTTACACGCTCTAGAATGGATCCAGAACAGATTGCAACTGTGTGCAAACAGTGTCTTAAGGCACTAGCGTTTCTACACAGCCAAGGCGTTATACACCGAGATATTAAATCAGATTCAATACTGCTGACGTCGGACGGTAGAGTTAAACTATCCGATTTCGGTTTCTGTGCTCAAGTATCTCAGGAATTACCGAAACGAAAATCTCTAGTGGGGACGCCATATTGGATGTCCCCAGAAGTGGTATCCAGACTTCCGTATGGACCGGAAGTTGATATTTGGTCTTTGGGTATCATGATCATAGAGATGGTTGATGGCGAACCGCCTTTCTTCAACGAACCACCACTACAGGCTATGCGTCGTATCCGCGACATGCCGCCTCCTCGGCCCCGCGGTCTGTCGCGCTGCCCGGCCGACCTGACATCTCTCATAGAGTCAGCGTTAGTGCGTGACCCTTCCGCCCGCCACTCCGCTGCTCATCTCCTGCATCACCCGTTCCTGCGACGAGCGGGACCACCCGCCATCCTCGTGCCGCTAATGCCGCACGGCAACTGA

Protein sequence:

>DPOGS206674-PA
MFSKKKKKPLISPPSNFEHRVHTGFDKSEGKFVGLPLQWASLVGNNQILKSTNRPLPLVDPSEITPTEILDLKTIVRGDHRASSVATVSRPISTNPMTQPIQNGVVLPKTSNVARSNSLRSTSPPRIRRDFRNHPNIPPSVPEESPQVPAPPQYPSLRREQSNFSINKPISDSPVEWSHSHEATVRNLSEINDNQQAAMTYQNIQNANVPYQHNHPPLPIPGPHTVNGNNNLGDGYPQSLRPQQPGPPHIPLGESKHELRLTHEQFRAALRLVVSEGDPRETLTGFTKIGEGSTGVVCAATDTRTRRRVAVKMMNLLKQHRRELLFNEVVIMRDYPHPNIVEMHASYLVGDVLWVVMEYMAGASLTQIVTRSRMDPEQIATVCKQCLKALAFLHSQGVIHRDIKSDSILLTSDGRVKLSDFGFCAQVSQELPKRKSLVGTPYWMSPEVVSRLPYGPEVDIWSLGIMIIEMVDGEPPFFNEPPLQAMRRIRDMPPPRPRGLSRCPADLTSLIESALVRDPSARHSAAHLLHHPFLRRAGPPAILVPLMPHGN-