Monarch geneset OGS2.0

DPOGS200860
TranscriptDPOGS200860-TA4080 bp
ProteinDPOGS200860-PA1359 aa
Genomic positionDPSCF300071 + 273367-292525
RNAseq coverage98x (Rank: top 61%)
Annotation
HeliconiusHMEL0126450.074.94% 
BombyxBGIBMGA009848-TA0.058.83% 
Drosophilaksr-PA9e-10856.94% 
EBI UniRef50UniRef50_UPI0002246AEE0.045.74%UPI0002246AEE related cluster n=1 Tax=unknown RepID=UPI0002246AEE
NCBI RefSeqXP_001605076.10.046.79%PREDICTED: similar to ENSANGP00000009647 [Nasonia vitripennis]
NCBI nr blastpgi|1565484520.046.79%PREDICTED: kinase suppressor of Ras 1-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|1565484520.047.52%PREDICTED: kinase suppressor of Ras 1-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00167723.5e-50transferase activity, transferring phosphorus-containing groups
GO:00046725.1e-45protein kinase activity
GO:00064685.1e-45protein phosphorylation
GO:00055242e-21ATP binding
GO:00046742e-21protein serine/threonine kinase activity
GO:00047132e-13protein tyrosine kinase activity
GO:00355562.3e-11intracellular signal transduction
KEGG pathwayspu:5756134e-52 
 K04365 (BRAF)maps-> Prostate cancer
    Regulation of actin cytoskeleton
    MAPK signaling pathway
    Glioma
    Melanoma
    Pathways in cancer
    Chemokine signaling pathway
    Endometrial cancer
    Natural killer cell mediated cytotoxicity
    Insulin signaling pathway
    Neurotrophin signaling pathway
    Long-term depression
    Focal adhesion
    ErbB signaling pathway
    Colorectal cancer
    Thyroid cancer
    mTOR signaling pathway
    Progesterone-mediated oocyte maturation
    Long-term potentiation
    Renal cell carcinoma
    Pancreatic cancer
    Acute myeloid leukemia
    Vascular smooth muscle contraction
    Bladder cancer
    Non-small cell lung cancer
    Chronic myeloid leukemia
InterPro domain[1047-1356] IPR0110093.5e-50Protein kinase-like domain
[1076-1333] IPR0012455.1e-45Serine-threonine/tyrosine-protein kinase
[1075-1339] IPR0022902e-21Serine/threonine-protein kinase domain
[1075-1333] IPR0206352e-13Tyrosine-protein kinase, catalytic domain
[386-434] IPR0022192.3e-11Protein kinase C-like, phorbol ester/diacylglycerol binding
Orthology groupMCL11476 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200860-TA
ATGGATGCAGATAGCGAAAACGAAAAAAAAAGAATCCGTGATGCAATTCAGATGATAGAGACCATTCAGTCGATGATAGACGTTTCAGCAGACAGGCTCGAGGGCTTGAGAACACAATGTTCAACGAGTGCGGAGCTGACGCAGCAGGAGATCCGGACACTGGAGGGAAAGCTGGTGAAGCATTTCTCGCGGCAGCTGGTGATCAAGGCCCAGTTCGTGGAGGAGATCCAACGAGAGCTTGGACATGTGCCGAGTTTGAGACAGTGGCTGAGAGTTGTGGGGCTCAGTGTTGATGCTATAGAGTCGGTGTTATCTCGTGTGTCATCTCTGGAACTGCTCCGTGATCGCTCAGAACATGAGCTGCGAGCGATGCTGTGTGGTGCCAGGGACGAGGAAGTCCGCAGGCTCTGTAGAGCAATGCAAAGACTCAGGACATATACAGAGGCTTTGGCGCGTGGCGAAGGTTCCGCGGAACTGCCCCTATATTGGGACTCGTGGGAGAGACACGTGCGAGGCTCACCCAGGGCCAGGGACGACAGGAACAACGATAATAAGAAGGGCGGTAAATCTCCCACGACGCCCATCAACAAACGCAAGCAGAACTCTCACCTGCCCGCACCACCAGCTCAACCCTCGTCGCTCACCAAGTCGAGGTCACATGAGTCCCAGCTGTCTGTGAAGCCGGACACATCAGACCACAGTGATAACAGTCAGTCTTCGGCCGCCGTGTGTGAGGTGTCGGAGTCGCCCCCGGGCTCCCCTAGAGACCCCGACCCCGACCCACCGCCTCGATACTACACCAACACGATTCCCCCTCCCGCGCCCCGCTCGCCCCGCACGCCCACCGTGAGCGGCTGTATGGCCCACGACATCGCCCACCGGTTCACCAAGACCTTCAACATGATAGCCACCTGCGACTACTGTGACAAGCAGATGCTCTTCGGCAGCGGCTTGAAGTGTAAAGAGTGCAAGTTCAAATGTCATAGTGATAACAGTCAGTCTTCGGCCGCCGTGTGTGAGGTGTCGGAGTCGCCCCCGGGCTCCCCTAGAGACCCCGACCCCGACCCACCGCCTCGATACTACACCAACACGATTCCCCCTCCCGCGCCCCGCTCGCCCCGCACGCCCACCGTGAGCGGCTGTATGGCCCACGACATCGCCCACCGGTTCACCAAGACCTTCAACATGATAGCCACCTGCGACTACTGTGACAAGCAGATGCTCTTCGGCAGCGGCTTGAAGTGTAAAGAGTGCAAGTTCAAATGTCATAGGGATTGTGAGAGTAAAGTTCCTCCATCGTGCGGTCTCCCTCCAGAGTTCGTGACTGCCTTCAAAGAAAAATTTCACAAAGACGGTGGTCTGTACCTGACGTCCGTGTCGTCAGGCCGCACGCTGATACCCTCACTGACTCCTCTCAGACGCCGCCCGCCGCCCACCGCCCATCATCATACGGTAAGGCTGTCTCACACTATGATAAATGAATCTGTATATGATATGACCTCTATATCTATTGTATGCTTAGGGCTCGCGTCTTCCACATCGGCGAGACAGGGTTTCGGTAATGCCTTAGAAGGTAACCCGGATTCATCATCGAACACGTCCTCGTGTAACAGCTCCACCCCGTCCTCCCCCGCGCTGGCCGCGCCCGCCCCCGCGCCCGCCACGCCGCAGGCACACCACCACCACCACCATATAACACTCAAACAGCAGTTCCACTTCCCTGAGATGTCGACACGGTCGGTGGGGACGCCGGTGGAGCGACACGACTCACGACACGACCAGCGACACGACTCCCGACACGACGCTTCCGACCGGTGGCCGAGGCAGAACAGTTACACTGTTGGCATCAGGACTCGAGCAGTGACACGAAGGGTGATGTGTCCAAGGAAGGCTATATGTCAGAGAGAGTTCGGTGCCAATCCCGAGGGAAAGCTGGTGAAGCATTTCTCGCGGCAGCTGGTGATCAAGGCCCAGTTCGTGGAGGAGATCCAACGAGAGCTCGGACATGTGCCGAGTTTGAGACAGTGGCTGAGAGTTGTGGGGCTCAGTGTTGATGCTATAGAGTCGGTGTTATCTCGTGTGTCATCTCTGGAACTGCTCCGTGATCGTTCAGAACATGAGCTGCGAGCGATGCTGTGTGGTGCCAGGGACGAGGAAGTCCGCAGGCTCTGTAGAGCAATGCAAAGACTCAGGACATATACAGAGGCTTTGGCGCGTGGCGAAGGTTCCGCGGAACTGCCCCTATATTGGGACTCGTGGGAGAGACACGTGCGAGGCTCACCCAGGGCCAGGGACGACAGGAACAACGATAATAAGAAGGGCGGTAAATCTCCCACGACGCCCATCAACAAACGCAAGCAGAACTCTCACCTGCCCGCACCACCAGCTCAACCCTCGTCGCTCACCAAGTCGAGGTCACATGAGTCCCAGCTGTCTGTGAAGCCGGACACATCAGACCACAGTGATAACAGTCAGTCTTCGGCCGCCGTGTGTGAGGTGTCGGAGTCGCCCCCGGGCTCCCCTAGAGACCCCGACCCCGACCCACCGCCTCGATACTACACCAACACGATTCCCCCTCCCGCGCCCCGCTCGCCCCGCACGCCCACCGTGAGCGGCTGTATGGCCCACGACATCGCCCACCGGTTCACCAAGACCTTCAACATGATAGCCACCTGCGACTACTGTGACAAGCAGATGCTCTTCGGCAGCGGCTTGAAGTGTAAAGAGTGCAAGTTCAAATGTCATAGGGATTGTGAGAGTAAGGTCCCTCCATCGTGCGGTCTCCCTCCAGAGTTCGTGACTGCCTTCAAAGAAAAATTTCACAAAGACGGTGGTCTGTACCTGACGTCCGTGTCGTCAGGCCGCACGCTGATACCCTCACTGACTCCTCTCAGACGCCGCCCGCCGCCCACCGCCCATCATCATACGCTCCACCCCGTCCTCCCCCGCGCTGGCCGCCCCCGCCCCNNNNNNNNNNNNGCCCCCGCCCCCGCGCCCGCCACGCCGCAGGCACACCACCACCACCACCATATAACACTCAAACAGCAGTTCCACTTCCCTGTACTTGTGTGTTCAGAGATGTCGACACGGTCGGTGGGGACGCCGGTGGAGCGACACGACTCACGACACGACCAGCGACACGACTCCCGACACGACGCTTCCGACCGGTGGCCGAGGCAGAACAGTTTGTCCATGAAGGAGTGGGACATACCTTACGACGAGCTCAAGCTGTTCGAGGTGATCGGAACAGGTCGCTTCGGGACGGTCTACAGGGGCAGCTGGCACGGAGCGGTCGCCGTGAAGCTGCTGCACGTGAACGCTCTCAGCGACCACACCGCGCCTCTGGACACCTTCAAGCACGAGGTGGCGACCTTCAGGAAGACTCGCCACGAGAACCTGGTGCTGTTCATGGGCGCGTGTATGAAGCCACCTCGCCTGGCCATCGTGACGTCACTGTGTAAGGGCATGACGCTGTACACACACATCCACCTCAGGAAGGACAAGTTCACCGCCAACAAGAGCGTCATCGTCGCACAGCAGATATCACAGGGCATGGGTTACTTGCACGCTCGGGGCATCGTGCACAAGGATCTGAAGACGAAGAATATATTCTTGGAGAATGGAAAAGTCGTCATCACAGACTTCGGACTGTTCAGCGTCACCAAGCTGTGTTTTGGCAACAACGCCCGTGGACACACACACACACACACACACACACACACACACACACACACACACACACACACACACACACACACACACACATATCATAATACACGCACGTTCACATATATATTCACTGTATGGTACGAGCTGCTGTGCGGTGAATATCCCTTCAAGGGCCAGCCTCCTGAGGCGGTCATCTGGCAGGTCGGCAAGGGAGTGAAGCAGTCCTTGAACAACATGCAAGCCTCTAGAGATATCAAAGACATCCTTATGCTCTGCTGGGCTTACCGATCAAGTGAGCGGCCAGACTTTCCGCACTTGTTGTCAACTCTGGAGAAGCTGCCAAGGAAGAGACTGGCTCGCTCGCCCTCACATCCTGTTCATCTGTCACGATCAGCTGACTCAGTGTTCTGA

Protein sequence:

>DPOGS200860-PA
MDADSENEKKRIRDAIQMIETIQSMIDVSADRLEGLRTQCSTSAELTQQEIRTLEGKLVKHFSRQLVIKAQFVEEIQRELGHVPSLRQWLRVVGLSVDAIESVLSRVSSLELLRDRSEHELRAMLCGARDEEVRRLCRAMQRLRTYTEALARGEGSAELPLYWDSWERHVRGSPRARDDRNNDNKKGGKSPTTPINKRKQNSHLPAPPAQPSSLTKSRSHESQLSVKPDTSDHSDNSQSSAAVCEVSESPPGSPRDPDPDPPPRYYTNTIPPPAPRSPRTPTVSGCMAHDIAHRFTKTFNMIATCDYCDKQMLFGSGLKCKECKFKCHSDNSQSSAAVCEVSESPPGSPRDPDPDPPPRYYTNTIPPPAPRSPRTPTVSGCMAHDIAHRFTKTFNMIATCDYCDKQMLFGSGLKCKECKFKCHRDCESKVPPSCGLPPEFVTAFKEKFHKDGGLYLTSVSSGRTLIPSLTPLRRRPPPTAHHHTVRLSHTMINESVYDMTSISIVCLGLASSTSARQGFGNALEGNPDSSSNTSSCNSSTPSSPALAAPAPAPATPQAHHHHHHITLKQQFHFPEMSTRSVGTPVERHDSRHDQRHDSRHDASDRWPRQNSYTVGIRTRAVTRRVMCPRKAICQREFGANPEGKLVKHFSRQLVIKAQFVEEIQRELGHVPSLRQWLRVVGLSVDAIESVLSRVSSLELLRDRSEHELRAMLCGARDEEVRRLCRAMQRLRTYTEALARGEGSAELPLYWDSWERHVRGSPRARDDRNNDNKKGGKSPTTPINKRKQNSHLPAPPAQPSSLTKSRSHESQLSVKPDTSDHSDNSQSSAAVCEVSESPPGSPRDPDPDPPPRYYTNTIPPPAPRSPRTPTVSGCMAHDIAHRFTKTFNMIATCDYCDKQMLFGSGLKCKECKFKCHRDCESKVPPSCGLPPEFVTAFKEKFHKDGGLYLTSVSSGRTLIPSLTPLRRRPPPTAHHHTLHPVLPRAGRPRPXXXXAPAPAPATPQAHHHHHHITLKQQFHFPVLVCSEMSTRSVGTPVERHDSRHDQRHDSRHDASDRWPRQNSLSMKEWDIPYDELKLFEVIGTGRFGTVYRGSWHGAVAVKLLHVNALSDHTAPLDTFKHEVATFRKTRHENLVLFMGACMKPPRLAIVTSLCKGMTLYTHIHLRKDKFTANKSVIVAQQISQGMGYLHARGIVHKDLKTKNIFLENGKVVITDFGLFSVTKLCFGNNARGHTHTHTHTHTHTHTHTHTHTHTHTYHNTRTFTYIFTVWYELLCGEYPFKGQPPEAVIWQVGKGVKQSLNNMQASRDIKDILMLCWAYRSSERPDFPHLLSTLEKLPRKRLARSPSHPVHLSRSADSVF-