Monarch geneset OGS2.0

DPOGS203506
TranscriptDPOGS203506-TA3678 bp
ProteinDPOGS203506-PA1225 aa
Genomic positionDPSCF300055 - 607273-611891
RNAseq coverage64x (Rank: top 67%)
Annotation
HeliconiusHMEL0061200.065.50% 
BombyxBGIBMGA008562-TA0.054.39% 
DrosophilaCG15894-PB8e-0927.17% 
EBI UniRef50UniRef50_E2BY483e-2940.33%Putative uncharacterized protein n=3 Tax=Formicidae RepID=E2BY48_HARSA
NCBI RefSeqXP_968049.22e-5037.26%PREDICTED: similar to rac serine/threonine kinase [Tribolium castaneum]
NCBI nr blastpgi|1892360315e-4937.26%PREDICTED: similar to rac serine/threonine kinase [Tribolium castaneum]
NCBI nr blastxgi|1948962404e-7030.56%GG19585 [Drosophila erecta]
Group
KEGG pathwaytca:6564237e-50 
 K04456 (AKT)maps-> Prostate cancer
    Fc epsilon RI signaling pathway
    Toll-like receptor signaling pathway
    MAPK signaling pathway
    Fc gamma R-mediated phagocytosis
    Glioma
    B cell receptor signaling pathway
    Melanoma
    Pathways in cancer
    Chemokine signaling pathway
    Adipocytokine signaling pathway
    Endometrial cancer
    Chagas disease
    Insulin signaling pathway
    Neurotrophin signaling pathway
    T cell receptor signaling pathway
    Focal adhesion
    ErbB signaling pathway
    Colorectal cancer
    mTOR signaling pathway
    Tight junction
    Progesterone-mediated oocyte maturation
    Apoptosis
    Renal cell carcinoma
    Small cell lung cancer
    Pancreatic cancer
    Acute myeloid leukemia
    Non-small cell lung cancer
    Jak-STAT signaling pathway
    Chronic myeloid leukemia
    VEGF signaling pathway
Orthology groupMCL24794 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203506-TA
ATGGCAGCACTGGCGACGGAGCTCGGCGTGCCGTCGACGCGCTTGCGCGCTGCAGCGCTCGATGTCGCGTCTCAGAAGGGAACTGCGGGGTCAGGGGTCGAGGGGCGGCCGCCTGCCGACTGCCGAGCGAGTCGTCCTATCGGCGGTTGCGGCACGCACCGCACGGCGAGATTGCCAGCTGCCTGCCTCATAACCCTGGGAGCGTATGATTGTGTCATCCACTTTCCCTCATCCACCACCGTCCTGGTTAATGTATCTGACGTCAGGAATAAATGCTTAGCAGAAGACCTCAATCAAGGGATGAACAGAGGTGAACTTACATTAAGTACACCCGATGTGAGAGAGTGCAGGAAGCAGGCGGCAGGGGCAGAGGCAGGCGGGCGCATGGAGGCTCGCGAGCGCGAGCGCCGGGCGCGCCGCGGGCCCGCGCCCATCGCCTCCTTCGACCACGACGCCTCCGATGAATCGAGTCCAGAAAATAAACAAGCGATACGATTGCCTGAAATCCGCGAAGATCCTGCAAGTTCAAGTTGCAGCGTTGCACAGTCCACAAGCACAGGAGAAGGCCTGGCCCCACCGGAACAGCGCGCCCGCTCCCTGTCATCACCAGCCGTTCATACCGCGACCGTAGTCACACCCGTAGCCTTGCAAACACCATCTGCTCCACGCATCGATATTTCTCGTGCTTCCAGTTCCAGTCATCACGACTCTCGGGACAGCTCACCGGAGCTTGCATTGTTTGCCGGAGGTGGTAGCAGTGAGGAAACAAGAGAACGTCTGGAATTAGGTTTTCGTGAAGACGGCGCCCTGGACTTGCGGTCGTCTACAGAAGAACTGGCATTCCTGGAGGGTGCTCCGGAAGCTGCAGAAGTGCGTCCACCACCTGTGGCGCAACCATCGCGCAGGCACTCGCGTAAAGATAGTCAAAGTTCTGAGGCCGCGCTTCTTGCTGTGTCCGGTCGCACTAGTCGATTGTCCAGCGTAGGTTCTCAATGCTCTGCTCATTCTGCAATTTCGGCTTTTAGTCAAATAAGTCGCGTGTCACGACTTTCTGTTGTATCAGGGACATCTCGATCACCTTCTCCGCATAAAATGCTTCTGGAAACATCATTTTGTGGTCCCAAGCCCATAGAAACGGATCCCGAAATATGCGCTGCTGCAGTGGAGGAACGGCTATTAGAAATTGCTAAATTAACAAATGAAGCCGGAGCAGCCTCATCCTCTGATCCACAACCTTTGCCAATGCCAACTATTCCTATTGCTATAGATGCACGCGATCGAAGAGAAGTTCGTACGGAAGTTACATTAGAAAATACTAGACCAGCTAGTGCACCAAAACTTACTTCACCCAGTGCTCCACCCACTACACCAGTACCAGATCTCGCCGTTTCAACTAGTTCCTTAGATGATGACAAGCGTCTAAAAGAGACAAGAAATCGAGCTAAAGTAGAAGAACACCGGGCTAGGTCAAGAGACAGACATGAGACGAACAAGCCAGAAGTCTATCGAGCGGGAAACAGATCTAAGGATATCATTAGAATCAAACTTAAACCAGACAATGAATACGACGACGAAGAGGAGGGTGAGAGTGAAGCGACTTTAGTCAGCAGTGAACCAGCTAAAAAACCAATAACCCTTGAATTAAATGATCAGTGCTCTAAACCAACTAAACCTGTAAGTCCATTAGTAACAACTCGTCGCCAACGTGATAGTAGAACGCCATCTCCAAGTGGCGTTCCCGTTTCGAGAAAGTCGTCGTTTTGTTCGCTTTTTAAATCGCGCGAGACAATTGCTTCACCAGATTCCCCTTCAGACGTATTTCGTCGTAAAAAAAGTTTAAATGAAGGTCGATCAAGAAGTAAAAGCCGTGACCGTACGACTACACCAACTTCGGCTGGAAAAATAAAAGGGTCTGTTTTATCTTTATTTAAAACACCAAGGCGTAGCGGGGCATCACCATCTCCAAGTTCACGTGATGTGTCTCCAGTTGTTCAACAGCAGCGTCAATTTCCCCAAACCCCTCATGACAAACAACGTGGCGAAAAATTGAAATACTATGAAGATGCAAAGGATGGCATAATTCATATTCCCCTTCGCACACCCCCTGATGAAATTGAGCCCAAAAAAGGTAAAGAAGGTAGCGATGATAAAATAACGGAAAAGCCACGCCAAGTTATCCGCCCCGCTTCAGCGCCACAGCCACGAGCTCTCCCGGACCGCTCATTGGTCATGTCACCCGTGCCTTCACCCAAACCCACTCAAAGGACGGTTCTTCCAGATGGAAGCATTATAATTCCATTACATTCACCAACAGATAAAACTGCTAAAGTTATACTTCCGTTCGAAACACAAGTTAAATCCGAATCTGACATCAAATATCCCGAACAAATAGATCTGAATCATAAAGAACTTAACTTATATAATAATTTAAAACATAAAGAACTCGAATATGCAACTGAAAGAATCGATTCGATTCCTAACGAGCCCGACGGTTCACAGTTTATCTCTTCGCCACCGCCAGATATAGTTCCAGAAACTTACATCGGAGAGCGACCTAGACGTAAAGAACGAATAGTTTTCACGACACATGTGGGCAGTAAAGAACATGTATTTAGTACACAGTTTAGTATAACAAAAACACCAAGCGTAACCAGTGAAATATCAGAATCAATACAGAGTGTTCCTGAATTTGAAGAAGTCAAACAGAACGAAACTTCCCCTCAAGAAACAGATTGGAAAAGTAATGACGTAATAGATAACGACCAAAAGTATCATTCACAAAGAGAATCTTTTTGTGAAACAGGAGAAGAGAACGTGACCCGAGAATTGAGCAGAGAAGTGAGTAGAGAAGTGAGCAGAGAAGTGAGTAGAGAAGTGAGTCCTACCCCAGATACTGGGAGGGATTCTTCAGAATCCGAGACCAGTTTAGAAATAGCAGCTACACACGGTGGAAGCGAAGCGGAGAGACGAGGACTTGTAGTTCAGGAATCATTTGAAGAGTTACCGTACGTTCCTACTACGCTACCCCTAGAGCGTTCACTAGCGCTCCCTATGGTGCCGGTGCGAGAGCGAGGTGGAGTGCACGTGGCCGGAGTACAGCGGCCGCGCGCAACAAAGAGTGCGGGTCGCCAACCTGGAGCTCTGACATCTCCCGCGCCCCTGGTCGCGCCTGCTGCCGTATCCCCCGCCGCCGGAGACTCTCCCGCTGACCGTCTCTACATCAAACTTCCGCGGCGCGCACGCACGGTCTCCACAGCCTCCGCTGCTCCTCCACCGCCACCTCGCAACCTTCGCACTCGCTCCCGAAGCGGCGGCGATGCGAGCTCTATGGAAACGCGGAGCAAAACGGAGTGGATCGATTTCTCCGAGGTTCCTGAGCGTCGAAAGCAACCGAAGCGCATTCAGACACTGCCAGCGAGCGCTCGCGACACCGTGGTGTTTAGTTACGTGCCCCCCGAGCGCTGCCGCTGCGACTGTCACGCTCACGACACTGCTGACGATGAGTTGCCGCTTCTACAGGACGCCAGCCCAGCACGCGCTTCCAGTGCCGCCTCCCTGGATGACCACGACCGCCACGAGCCCTTCATCGCCGATCTCGACCTTCGTCATAGCACCTCGGACTCGGTACAGTACACTTATCATTTCAGAACCGACTCATTTCTCGTTTACCCCGAGTAG

Protein sequence:

>DPOGS203506-PA
MAALATELGVPSTRLRAAALDVASQKGTAGSGVEGRPPADCRASRPIGGCGTHRTARLPAACLITLGAYDCVIHFPSSTTVLVNVSDVRNKCLAEDLNQGMNRGELTLSTPDVRECRKQAAGAEAGGRMEARERERRARRGPAPIASFDHDASDESSPENKQAIRLPEIREDPASSSCSVAQSTSTGEGLAPPEQRARSLSSPAVHTATVVTPVALQTPSAPRIDISRASSSSHHDSRDSSPELALFAGGGSSEETRERLELGFREDGALDLRSSTEELAFLEGAPEAAEVRPPPVAQPSRRHSRKDSQSSEAALLAVSGRTSRLSSVGSQCSAHSAISAFSQISRVSRLSVVSGTSRSPSPHKMLLETSFCGPKPIETDPEICAAAVEERLLEIAKLTNEAGAASSSDPQPLPMPTIPIAIDARDRREVRTEVTLENTRPASAPKLTSPSAPPTTPVPDLAVSTSSLDDDKRLKETRNRAKVEEHRARSRDRHETNKPEVYRAGNRSKDIIRIKLKPDNEYDDEEEGESEATLVSSEPAKKPITLELNDQCSKPTKPVSPLVTTRRQRDSRTPSPSGVPVSRKSSFCSLFKSRETIASPDSPSDVFRRKKSLNEGRSRSKSRDRTTTPTSAGKIKGSVLSLFKTPRRSGASPSPSSRDVSPVVQQQRQFPQTPHDKQRGEKLKYYEDAKDGIIHIPLRTPPDEIEPKKGKEGSDDKITEKPRQVIRPASAPQPRALPDRSLVMSPVPSPKPTQRTVLPDGSIIIPLHSPTDKTAKVILPFETQVKSESDIKYPEQIDLNHKELNLYNNLKHKELEYATERIDSIPNEPDGSQFISSPPPDIVPETYIGERPRRKERIVFTTHVGSKEHVFSTQFSITKTPSVTSEISESIQSVPEFEEVKQNETSPQETDWKSNDVIDNDQKYHSQRESFCETGEENVTRELSREVSREVSREVSREVSPTPDTGRDSSESETSLEIAATHGGSEAERRGLVVQESFEELPYVPTTLPLERSLALPMVPVRERGGVHVAGVQRPRATKSAGRQPGALTSPAPLVAPAAVSPAAGDSPADRLYIKLPRRARTVSTASAAPPPPPRNLRTRSRSGGDASSMETRSKTEWIDFSEVPERRKQPKRIQTLPASARDTVVFSYVPPERCRCDCHAHDTADDELPLLQDASPARASSAASLDDHDRHEPFIADLDLRHSTSDSVQYTYHFRTDSFLVYPE-