Monarch geneset OGS2.0

DPOGS203453
TranscriptDPOGS203453-TA4407 bp
ProteinDPOGS203453-PA1468 aa
Genomic positionDPSCF300242 + 180239-193540
RNAseq coverage401x (Rank: top 30%)
Annotation
HeliconiusHMEL0150260.068.98% 
BombyxBGIBMGA011102-TA0.063.07% 
DrosophilaSos-PA0.051.99% 
EBI UniRef50UniRef50_E0VPV80.054.24%Histone H2A n=10 Tax=Eumetazoa RepID=E0VPV8_PEDHC
NCBI RefSeqXP_002428152.10.054.24%ras GTP exchange factor, son of sevenless, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420149710.054.24%ras GTP exchange factor, son of sevenless, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420149710.049.26%ras GTP exchange factor, son of sevenless, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00072641.5e-261small GTPase mediated signal transduction
GO:00056221.5e-261intracellular
GO:00050851.5e-261guanyl-nucleotide exchange factor activity
GO:00350231.1e-35regulation of Rho protein signal transduction
GO:00050891.1e-35Rho guanyl-nucleotide exchange factor activity
GO:00036771.2e-34DNA binding
GO:00510561.6e-32regulation of small GTPase mediated signal transduction
GO:00055151.2e-19protein binding
KEGG pathwayphu:Phum_PHUM3661400.0 
 K03099 (SOS)maps-> Prostate cancer
    Regulation of actin cytoskeleton
    Fc epsilon RI signaling pathway
    MAPK signaling pathway
    Gap junction
    Dorso-ventral axis formation
    Glioma
    B cell receptor signaling pathway
    Pathways in cancer
    Chemokine signaling pathway
    Endometrial cancer
    Natural killer cell mediated cytotoxicity
    Insulin signaling pathway
    Neurotrophin signaling pathway
    T cell receptor signaling pathway
    Focal adhesion
    ErbB signaling pathway
    MAPK signaling pathway - fly
    GnRH signaling pathway
    Renal cell carcinoma
    Acute myeloid leukemia
    Non-small cell lung cancer
    Jak-STAT signaling pathway
    Chronic myeloid leukemia
InterPro domain[27-1269] IPR0089371.5e-261Ras guanine nucleotide exchange factor
[27-1269] IPR0157591.5e-261Ras GTP exchange factor, son of sevenless
[573-1106] IPR0235781.9e-113Ras guanine nucleotide exchange factor, domain
[838-1080] IPR0018954.8e-74Guanine-nucleotide dissociation stimulator CDC25
[212-434] IPR0002191.1e-35Dbl homology (DH) domain
[9-184] IPR0090721.2e-34Histone-fold
[600-797] IPR0006511.6e-32Ras-like guanine nucleotide exchange factor, N-terminal
[435-549] IPR0119931.2e-19Pleckstrin homology-type
[445-555] IPR0018496e-07Pleckstrin homology domain
Orthology groupMCL11388 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203453-TA
ATGCTGCCCACAGTGGAGGAGACTCGGGGTTACGACTTCCAGGACCCGGAGAATGCGGAGAAATGGAAGGGCTTCTTCATCAGCTCTCTCCGCAAGGTGCTGGAGTCGGTCCACCCCACGCTGACGGCGGACGAGGGCGCTCTGGAGTTTGTGGAGTCGCTGTGTCTTCGTCTGCTGGGCATGCTGTGTGCGCCGCCCGCGCCTCTCAGCGTGGCGGATGGAGAGGAGAGGGTGGCCCGCAGCTTCCCCACGCCACTGGACCGTTGGGCGCTCATCGAGGCCAGGGAGGCGGCCGGCGCCAGGAGACGGAAACTACTGCTGCCACTTGAAAGGCTGCATCTGTTGCTGCAGAAGGAGGTGTTGCTCTATAAGATCGACATGTCGGTGACAACCTTCATGACCGTCATTCTGGAGTTCATCAGCACCGACATCCTGCGGCTGGCGGGGAACTTCGTGAAGAAGATTTCCCAGAAGAGCGGCTACCAGGTCATCACCTGCAGTGACATCAAGACTGCCATGTGCGCTGATAAGGTCCTGATCGACATGTTCTACCAAGACTCGGAGCTGGCCAGCATCGCAGCTCTACCGGCGTTCAACACAAGCGAAAGGGGAAGGAGAGCGTCGCTGTCGTACGGGGATCTCGTGCGGGACCTGCTGGCTGACGAGCGGAACTTCCTTAGGGACCTCAACCTCATGATACGGGTGTTCAAGGAGGAGCTAGAGAAGATCGTCGATGACAATAAGGTGATATCTCTAATATTTGGCAACATAGTGGACATATACGAGCTGACGGTCACGCTGCTGGGCAACCTCGAGGACGCCATGGAGATGTCCCAGGACACGCTCACACCATACATCGGCAGCTGTTTTGAAGAACTGGCGGAGGTGGAGGAGTTCCGGGCGTTCGTCCGCTACGCCAACATCGTCACCAGGAGGGAGTCCCGGGACGCACTCGCCGCGCTCGTTGATGATCCACAGCTGGGCGAGCGCCTGGAGACGGCTGGCCACGGGTTCCGCCTGGCCGTCAAGTACTGCCTGCCGCGGCTGCTGCTGTCTCCGGTGGCGCACGTGTTCGTGTATCACTCGTACGTGCTGGCCATGCTGCCCCTGGCGCCCGCTAGCGACGACCGGGAGAGCTTCAAACAGGTCGAATGCAATCTACATCCAATAGAGAAATTACTGACAAGAGCCTTGGGGAACGGACCGCAGCTGGACGGCGCGATGAGGTCGGCGTCTCGAGCTCGCCGGAAGATGGCCATCGACAAGTGTAACGAGCTGGCCAGGCTGGTCGACAACTGGGACGCCAGGGACGTGCCGCAGTGCTGCAACGAGTTCATTAGAGAGGACACGCTCACCAAGCTGGGTCCGGGGAAGCGCGTCGCTGAGCGGAGAGCTTTTTTATTCGACGGACTCCTGTTGCTCTGCAAACCCGTCACCAGCCTGGTGACTGTGAACAGCGGCGTGACGGCGGGCCCGCCCCAGCTGAAGCTGAAGGAGAAGCTGCACATACGGAAACTGGACATAGTGGACCGCCCCGACGGAGAAGAGGGTCGCAATCTAATGGAGCTGTGTCCGCGCGTGGGCCCGCCCGTGGTGCTGGCGGCCTCGTCTCCCGCCGAGAAGAGATGCTGGATGAGCGACCTCGTCATACTCAACACCAAACCCATGCTGGACAGGAGTCTGGACAGTATTCTCCTGGACCTGGAGCGTCGTCACCCTCTCCGCCTGCCGAGCCCCTCCTTGTACCGGTTCGCGGAGCCCGACGGACCTCACAATATACTGCTGGAGCACGGACACGGTCCCGCGCCCCTCATCAAGGGCGCGACGCTGCTGAAGCTGGTGGAGCGCCTCACGTACCACGTGCACGCCGACCTCAACCTGGTGCGGACCTTCCTCACCACCTACCGCTCCTTCTGTTCACCCTCGGAGCTGCTCGCTTTACTCATCGAGCGCTTCGACATACCCGAACCGCACCTCGTGTACGACGCGCCTCGACCCGTCCCAAGGATCAAGATCCCCAGGAAGGACATCAGTCCCAACGCTTCGCTGATGGACCTCGACCTCGACATCGGTAACATTATACTATACAGAGTAGACCTTGTAGAGACGTGGCTGAGAGCGGATTGCGTGGAGATCTCGGCCACCAGTGACGCGGAGAAGCTGAGCAAAAACACGGCCAGGGAGGACTGGAAGAGATACAGGAAGGAGTTCCAACAACCCGTGAAGTTCAGGGTGATAAACGTCCTCCGACACTGGGTGGATCAACACTTCTATGACTTCGAGCGCGAGCCGGAGCTGCTCGCCAAGCTGAAGAGCTTCCTGGAGGCCGTGGACGGCAAACCCATGAGGAAGTGGGTGCAGAGCGTGCTCAAGACCGTTCAGAGGAAGAGCGGCCTACAGTCGGACAACGAGTCCTTGTGTAGCGTGTCGTCGGGCGTGTCGTACGTGTTCGACAGACTGCCGCCGGCGCCGCTCAGGCACGTGGCGGAGCCCGACAGGCACGACTGGCATCCGCTGGCGCTGCACCCGCTGGAGGTGGCCCGCCAACTCACGCTCCTCGAGTTCCAGCTATACAGACAGGTGAAGCCGTCGGAGCTGGTGGGGGCGGCGTGGACCAAGAAAGACAAGGAGAAGTCCAGCCCCAACCTGTTCAGGATCAGCAAAAACACCACCAACTTCACTCGCTGGATAGAGAAGTGGATAGTGGAGAGTGAGAACGTGGAAGAGCGCGCGGGCGTGCTCAGCTGGTGTCTGGAGCTGGCCGTGGCGCTCAGCGACCTCAACAACTTCAACGGCGTGTTCGCTGTGGTCGCCGCCTGCGAGTCCGCCTCCGTCTACAGGCTCAAGTACACCTTCCAGATGCTGCCGCCCCGCCTCCTGCGCGCTCTAGACCAGTTCCGTGAGCTGAGCTCGGACCACTTCCGCCTGTACCAGGAGAGACTGCGGAGCATCAACCCGCCCTGCGTGCCCTTCGTGGGGGTCTACCTCACCAAGATACTACACATCGAGGAGGGGAATCCCGATTTCCTCTCCAACACCGAGCTCATAAACTTCTCGAAGCGTCGTATGGTGGCGGAGATAACCGGCGAGATCCAGCAGTACCAGAACCAGCCCTACTGTCTGACGCTGGAGCCCAGGACCAGGGCTTTCCTGGAGAACCTGGATCCGTTCCCCGGGATGGATGATAACGAAGTTACCAACTACCTGTATGGAAAGAGCAAGGAGATCGAACCCAAGGGAGCCGTCAAACAGACGCACAAGTTTCCCCGCCGCTACCCCGAGCTGTCTCTGAAGCCGGTGAAGGTGTCGCGCCGCAGACACGACACGTCCTCCACCACGCTCTCCTCCACCAACTCGCTCGTCAGTCTGGACGGAGCGTTCAGCGTGTCCCAGCTGTCGCCCACCAGCAGCGTCTGGGACGGAGCCAGCGTCGCCTCCCTGCCGCTCCACGCCCACGACCAGGGGAGCAGGGAGGAGCGCTCGGTGACCAGTCCGCGGCTGGAGAGAGCCAGCTCCAGCTCACTGGCGCAGCTCGGGCAGCTCAGTCTGCTGGACAAGCTGTTCGACAAGAGCAAGTCCGCCGCCACGCTGCACAGGAACAGCGGCTCCACCAAGGACGACCCGCCCTCGCCGCGCGACAACCACTCACCCAAGCGTCGGCCGGCGCTGTCTCCCCGCGGCGCGCTCGTGGAGGCGGACATGTACCAGAACAGAAGACGCACGGACACACAGGAGCGTGTGCCGCCGCTGCTGCCGCGCGGGGGGGCCGACCCCGAGGCGCCGCCCGTACTGCCGCGGAGACCGCCCTCGCCGCACACGGCGTCGACGGAGCCGCCCCCGGAGCCCCCCGACCGCCCGCTGCCGAGCCCCAAGCACCACGACCTGTTCAGACACGGGGAACTGGCCGCGTGCACGTTGATCAGAAACAGCTATAAGATAAATACATTGATCTGTTACAAACGACCAATAACCCCTGACCCGCAGGAGTCCCGCGCTGTCCCCGGCCGCCCCGCCCGCGCCCAGCCCCATGATGGTGGACAGGTGAGACACACACCCTGTATACAGTGCGAGATATATAATGAACACTCCTTGGCCCGAGCGGGACCAGACCCTGTGACCTGGCGGCCTTCAGCTTCCCCTCGGCCACGTCTCCCGGACACATCCCCGGTGAGTGATCCTGTTACTACACACCCCTCACCCCCTCACACCTCACCCTTGATATTAACGGAGCATGTGTCGCCCCCAGCGGAGACCGTCCCCCCGCCCCTGCCGCCGCGGAGAAGAAGAGACTCCGCCGCGCCCCACACGCCGCACCCGGTTAGCTCCAAAACTCCCGCCGAAGCCGGCCGCGGCGGCCGTCGCCCCGCGGACCTAGGACGCCAACGGACACGCAAGCTGCCAAACTAA

Protein sequence:

>DPOGS203453-PA
MLPTVEETRGYDFQDPENAEKWKGFFISSLRKVLESVHPTLTADEGALEFVESLCLRLLGMLCAPPAPLSVADGEERVARSFPTPLDRWALIEAREAAGARRRKLLLPLERLHLLLQKEVLLYKIDMSVTTFMTVILEFISTDILRLAGNFVKKISQKSGYQVITCSDIKTAMCADKVLIDMFYQDSELASIAALPAFNTSERGRRASLSYGDLVRDLLADERNFLRDLNLMIRVFKEELEKIVDDNKVISLIFGNIVDIYELTVTLLGNLEDAMEMSQDTLTPYIGSCFEELAEVEEFRAFVRYANIVTRRESRDALAALVDDPQLGERLETAGHGFRLAVKYCLPRLLLSPVAHVFVYHSYVLAMLPLAPASDDRESFKQVECNLHPIEKLLTRALGNGPQLDGAMRSASRARRKMAIDKCNELARLVDNWDARDVPQCCNEFIREDTLTKLGPGKRVAERRAFLFDGLLLLCKPVTSLVTVNSGVTAGPPQLKLKEKLHIRKLDIVDRPDGEEGRNLMELCPRVGPPVVLAASSPAEKRCWMSDLVILNTKPMLDRSLDSILLDLERRHPLRLPSPSLYRFAEPDGPHNILLEHGHGPAPLIKGATLLKLVERLTYHVHADLNLVRTFLTTYRSFCSPSELLALLIERFDIPEPHLVYDAPRPVPRIKIPRKDISPNASLMDLDLDIGNIILYRVDLVETWLRADCVEISATSDAEKLSKNTAREDWKRYRKEFQQPVKFRVINVLRHWVDQHFYDFEREPELLAKLKSFLEAVDGKPMRKWVQSVLKTVQRKSGLQSDNESLCSVSSGVSYVFDRLPPAPLRHVAEPDRHDWHPLALHPLEVARQLTLLEFQLYRQVKPSELVGAAWTKKDKEKSSPNLFRISKNTTNFTRWIEKWIVESENVEERAGVLSWCLELAVALSDLNNFNGVFAVVAACESASVYRLKYTFQMLPPRLLRALDQFRELSSDHFRLYQERLRSINPPCVPFVGVYLTKILHIEEGNPDFLSNTELINFSKRRMVAEITGEIQQYQNQPYCLTLEPRTRAFLENLDPFPGMDDNEVTNYLYGKSKEIEPKGAVKQTHKFPRRYPELSLKPVKVSRRRHDTSSTTLSSTNSLVSLDGAFSVSQLSPTSSVWDGASVASLPLHAHDQGSREERSVTSPRLERASSSSLAQLGQLSLLDKLFDKSKSAATLHRNSGSTKDDPPSPRDNHSPKRRPALSPRGALVEADMYQNRRRTDTQERVPPLLPRGGADPEAPPVLPRRPPSPHTASTEPPPEPPDRPLPSPKHHDLFRHGELAACTLIRNSYKINTLICYKRPITPDPQESRAVPGRPARAQPHDGGQVRHTPCIQCEIYNEHSLARAGPDPVTWRPSASPRPRLPDTSPVSDPVTTHPSPPHTSPLILTEHVSPPAETVPPPLPPRRRRDSAAPHTPHPVSSKTPAEAGRGGRRPADLGRQRTRKLPN-