Monarch geneset OGS2.0

DPOGS208777
TranscriptDPOGS208777-TA2517 bp
ProteinDPOGS208777-PA838 aa
Genomic positionDPSCF300036 - 831581-841733
RNAseq coverage131x (Rank: top 56%)
Annotation
HeliconiusHMEL0154280.075.34% 
BombyxBGIBMGA007636-TA0.083.40% 
Drosophilavav-PC5e-16545.81% 
EBI UniRef50UniRef50_E2A8N90.050.48%Protein vav n=6 Tax=Endopterygota RepID=E2A8N9_CAMFO
NCBI RefSeqXP_396932.20.050.18%PREDICTED: similar to vav CG7893-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3800302410.052.30%PREDICTED: protein vav-like [Apis florea]
NCBI nr blastxgi|3800302410.052.06%PREDICTED: protein vav-like [Apis florea]
Group
Gene OntologyGO:00056229.9e-54intracellular
GO:00350239.9e-54regulation of Rho protein signal transduction
GO:00050899.9e-54Rho guanyl-nucleotide exchange factor activity
GO:00055153.4e-30protein binding
GO:00355561.8e-09intracellular signal transduction
KEGG pathwayame:4134880.0 
 K05730 (VAV)maps-> Regulation of actin cytoskeleton
    Fc epsilon RI signaling pathway
    Fc gamma R-mediated phagocytosis
    B cell receptor signaling pathway
    Chemokine signaling pathway
    Natural killer cell mediated cytotoxicity
    Leukocyte transendothelial migration
    T cell receptor signaling pathway
    Focal adhesion
InterPro domain[236-457] IPR0002199.9e-54Dbl homology (DH) domain
[6-134] IPR0017153.4e-30Calponin homology domain
[467-598] IPR0119934.9e-25Pleckstrin homology-type
[661-769] IPR0009802e-21SH2 motif
[26-98] IPR0226138.3e-14Calmodulin-regulated spectrin-associated protein, CH domain
[476-582] IPR0018493e-12Pleckstrin homology domain
[766-832] IPR0014528.7e-12Src homology-3 domain
[592-635] IPR0022191.8e-09Protein kinase C-like, phorbol ester/diacylglycerol binding
Orthology groupMCL10528 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208777-TA
ATGGCTGCCGGTGGAGAGGATCTTTGGCGCGAGTGCGCAACTTGGCTCACTCGATGTGGCTTGTTACGGCCTGATCATAAAGCAAATTGGGAGACGAGTACTATCCATGATCTTGCTTATACTTTACGAGATGGTGTGCTGCTTTGTAATTTGTTAAACACGCTGTACCCGGGATGTATCGACATGAAAGATGTTAATCAACGACCCCAGATGGCACAGTTCCTATGTATGAGAAACATTAAAGTCTTCCTGAGGACATGCCACGAAGTGTTTGAGCTTCGGGAGACGGATCTCTTTGATCCTTCGATGTTATTCGACCTCTCCGATTTTCACAGGGTGTTATGTACCTTGGCCAAGCTCAGTCAATGTCCTAAAGCCTTGGCTAGAAATGTTCAGCCGTTCTCAGCGAGACGGACTCAGTCGGAGGAGGACATCTACAAGGACCTTCAGTCGGTCGCGAACATGCCGTGCGAGACGCCCTCGTACGAGGTGGTGGAGGAGGTGGAGACGAGGGAGGTACCGTGGATACAGTTCACTATACCGAGCGTGGCCTGCGAGGATCGCGTCGAGGAGATATACGAAGACCTCTGCTATGTCAATGGCGGAACGGGCCGTGGGGTGGGGGAGTACGCCAGCTACTGCGCCCGCCTTCACGACGAGGAGATCTACCACGACCTGTGCGTGGTGACAGGGGCAAGGGGCGCGCCCGCCGACAAACACAACATCGCTTTCGCTACACTAGCGACGTCGTCGCACAGCTTAGAGAAGAGAGATTACGTTATACGGGAACTGGTCGACACTGAGTGTAACTATGTGGATGTGCTCAGTAAGATTATCAAATACTTCTTGAGGCCGCTGACGCCGTATCTGAAGCCCCAGGATATGCAAGTCATATTTTTCGGTGTCAAGGAGTTGCACGATATACACAACGGTCTACTCCGCCAGCTAAGGCTCGCGACCGACAATTGTGTACCTGGCAGCGGAGCGCCGCGACTAGCGGACGTGTTCCTGGCGTGGCGGGAGCGGCTCCTGCTGTATGGAGACTATTGCTCAAATCTCACAAACGCCCAGGACACGTTGAAAGCGCTCGATGCAAGGGATTCCACATTCAGTAAGCAGCTACTGAAATGTCAAAAGGAGCACAGTGACGGTCGTATCCAGCTGCGCGACATCCTGTCGGTTCCCATGCAGCGTGTGCTCAAGTATCATCTGCTGCTGGACAAGCTGGTGCACGAGACCCAGCCGAACCACGAGGAGTTCCGCGGCTTGGAGCGCGCCAAGGAAGCCATGGTGGACGTGGCTCAGTATATCAACGAGGTCAAGAGGGACAGCGAGGTGCTCGTGTTGCTGGCTAAGCTGCAGGAGAGTATAGTGGACTGGGACCGGTCGGGCGCGGAGGGCGGGTCGCTGGCAGCTTACGGCCGCCTGCTGCTGGACGGAGAGCTGAAGGTGAAGGCTCACGAGGACCAGAAGATGAGGATGAGATACGTGTTCGTGTTCGACAAGTACATGCTGCTGTGCAAGCCCGTCAAGGAGAACCAGTACTCGTACCGGAAGGGCATCAAGCTGGCCGAGTACCGTGTGGAGGAGGGCGGGCCGCGGCGCTCGCTGAGGGCGGACGCTCGCTGGACCGCGCACTTCTTCCTCGTCAGGCGGACCAAGGACGCCACCTACACGCTTTACGCCAAGACGGAGGACTGTCGCCGGAAATGGCTGAAGGCCATCAACGACGCCATAGAGAATCTGGAGCCGCCCGGCTGTCGGCGGACCAACCACAAGTTCGTTCTTCACACCTTCGAGAAGCCGGCCACGTGTCATCACTGCTCCAAGTTCCTCAAGGGCAGGATATTTCAGGGCTACCTGTGTTCCGTGTGTGGAGCGTGCGCCCACAAGGAGTGCATCGCTCTGTCGGGCCGCTGCGGGGGAGGGAGCGCCCCCGCGCCGCCTCTGCCACCACACGCACTACAACCGGACACCGCGCTGCACTACTACATGTGGTACGTGGGCGAGATGAGCCGGGAGACGGCCACGTCCCGCCTGGAGCGCCGTGTGGACGGCACCTTCCTGCTGCGGGTGAGGCCGCGCGCCGCGCTCCACGACACCCAGTACGCGCTCTCACTCAAAACCGACAACACGGTGAAACACATGCGAGTGTGTCTGAAGCCCATCGACTCGGTGCCTCACTACTACCTGTCCGAGTCCAGGTTCTTCAGGAGCGTGGTCGAGCTGATATCCTACTACGAGAAGACGAGCTTGTCGGAGAACTTTGTCGGATTAAACTCGAACTTGCGATGGCCCTTCCGCCGCGTGGTGGCGACTGTCATCCACGACTTCCGACCGCTAGAGGCATCCCAGTTGGCGCTCCGCCCCGGCGCGAAGGTGTTAGTGCTCAGCAAGGAAGGCGACGGCCGGGGCTGGTGGAAAGGCCGCACGCTGGATAAGGGTGACCACCGCTCGGGCTACTTCCCCAAGGAGTGCGTCCGCGAGGAGCCCGAGTGTATCGGAGCGTTAGACTGA

Protein sequence:

>DPOGS208777-PA
MAAGGEDLWRECATWLTRCGLLRPDHKANWETSTIHDLAYTLRDGVLLCNLLNTLYPGCIDMKDVNQRPQMAQFLCMRNIKVFLRTCHEVFELRETDLFDPSMLFDLSDFHRVLCTLAKLSQCPKALARNVQPFSARRTQSEEDIYKDLQSVANMPCETPSYEVVEEVETREVPWIQFTIPSVACEDRVEEIYEDLCYVNGGTGRGVGEYASYCARLHDEEIYHDLCVVTGARGAPADKHNIAFATLATSSHSLEKRDYVIRELVDTECNYVDVLSKIIKYFLRPLTPYLKPQDMQVIFFGVKELHDIHNGLLRQLRLATDNCVPGSGAPRLADVFLAWRERLLLYGDYCSNLTNAQDTLKALDARDSTFSKQLLKCQKEHSDGRIQLRDILSVPMQRVLKYHLLLDKLVHETQPNHEEFRGLERAKEAMVDVAQYINEVKRDSEVLVLLAKLQESIVDWDRSGAEGGSLAAYGRLLLDGELKVKAHEDQKMRMRYVFVFDKYMLLCKPVKENQYSYRKGIKLAEYRVEEGGPRRSLRADARWTAHFFLVRRTKDATYTLYAKTEDCRRKWLKAINDAIENLEPPGCRRTNHKFVLHTFEKPATCHHCSKFLKGRIFQGYLCSVCGACAHKECIALSGRCGGGSAPAPPLPPHALQPDTALHYYMWYVGEMSRETATSRLERRVDGTFLLRVRPRAALHDTQYALSLKTDNTVKHMRVCLKPIDSVPHYYLSESRFFRSVVELISYYEKTSLSENFVGLNSNLRWPFRRVVATVIHDFRPLEASQLALRPGAKVLVLSKEGDGRGWWKGRTLDKGDHRSGYFPKECVREEPECIGALD-