Monarch geneset OGS2.0

DPOGS209716
TranscriptDPOGS209716-TA951 bp
ProteinDPOGS209716-PA316 aa
Genomic positionDPSCF300105 - 321805-326483
RNAseq coverage1561x (Rank: top 8%)
Annotation
HeliconiusHMEL0113522e-7970.67% 
BombyxBGIBMGA008927-TA8e-16894.58% 
DrosophilaArc-p34-PA1e-13777.97% 
EBI UniRef50UniRef50_O151445e-12872.88%Actin-related protein 2/3 complex subunit 2 n=120 Tax=Opisthokonta RepID=ARPC2_HUMAN
NCBI RefSeqXP_002003511.13e-13878.31%GI22182 [Drosophila mojavensis]
NCBI nr blastpgi|2897401693e-13778.31%actin-related protein ARP2/3 complex subunit ARPC2 [Glossina morsitans morsitans]
NCBI nr blastxgi|1947589982e-13278.31%GF15118 [Drosophila ananassae]
Group
Gene OntologyGO:00308333.2e-188regulation of actin filament polymerization
GO:00058563.2e-188cytoskeleton
KEGG pathwaydmo:Dmoj_GI221828e-138 
 K05758 (ARPC2)maps-> Shigellosis
    Pathogenic Escherichia coli infection
    Regulation of actin cytoskeleton
    Bacterial invasion of epithelial cells
    Fc gamma R-mediated phagocytosis
InterPro domain[1-295] IPR0071883.2e-188Arp2/3 complex, 34kDa subunit p34-Arc
Orthology groupMCL13848 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209716-TA
ATGATCTTGCTGGAGATCAATAATAGAATTATAGAAGAAACTCTTACAGTTAAATATAAAAATGCACTGGCGCGCTTAAAACCTGAATCCATAGATGTGACTCTTGCAGACTTTGATGGAGTGCTGTTTCATATATCTAATGTCAATGGGGATAAAACCAAAGTTAGGGTTAGTATATCATTGAAGTTCTATAAACAGTTGGAGGAACATGGGGCTGATGAACTCCTCAAAAGGGTCTATGGTCCACTGCTTACTGAACCAGAGTCAGGTTACAATGTATCGGTGCTGTTGGATATGGAGAACATTCCTGACGACTGGGAGCTGATGGTGAAGAAGGTCGGCCTGCTGAAGAGGAACTGCTTTGCGTCAGTGTTCGAGAGATACTTCAGATTACAGGAGGATGGGGACGTGGGTCACAAGAGAGCGGTCATTAACTATCGACAGGATGAGACTCTTTATGTGGAAGCTCAGGAGGACCGGGTGACTGTTGTATTTTCGACTGTGTTCCGTCATGAGGATGATATTGTCATCGGGAAGGTCTTCATGCAGGAACTCAAAGAAGGAAGGAGAGCTTCTCACACCGCGCCACAGGTTCTATTTTCACACAAAGAGCCACCATTAGAGCTGTTGGATACTGATGCTAAAGTAGGCGAAAATATAAGTTATGTAACATTTGTGCTGTTCCCTAGACACACGTGTGCGGCGGCGCGTGACAACACCATCGACCTGCTGCATATGTTCCGCGACTACCTTCACTACCACATCAAGTGTTCCAAGGTGTACGTTCACTCCCGTATGCGCGCCAAGGCCGGTGATCTGTTGAAGGTGCTGAACCGCGCTCGGCCGCAGTCCACCGGCCGTCCCACGGAGCGGAAGACCATCACGTTATGGGAGAACGTTCGTGAGGCGAGATTGAGTGTTGCGGGTGGGCGGCTCGTAGACGTCTCGTAG

Protein sequence:

>DPOGS209716-PA
MILLEINNRIIEETLTVKYKNALARLKPESIDVTLADFDGVLFHISNVNGDKTKVRVSISLKFYKQLEEHGADELLKRVYGPLLTEPESGYNVSVLLDMENIPDDWELMVKKVGLLKRNCFASVFERYFRLQEDGDVGHKRAVINYRQDETLYVEAQEDRVTVVFSTVFRHEDDIVIGKVFMQELKEGRRASHTAPQVLFSHKEPPLELLDTDAKVGENISYVTFVLFPRHTCAAARDNTIDLLHMFRDYLHYHIKCSKVYVHSRMRAKAGDLLKVLNRARPQSTGRPTERKTITLWENVREARLSVAGGRLVDVS-