Monarch geneset OGS2.0

DPOGS212379
TranscriptDPOGS212379-TA1119 bp
ProteinDPOGS212379-PA372 aa
Genomic positionDPSCF300019 + 342801-344361
RNAseq coverage989x (Rank: top 13%)
Annotation
HeliconiusHMEL0039450.090.59% 
BombyxBGIBMGA004645-TA0.087.40% 
DrosophilaSop2-PA4e-15268.04% 
EBI UniRef50UniRef50_O971826e-15068.04%Actin related complex p41 subunit n=26 Tax=Opisthokonta RepID=O97182_DROME
NCBI RefSeqNP_001037211.10.086.86%suppressor of profilin 2 [Bombyx mori]
NCBI nr blastpgi|3802939350.087.94%Arp2/3-P40 [Spodoptera frugiperda]
NCBI nr blastxgi|3802939350.087.94%Arp2/3-P40 [Spodoptera frugiperda]
Group
Gene OntologyGO:00308331.2e-243regulation of actin filament polymerization
GO:00058561.2e-243cytoskeleton
GO:00037791.2e-243actin binding
GO:00055159.2e-49protein binding
KEGG pathwayaag:AaeL_AAEL0075461e-156 
 K05757 (ARPC1A_B)maps-> Shigellosis
    Pathogenic Escherichia coli infection
    Regulation of actin cytoskeleton
    Bacterial invasion of epithelial cells
    Fc gamma R-mediated phagocytosis
InterPro domain[1-361] IPR0173831.2e-243Actin-related protein 2/3 complex, subunit 1
[12-366] IPR0159439.2e-49WD40/YVTN repeat-like-containing domain
[1-369] IPR0110462.2e-47WD40 repeat-like-containing domain
[45-82] IPR0197817.6e-06WD40 repeat, subgroup
Orthology groupMCL13479 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212379-TA
ATGGCACAAACGCTTACACTAGGTGACTCATGCGCCCCGATAACTTGTCATGCGTGGAATAAGGAAAGAAATCAAATCGCTTTTTCTCCCAATAACAATGAAGTGCATATTTACCAAAAGGAGGGAAATGAATGGAAGCAAACTAATAATCTGGTGGAACATGATATGAGGGTCATGGGAATAGACTGGGCTCCGAATACTAACAGGATTGTAACATGCTCTGTGGACCGTAATGCTTATGTCTGGACCCAAAGTGAAGATGGGAAGTGGACTACTACCCTGGTATTGTTGCGTATAAACAGAGCAGCAACATGTGTTAAATGGTCTCCGATGGAAAACAAATTTGCAGTTGGCTCTGGAGCTCGCCTGATATCCATTTGCTATTTTGAAAAGGAAAACAATTGGTGGGTGTCTAAACATATTAAAAAACCAATTCGCTCAACTGTAACCACTTTAGATTGGCATCCAAATAATATTCTATTGGTTGCTGGCTCGACTGATTTTAAAGTAAGAGTTTTTTCCGCGTATATTAAGGATATTGAGGATCAACCTGGACCTAATGTCTGGGGGTCTAAGTTGCCTTTAGGTCAATTATTGGCTGAGTTTCCATCTGCTGGAAATGGATGGGTGCATAGTGTTTCCTTTTCTGCAGATGGCAACAAAGTTGCTTGGGTGGGCCATGACAGCTCTATAAATGTAGCCGACGCTTCTAATGGAAGGGCTGTTATCAAGCATAGGACTGAATATCTACCATTCCTAGGATGCATTTGGATCTCAAATAATTCTTTGGTAGTGGCAGGCCATAGCTGCATTCCATTACTCTACTGCCATGACGGAGGGGAGATTAAGTTTGTTGCAAAGTTGGACAATACTCAGAGGAAAGAGTCTGGTGGTTTATCTGCTATGAAGAAGTTCCAGTCGTTGGATAGACAAGCTCGTATTGAAACCAATGATACCTACTTGGACTCCATACATCAAAATGCTATAACCAGTATTAATTTGTACGAGGGTACCAAGGCTAATGCTGTCAAGTTCAGTACTTCTGGTCTAGACGGTCAGCTGGTCGTTTGGGATTTGGTCTCCCTCGAGAAGTCCATTGAATCCCTGAAGATTTTCTAA

Protein sequence:

>DPOGS212379-PA
MAQTLTLGDSCAPITCHAWNKERNQIAFSPNNNEVHIYQKEGNEWKQTNNLVEHDMRVMGIDWAPNTNRIVTCSVDRNAYVWTQSEDGKWTTTLVLLRINRAATCVKWSPMENKFAVGSGARLISICYFEKENNWWVSKHIKKPIRSTVTTLDWHPNNILLVAGSTDFKVRVFSAYIKDIEDQPGPNVWGSKLPLGQLLAEFPSAGNGWVHSVSFSADGNKVAWVGHDSSINVADASNGRAVIKHRTEYLPFLGCIWISNNSLVVAGHSCIPLLYCHDGGEIKFVAKLDNTQRKESGGLSAMKKFQSLDRQARIETNDTYLDSIHQNAITSINLYEGTKANAVKFSTSGLDGQLVVWDLVSLEKSIESLKIF-