Monarch geneset OGS2.0

DPOGS215022
TranscriptDPOGS215022-TA1122 bp
ProteinDPOGS215022-PA373 aa
Genomic positionDPSCF300256 + 272852-276284
RNAseq coverage988x (Rank: top 13%)
Annotation
HeliconiusHMEL0087190.093.01% 
BombyxBGIBMGA012151-TA0.091.67% 
DrosophilaArp87C-PA0.078.76% 
EBI UniRef50UniRef50_P611630.081.18%Alpha-centractin n=89 Tax=Eukaryota RepID=ACTZ_HUMAN
NCBI RefSeqNP_001040336.10.091.67%ARP1 actin-related protein 1-like protein A [Bombyx mori]
NCBI nr blastpgi|1140530210.091.67%ARP1 actin-related protein 1-like protein A [Bombyx mori]
NCBI nr blastxgi|1140530210.091.67%ARP1 actin-related protein 1-like protein A [Bombyx mori]
Group
KEGG pathwayppp:PHYPADRAFT_2259065e-123 
 K05692 (ACTB_G1)maps-> Pathogenic Escherichia coli infection
    Regulation of actin cytoskeleton
    Viral myocarditis
    Bacterial invasion of epithelial cells
    Tight junction
    Adherens junction
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Phototransduction - fly
    Vibrio cholerae infection
    Dilated cardiomyopathy
    Shigellosis
    Leukocyte transendothelial migration
    Hypertrophic cardiomyopathy (HCM)
    Phagosome
    Focal adhesion
InterPro domain[1-373] IPR0040007.4e-237Actin-like
Orthology groupMCL11238 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215022-TA
ATGGACCTGATTGTGAACCAACCTGTCGTTATAGACAATGGTTCTGGTGTCATTAAAGCCGGTTTCGCCGGCGATCAGATACCGAAATGCAGATTTCCGAACTATATAGGTCGTCCAAAACATGTGCGTGTTATGGCGGGTGCTTTGGAGGGGGATTTGTTTGTGGGACCGCAGGCCGAGGAACACAGAGGACTGCTCACCATCAGATATCCCATGGAGCATGGGATTGTGACCGACTGGAATGACATGGAAAAAGTCTGGACTTACATATACACTAAGGACCAACTATCAACATGCCCCGAGGAGCACCCTGTGTTGTTGACGGAGGCTCCGATCAACCCTCGTCGTAACAGAGAGAAGACGGCCGAAGTGTTCTTCGAAACGTTCTCTGTACCAGCGCTGTTCCTATCAATGCAGGCGGTTCTCAGCTTATACGCGACCGGGCGGACCACAGGCGTGGTGCTGGACTCGGGGGACGGGGTCACGCACTCGGTGCCAATATACGAGGGGTTCGCGATGCCCCACAGCATCATGAGGGTGGACGTGGCTGGCAGGGACGTCACCAAATACTTGAGGTTGCTTCTTCGTAAGGAAGGTGTTAACTTGGAGACGTCGGCCGAGCTGGAGATCGTGAAGGCGATCAAGGAGCGCGCCTGCTACCTGTCCCCGAACCCACTCAAGGAAGAGACTCTGGACCCGGAGAAGGCGCAGTACTGTCTCCCAGACGGGACACAGCTGGAGATCGGTCCAGCTCGTTTTCGAGCTCCAGAAGTACTGTTCCGTCCAGATCTCATAGGAGCGGAGTGTGAGGGTCTGCACGAGGTGCTGATGTTCGCTATCCAGAAGTCCGACATGGACCTGCGGAAGGTGCTGCACCAGAACATCGTTCTGTCCGGAGGATCCACGCTGCTGAGGGGCTTCGGAGACAGACTGCTGGCGGAGATCAGGAGACTCGCGCCCAAAGACATGAAGATCAGGATCTCAGCTCCTCAAGAGCGGCTGTACTCCACCTGGATAGGCGGTTCCATTCTGGCGTCCTTGGACACCTTCAGGAAGATGTGGGTCTCCAAGAGAGAGTACGAGGAGGAGGGACACCGCGCCGTGCACCGGAAGACCTTCTAG

Protein sequence:

>DPOGS215022-PA
MDLIVNQPVVIDNGSGVIKAGFAGDQIPKCRFPNYIGRPKHVRVMAGALEGDLFVGPQAEEHRGLLTIRYPMEHGIVTDWNDMEKVWTYIYTKDQLSTCPEEHPVLLTEAPINPRRNREKTAEVFFETFSVPALFLSMQAVLSLYATGRTTGVVLDSGDGVTHSVPIYEGFAMPHSIMRVDVAGRDVTKYLRLLLRKEGVNLETSAELEIVKAIKERACYLSPNPLKEETLDPEKAQYCLPDGTQLEIGPARFRAPEVLFRPDLIGAECEGLHEVLMFAIQKSDMDLRKVLHQNIVLSGGSTLLRGFGDRLLAEIRRLAPKDMKIRISAPQERLYSTWIGGSILASLDTFRKMWVSKREYEEEGHRAVHRKTF-