Monarch geneset OGS2.0

DPOGS200328
TranscriptDPOGS200328-TA1131 bp
ProteinDPOGS200328-PA376 aa
Genomic positionDPSCF300026 + 277364-278586
RNAseq coverage7994x (Rank: top 2%)
Annotation
HeliconiusHMEL0029150.0100.00% 
BombyxBGIBMGA013945-TA0.096.81% 
DrosophilaAct5C-PB0.0100.00% 
EBI UniRef50UniRef50_P632670.094.15%Actin, gamma-enteric smooth muscle n=867 Tax=root RepID=ACTH_HUMAN
NCBI RefSeqXP_002064370.10.0100.00%GK20124 [Drosophila willistoni]
NCBI nr blastpgi|175308050.0100.00%actin 5C, isoform B [Drosophila melanogaster]
NCBI nr blastxgi|175308050.0100.00%actin 5C, isoform B [Drosophila melanogaster]
Group
KEGG pathwayapi:1001458220.0 
 K05692 (ACTB_G1)maps-> Pathogenic Escherichia coli infection
    Regulation of actin cytoskeleton
    Viral myocarditis
    Bacterial invasion of epithelial cells
    Tight junction
    Adherens junction
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Phototransduction - fly
    Vibrio cholerae infection
    Dilated cardiomyopathy
    Shigellosis
    Leukocyte transendothelial migration
    Hypertrophic cardiomyopathy (HCM)
    Phagosome
    Focal adhesion
InterPro domain[2-376] IPR0040000Actin-like
Orthology groupMCL10033 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200328-TA
ATGTGCGACGAAGAAGTAGCCGCGTTGGTAGTGGACAATGGCTCCGGTATGTGCAAGGCCGGGTTCGCGGGAGACGATGCCCCTCGTGCCGTCTTCCCGTCGATCGTGGGCCGCCCTCGTCACCAGGGCGTGATGGTCGGTATGGGTCAGAAGGACTCGTACGTCGGAGACGAGGCCCAGAGCAAGAGAGGTATCCTCACCCTGAAGTACCCCATCGAGCACGGCATCGTCACCAACTGGGATGACATGGAGAAGATCTGGCACCACACCTTCTACAACGAACTGCGTGTGGCCCCCGAAGAGCACCCCGTCCTGCTCACAGAGGCTCCCCTCAACCCTAAAGCCAACAGAGAAAAGATGACCCAGATCATGTTTGAGACCTTCAACACGCCCGCCATGTACGTCGCCATCCAGGCCGTACTCTCACTGTACGCCTCCGGTCGTACCACCGGTATCGTGCTGGACTCCGGAGACGGTGTCTCCCACACAGTACCCATCTACGAGGGATACGCCCTGCCTCACGCCATCCTGCGTCTGGACTTGGCGGGCCGTGACCTCACCGACTACCTGATGAAGATCCTCACTGAACGTGGATACTCTTTCACCACCACGGCCGAGAGAGAAATCGTTCGTGATATCAAGGAGAAGCTTTGCTACGTCGCTCTCGACTTCGAGCAGGAAATGGCCACCGCTGCCTCCAGCAGCTCCCTCGAGAAGTCTTACGAGCTTCCCGACGGACAGGTCATCACCATCGGAAACGAACGATTCCGTTGCCCTGAGGCTCTCTTCCAACCCTCATTCCTGGGTATGGAAGCATGCGGCATCCACGAAACTACTTACAACTCCATCATGAAGTGTGATGTGGACATCCGTAAGGACTTGTATGCCAACACAGTACTGTCGGGTGGTACCACCATGTACCCAGGCATCGCCGACCGTATGCAGAAGGAAATCACAGCTCTGGCACCTTCAACCATGAAGATCAAGATCATCGCTCCACCGGAGAGGAAATACTCCGTATGGATCGGAGGTTCCATCCTCGCGTCTCTGTCGACATTCCAACAGATGTGGATCTCGAAACAGGAGTACGACGAGTCCGGCCCCTCAATCGTGCACAGGAAGTGCTTCTAA

Protein sequence:

>DPOGS200328-PA
MCDEEVAALVVDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHQGVMVGMGQKDSYVGDEAQSKRGILTLKYPIEHGIVTNWDDMEKIWHHTFYNELRVAPEEHPVLLTEAPLNPKANREKMTQIMFETFNTPAMYVAIQAVLSLYASGRTTGIVLDSGDGVSHTVPIYEGYALPHAILRLDLAGRDLTDYLMKILTERGYSFTTTAEREIVRDIKEKLCYVALDFEQEMATAASSSSLEKSYELPDGQVITIGNERFRCPEALFQPSFLGMEACGIHETTYNSIMKCDVDIRKDLYANTVLSGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQQMWISKQEYDESGPSIVHRKCF-