Monarch geneset OGS2.0

DPOGS200327
TranscriptDPOGS200327-TA1131 bp
ProteinDPOGS200327-PA376 aa
Genomic positionDPSCF300026 + 270682-271904
RNAseq coverage1001x (Rank: top 13%)
Annotation
HeliconiusHMEL0029140.0100.00% 
BombyxBGIBMGA013945-TA0.096.54% 
DrosophilaAct5C-PB0.099.73% 
EBI UniRef50UniRef50_P632670.094.15%Actin, gamma-enteric smooth muscle n=867 Tax=root RepID=ACTH_HUMAN
NCBI RefSeqNP_001119726.10.099.73%actin, cytoplasmic A3 [Bombyx mori]
NCBI nr blastpgi|31829020.0100.00%cytoplasmic actin A3b [Helicoverpa zea]
NCBI nr blastxgi|31829020.0100.00%cytoplasmic actin A3b [Helicoverpa zea]
Group
KEGG pathwaydan:Dana_GF138270.0 
 K05692 (ACTB_G1)maps-> Pathogenic Escherichia coli infection
    Regulation of actin cytoskeleton
    Viral myocarditis
    Bacterial invasion of epithelial cells
    Tight junction
    Adherens junction
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Phototransduction - fly
    Vibrio cholerae infection
    Dilated cardiomyopathy
    Shigellosis
    Leukocyte transendothelial migration
    Hypertrophic cardiomyopathy (HCM)
    Phagosome
    Focal adhesion
InterPro domain[2-376] IPR0040000Actin-like
Orthology groupMCL10033 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200327-TA
ATGTGCGACGAAGAAGTAGCCGCGTTGGTAGTGGACAATGGCTCCGGTATGTGCAAGGCCGGGTTCGCGGGAGACGATGCCCCTCGTGCCGTCTTCCCGTCGATCGTGGGCCGCCCTCGTCACCAGGGCGTGATGGTCGGTATGGGTCAGAAGGACTCGTACGTCGGAGACGAGGCCCAGAGCAAGAGAGGTATCCTCACCCTGAAGTACCCCATCGAGCACGGCATCGTCACCAACTGGGATGACATGGAGAAGATCTGGCACCACACCTTCTACAACGAACTGCGTGTGGCCCCCGAAGAGCACCCCGTCCTGCTCACAGAGGCTCCCCTCAACCCTAAAGCCAACAGAGAAAAGATGACCCAGATCATGTTTGAGACCTTCAACACGCCCGCCATGTACGTCGCCATCCAGGCCGTACTCTCACTGTACGCCTCCGGTCGTACCACCGGTATCGTGCTGGACTCCGGAGACGGTGTCTCCCACACAGTACCCATCTACGAGGGATACGCCCTGCCTCACGCCATCCTGCGTCTGGACTTGGCGGGCCGTGACCTCACCGACTACCTGATGAAGATCCTCACTGAACGTGGATACTCTTTCACCACCACGGCCGAGAGAGAAATCGTTCGTGATATCAAGGAGAAGCTTTGCTACGTCGCTCTCGACTTCGAGCAGGAAATGGCCACCGCTGCCTCCAGCAGCTCCCTCGAGAAGTCTTACGAGCTTCCCGACGGACAGGTCATCACCATCGGAAACGAACGATTCCGTTGCCCTGAGGCTCTCTTCCAACCCTCATTCCTGGGTATGGAAGCCAATGGCATCCACGAAACCACTTACAACTCCATCATGAAGTGTGATGTGGACATCCGTAAGGACTTGTATGCCAACACAGTACTGTCGGGTGGTACCACCATGTACCCAGGCATCGCCGACCGTATGCAGAAGGAAATCACAGCTCTGGCACCTTCAACCATGAAGATCAAGATCATCGCTCCACCGGAGAGGAAATACTCCGTATGGATCGGAGGTTCCATCCTCGCGTCTCTGTCGACATTCCAACAGATGTGGATCTCGAAACAGGAGTACGACGAGTCCGGCCCCTCAATCGTGCACAGGAAGTGCTTCTAA

Protein sequence:

>DPOGS200327-PA
MCDEEVAALVVDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHQGVMVGMGQKDSYVGDEAQSKRGILTLKYPIEHGIVTNWDDMEKIWHHTFYNELRVAPEEHPVLLTEAPLNPKANREKMTQIMFETFNTPAMYVAIQAVLSLYASGRTTGIVLDSGDGVSHTVPIYEGYALPHAILRLDLAGRDLTDYLMKILTERGYSFTTTAEREIVRDIKEKLCYVALDFEQEMATAASSSSLEKSYELPDGQVITIGNERFRCPEALFQPSFLGMEANGIHETTYNSIMKCDVDIRKDLYANTVLSGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQQMWISKQEYDESGPSIVHRKCF-