Monarch geneset OGS2.0

DPOGS203389
TranscriptDPOGS203389-TA1125 bp
ProteinDPOGS203389-PA374 aa
Genomic positionDPSCF300003 + 624761-626371
RNAseq coverage55x (Rank: top 69%)
Annotation
HeliconiusHMEL0120443e-11259.57% 
BombyxBGIBMGA002082-TA1e-11851.60% 
DrosophilaAct87E-PA4e-9845.04% 
EBI UniRef50UniRef50_P627362e-9444.41%Actin, aortic smooth muscle n=733 Tax=root RepID=ACTA_HUMAN
NCBI RefSeqNP_999634.11e-9845.48%actin related protein 1 [Strongylocentrotus purpuratus]
NCBI nr blastpgi|866108916e-9845.38%actin [Ciona intestinalis]
NCBI nr blastxgi|866108911e-9345.09%actin [Ciona intestinalis]
Group
KEGG pathwaycin:4456843e-98 
 K05692 (ACTB_G1)maps-> Pathogenic Escherichia coli infection
    Regulation of actin cytoskeleton
    Viral myocarditis
    Bacterial invasion of epithelial cells
    Tight junction
    Adherens junction
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Phototransduction - fly
    Vibrio cholerae infection
    Dilated cardiomyopathy
    Shigellosis
    Leukocyte transendothelial migration
    Hypertrophic cardiomyopathy (HCM)
    Phagosome
    Focal adhesion
InterPro domain[1-374] IPR0040001.4e-145Actin-like
Orthology groupMCL26542 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203389-TA
ATGTCCGCTGATAGGATCGCGGTCGTTTTTGATTCTGGGTCTCAATCGACGAAAGCCGGTTTCGCTGGAGAATCTTCGCCAAGATCTGTCATTAAAACCCTGGTAGGACGATTTCGCCATGAAGGTCTATTGGACGGACTTCCAGATATATTTTTCGGGAATGAAGCTGTCAAGAAGAGAGGGTTTTGTGTATCGACGTGGCCAGTAAAAGATGGAATGATAGATGATTGGAATGAAGTGGAAAAATTTTGGCATCATATCTTTTACAAAGAACTGCATGTTGCTCCCGAGGAGGCTCAACTTTTAATGTCAATTCATCCACTTACGCCACATAAAGACAAGGAAAAAATGGCCGAGATACTTTTTGAGAGTCTCTCTATCCACGACTTATACTTGGCGATATCATCGGCGCTGGCATTACATGCGAATGGGAGGACATCAGGACTGGTCTGGGAAAATGGGCATTCGTGTTCTTATGTATCACCAGTTTTTGAGGGATTCCCCCTTAAACATGCCACTATAGAGTCAGAAATTAATGGGAGTTCTCTGACAAAAAGACTCCAAAAGCTGTTAAACGAGATCGGCTATTCTTTCACCACAAGTGTGGAAATAGACATACTGGAAGATATAAAGGCAAAGCTATGTTATGTGGCCATGGACTATGAGAATGAAGTACAGACTGTCAATCGTTTAAAAGATAGCACCCACTATGAGTTGCCTGATGGACAGCATGTTTTGCTTTGCGAGGAAAGGTTCAAGTGTCCGGAAATGCTATTTCAACCAAAAACGTGCGGGATGAATTCCTTTAACATAGTGGATAATATTTGTTCTAGCATATCCAAATGCGATTTGGAATATAAAACTTTATTTTACGATAACATAGTACTTTCTGGCGGCTCAAGCTTGTTCAGAGGGCTCTCCGAGCGTTTGAGTGTTGAACTATCCAGGCGAGTATCAGATATGCCTGGCATAAAAGCAAATGTTAGTTCAATCCCATCCAGACATTATTCGTCATGGTTAGGTGGATCTATCTTGGCGTCGCTGAAATCCTTACACGGGTTTTGGATGACAAAACAGGAATATGACGACAACGGACCCGAAAGGGTACATTATAAATTTTTTTAA

Protein sequence:

>DPOGS203389-PA
MSADRIAVVFDSGSQSTKAGFAGESSPRSVIKTLVGRFRHEGLLDGLPDIFFGNEAVKKRGFCVSTWPVKDGMIDDWNEVEKFWHHIFYKELHVAPEEAQLLMSIHPLTPHKDKEKMAEILFESLSIHDLYLAISSALALHANGRTSGLVWENGHSCSYVSPVFEGFPLKHATIESEINGSSLTKRLQKLLNEIGYSFTTSVEIDILEDIKAKLCYVAMDYENEVQTVNRLKDSTHYELPDGQHVLLCEERFKCPEMLFQPKTCGMNSFNIVDNICSSISKCDLEYKTLFYDNIVLSGGSSLFRGLSERLSVELSRRVSDMPGIKANVSSIPSRHYSSWLGGSILASLKSLHGFWMTKQEYDDNGPERVHYKFF-