Monarch geneset OGS2.0

DPOGS209047
TranscriptDPOGS209047-TA1143 bp
ProteinDPOGS209047-PA380 aa
Genomic positionDPSCF300102 + 39545-42917
RNAseq coverage821x (Rank: top 16%)
Annotation
HeliconiusHMEL0060864e-10666.45% 
BombyxBGIBMGA009999-TA4e-12056.54% 
DrosophilaArp11-PA4e-7240.32% 
EBI UniRef50UniRef50_E2BHA63e-7240.55%Actin-related protein 10 n=7 Tax=Formicidae RepID=E2BHA6_HARSA
NCBI RefSeqNP_001177675.11e-7539.95%actin related protein 11 [Nasonia vitripennis]
NCBI nr blastpgi|3800158272e-7842.12%PREDICTED: actin-related protein 10-like [Apis florea]
NCBI nr blastxgi|3800158277e-7542.12%PREDICTED: actin-related protein 10-like [Apis florea]
Group
KEGG pathwayafv:AFLA_0899106e-23 
 K05692 (ACTB_G1)maps-> Pathogenic Escherichia coli infection
    Regulation of actin cytoskeleton
    Viral myocarditis
    Bacterial invasion of epithelial cells
    Tight junction
    Adherens junction
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Phototransduction - fly
    Vibrio cholerae infection
    Dilated cardiomyopathy
    Shigellosis
    Leukocyte transendothelial migration
    Hypertrophic cardiomyopathy (HCM)
    Phagosome
    Focal adhesion
InterPro domain[2-367] IPR0040003.2e-98Actin-like
Orthology groupMCL13916 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209047-TA
ATGGCCCTTTACGAAGGGATCGCCCTTATTCAAGAAAAGCAAGCAGTGGTTTTGGATCTTGGTCATGACTATACCAAATTCGGTTTTACGGGAGAGGCGGCGCCACGATGTATCATTCCCTCAAGTTTTTGGAGCCCAGATGAAAAGTGTTATAAGAAAGTGTATGATTACAAAGATGAAGATGAACTGTATGAAAACCTTGTACATATGCTGCATCTGCTGTATTTCAGACATGTTCTAGTGAATCCTAAGGAGCGTAAAGTAGTAGTAGTGGAATCTCTCCTAACTCCTACACTCTTTAGAGAGACTCTAGCCAAAGTGCTGTTCATGCATTATGAGGTGTCGGGTGTGATGTGGGCGGACAGCGCTCGACTGTGTGCCCTCACACTGGGGACAAACATCACACTTGTTGTTAACATTGGAGCACTTGAAGCTGAGGTTGTGGCTGTGGTCCACGGCAGTCCCGTCATCCACGCCATTCAGAGCGGCCCGCTGGGAGCTCGCGCCGTGTCCACGGAGCTCGCCCGTCTCCTCGACGAACAGTACGGAGCTCCACTCCAACTGTCGGAACACGTCCTCGAGGACATCCGCGTGCGCGCCTGTTTCGTGCCCGGCAGAGCACGTGCTCTCACCCTGGACGACCCGAGCACGGTGGGGGCGCGCGGTGTCTCCGTCACCGGACCGGATCGAGTCCTCACCGTGGGCGGGAAAGCGCGCGAGCGAGCTGCAGAGACGCTGTTCCTTAGAAACAACGAGTTGGCGTCGCTGCCCGATCTTGTGCTCAAATGTATCCTCCAGTGTCCGATCGACGTACGGCGCGAGCTGGCCGCGAACATTCTGGTGACGGGCGGCGGCGCGGCGCTCACCGGGCTCAAGGCGCGCTTGGCGGCGGAGCTGAGACACCTCGTCACCCTGACGCCCTACAGTGAGTCGCTGCACGGACTCCAGTTCTCGTTCCACTCTGCGCCGTCTCCGGACAGCACGGTGTCGTGGATGGGCGGGGCGCTGGCGGGCGCAGCGGACAGCGGCTCTCGGGCGCTCCTCAAAGACGTGTACTCCCGGTCGCGGCGCCTGAGGGACTGGCCGTGCCTGCTACACACCACGCCGCCCGACCATCATCACTGGGCGGAGCTTCAATGCTGA

Protein sequence:

>DPOGS209047-PA
MALYEGIALIQEKQAVVLDLGHDYTKFGFTGEAAPRCIIPSSFWSPDEKCYKKVYDYKDEDELYENLVHMLHLLYFRHVLVNPKERKVVVVESLLTPTLFRETLAKVLFMHYEVSGVMWADSARLCALTLGTNITLVVNIGALEAEVVAVVHGSPVIHAIQSGPLGARAVSTELARLLDEQYGAPLQLSEHVLEDIRVRACFVPGRARALTLDDPSTVGARGVSVTGPDRVLTVGGKARERAAETLFLRNNELASLPDLVLKCILQCPIDVRRELAANILVTGGGAALTGLKARLAAELRHLVTLTPYSESLHGLQFSFHSAPSPDSTVSWMGGALAGAADSGSRALLKDVYSRSRRLRDWPCLLHTTPPDHHHWAELQC-