Monarch geneset OGS2.0

DPOGS203388
TranscriptDPOGS203388-TA1110 bp
ProteinDPOGS203388-PA369 aa
Genomic positionDPSCF300003 + 598824-600072
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0029141e-6837.27% 
BombyxBGIBMGA002083-TA3e-14059.89% 
DrosophilaAct57B-PA7e-6938.07% 
EBI UniRef50UniRef50_P261972e-6637.57%Actin-2 n=5139 Tax=Eukaryota RepID=ACT2_ABSGL
NCBI RefSeqXP_002112714.11e-6837.57%actin [Trichoplax adhaerens]
NCBI nr blastpgi|3288754681e-6837.57%hypothetical protein DFA_05968 [Dictyostelium fasciculatum]
NCBI nr blastxgi|3288754681e-6637.57%hypothetical protein DFA_05968 [Dictyostelium fasciculatum]
Group
KEGG pathwaytad:TRIADDRAFT_359564e-68 
 K05692 (ACTB_G1)maps-> Pathogenic Escherichia coli infection
    Regulation of actin cytoskeleton
    Viral myocarditis
    Bacterial invasion of epithelial cells
    Tight junction
    Adherens junction
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Phototransduction - fly
    Vibrio cholerae infection
    Dilated cardiomyopathy
    Shigellosis
    Leukocyte transendothelial migration
    Hypertrophic cardiomyopathy (HCM)
    Phagosome
    Focal adhesion
InterPro domain[1-369] IPR0040001.8e-99Actin-like
Orthology groupMCL44353 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203388-TA
ATGGCTTTTGAAAAACCAGCTGTTGTTCTTGATAACGGTAGTTACTATATTAAAGCTGGGTTTGCTTGTGACAATCATCCTGTTGCAATATTTAGATCAATGGTTGGGAGGCCAAATTTTTCACAGAGAAGTCTAGGTCAAGAATATTATGACATATTTATCGGTGACGAAGCTATTGAGAGAGTAGAAGACTTGGAATTATGTCACCCGATAGTAGAGGGAAGGATTGTCCATTGGGATAATATGGAAAAAATTTGGCATCATGTATTTTATAGAGAATTAAAAGCGGCACCTGAAGACCGAGCTGTTATTTCAGCATGTGGTCCCACAGTAGATATTACAGAAAAAATAAAGTGCTGTGAAATATTTTTTGAGACCCTTAACTCTCCGGCATTATGTATACAGCCACAGTGTTCTTTGGCAATGTACGGTTCTGGAACTACCACTGGACTATGTGTGGATATCGGACACGCCACTACAGATGTTATACCCATTTTTGAGGGAGGAATGATGAAATACGCTCACATGAAAACAAATTTGGCAGGAGTGCAAATAGCAGATTTCATTAAAAAAAGTTTATTGGATCGCAGTGGTTCGCATACAATCAAGTCCCCGAGCACTTTAGAGGACGTTATCAAAAACTGTTTATATGTAACAAGGAACTGTGCTGTCACCCGAAAGCAATATCTCAAAAAGTACACATTGCCTGGTGGTGAAGAAATAGATGTCAGTCATGAGGCTTTTATGGCATCAGAACTATTATTTCAACCGGATCTCGTTAAAGGCGAAGCTACTGATTTTTTACCGTTGCATGAAGCTGTTATTACGTCTGCTTTGAAATGTGATGATGAGTTAAGGTTAGAATTATACAACAACATAGTTCCTTGTGGTGGATTGGCTACGATTCCAGGGCTTAACGAAAGATTGGAACTAGAGATTGTTAAGCAAGTTGATAAGCCCATCGCAATACTGTCTTCACCAGAAGCATATGCTGTAGCTTGGTTAGGCGGAGCCACGTTCGCAGGGCTGGGTGATGCCCAGAAAATGTGGATATCCAAAAAACAATTTGAAGAATATGGGGAAAGAATTGTGAGAAATAAATTTTTATAA

Protein sequence:

>DPOGS203388-PA
MAFEKPAVVLDNGSYYIKAGFACDNHPVAIFRSMVGRPNFSQRSLGQEYYDIFIGDEAIERVEDLELCHPIVEGRIVHWDNMEKIWHHVFYRELKAAPEDRAVISACGPTVDITEKIKCCEIFFETLNSPALCIQPQCSLAMYGSGTTTGLCVDIGHATTDVIPIFEGGMMKYAHMKTNLAGVQIADFIKKSLLDRSGSHTIKSPSTLEDVIKNCLYVTRNCAVTRKQYLKKYTLPGGEEIDVSHEAFMASELLFQPDLVKGEATDFLPLHEAVITSALKCDDELRLELYNNIVPCGGLATIPGLNERLELEIVKQVDKPIAILSSPEAYAVAWLGGATFAGLGDAQKMWISKKQFEEYGERIVRNKFL-