Monarch geneset OGS2.0

DPOGS207501
TranscriptDPOGS207501-TA1257 bp
ProteinDPOGS207501-PA418 aa
Genomic positionDPSCF300177 - 585444-586700
RNAseq coverage603x (Rank: top 21%)
Annotation
HeliconiusHMEL0173080.098.80% 
BombyxBGIBMGA001921-TA0.096.89% 
DrosophilaArp66B-PA0.084.69% 
EBI UniRef50UniRef50_P323920.084.69%Actin-related protein 3 n=48 Tax=Eukaryota RepID=ARP3_DROME
NCBI RefSeqXP_001662409.10.088.28%actin [Aedes aegypti]
NCBI nr blastpgi|3085126730.097.76%actin [Biston betularia]
NCBI nr blastxgi|3085126730.097.76%actin [Biston betularia]
Group
Gene OntologyGO:00308337.8e-262regulation of actin filament polymerization
GO:00058567.8e-262cytoskeleton
GO:00037797.8e-262actin binding
GO:00055247.8e-262ATP binding
KEGG pathway 
InterPro domain[1-414] IPR0040007.8e-262Actin-like
[1-414] IPR0156237.8e-262Actin-related protein 3
Orthology groupMCL11381 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207501-TA
ATGGATAATAATTTACCAGCTTGTGTTATAGATGTGGGAACTGGATACACAAAATTAGGTTTCGCTGGTAACAAGGAACCACAGTTTATCATCCCGTCAGCCATAGCGATAAAAGAAACTGCTAAAGTTGGTGATCAAAGTACTAGACGTATGACTAAGGCAGTGGAGGATTTAGATTTTTTCATCGGCGATGAAGCTTTCGAAGCGACGGGATATTCAGTGAAATATCCCGTTCGTCACGGTCTAGTGGAGGACTGGGATCTTATGGAAAGGTACATTGAACAATGTATCTTCAAATATTTGCGTGCCGAACCCGAAGATCATCATTTTTTGATTACCGAACCTCCATTAAACACACCGGAAAACCGAGAATATTTGGCCGAGATAATGTTTGAGTCATTCAATGTTCCGGGACTGTATATCGCTGTCCAAGCCGTTCTCGCTCTTGCTGCTTCATGGAAGTCGCGTACATCCGCGGGACGTACGTTTACCGGAATTGTCGTGGACAGTGGTGATGGTGTAACACACATTGTGCCTGTAGCTGAAGGTTATGTTATTGGATCCTGTATCAAGCATATCCCCATAGCTGGGAGGAATATCACTTCATTTATCCAATCACTCCTACGAGAACGTGAAGTAGGAATTCCTCCTGAGCAAAGCTTAGAAACTGCAAAGGCAATTAAGGAGAGATATAGTTATATTTGCCCAGATATTGCTAAGGAATTTGCTAAATATGATTCAGATCCCGGAAAATGGATGAAAAAGTATACAGGTATAAATGCAATCACTAAAAACCCATTTTCAGTTGATGTTGGTTATGAAAGATTTCTAGGTCCTGAAATTTTCTTCCATCCCGAGTTTTCTAATGCTGATTTTACTGTACCTTTAAATGAAATGGTCGATGAAGTAATACAAAGTTGTCCAATTGATGTCCGTAGAGGATTATATGGTAATATTGTTCTGTCCGGTGGTTCAACTATGTTTAGAGACTTTGGCAGGAGACTTCAGCGAGACATCAAGCGTACAGTGGATGCTAGGTTAAAACTATCAACTATGTTGTCCGAGGGGCGTATAACGCCAAAGCCGATTGATGTCCAAGTCATTTCTCACAACATGCAAAGATATGCAGTGTGGTTTGGTGGAAGTATGTTGGGTTCAACACCAGAATTCTATCAAGTATGTCACACTAAGCAGGCATACATGGAGTATGGTCCTAGTATTTGCCGGCACAATCCAGTTTTTGGAACCATGACATAA

Protein sequence:

>DPOGS207501-PA
MDNNLPACVIDVGTGYTKLGFAGNKEPQFIIPSAIAIKETAKVGDQSTRRMTKAVEDLDFFIGDEAFEATGYSVKYPVRHGLVEDWDLMERYIEQCIFKYLRAEPEDHHFLITEPPLNTPENREYLAEIMFESFNVPGLYIAVQAVLALAASWKSRTSAGRTFTGIVVDSGDGVTHIVPVAEGYVIGSCIKHIPIAGRNITSFIQSLLREREVGIPPEQSLETAKAIKERYSYICPDIAKEFAKYDSDPGKWMKKYTGINAITKNPFSVDVGYERFLGPEIFFHPEFSNADFTVPLNEMVDEVIQSCPIDVRRGLYGNIVLSGGSTMFRDFGRRLQRDIKRTVDARLKLSTMLSEGRITPKPIDVQVISHNMQRYAVWFGGSMLGSTPEFYQVCHTKQAYMEYGPSICRHNPVFGTMT-