Monarch geneset OGS2.0

DPOGS207383
TranscriptDPOGS207383-TA1989 bp
ProteinDPOGS207383-PA662 aa
Genomic positionDPSCF300267 + 42028-46766
RNAseq coverage107x (Rank: top 60%)
Annotation
HeliconiusHMEL0122390.089.12% 
BombyxBGIBMGA008880-TA0.066.13% 
DrosophilaArp5-PA1e-15344.41% 
EBI UniRef50UniRef50_E2BG390.057.25%Actin-related protein 5 n=13 Tax=Pancrustacea RepID=E2BG39_HARSA
NCBI RefSeqXP_623919.10.056.78%PREDICTED: similar to CG7940-PA [Apis mellifera]
NCBI nr blastpgi|665465590.056.78%PREDICTED: actin-related protein 5 [Apis mellifera]
NCBI nr blastxgi|3320280590.056.89%Actin-related protein 5 [Acromyrmex echinatior]
Group
KEGG pathway 
InterPro domain[23-629] IPR0040004.8e-182Actin-like
Orthology groupMCL13841 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207383-TA
ATGGACGACGTTCTCGTACTTAAGGATTATAAAACTATTCCTGATATAGTTCATGAATATGCTCCATCTTTAAAGTTCGGTCATATACCATTAGTTATTGACAATGGTTCATATCAGTGTAGGGTGGGTTGGTCAATATACGACGAGCCGTATTTAACATTTAAAAATCTGATAGCTAGGCCTCGTAAAGACCGCTGTAAAAAAGACGCTGATCCTCCGGTAACACCTCCTATACAAATTGGAAATGATATAGTTAATATAGAAGCTGTAAGGTTTCAGCTGAAAACACAGTTTGATAAAAATATAGTCACCCACTTTGAAGTTCAAGAGCAAGTCTGTGATTATATCTTTTCCCACCTCGGTATTGACAGTGAAGGTCATGTTCCACATCCTATTGTTATGACGGAAGCATTTGTTACTCCCAACTACAGCCGACAGTTGATGTCAGAATTGCTTTTTGAGGCTTATGGTATACCAGCAGTGAGCTATGGAGTGGATTCGCTGTTTAGTTTCTACAGAAATGGTATCGGGGACACAGCATTAATAGTTAACTGTGGCTATCACACTATTCACTTTATACCCGTGTTAAAAGGAAAAGTTGTTGCCGAGCGCGCTAGAAGAATTAACTTAGGTGGAAGTGAAATTATATCATATATGCACAAACTCCTGCAACTTAAGTACCCTGTTCATGTAAATGCCATAACAATGTCCAGGATTGAAGAGATTCTTCACGAACACTGTTCAATAGCCTTGGACTATCAGGAGGAAATAAGGAAATGGGCAAACCCAGATTACTATGAAGCTAATGTTAAAAGAGTCCAATTGCCATTTGTACAATCATCAAGTTCATCGACCTTAACAGCCGAGCAACAAAAAGAAAGAAAGAAGGAAATGGCGCGCCGTTTGCTTGAAATAAATGCTAGAAAGAGGGAAGAACGGCTCTTAGAGGATGAAGAACAGCTGAACCAGCTCTTGGCTTTGCAGGAATTGATTGAAGATGGTGAAACTGATGAGTTTAATGAAGCGATCAAAGCATTTGATATAAAGAATTATGGTGATCTGCAGAGACAAATAGCAAATTTGAATGTGAGGATTGAGAAGAACAAACAACGTATTGCTGCAGCGGCAAATGCTGAGGAAGTGGTGGAGACCAGACCGGCTGGGCGGTACCAGCCACCCAATGATCCAGAAGCGTTCCAAGTGTGGTTAGAGGAAACCAGGGCTAAATACCGTGAAGTGTCAGCTCGTCGAGAGGCCCGCAGGGCGCGTCGCGCGGCGATGGCTAAGAGACGCACAGCCGCCGCCGCTGAACGTATGAGAGCTATCTCTAGACTCGCGGCCGCGGGCGACGACTTCGGTTATAGGGATTCAGATTGGGACGTCTATAAAAGTATAAGCCGTGAAGCCGACTCTGATTCAGAGGCGGACGGCGAAAGGTTGGTGGAGTTGGAGGAAGCGCTGAGGGAATACGAACCAGCGCAGCCCTCACAGTACCATCACCAACTACATCTCGCCATAGAACCCATCAGAGCTCCAGAATTGATGTTCCAACCGTCAATGATGGGAAACTTAGAGGCTGGATTGGCTGAAACAATGGAATACGTATTCAAACATTTCAGTGAAGAAGACCAGCTGCTATTAGCTAACAACGTGTTCCTCACCGGGGGATGTTCACAATTTCCAGGTTTAAAGGAAAGACTGGAGAGAGAACTCTTAGAAATGCGACCATTTCAATCAAGTCACAAAGTTGTAATGGCGAAAAATCCAAATCTCGACGCCTGGTATGGCGCAAGGGACTTCGCTGGTAGCAACGATTTCGAAGACTGGTGTATATCAAAGGAAGAGTATTATGAAATGGGTGGGGAATATTTGAAGGAGCACCACGCGAGTAACCGGTATTATAAGAGTCCAGCACCTCTCATTGATAACACACTGACGCCGGCGGGAGACGCTAACGTCGTCAAGGAAGAAATTGTTGTAGACTGTTAA

Protein sequence:

>DPOGS207383-PA
MDDVLVLKDYKTIPDIVHEYAPSLKFGHIPLVIDNGSYQCRVGWSIYDEPYLTFKNLIARPRKDRCKKDADPPVTPPIQIGNDIVNIEAVRFQLKTQFDKNIVTHFEVQEQVCDYIFSHLGIDSEGHVPHPIVMTEAFVTPNYSRQLMSELLFEAYGIPAVSYGVDSLFSFYRNGIGDTALIVNCGYHTIHFIPVLKGKVVAERARRINLGGSEIISYMHKLLQLKYPVHVNAITMSRIEEILHEHCSIALDYQEEIRKWANPDYYEANVKRVQLPFVQSSSSSTLTAEQQKERKKEMARRLLEINARKREERLLEDEEQLNQLLALQELIEDGETDEFNEAIKAFDIKNYGDLQRQIANLNVRIEKNKQRIAAAANAEEVVETRPAGRYQPPNDPEAFQVWLEETRAKYREVSARREARRARRAAMAKRRTAAAAERMRAISRLAAAGDDFGYRDSDWDVYKSISREADSDSEADGERLVELEEALREYEPAQPSQYHHQLHLAIEPIRAPELMFQPSMMGNLEAGLAETMEYVFKHFSEEDQLLLANNVFLTGGCSQFPGLKERLERELLEMRPFQSSHKVVMAKNPNLDAWYGARDFAGSNDFEDWCISKEEYYEMGGEYLKEHHASNRYYKSPAPLIDNTLTPAGDANVVKEEIVVDC-