Monarch geneset OGS2.0

DPOGS204919
TranscriptDPOGS204919-TA1776 bp
ProteinDPOGS204919-PA591 aa
Genomic positionDPSCF300340 + 98346-110526
RNAseq coverage302x (Rank: top 37%)
Annotation
HeliconiusHMEL0023103e-12356.42% 
BombyxBGIBMGA001730-TA5e-17263.64% 
DrosophilaGel-PH8e-5431.33% 
EBI UniRef50UniRef50_Q273197e-5932.72%Gelsolin, cytoplasmic n=4 Tax=Arthropoda RepID=GELS_HOMAM
NCBI RefSeqXP_001846164.11e-7836.67%Gelsolin [Culex quinquefasciatus]
NCBI nr blastpgi|1700366272e-7736.67%Gelsolin [Culex quinquefasciatus]
NCBI nr blastxgi|1700366279e-7836.64%Gelsolin [Culex quinquefasciatus]
Group
Gene OntologyGO:00037798.4e-81actin binding
KEGG pathwaycqu:CpipJ_CPIJ0046293e-78 
 K05768 (GSN)maps-> Regulation of actin cytoskeleton
    Fc gamma R-mediated phagocytosis
InterPro domain[43-582] IPR0071228.4e-81Gelsolin
[495-560] IPR0071233.5e-08Gelsolin domain
Orthology groupMCL21167 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204919-TA
ATGGAACATCCGGCCTTCGAAGAAGCGGGTCAGGAACCCGGCCTCGAGATTTGGACGATCGATATAATGATCATATGTATGGTTTACATTATGGAGGAACTGGTTTTCACTGCTCTCTTCTTCACCATCAGGTACCTGGAGGGCGGCAACGATTCTGGTTTTAATGAGGTGGAAACAAATGCTGGGGCTGAAAAACGTCTGCTGAAGCTGTCAGGCTGTGAAAACATGAGAATCGAAGAGGTACCACCAGAAGCTTCATCACTAACACAAGACCACTGCTTCATCCTGGAAGTGGATCACGACATATTCGTGCTGCTGCCAGACGGCGCGAAGGCGACGCAAAGACGAAAGATCATCAGCGTCGCGAACACATTGCGTGATGATTACCACAACGGAAGAGCCAGCATCGAAATCATTGATGAATTCTCCTCCGACGATGACTATTCGGCGTTCTTCGAAGCTCTTGGGGACGGATGCAAAGATGATCTGGTTGCTGATGAAAGCTCTGATACATATACACGCTCTAGCGTTTCAGCCGTGTACTTATACAAAGTGCTACAGGGAGACGAAATAGATCTTCTGGAGATCAACAAGCCGTTCAAGCAGAGCCAATTGACGTCAGAGGACATATTCATCTTAGATACCCCATGTTCTGGAATTTACATCTGGCTCGGCAAAGACCTGGATCCAGATGTCAGGAAGACTTACAACGACATCGCCCAGCAATACTTGGATATGAAGGGCTATCCGTCTTGGGTTCCCATAACCCGGGTGTCAGAAGACATGGAGAGCAGCGTGTTCAAGCAATACTTCCATAGATGGGACACCGCCACCACCACCATCAGATCTGTCAAAGATATCGCAGCTGAGATTGATGCCGGCTACTTCTCTGGAGACGCGGATGACGTTGAAGCCGTCGCTCGCTACATCGGGAAGAGTGCTGTCGCGAGGGGTTACATGCCAGGCGGAGATGGCGTCTTCACTCTGTACAGAGCTGGAGAGGAGCCCGAAGACATAACAGACCAGGAGACGGCTAAGCTGTACACCAGCGAGGTGTACGTTGCCAAATATCAGTACAAGAATGATAATGACGAAGACCTCACCGTCGCTTACCTCTGGCTTGGTAAGGACGCGAGCAGCGACGATATACAAGCAGGGATTCATCTCTTAGATTCAATTGAAGAAGAATCTGAAGGTCCGGTTATTCACGTGAAGCTGCCTCAAGGAAAAGAGAATAAGCACTTCCTGACCTTATTTAAGGGAAATCTCATTATTCTCTGTGGTGGTAAGGATAATGAGTACAAGTGCCAGAACTTCAGTGACAGCTACGACGACGACGGGATTAGATTGTTTAAGGTTGAGGGTACGAAGCTTGGTGAAGACATGAGAGCGATGCAAGTTGAGGAGAGGTCAGATAATTTGGAGATCGACGATGTCTTCATATTGGAGACTCCAGACGTGGTGTATTTATGGAATGGAAAGGACCTGTCCGATGACGGAGCCTTCATCCTCGATACGGGAGAGGAACTCTATCTGTGGTTGGGCAAAAACACCCCACAGCGTGTGAAGCAAGCTCGCCTCAAGATTATTACGGATTATATCGAAGATGACGGCCTGGAGAGGACAGCGGAATCCGCTGTGGTGGTGACCCTCACGCAGGGCGGAGAGCCTCAAACCTTCAAAAATCTATTTACAGAATGGGACGACGAGTACTTTGAGAAGCAAACGTCGTACGAAGACATGAAGAACGAAACCAAAGCGGCAAATTCAAAATAA

Protein sequence:

>DPOGS204919-PA
MEHPAFEEAGQEPGLEIWTIDIMIICMVYIMEELVFTALFFTIRYLEGGNDSGFNEVETNAGAEKRLLKLSGCENMRIEEVPPEASSLTQDHCFILEVDHDIFVLLPDGAKATQRRKIISVANTLRDDYHNGRASIEIIDEFSSDDDYSAFFEALGDGCKDDLVADESSDTYTRSSVSAVYLYKVLQGDEIDLLEINKPFKQSQLTSEDIFILDTPCSGIYIWLGKDLDPDVRKTYNDIAQQYLDMKGYPSWVPITRVSEDMESSVFKQYFHRWDTATTTIRSVKDIAAEIDAGYFSGDADDVEAVARYIGKSAVARGYMPGGDGVFTLYRAGEEPEDITDQETAKLYTSEVYVAKYQYKNDNDEDLTVAYLWLGKDASSDDIQAGIHLLDSIEEESEGPVIHVKLPQGKENKHFLTLFKGNLIILCGGKDNEYKCQNFSDSYDDDGIRLFKVEGTKLGEDMRAMQVEERSDNLEIDDVFILETPDVVYLWNGKDLSDDGAFILDTGEELYLWLGKNTPQRVKQARLKIITDYIEDDGLERTAESAVVVTLTQGGEPQTFKNLFTEWDDEYFEKQTSYEDMKNETKAANSK-