Monarch geneset OGS2.0

DPOGS215132
TranscriptDPOGS215132-TA1644 bp
ProteinDPOGS215132-PA547 aa
Genomic positionDPSCF300427 - 90241-103087
RNAseq coverage6742x (Rank: top 2%)
Annotation
HeliconiusHMEL0225595e-13752.85% 
BombyxBGIBMGA001754-TA7e-11241.83% 
DrosophilaGel-PH2e-9042.92% 
EBI UniRef50UniRef50_Q071713e-8842.92%Gelsolin n=37 Tax=Eumetazoa RepID=GELS_DROME
NCBI RefSeqXP_967392.12e-9942.36%PREDICTED: similar to GA10732-PA [Tribolium castaneum]
NCBI nr blastpgi|2700146327e-9942.75%hypothetical protein TcasGA2_TC004676 [Tribolium castaneum]
NCBI nr blastxgi|2700146323e-9642.09%hypothetical protein TcasGA2_TC004676 [Tribolium castaneum]
Group
Gene OntologyGO:00037799.2e-164actin binding
KEGG pathwaydgr:Dgri_GH190242e-93 
 K05768 (GSN)maps-> Regulation of actin cytoskeleton
    Fc gamma R-mediated phagocytosis
InterPro domain[45-537] IPR0071229.2e-164Gelsolin
[66-148] IPR0071233.4e-17Gelsolin domain
Orthology groupMCL18142 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215132-TA
ATGAAGCTCCTAACGGTAGTAACAATCTTCGCGGTTGCGGTTTCAACGCAAGCAAGGACTGCGGTAAGACAGCAGTCTGTCAGTGGACACATCACATCGCTGAGCGATAAGGATGCCCGCCAGAAGGCCAGTGTCCACCCAGCATTCGCGAACGCCGGCACAAAGGCTGGACTCGAAATATGGAGAATTGAGAACTTCGATCCCGTCGCAGTGCCAGCCGCAGAACATGGAAAGTTCTACAAGGGAGATTCATATATTGTCCTAAAAACTACATCAGATAAAAAAAAGAATTTATCCTGGGATATCCATTACTGGATTGGCAGCGAGTCATCCCAGGACGAGTCCGGAGCTGCTGCTATACTGTCTGTGGGTCTGGATGATAAATTCAACGATAAGGCGATACAGCACAGAGAAGCCATGGGTTACGAGAGCCAGCAGTTCCTTGGATACTTCAAGAACGGCGCTGTACGTTACCTCGACGGAGGTCACGACTCCGGGTTCAACCACGTGGTCACAAACCCGGGAGCTGAGAAACGACTGTTCCAGGTCAAAGGAAAGAAGAATATTAGGGTTAGACAGGTGGATCCTCTCATCTCGTCGATGAACAAAGGCGACGTCTTCATACTCGATGTGGACAACAGCATCCTGGTGTACGTGGGCAGTTCAGCGAAGAATGTGGAGAAGTTGAAGGCGATTTCCATAGCCAATCAGATAAGAGATCAGGATCACAACGGACGGGGGAAAGTTGATATTATTGACCAATATTCTAGTGACGTAGATGTCGACAAATTCTTTACTTCGCTGGGTTCTGGATCCAAAAATCTGGTGCCAGATGAAAGTGCCGGTGGAGATGATCAGGTAATGGCACTGAAAACTGTATCTAAATCAAGTATAACTGTACGTTACCTCGACGGAGGTCACGACTCCGGGTTCAACCACGTGGTCACAAACCCGGGAGCTGAGAAACGACTGTTCCAGGTCAAAGGAAAGAAGAATATTAGGGTTAGACAGGTGGATCCTCTCATCTCGTCGATGAACAAAGGCGACGTCTTCATACTCGATGTGGACAACAGCATCCTGGTGTACGTGGGCAGTTCAGCGAAGAATGTGGAGAAGTTGAAGGCGATTTCCATAGCCAATCAGATCAGAGATCAGGATCACAACGGACGGGGGAAAGTTGATATTATTGACCAATATTCTAGTGACGTAGATGTCGACAAATTCTTTACTTCGCTGGGTTCTGGATCCAAAAATCTGGTGCCAGATGAAAGTGCCGGTGGAGATGATCAGACATTCGAGAAAAATGAAGAGCGATCAGTCGTTTTGTCGGAAGTGTCCGACAGTTCTGGGAAATTAAAAATAACTCCACTCACAGGACCCTACCGCCAGGACCAGCTGAAGCCACAAGACACTTACATACTTGACACTGTCAGCGGCTCAATCTATGTGTGGGTTGGGAAACAAGCTTCACCAAAGGAAAAAAGTGAAGCTATGTCCAAGGCCGAACAATATTTGAGCTCAAAGAATTATCCGTCATGGGTACACGTGGCGAGGATTCCGCAGGGTACGGAACCCGCTATATTCAAGCAATACTTCACTACGTGGCGTGACGCCGGCATGTCGCACACACGATTGGTTCGTTAG

Protein sequence:

>DPOGS215132-PA
MKLLTVVTIFAVAVSTQARTAVRQQSVSGHITSLSDKDARQKASVHPAFANAGTKAGLEIWRIENFDPVAVPAAEHGKFYKGDSYIVLKTTSDKKKNLSWDIHYWIGSESSQDESGAAAILSVGLDDKFNDKAIQHREAMGYESQQFLGYFKNGAVRYLDGGHDSGFNHVVTNPGAEKRLFQVKGKKNIRVRQVDPLISSMNKGDVFILDVDNSILVYVGSSAKNVEKLKAISIANQIRDQDHNGRGKVDIIDQYSSDVDVDKFFTSLGSGSKNLVPDESAGGDDQVMALKTVSKSSITVRYLDGGHDSGFNHVVTNPGAEKRLFQVKGKKNIRVRQVDPLISSMNKGDVFILDVDNSILVYVGSSAKNVEKLKAISIANQIRDQDHNGRGKVDIIDQYSSDVDVDKFFTSLGSGSKNLVPDESAGGDDQTFEKNEERSVVLSEVSDSSGKLKITPLTGPYRQDQLKPQDTYILDTVSGSIYVWVGKQASPKEKSEAMSKAEQYLSSKNYPSWVHVARIPQGTEPAIFKQYFTTWRDAGMSHTRLVR-