Monarch geneset OGS2.0

DPOGS204699
TranscriptDPOGS204699-TA4536 bp
ProteinDPOGS204699-PA1511 aa
Genomic positionDPSCF300170 + 540789-552239
RNAseq coverage222x (Rank: top 45%)
Annotation
HeliconiusHMEL0082230.062.52% 
BombyxBGIBMGA007478-TA1e-13277.48% 
DrosophilaSCAR-PB9e-7850.17% 
EBI UniRef50UniRef50_F4WVK71e-8560.22%Wiskott-Aldrich syndrome protein family member 3 n=10 Tax=Neoptera RepID=F4WVK7_ACREC
NCBI RefSeqXP_973241.14e-8958.54%PREDICTED: similar to actin binding protein, putative [Tribolium castaneum]
NCBI nr blastpgi|910879759e-8858.54%PREDICTED: similar to actin binding protein, putative [Tribolium castaneum]
NCBI nr blastxgi|910879751e-11047.85%PREDICTED: similar to actin binding protein, putative [Tribolium castaneum]
Group
Gene OntologyGO:00055155.1e-05protein binding
KEGG pathway 
Orthology groupMCL15072 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204699-TA
ATGCCGCTGCCGAAGCGCTGCGTCGAGCCGGTGCACGTGTCTCGCGGCACTGTCCCCGAGCGCCTGGCCGTACCCTCCGAGCTAGAGGCCGTCACCAATGGCACCCTCGCCAACACAGTCAGGCAACTATCATCCCTCTCTAAGCATGCAGAGGACATGTTCGGTGAGTTGACCCGCGAGGCTACCGGCCTGGCTGAGCGGACGAACGTGTTGCAGGCGAGAATAGACCGCCTCGCCATCAAGGTCACACAACTGGATTCAGGGGTTGAAGAAGTGACGCTCCAGGACATACAAATGCGGAAGGCGTTCCGTTCGGCTAGGAGCTTCCAACAGCAGCTGTTCTCTCGTACGAGTATGCCGTCGGCCATGCTGGCGACGTACGCGCGTTGCGATCGACCACCACCGCTTGAAAAACTGAACGAGTTCAGAGACGATGGCAGGGATGCCAGGAAGTTCTACACTGACCCGGACTACTTCTTCGAGCTGTGGCGCAGAGAGATGCTGCAGGACACCGAGCGGATCCAACATGATAGAGGGAAGAAGGTCCGGCAACCGCGTAACAACAACACTGAAGGCAGATCGGCCCGTCGTGTCCGTGTGCCACATTCTACCAGGGACAGACAGAAGGCGGATGCGGTGACACGAGGAGAACACATCATGGCTCCCAGCCAGCTCAGGAGATACAACATAGACTACCGGGAGAATAAACAGGAACAGGTTTATCAGACAACAACAGTAACCGACGGTGGTTACCGAGCCACACAGAACAGGCCACCCATAACACAACCGGCTCGACCCAACTCCATCGAAATAGAAAACCAGCAGAATAGAACCTCGAGGCTCACTGGCAATGGACATGTAATGTCTGATGAGAGTCCATACGGTGGTGTAGAGAGTATTTACGGTAGTGCTGCCGGTACACCGCGCCGGGCCAGGCCTTCGATACCTCCCCCCGCGCCCCCCGCAACGATCGGTGATTCCCCCTCGCAGCACACGCCTACCAGGGCTCAGGCCAACGGTTTCGGCTGTACCGATTACGAGACAGTTGTGAGGTCTTCTCTGCCGCCCCCGCCCCCGCCCCCGGAGTCACCGCACAGTGTGAACGGAACCTCTCCGCCGCACCGCACCGCCTTGGACAAGCATCTCGACCAGATGCACGCTGTTATCGGAATGATGTCATCAGAGGAACTCCCGCCGCGAGTCCCTTCGCCCCCATTGGCGCCTCCGGCTCCATTAGCGCCCCCAGCGCCCCCGCCGCCAGAAGAAGACCGCCCTCGGCCTCCGTCGCCGGCTTCGCTACTCCGGGGTGCTTCCGCTCTCAAACCTCCCAGGCCCTCCGCGGACCCCGCACCGGACCCGCGCTCCGACCTCCTCAAAGCTATACGTGAAGAGGACATGTTCGGTGAGTTGACCCGCGAGGCTACCGGCCTGGCTGAGCGGACGAACGTGTTGCAGGCGAGAATAGATCGCCTCGCCATCAAGGTCACACAACTGGATTCAGGGGTTGAAGAAGTGACGCTCCAGGACATACAAATGCGGAAGGCGTTCCGTTCGGCTAGGAGCTTCCAACAGCAGCTGTTCTCTCGTACGAGTATGCCGTCGGCCATGCTGGCGACGTACGCGCGTTGCGATCGACCACCACCGCTTGAAAAACTGAACGAGTTCAGAGACGATGGCAGGGATGCCAGGAAGTTCTACACTGACCCGGACTACTTCTTCGAGCTGTGGCGCAGAGAGATGCTGCAGGACACCGAGCGGATCCAACATGATAGAGGGAAGAAGGTCCGACAGCCGCGTAACAACAACACTGAAGGCAGATCGGCCCGTCGTGTCCGTGTGCCACATTCTACCAGGGACAGACAGAAGGCGGATGCGGTGACACGAGGAGAACACATCATGGCCCCCAGCCAGCTCAGGAGATACAACATAGACTACCGGGAGAATAAACAGGAACAGGTTTATCAGACAACAACAGTAACCGATCCACCAATGGCTCGGTGTGTCTTCAGGCCACCCATAACACAACCGGCTCGACCCAACTCCATCGAAATAGAAAACCAGCAGAATAGAACCTCGAGGCTCACTGGCAATGGACATGTAATGTCTGATGAGAGTCCATACGGTGGTGTAGAGAGTATTTACGGTAGTGCTGCCGGTACACCGCGCCGGGCCAGGCCTTCGATACCTCCCCCCGCGCCCCCCGCAACTATCGGGGATTCCCCCTCGCAGCACACGCCTACCAGGGCTCAGGCCAACGGTTTCGGCTGTACCGATTACGAGACAGTTGTGAGGTCTTCTCTGCCGCCCCCGCCCCCGCCCCCGGAGTCACCGCACAGTGTGAACGGAACCTCTCCGCCGCACCGCACCGCCCTGGACAAGCATCTCGACCAGATGCACGCTGTTATCGGAATGATGTCATCAGAGGAACTCCCGCCGCGAGTCCCTTCGCCCCCATTGGCGCCTCCGGCTCCATTAGCGCCCCCAGCGCCCCCGCCGCCAGAAGAAGACCGCCCTCGGCCTCCGTCGCCGGCTTCGCTACTCCGGGGTGCTTCCGCTCTCAAACCTCCCAGGCCCTCCGCGGACCCCGCACCGGACCCGCGCTCCGACCTCCTCAAAGCTATACGTGAAGGAATCAAGTTACGCAAAGTGGAGAAACGTTGCGAAGAAAACACGGGCAAATATAATACACTGAGGGGCGCGCCGCCATTGTTAGACGTGGCGTCTATTTTGGCTCGACGCGTGGCCGTTGAGCAGAGCGACACCGACTCCGACAACGAGTCGGACTCCGACGACTCCGACCACAGAGATATCGAAGCACAGAAGTCAAATTTGAAAACCGATTCCCCACCCAAGAACAGTCCACCGAAGGCGTCCGTTCCTCCGAGGAGGAGCAGTCTCGTCAAGGATTCGCCGAATCCTTCACCGCCCAATCACCCTCAGAAGGAAGCGGTCCCTGAGACCCAAAAGATTTCATCCCCACCTAATAAGAATATAATCTCATCTGTAAATAATAAAGAAACGCATCTGCCTTCATTCAAGAGTAATCTGTTACAGAGGAATGCATCATCACAACCAGTTGTCGCGGAAATAAGTTCGCCGGCGGTGAGCTCTCCGCCTATTACTTCACCACCGACACATTCTCCGGTAGTAGCGTCTCCCCCAGTCAGTAAATCAATTACATCTTCTCCTATCAAATCTGTATTATCGCCACCTACAAAACGAGACAATTTCCTATTAAATAAGAACGAAATAATGTCACCCCCGTCGTCCGCAGTGACATCGCCCCCTCCGAATATTTTCAAATTCAATAAAACCGACATGAAACCTGAGGATAGTTCGGCCTCATCGAAAATAAAGAAGGATTCAATATTAATCAAGGAGGCTCCTATTAAAATCGATAAACTGTCATTGGTAAATATGAGCAGCTATGATGCCGAGAAAACGGTTGGTAGCGATCAGAACAACTCTTCTAAAACGTCCGATGTCACTAATGGTGAAAAAGCTGCGGAATCGAATGGAGTCGGTGACAAAATAAGAAAATTCGAGAAGGCCGCCGAAGATGCGAGTCTAAGCGCAGGGAAACTGTCCCGGCCAGGGTCTGTTAGAGGAAGATCACGATCAGAGAGACTCGGATCCGACAGTAAAGAAGACTTGCCACCTCCAGCCACACCCTTCAAAGACCACGTGTTCTTCGACCTCGGTAGTGCAGATAAAACGGATAAGAGTAATACCGAACCGGAGGTCACAAAAGTTCAAAACACAGAACCCGAACGAACCTTCGAGAGGCAGCTGAGTCTGCCGAGAGGATCGGTGAATTCAGTTGCAGCTAAATGGCAACACAACGAAAGCTCGTCTATCGATGAAAAAATACAAAAACCCGAACCGCAGAGAATTATAAAGCCGGAATTCAAACCGATGCCGTCGCACGTCGACGTCACGATTCGATCTCCAAGTCTTGGCACTAAGCTCCCGGATCAATTCCCCTCTCAAGGAACTGCCTTCCGCAACGTCGCTTCCCCCGACCCTCGTCCAGTTAAAGTCAACGACGATGTGAATACTCAGAACAATTTCAAGACCTGCATATTAAAGAAGAATAAGGAAAAATATTCCGTTAGCAAGGGTTCCAGTTACTTCACGAGAGGTTCCGAGGAGATATGGCTGGTGAAGAAGAAAGACATCGAAGAAATTGAGGAGCGGGTACTCGACTCCTTCCGACGGGCGGGGGGTAATCTGTGCATGCGGAGTGAATCCGTGCGTCCGTCGTCGGATAGCGGTCAAAATTCCTTCAGCCACGCGACTATGGGTCGTACTAAGAGACAGATGTATACCAGAAGCGAATCTTTGGACCCACAATGGGCAGGAGCGAGGGCGCAAACCTTAAAGCGCCAGTCATCAGTAGCTTGTACGTGCGGCCACGATAAAAAAACTAGAGCGAAGAGTGCGGGTCCGGAATCGAATCAGCGACCTCGCTCTCGATCTCATGGCGACGAAAACAACCAAGGTCACGTGTTAGACAAGTACGAAACACTAGTGTAG

Protein sequence:

>DPOGS204699-PA
MPLPKRCVEPVHVSRGTVPERLAVPSELEAVTNGTLANTVRQLSSLSKHAEDMFGELTREATGLAERTNVLQARIDRLAIKVTQLDSGVEEVTLQDIQMRKAFRSARSFQQQLFSRTSMPSAMLATYARCDRPPPLEKLNEFRDDGRDARKFYTDPDYFFELWRREMLQDTERIQHDRGKKVRQPRNNNTEGRSARRVRVPHSTRDRQKADAVTRGEHIMAPSQLRRYNIDYRENKQEQVYQTTTVTDGGYRATQNRPPITQPARPNSIEIENQQNRTSRLTGNGHVMSDESPYGGVESIYGSAAGTPRRARPSIPPPAPPATIGDSPSQHTPTRAQANGFGCTDYETVVRSSLPPPPPPPESPHSVNGTSPPHRTALDKHLDQMHAVIGMMSSEELPPRVPSPPLAPPAPLAPPAPPPPEEDRPRPPSPASLLRGASALKPPRPSADPAPDPRSDLLKAIREEDMFGELTREATGLAERTNVLQARIDRLAIKVTQLDSGVEEVTLQDIQMRKAFRSARSFQQQLFSRTSMPSAMLATYARCDRPPPLEKLNEFRDDGRDARKFYTDPDYFFELWRREMLQDTERIQHDRGKKVRQPRNNNTEGRSARRVRVPHSTRDRQKADAVTRGEHIMAPSQLRRYNIDYRENKQEQVYQTTTVTDPPMARCVFRPPITQPARPNSIEIENQQNRTSRLTGNGHVMSDESPYGGVESIYGSAAGTPRRARPSIPPPAPPATIGDSPSQHTPTRAQANGFGCTDYETVVRSSLPPPPPPPESPHSVNGTSPPHRTALDKHLDQMHAVIGMMSSEELPPRVPSPPLAPPAPLAPPAPPPPEEDRPRPPSPASLLRGASALKPPRPSADPAPDPRSDLLKAIREGIKLRKVEKRCEENTGKYNTLRGAPPLLDVASILARRVAVEQSDTDSDNESDSDDSDHRDIEAQKSNLKTDSPPKNSPPKASVPPRRSSLVKDSPNPSPPNHPQKEAVPETQKISSPPNKNIISSVNNKETHLPSFKSNLLQRNASSQPVVAEISSPAVSSPPITSPPTHSPVVASPPVSKSITSSPIKSVLSPPTKRDNFLLNKNEIMSPPSSAVTSPPPNIFKFNKTDMKPEDSSASSKIKKDSILIKEAPIKIDKLSLVNMSSYDAEKTVGSDQNNSSKTSDVTNGEKAAESNGVGDKIRKFEKAAEDASLSAGKLSRPGSVRGRSRSERLGSDSKEDLPPPATPFKDHVFFDLGSADKTDKSNTEPEVTKVQNTEPERTFERQLSLPRGSVNSVAAKWQHNESSSIDEKIQKPEPQRIIKPEFKPMPSHVDVTIRSPSLGTKLPDQFPSQGTAFRNVASPDPRPVKVNDDVNTQNNFKTCILKKNKEKYSVSKGSSYFTRGSEEIWLVKKKDIEEIEERVLDSFRRAGGNLCMRSESVRPSSDSGQNSFSHATMGRTKRQMYTRSESLDPQWAGARAQTLKRQSSVACTCGHDKKTRAKSAGPESNQRPRSRSHGDENNQGHVLDKYETLV-