Monarch geneset OGS2.0

DPOGS209828
TranscriptDPOGS209828-TA3588 bp
ProteinDPOGS209828-PA1195 aa
Genomic positionDPSCF300117 + 634508-645764
RNAseq coverage808x (Rank: top 16%)
Annotation
HeliconiusHMEL0089840.082.82% 
BombyxBGIBMGA003044-TA2e-13233.46% 
Drosophilajar-PH0.062.05% 
EBI UniRef50UniRef50_Q019890.062.90%Myosin heavy chain 95F n=50 Tax=Bilateria RepID=MYS9_DROME
NCBI RefSeqXP_001648742.10.064.43%myosin vi [Aedes aegypti]
NCBI nr blastpgi|1571051570.064.43%myosin vi [Aedes aegypti]
NCBI nr blastxgi|1571051570.064.54%myosin vi [Aedes aegypti]
Group
Gene OntologyGO:00055240ATP binding
GO:00164590myosin complex
GO:00037740motor activity
KEGG pathway 
InterPro domain[26-745] IPR0016090Myosin head, motor domain
Orthology groupMCL13013 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209828-TA
ATGGGTGAAGAGGCGGATGTCCTTCCCATCGACAACAAACACACGCGCCGCCTCGTCAACTTCGATGACATACTGCCAGCTGGAGATCCGGCTGTAGACGTCGATGATAACTGCGAACTAATGTTCCTGAACGAGGCTACTCTTCTTAATAACATACTCATCAGATACAATAAGAAAAAGATATACACGTACGTAGCAAATATTCTGCTAGCAGTGAATCCGTACGAAGATATCCCGGATATGTATTCTTCGTCTACCATCAAGAAGTATCAAGGGAGATCGCTCGGTGAACTGCCGCCTCACGTCTTCGCTATAGCTGACAAAGCCTTCCGTGATATGAAAGCGTTGAAGCAATCTCAGTCCATCATAGTGTCGGGAGAATCAGGCGCCGGGAAAACGGAATCAACGAAATACATACTGAAATATCTCTGTGATTTGTGGGCGAAGGGCGCTGGGCCAGTTGAACAGAAAATATTGGACGCCAATCCGATCCTTGAAGCGTTCGGTAACGCTAAAACAACACGCAACAACAACAGTTCCCGTTTCGGTAAATTCATGGAGGTGCATTTCTCCAACAAATACCAAGTGGTGGGTGGTCACATATCGCATTATTTGTTGGAGAAATCACGGATATGCACCCAAAGCGCCGAAGAGAGGAACTATCACGTGTTCTACTTGCTGTGCGCTGGAGCACCACAGGAACTACGGTCCGCCTTGAAAATAACCAAGCCCGATGATTATTTATATCTGAAGAACGGTTGCACTCAATACTTCACGTCGCCGCAATCTGAGAAGAAAATAAATGCAAGTCAAAAGAGCAAGCAACAACAAGCCAAAGGCGGTCTCCGAGACCCCATACTGGATGACGTGGAAGATTTCCAAAGACTCTACCAGGCCCTATCCCACATAGGTTTATCAGAGTCCGAAAAGAAATCGGTGTTTTCCATAGTGGGGGCCGTATTACATCTCGGCAATATAGAGTTTGAGGAGGAGGGTGGTGCTAGAGGTGGTTGTAGAGTTACACCAGAATCTGAACACGCTCTAGCTACAGCCAGCGAACTATTGGGTGTGGATGCTGGGGAACTGAGAATGGCTCTGGTCTCTAGACTTATGCAGAGCTCTAGAGGAGGAATCAAAGGAACCGCTATCATGGTGCCCCTCAAGACGTATGAGGCTTGTAACGCTCGCGACGCCCTGGCTAAGGCTGTGTACAGTCGGTTGTTTGACTCCATAGTGAGGAGGATCAATGACAGTATACCCTCCAGCACTTCCGCTTACTATATAGGAGTGTTGGACATAGCCGGATTCGAATACTTCCAAATGAACAGTTTCGAACAGTTTTGCATAAACTACTGCAATGAGAAGCTGCAACAGTTCTTTAACGAACGGATCCTCAAGAACGAACAAGAATTATACAAGAGAGAGGGCTTGAACGTACCTGAGATAAGATTCGTCGACAATCAGGATTGTATTGATCTAATAGAGAGCAAAAACCACGGTATCTTCCACTATCTGGACGAAGAGTCCAAATTGCCGAAACCAGATTTCGGTCATTTCACCAACTCAGTCCACAAGGAACTCGGCAATCAGTCCAATTCCCGCCTGAACGCTCCTCGCGCGTCCCGCCTGAAGGCTCACCGAGCTCTTCGCGAGCACGAAGGATTCCTTCTACGACACTTCGCTGGAGCGGTTTGCTATAACACGAGTCAATTCATAGAAAAAAACAACGATGCCCTACACGCCTCGCTGGAGTTCCTGGTGCAGGAATCTAAGAACACATTCGTACAACAACTGTTTGAAAATACAGACAACAGCAACGCTAAAGGGAAACTCAACTTTATATCCGTGGGCGCTAAGTTCCAATCTCAACTGTCACAACTTATGGAGAAATTGAAAGAAAACGGCACTAACTTCGTCCGCTGCATCAAACCGAACTCGCGTATGGAGGGTGGCTCGTGCTCTGGCTCGTTAGTGCTGACACAACTTCAGTGCAGTGGAACTATCGCCGTTCTATCGCTCATGGAACATGGTTTTCCATCCCGGGCTCCATTCGCAGACTTACACCGCCTCTATTCTGACTACCTACCGCCGAAACTGGCAGGACTACAGCCTAAAATATTCTGCCAGGCAATAGTCCACAGCTTCGGTCTATCAGACAAAGACTACAAGTTCGGTATAACGCGCGTGTTCTTCAGACCCGGCAAGTATTCGGAGTTCGATACCATGATGAAGTCGGATCCTGAGAACCTCAAGGCTATAGTCGACTCGGTGCTGGCCTGGCTCGTCAAGTCGAGATGGAGACGTTCCATATTCTCTGTGCTCTCCATTATTAAATTGAAGAACAAAATCCTTCATCGCCGCAAATGTCTGCTGATTGTACAAAAGACGATCCGCGGTTATCTGACTCGTAAGCAACACATGCCGCGTTACAAGGGGATCGCTAAGATCAGGCTGCTAGAGAAGAACCTCGTACAGATGGACACTGTGACGTCACAACTGAAGAAGGAAAGAGACAGCGCTAAGAAGAATATTGAAAACCTGAGAAATAGTATCAAAAACGCCTGTCATACCATCAAGAGTAACGAGAGAATAACCCGACACGAAATCGACCAAATATACACGAAGCTCACTAAAGACGTGGAAGCCCAAATGGCGGCCTTGCAGAAGGCGATGGTCGATCAGAAGAACAGAGAGGAACAAGAGCGCCTTCGCAAACTGGAGGCCGAGATGCGAGCGGCCGAGGAAGCGAAACGAAGGGAACAAGAAATGCTACGGCAAGAAGAGGAACACAGACGACTGAAAGCGGAGATGGAGTCGAGGCGGAAGACGGAGGAAGCGGAGAGGAAGAGGCAGGAGGAAGCTGACCGGGCGGCCGCGGACAGGCTGAGGAGGCAGCTAGAGGAGGAAGACCGCGCCGACCAGGAGAATAGAGAGAGGTTGGAACAAGAGCGGCGCGATCATGAAATGGCCGTGCGGCTCGCCAACGAAACGGACGGCCACGTGGAGGGCTCGCCGCCACAGCTACGCAGGTCGGAACGTGTTCGTATGCAACAAGCTCTCCAAGAAAAGCAGAAATACGACCTCTCCAAGTGGAAATATTCCGAGCTGAGAGACACGATCAACACCTCGTGTGACATAGAATTGCTTGAGGCATGTCGTCATGAGTTCCATCGCCGTCTGAAGGTGTATCACGCTTGGAAGGCAAAGAACGCTCGTAAGTCGACCTTGGAACAGGAACGAGCTCCGCAGTCCATCATGGACGCCGCCAAGGCTCCCCGAGTAACCACGGGCGCCATATTGGGTGCGCGTCATCGTTACTTCCGCATCCCGTTCGCTCGCCCCGGCAGTGACGAAGCCCGCGGCTGGTGGTACGCCCACTTCGACGGGCAGTACGTGGCTCGTCAGATGGAGCTACACCCGGATAAGACGCCGGTACTGCTCCAGGCTGGGTTGGATGACATGCAGATGTGTGAACTCAGCTTAGACGAGACAGGACTCACGAGGAAACGCGGAGCCGAAATATTGGAGCACGAGTTCGAGCGAGAATGGGCCAAGAACCGCGGACCGCCCTACAAACCAGCTCGTTAA

Protein sequence:

>DPOGS209828-PA
MGEEADVLPIDNKHTRRLVNFDDILPAGDPAVDVDDNCELMFLNEATLLNNILIRYNKKKIYTYVANILLAVNPYEDIPDMYSSSTIKKYQGRSLGELPPHVFAIADKAFRDMKALKQSQSIIVSGESGAGKTESTKYILKYLCDLWAKGAGPVEQKILDANPILEAFGNAKTTRNNNSSRFGKFMEVHFSNKYQVVGGHISHYLLEKSRICTQSAEERNYHVFYLLCAGAPQELRSALKITKPDDYLYLKNGCTQYFTSPQSEKKINASQKSKQQQAKGGLRDPILDDVEDFQRLYQALSHIGLSESEKKSVFSIVGAVLHLGNIEFEEEGGARGGCRVTPESEHALATASELLGVDAGELRMALVSRLMQSSRGGIKGTAIMVPLKTYEACNARDALAKAVYSRLFDSIVRRINDSIPSSTSAYYIGVLDIAGFEYFQMNSFEQFCINYCNEKLQQFFNERILKNEQELYKREGLNVPEIRFVDNQDCIDLIESKNHGIFHYLDEESKLPKPDFGHFTNSVHKELGNQSNSRLNAPRASRLKAHRALREHEGFLLRHFAGAVCYNTSQFIEKNNDALHASLEFLVQESKNTFVQQLFENTDNSNAKGKLNFISVGAKFQSQLSQLMEKLKENGTNFVRCIKPNSRMEGGSCSGSLVLTQLQCSGTIAVLSLMEHGFPSRAPFADLHRLYSDYLPPKLAGLQPKIFCQAIVHSFGLSDKDYKFGITRVFFRPGKYSEFDTMMKSDPENLKAIVDSVLAWLVKSRWRRSIFSVLSIIKLKNKILHRRKCLLIVQKTIRGYLTRKQHMPRYKGIAKIRLLEKNLVQMDTVTSQLKKERDSAKKNIENLRNSIKNACHTIKSNERITRHEIDQIYTKLTKDVEAQMAALQKAMVDQKNREEQERLRKLEAEMRAAEEAKRREQEMLRQEEEHRRLKAEMESRRKTEEAERKRQEEADRAAADRLRRQLEEEDRADQENRERLEQERRDHEMAVRLANETDGHVEGSPPQLRRSERVRMQQALQEKQKYDLSKWKYSELRDTINTSCDIELLEACRHEFHRRLKVYHAWKAKNARKSTLEQERAPQSIMDAAKAPRVTTGAILGARHRYFRIPFARPGSDEARGWWYAHFDGQYVARQMELHPDKTPVLLQAGLDDMQMCELSLDETGLTRKRGAEILEHEFEREWAKNRGPPYKPAR-