Monarch geneset OGS2.0

DPOGS207000
TranscriptDPOGS207000-TA5382 bp
ProteinDPOGS207000-PA1793 aa
Genomic positionDPSCF300001 + 944012-1001836
RNAseq coverage247x (Rank: top 42%)
Annotation
HeliconiusHMEL0086770.082.81% 
BombyxBGIBMGA012920-TA1e-5584.73% 
DrosophilaMhcl-PB0.042.89% 
EBI UniRef50UniRef50_E1ZXJ20.043.32%Myosin-XVIIIa n=12 Tax=Formicidae RepID=E1ZXJ2_CAMFO
NCBI RefSeqXP_001604740.10.044.57%PREDICTED: similar to CG31045-PA [Nasonia vitripennis]
NCBI nr blastpgi|3454945080.044.80%PREDICTED: myosin-XVIIIa isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|3454945060.044.38%PREDICTED: myosin-XVIIIa isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00055247.2e-73ATP binding
GO:00164597.2e-73myosin complex
GO:00037747.2e-73motor activity
KEGG pathway 
InterPro domain[121-851] IPR0016097.2e-73Myosin head, motor domain
Orthology groupMCL10340 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207000-TA
ATGCTCTACATGAGGATCATACTGAAGCTGGTATTTTTCGAAATCAGTGTCTCAAATTCTGAGGATAAACATGTGTTGCCATTACCAGAAATCCAAAGGAATCCACTTAACACAATAATATTATTTAATGAGTCACACCGATTGAAAAAGGCAAAGTCCGAAGAGCAGCTCGCATGCGAGAAAGAATGGCTACAAGCCGGTCAGATGTGGCTGGCCCATCGTGGCGGGTTCACAGCTGTAGTGAGAGAGGGAGATGCTGAACCTGGCCGGGCTAAAGTCAGGGTACTACAGACGGGAGAGATTATCACAGTCGATGAAGACGATCTAGAGAAGGCTAATCCCCCTCAATTAGAGCGATGCGAGGACATTGCATCGTTGCGTTGCTTGAACGAGTGCGGTGCGTTGCACGTGCTGCGTTCGCGTTATGCCGCTGCGTTGCCGCACGCTCGTGCCGGACATGCTCTACTTGTGCTTGGACCACCCCGCCGTTCTACTCCCGTTTATACAGAAAAGGTGGCGGCTATGTTCCGCAGCTGCCGCACCGACGACATGCCGCCGCACGTTTTTGCGGCAGCTCAATCAGCACACCGCGCCATGTTGGCCGCGAGAAGAGACAGGGCTATTGTGTTCCTAGGAAGGTCCGGTTCTGGTAAGACCTCTGCTATGCGTCACTGTGTTTGGTATCTGGCTACAGCATATCCAGCGCAAGGCTCCAAACTCACACCGGAGAGGCTGGAAGCTGCTCTGGATGTACTGCACATATTCGGCTCCAGTCGAAGCGCCTCCAATCTCCACGCCTCTCGTTTCGTTTCTCTTACGTCGCTGGACTTCGACGGTGCGGGTGCGTTAGTGTCAGCTTCAGTACAAGCGCTCCTGCCAGACCTCAGACCTGATCAGTCTCCACTGAGGGCGTTACACACATTGTTCCACGGCTGTGATGCTCGTCTTCGTCGCGAGCTTCTGTTGGACCAGGCTCCAGTGAACGCTCCCAATCCGTACATCAACTCCTCTGATAAGACTGACGCTCCGACCGAATTCGTTGCTTTGAAGGAGGCGTTGCAACTTTTATCAGTCACCGAACAAGAGCAATTGGCCGTGTGGAAGATTATAGCTGCTATATGTCACTTGGGATGGGCCGGAGTTGCTAGAGCTAACTCTGGTTCCGGTGTGAGGTATCAGTTCGCTAGTACAGGTTGTGCGGGTCGAGCTGGTCGTCTACTTGGCGTCACAGCTGAGGAACTCGCCAAGGCTTGCTTCGCCCCCGCCAGCCCGCCCTCACCGCAGCCGCCTACTGGATTGAGAACTCCATCGCCTTCGAACGAGAAGGAAGCGCCAGGATCGGACGCTCTCGATGCTTTCGCCACCGGACTGTATAGCGAAGTATTCAACGTTATCGTAGGATTCGTCAACAGGAGTACGTCGACATCAAGCCGCACTTCCAGCTCTTTGCTACTGCTGGATTCTCCAGGTGCTGATAACCCTATGTGTTCCGGACAACAAAGCGGCGCCGGGCTGACCAAGCTACTGTCCAATTATATGCAGGAACGTCTACAAGCCGTGTTCCACGAGGCCATGCTGGTTGCTCCACGCGAACGGTACGCGCAAGAAGGAGTCACCATAGACGACGGTAACGACGAAGATTGGCTATCAGAGTCTGTCAACCCGGGGCCGATGGTGGATTTGTTGGACAAGTCACCTCAGAACACAATAGTCAGAAGTTCTCAGGCCGATCTGAGAGATTGCGACCGGCGGGGACTTCTCTGGTTGTTGGACGAGGAATCTATGTACCCGGGTTCTTCTGACGATACTTTCTTAGAGCGCGTCATGTCCCAGTACGGCGCCCCCCATCATACACACTACCTAATAAAGAAGGCTCCTCACAATAGACAATTCATACTTCAGCACTTACAGGGCACTAATCCTGTTTTATATGATGTCTCCGGGTGGGTGAAGGCCAGCCGCGAGAATCCAGCTATGAAGAGAGCTCATACTTTGCTTCAGGAAAGCCAAAATCCCTTGTTGTCGTCGTGTGAACGTACGACTGACGCTGGCACGCTGCGAAGAGCCGCATCCGTACGACGAGCGCTGGCTTCTGGCGATGTTGCCACCTTCTCCCGGACCTCAATCATTCGTCGTTCCCTCAACTCTGGCACCGCTGGTATGAAGCGTCGGTCAACAGCTCTCCAAGCCAAGTTCATAGCTGACGGGGTCGTGGATACCTTACGCCGCTGCGGGTCAGGCGGGATACAGTTTGTTTGCTGTCTCCTCACCAACCAACCTAACGAGACCCCCGACGATGTAAACGTACCGCTATTAAGATCCCAGTTCCGCGGCTTCCAATTACTAGACGCGGCTAGATTGTATAAACAAGGATTCCCTGAACATATGCCATTATCGGAATTCGCTAGGAGATATCGACTATTGGCAACGTCGGAGAAGGAGGAATCTGAAACATCCCAGCAGTCAACGACGTTATCGGACAGACAGATCGTGGACGAGATGCTGTTGGTGCTAGACCTAGATGTGACGAGCTACAGACTCGGACCCACTCAGTGTTGTATTACAATGTTTGTGATCCTGGCCTTTATCGGGGTCAGATTCGCCCTTCATGTGTCAAGGCAGAGCGGTCGGTCGCTGGCCGACGAGTGCTACACAAGAAAATGCACCACCGCCGCTGTGGAGCTGTCGGTGCCGGAGTGGATGCATTATCTGGCTGCTGGACTCTTCCTCCTCTGGCTGGTGTTCGGAAGACGTCGGAGCGTTTGTAGAACTCTGCGCCTGCGCACATGCACGCCGTACACCGTAACGCGCGCGCGCATCCACGCTCCGCGACCGCGATCGCCCGTGCGTCGGGCCCGCGATATGCCCGCAGTGAAAGTATTTAGTATATTTGGTGGGCAGAAGCCGGCCGAGAAGCAACCGACACCCCCCGCTCATGGGGCTATTCAAAAATTGTACAGGACATTTTCAGTCAGTCCCGAAATCTCAACGGAGCACAAACTCGGCAAAGTAACCGCGATAGCTGCCTGTATTGAGAAAAAGATTACTGAGCCTAGTATTTCTAAGAGTGACGTGCAAAATATATTGAAAAAGACTAATACTTGCGATTCATCGAACTCAACGGATTGTTCGAAATTACAAAGTAACGTGTATGAGAATGTTAATATTGTGAACTCGGAAAAACTATCATCGGTTAGTAATAATTTGTTGGAAGAAACAAAAGGATATTCATCTATTTATGAAAATATAACATTTATTGGGAAAATCGATTGTTCTGAAACAGTAATCGAACCTTTAACCTCCGAATCAAAATTAGGGAAGGCCTTGGAATCATTTGATAAAATTCTAACAGAATTTTCTTCAAGCCTTTTTTTAACTCACAATCCTAATTTCGTAGTACCCAAATTACAAAAATCTAAAACTTGTAGTATTATAGAGTCGCGGTGTATTCTGAAGAAAACAAATTCTGATCCCGAAACAGAGCGAAGAACGCGTCATCTCGTTTCGAGAAATAACTCTATAGACAAAACTACGAGTTTGTGGAATCTTGATGACATGAGAGGTAAGGAGCTAACAAGTCCTCTTACACCGCTGAATCCTTTACCTATTGACAAAATGAGTTCTGACAAATATGCAACGTATAAAGTATCATCAAAACCCACGACCCGCGCTACAACCATAAAAACAGAAGACTTGAAAAAATTGGACGTAAAAACAAGAATTCTAAAAAAGACCCTATCAAATCCTCCATCGACGCCGGTACCTGTCGTGTCAAAAACGAAACCGATTCCAAAAAAACTGGACAAGAAGTTCACTGACATCAAAAAGGATAAGCCGAGAAGTGGTCTTAATAATGCCAGAAACACTATTGATATGCCAAAATCCCAAATGCAAAAAGCAAAAAGTGTTTGGGAAATTGGTAACGAATCGATGATTTCGCCTGGCTTGGAGAGAACTAAATCAAGTACTTCTATCGCGGGTTCTCCTAGTAAGATCCCGGTGATCAGGAGCCAAATGTCACAAAATAAATTTAGCACCACACGAGCTCTTTTTTCTCCCACTCCTGTTGATCTAACTAATGTAGACCAGCCCAGAGAGTGTGCTCAGAAAAAAAAGCCCATCCTGTCTCGAAAAACAGCTGAGAAAATCGATCAAATGAAACAAACACGACAAAAATCCCAGCTAAATTTAAAAGCTGACTCAAAGCGGCAATCCGACAAGGCCAAGAAAGATTCAAGTGAAAAGAAAGTAGCATCTCCATCACCAAAGAGTACTCTAAGAGACTACTCTGATGAATTGAATGCCGTGCGCGCCAAAATACAAGTAAAAAAAATAAACAACAAACGAGATCTCACAGTAGAAATGCCCGGTAATAACCGAAACCAAAACGACACGGACGAAATCGATTCTATGCTATCACCTGTTAAATATCTCGTAAAAAAGTTGGAGTTTAAAACAGCTCTGGAATTGAAAACTGAGACTGCCCAATTTACAAACTGTAAAGTGATACCACCGTTTCATAAAGAGGTATGCGTAGCTAATACGACATTTCATAATCATTTGAGCACACTCGTGGGTCGCCAAATTAAATGCATAGAATCCAGGGATGACATTAAAACATACAGCCAGCAAGATAAGCTGACTTTGCACAAACATTCCGAGGAGAAGATATCGGACACGCACTCTGACTGCAGCGACGACTCCGGTCACGTTTCTAATGACGCTGCTAATGATAACGACGCTGTGTTCGACAACGTCCTTGAAATGAGTCGAGCGAACAGCAGCGCTGATGAGCTGGGCTGTAAATTGTTTGATGCTCCCAAACAGTTCAGGGTAGAACTGCCGGAGAATGCCAAGAACGTCTGTCCAGTGCGGCCCGCCCGGCGCGGCGAGCGCTGCAATGAGGTCGCTAGCGGTATACGCGCCATCGCCGACACTCCGAAAGTGGACGAACAGATCATGTTCAGAGGCGGCGTGGTGGGCGATTTGGACGCCCGCCGGGACGCCGCTCTGGCCCGAGTCCTCGTTCGACTACAAGCGAGGGCTCGCGGTCTCTTGGCCAGGAGACGAGCTGAAAGACTCCGCACTCAGCACACCGCCGCCAGATGCGTTCAACGAAATGTGCGCGCCTTCCTCGCGGTTCGTGACTGGCCATGGTGGAGACTATTGGTACGCGTCACTCCGCTACTGGCTGTGCATCGCACAGAACACAGGCTCAAACAAGCACAGGAGGAGCTGGAAACTCTCCGAGTTAAACTTGAGAAGGCAGAAAATGAGCGCAGTCATTACAAGAACGAAACGGAACAACTTGAGACTAAGAAGGTTCGGCGTGACGAACGTCATTCGGGATTGCACTTCGACATCTACGAAAAACCAAAATAG

Protein sequence:

>DPOGS207000-PA
MLYMRIILKLVFFEISVSNSEDKHVLPLPEIQRNPLNTIILFNESHRLKKAKSEEQLACEKEWLQAGQMWLAHRGGFTAVVREGDAEPGRAKVRVLQTGEIITVDEDDLEKANPPQLERCEDIASLRCLNECGALHVLRSRYAAALPHARAGHALLVLGPPRRSTPVYTEKVAAMFRSCRTDDMPPHVFAAAQSAHRAMLAARRDRAIVFLGRSGSGKTSAMRHCVWYLATAYPAQGSKLTPERLEAALDVLHIFGSSRSASNLHASRFVSLTSLDFDGAGALVSASVQALLPDLRPDQSPLRALHTLFHGCDARLRRELLLDQAPVNAPNPYINSSDKTDAPTEFVALKEALQLLSVTEQEQLAVWKIIAAICHLGWAGVARANSGSGVRYQFASTGCAGRAGRLLGVTAEELAKACFAPASPPSPQPPTGLRTPSPSNEKEAPGSDALDAFATGLYSEVFNVIVGFVNRSTSTSSRTSSSLLLLDSPGADNPMCSGQQSGAGLTKLLSNYMQERLQAVFHEAMLVAPRERYAQEGVTIDDGNDEDWLSESVNPGPMVDLLDKSPQNTIVRSSQADLRDCDRRGLLWLLDEESMYPGSSDDTFLERVMSQYGAPHHTHYLIKKAPHNRQFILQHLQGTNPVLYDVSGWVKASRENPAMKRAHTLLQESQNPLLSSCERTTDAGTLRRAASVRRALASGDVATFSRTSIIRRSLNSGTAGMKRRSTALQAKFIADGVVDTLRRCGSGGIQFVCCLLTNQPNETPDDVNVPLLRSQFRGFQLLDAARLYKQGFPEHMPLSEFARRYRLLATSEKEESETSQQSTTLSDRQIVDEMLLVLDLDVTSYRLGPTQCCITMFVILAFIGVRFALHVSRQSGRSLADECYTRKCTTAAVELSVPEWMHYLAAGLFLLWLVFGRRRSVCRTLRLRTCTPYTVTRARIHAPRPRSPVRRARDMPAVKVFSIFGGQKPAEKQPTPPAHGAIQKLYRTFSVSPEISTEHKLGKVTAIAACIEKKITEPSISKSDVQNILKKTNTCDSSNSTDCSKLQSNVYENVNIVNSEKLSSVSNNLLEETKGYSSIYENITFIGKIDCSETVIEPLTSESKLGKALESFDKILTEFSSSLFLTHNPNFVVPKLQKSKTCSIIESRCILKKTNSDPETERRTRHLVSRNNSIDKTTSLWNLDDMRGKELTSPLTPLNPLPIDKMSSDKYATYKVSSKPTTRATTIKTEDLKKLDVKTRILKKTLSNPPSTPVPVVSKTKPIPKKLDKKFTDIKKDKPRSGLNNARNTIDMPKSQMQKAKSVWEIGNESMISPGLERTKSSTSIAGSPSKIPVIRSQMSQNKFSTTRALFSPTPVDLTNVDQPRECAQKKKPILSRKTAEKIDQMKQTRQKSQLNLKADSKRQSDKAKKDSSEKKVASPSPKSTLRDYSDELNAVRAKIQVKKINNKRDLTVEMPGNNRNQNDTDEIDSMLSPVKYLVKKLEFKTALELKTETAQFTNCKVIPPFHKEVCVANTTFHNHLSTLVGRQIKCIESRDDIKTYSQQDKLTLHKHSEEKISDTHSDCSDDSGHVSNDAANDNDAVFDNVLEMSRANSSADELGCKLFDAPKQFRVELPENAKNVCPVRPARRGERCNEVASGIRAIADTPKVDEQIMFRGGVVGDLDARRDAALARVLVRLQARARGLLARRRAERLRTQHTAARCVQRNVRAFLAVRDWPWWRLLVRVTPLLAVHRTEHRLKQAQEELETLRVKLEKAENERSHYKNETEQLETKKVRRDERHSGLHFDIYEKPK-