Monarch geneset OGS2.0

DPOGS200868
TranscriptDPOGS200868-TA2922 bp
ProteinDPOGS200868-PA973 aa
Genomic positionDPSCF300071 + 485322-499770
RNAseq coverage829x (Rank: top 16%)
Annotation
HeliconiusHMEL0126260.086.32% 
BombyxBGIBMGA009857-TA0.078.07% 
DrosophilaMyo61F-PD0.057.46% 
EBI UniRef50UniRef50_E3X0960.051.83%Putative uncharacterized protein n=5 Tax=Coelomata RepID=E3X096_ANODA
NCBI RefSeqXP_002109061.10.044.78%hypothetical protein TRIADDRAFT_49825 [Trichoplax adhaerens]
NCBI nr blastpgi|3123779420.051.83%hypothetical protein AND_10622 [Anopheles darlingi]
NCBI nr blastxgi|3838523560.061.13%PREDICTED: myosin-IB-like [Megachile rotundata]
Group
Gene OntologyGO:00055241.4e-281ATP binding
GO:00164591.4e-281myosin complex
GO:00037741.4e-281motor activity
GO:00055151.3e-05protein binding
KEGG pathway 
InterPro domain[38-635] IPR0016091.4e-281Myosin head, motor domain
[776-958] IPR0109261.8e-26Myosin tail 2
Orthology groupMCL10069 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200868-TA
ATGGGGTACAAAAGTAAAAACGAAGTGGATTGTGCACTGTTGTTTAGGATATTGAAACGTCGCGAACGCTTTCTGCGTCCGGTGTCCGTGCAAGGCGTCATGGAACACTCCCTGCAGCACCGCGAGCGAGTCGGAGTCCAGGACTTCGTGCTGCTAGAGGACTTCCGCTCTGAAGCAGCGTTCATAGACAACCTGAAGAAACGCTTCCACGAGAACATTATATACACGTACATAGGCAACGTGCTGATCTCAGTGAATCCCTACAAGAACCTTCCTATATACACGGAGGAGAAACAGAGGCTGTACTACAAGAAGGCTTTCTTTGAGGCGCCACCACATGTATTTGCTATCGCGGACAACGCCTACAGATCGCTAGTGTACGAACACAGGGAACAATGTATATTGATTTCAGGGGAATCCGGTTCAGGTAAAACCGAGGCGTCCAAGAAAGTTCTCGAATACATCGCAGCTCGCACGAATCACCTGCACAATGTGGAGACCGTTAAAGACAAACTGCTTCAGAGCAATCCGCTCCTGGAAGCGTTTGGGAACGCCAAAACAAACAGAAACGACAACTCCAGTCGCTTCGGGAAGTACATGGACATACAGTTCAACTACGAAGGTGCACCTGAAGGTGGACACATCCTGAACTATCTGCTGGAGAAGTCTAGAGTTGTGAGTCAAATGGCCGGCGAGAGGAACTTCCACATATTCTACCAACTCCTGGCCGGTGGGGACCAGGAGCTGATGAAGCAGTTGAGGCTGCAAGGAAGATCGGAAGTCTACAAATACACTACCGACCTGACGTCAGCAAGTCAGAAAATGAACGACGCTGACCAGTTCCGCGTGGTGAGAGAGGCGATGAAAGTCATCGAGATAGGAGACAGCGAACAGCGCGAGATGTTTGAGATAGTCGCCAGCGTGTTGCATCTCGGCAACGTGAAGTTCGTTCAGAACGATAAAGGCTATGCTGAGATCCTCAACCACGACGCCAACAGCCAGAACGTCGCCGAGTTCTGCATCAACTTCTGCAACGAGAAGCTGCAACAGCTGTTCATCCAGCTGACACTCAAGTCGGAACAGGAGGAGTATCTGAGGGAAGGCATCGAGTGGGAGCCCATCGAGTACTTCAACAACATCGTCATATGTGACCTCATAGAAGAGAGGCATAGAGGCATCATATCGATCCTGGACGACGAGTGTCTCCGCCCCGGGGAGGCCAACGACCTCAGCTTCTTAGAGAAACTCTCCCAGAGACTAGACGGACATAAACACTTCAAGTCACACAAGAAGGTCGACTCCAAGACCCAGAAGCTGATGGGACGAGATGAATTTTGTCTGGTGCACTACGCGGGTGAGGTGACCTACAACGTGAACACCTTCATTGAGAAGAACAACGACCTGCTGTTCAGAGACCTGCAGGGACTCATGGCGGCCAGCGGAAATAACATCGTTGGCCGGTGCTTCAAGGACATGAATCTGATGAGCAAGAAACGTCCGGAGACAGCGGTGACACAGTTCAAGGTGTCCCTCAACGAGCTGATCAAGATCCTCAGCAGCAAGGAACCTTCATACATCCGGTGCATCAAACCCAACGACTTCAAAACACCCATGCACTTCGACGACAAGCTGGTGTCTCACCAGGTGAAGTACCTGGGGCTGATGGAGAACCTCCGCGTGAGACGAGCTGGGTTCGCCTACAGGAGGCAGTACGACGCCTTCCTCGAGAGATACAAGTGCCTAAGCCCTGAGACTTGGCCCAACTACCGCGGCCCAGCGCGAGAGGGGGTCCAGAAGTTAGTGGCGGCGCTCCGATACGAGAAAGAGGAATACAGGATGGGCAACACGAAGATATTCATCCGTTTCCCGAAGACTTTGTTCGAGACCGAGGACGCGTTCCAGATCAAGAAGAACGACATCGCCACCATCATACAGAGCCGCTGGCGAGGGTACAGGCAGCGGAGGAGGTATCTGGAGATGAAGCGGGCGGCGGTCATCATACAGAAGTGGGTGAGGAGGTTCCTCGCCCAGAGACTGAGGGAGAGGAGGAGGCGGGCCGCTGACGTCATCAGGGCCTTCATCAAAGGTTTCATCACCCGCAACGGTCCAGAGACGGTGGAGAACCGACGTTTCCTCGGCATCGCGAAGGTACACTGGCTGAAGCGCCTTGCGACTCAGCTGCCCAAACACCTGCTTGACCTTTCCTGGCCGCCCTGCCCCGCCACGTGTCAGGACGCCTCCAAACAACTCCACAAGCTACACCGACTACATCTAAGCAGGAAGTACCGCCTGGCATTGTCCCCGGAAGATAAGAAGCAATTTGAATTGAAGGTGCTGGCTGAGGCCATGTTCAAGGGTAAGAAGAACAGTTACAACAGCAGCATCCCGGAGCGGTTCGTCGCGGACAGACTGTCTGAAGAACAGCGAGTATTGAGAGACACGTTCATGGCCTCGCCCGCCTGGCCGGCGCAAGAGAAACTCATTTACTCGTGCGAGGCGGTGAAGTACGACCGGCGCGGGTACAAGCCCCGGCCGCGGTCGCTGGTGGCGTCGGACGCGGCGCTGTACGTGCTGGACGCGGGCTCGCGGAAGATGTTCAAGGTGAAGCACCGCCTGCCGCTCGACAAGCTGCGAGTCGTCCTCACCAACGAGAGCGATGGACTGCTGCTGGTAAAAATACCGCAGGACCTCAAGAAGGATAAGGGCGACCTCATAATGTCCGTGACGCACTTGATCGAAGCCCTCACCATCGTCACCGACTACACCAAAAAACCGGAAATCATCGAGATAGTTGACACCAGGACCATCGCTCACAACCTGGTGAACGGTAAGCAGGGTGGTACCATCGAGGTGACGCAAGGCACGCAGCCCGCCATCCACCGCGCCAAGAGCGGCAACCTGCTAGTTGTGGCATCCCCATAG

Protein sequence:

>DPOGS200868-PA
MGYKSKNEVDCALLFRILKRRERFLRPVSVQGVMEHSLQHRERVGVQDFVLLEDFRSEAAFIDNLKKRFHENIIYTYIGNVLISVNPYKNLPIYTEEKQRLYYKKAFFEAPPHVFAIADNAYRSLVYEHREQCILISGESGSGKTEASKKVLEYIAARTNHLHNVETVKDKLLQSNPLLEAFGNAKTNRNDNSSRFGKYMDIQFNYEGAPEGGHILNYLLEKSRVVSQMAGERNFHIFYQLLAGGDQELMKQLRLQGRSEVYKYTTDLTSASQKMNDADQFRVVREAMKVIEIGDSEQREMFEIVASVLHLGNVKFVQNDKGYAEILNHDANSQNVAEFCINFCNEKLQQLFIQLTLKSEQEEYLREGIEWEPIEYFNNIVICDLIEERHRGIISILDDECLRPGEANDLSFLEKLSQRLDGHKHFKSHKKVDSKTQKLMGRDEFCLVHYAGEVTYNVNTFIEKNNDLLFRDLQGLMAASGNNIVGRCFKDMNLMSKKRPETAVTQFKVSLNELIKILSSKEPSYIRCIKPNDFKTPMHFDDKLVSHQVKYLGLMENLRVRRAGFAYRRQYDAFLERYKCLSPETWPNYRGPAREGVQKLVAALRYEKEEYRMGNTKIFIRFPKTLFETEDAFQIKKNDIATIIQSRWRGYRQRRRYLEMKRAAVIIQKWVRRFLAQRLRERRRRAADVIRAFIKGFITRNGPETVENRRFLGIAKVHWLKRLATQLPKHLLDLSWPPCPATCQDASKQLHKLHRLHLSRKYRLALSPEDKKQFELKVLAEAMFKGKKNSYNSSIPERFVADRLSEEQRVLRDTFMASPAWPAQEKLIYSCEAVKYDRRGYKPRPRSLVASDAALYVLDAGSRKMFKVKHRLPLDKLRVVLTNESDGLLLVKIPQDLKKDKGDLIMSVTHLIEALTIVTDYTKKPEIIEIVDTRTIAHNLVNGKQGGTIEVTQGTQPAIHRAKSGNLLVVASP-