Monarch geneset OGS2.0

DPOGS201190
TranscriptDPOGS201190-TA3549 bp
ProteinDPOGS201190-PA1182 aa
Genomic positionDPSCF300262 + 94359-104849
RNAseq coverage763x (Rank: top 17%)
Annotation
HeliconiusHMEL0180170.077.90% 
BombyxBGIBMGA009857-TA1e-13346.31% 
DrosophilaMyo31DF-PB0.050.90% 
EBI UniRef50UniRef50_Q239780.050.90%Myosin-IA n=42 Tax=Eumetazoa RepID=MY31D_DROME
NCBI RefSeqXP_002425047.10.055.84%myosin IA, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420085110.055.84%myosin IA, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420085110.055.84%myosin IA, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055243.8e-289ATP binding
GO:00164593.8e-289myosin complex
GO:00037743.8e-289motor activity
KEGG pathway 
InterPro domain[18-870] IPR0016093.8e-289Myosin head, motor domain
[969-1138] IPR0109262.2e-25Myosin tail 2
Orthology groupMCL10069 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201190-TA
ATGTATTTAGCCTCAACGCCGGCTGATCTTGAACTGAGAACCGGCCTCGGAGGAGTGCGGCCGGTGACGTACCAACACAACAAGATCTACACGTATATAGGCGAAGTGCTGGTCTCTGTGAACCCTTACAAGTCTCTGGACATCTATGGCCAGCAGCACATGGCCCAGTACAGGGGCCGGGAGATGTTCGAGGTCCCTCCTCATGTGTATGCCGTAGCCGACGCCTGTCAGAGAGTGCTCAGGCAACAGGGGAGGGATACCTGTGTGCTGATATCAGGTGAGTCCGGTTCGGGGAAGACGGAGGCCTCCAAGTTCATCATGAAGTACATAGCGGCCAACACCATGCAGGTGCACAGGGAGTATATCGACAGAGTGAAAAACGTCTTGATACAATCGAATTCTATCTTAGAGACATTCGGTAACGCGAAGACGAACAGAAACGACAACTCCTCAAGGTTCGGTAAATACATGGACATACACTTCGACTACAAAGGGGATCCCATCGGCGGACACATCAGCAACTACCTGCTAGAGAAGAGCAGGGTCGTCAGTCTGCAGCCCGGGGAGAGGAACTTCCACGCCTTCTATCAGCTCTTAAGCACAAACAACCCGCAGACGAAAAAGTATGGATTGAACTCGAGTTCCGTGTACAAGATTTTGGGCAACGAGCGCGCGACGGCACAGGACTCTAAACTCTACAACGTGACCAAGAGCGCCTTTAATGCTCTGGGCTTCCCGCCGGCAGTCGTCGACGATATATGGAGCATCGTGGCCGGAGTCATCTTATTGGGTGAGTTGACATTCAGCGAGGGCGCGTCGGGCGAGGTGGTGGTGGGCGGGCCGGTGTCGTCGTGCGTCTCCGCCCTGGGAGTTTCACAGGAGTCTCTAAGGTCCGCGATGGTGGGCAGAGTGCTAGCGGCGGGCGGGGACTTGGTCAGCAAGGAACACACCCTCACTGACGCCCACTACACCCGGCTGGCGCTGGCCAAGGCCGCCTACGACAGACTGTTCAGTTGGATCGTACAACAGATCAACGCGGCGATAGAGGCGCCCTCAGCCTCGTACCGCTCCAGTGTGATCGGCGTCCTCGACATCTACGGCTTCGAGATCTTCGACACCAACAGCTTCGAACAGTTCTGCATCAACTACTGTAACGAGAAACTACAACAGCTCTTCATAGAGCTGGTGTTGAAGCAAGAGCAGGAGGAGTACTCCCGCGAGGGCATCACGTGGACGCCGGTCCCCTACTTCAACAACAGAGACATCTGTGCGCTAGTGGACGCGCCGCACGCCGGGATCATCGCCATCATGGACGAGGCCTGTCTCAACCCCACCAAGATATCCGACGGTCAGCTGTTGGAGGCGATGGACAAGCGCCTGAACTCGCACAAGCACTACACCTCGCGCCAGTTGTCTCCGCTCGACAAGAAACTCAAACACGCCGTCGACTTCCAGATCACCCACTACGCGGGTCAAGTGACCTACAACATCACCGGCTTCATGGAAAAGAACAAGGACTCGCTGTGGCAGGACCTCAAGAGACTTCTGCACCGCTCCAGCAACGCCTCCCTCGCTAAAATGTGGCCCGAGGGAGCCGTCGACATACAACAGATCAACGCGGCGATAGAGGCGCCCTCAGCCTCGTACCGCTCCAGTGTGATCGGCGTCCTCGACATCTACGGCTTCGAGATCTTCGACACCAACAGCTTCGAACAGTTCTGCATCAACTACTGTAACGAGAAACTACAACAGCTCTTCATAGAGCTGGTGTTGAAGCAAGAACAGGAGGAGTACTCCCGCGAGGGCATCACGTGGACGCCGGTCCCCTACTTCAACAACAGAGACATCTGTGCTCTAGTGGACGCGCCGCACGCCGGGATAATCGCCATCATGGACGAGGCCTGTCTCAACCCCACCAAGATATCCGACGGTCAGCTGTTGGAGGCGATGGACAAGCGCCTGAACTCGCACAAGCACTACACCTCGCGCCAGTTGTCTCCGCTCGACAAGAAACTCAAACACGCCGTCGACTTCCAGATCACCCACTACGCGGGTCAAGTGACCTACAACATCACCGGCTTCATGGAAAAGAACAAGGACTCGCTGTGGCAGGACCTCAAGAGACTTCTGCACCGCTCCAGCAACGCCTCCCTCGCTAAAATGTGGCCCGAGGGAGCCGTCGACATACAACAGACGTCCAAGCGGCCTCCGTCCGCGGCCAGCCTGTTCCGCTCGTCGATGGCGGCGTTGGTGAGCGGCCTGTCCAGCAAGGAGCCGTTCTACGTCCGCTGTGTGAAGCCCAACCCCGCGCAGGCGGCCCACCTTTGGGACGAACAGCTGGTCCGTCACCAGGTGTCGTACCTGGGCCTGGTGGAGAACGTGCGCGTGCGGCGCGCGGGGTTCGCCTCCCGCCAGCGGTACGACCGCTTCCTCAAGCGGTACAAGATGCTCTCTCAATACACGTGGCCCAACTTCCGAGGCTCCAGCAACAAGGACGCCGTCATGGTGCTGCTCAGGGACCTGCACATCACCGACGTGCAGTTCGGACACACCAAGCTCTTCATACGGAGTGCTCGTACCCTGCACGAGCTGGAGCGCGCCCGGTCCGAGCTGATCCCCTCCATCGTGGTGCTGCTCCAGAAGCTGTGGAGAGGAACCCTCGCCAGGCAGCGCTACAGGCGGATGAAGGCGGCCCTCGTCATATACAACGGATGGAAACGGTACCGCTTCAGGCGTTACATATCCGAGCTGCAGGCCATCCTCTCCCGGCACCGTAACGTGATCCCGTCGTGGCCGGCGGCACCCCGGGGGGTGGCGGTTCCCTTGCTTCAGGCGGCCTACCGTCGCTGGCGCGCCTACCTCACCCTCAAGCCCATCCCGAGGGACCAGTGGCCTCAACTCAAACTCAAGATATCCGCGGCCAGCGTGCTCAAAGGCAGGAGGGCCCAGTGGGGGGCCTCGAGGGAGTGGCGGGGGGACTACCTGGCTATTAATTCGTACAACGATAAATCATCGTCGTACCTGTCGTGCGTGTCTAGTCTGCAGCGCTCGCAGTCTTTGGGCAAGCCCCTGTTCTCGTGCCGCGTGTTCAAGTTCAACCGCTACAACAAGATGTCGGAGCGCTGCTTGCTGGTGACCGACACGTCCCTGTACAAGCTGGACGCGAGCTCCTTCAAGCCGCTAAAGAAGCCCACGCCCATCACGGAGGTTGGCGGCGTGCGTGTCATGAGCGGGGAGGCCCAGCTGGTCGTGGTGGTGGTCCCGGGCGCCAGGAACGACCTGGTGGTGGGGCTGGTGGCGCCCCCACACACCGACCTGCTGGGGGAACTGTTGGGAGTGCTCGCACATACGTACCACAGGCTGACCGGCTCCGAGCTACCCGTGGAGGTGGAGAGCGGCGCCAGCACGAGGTGTATCCTGGGAGGGAAGACGAGGGCCTTGCAGCTACCGCCGGCGACCACCAGCCCCGCCTCCCCCACCGCCACTCCCGCGCCCTTCACACACGCACACAACGTCATCACATACCACCCGGCGTCGGCGAGGGCGTAA

Protein sequence:

>DPOGS201190-PA
MYLASTPADLELRTGLGGVRPVTYQHNKIYTYIGEVLVSVNPYKSLDIYGQQHMAQYRGREMFEVPPHVYAVADACQRVLRQQGRDTCVLISGESGSGKTEASKFIMKYIAANTMQVHREYIDRVKNVLIQSNSILETFGNAKTNRNDNSSRFGKYMDIHFDYKGDPIGGHISNYLLEKSRVVSLQPGERNFHAFYQLLSTNNPQTKKYGLNSSSVYKILGNERATAQDSKLYNVTKSAFNALGFPPAVVDDIWSIVAGVILLGELTFSEGASGEVVVGGPVSSCVSALGVSQESLRSAMVGRVLAAGGDLVSKEHTLTDAHYTRLALAKAAYDRLFSWIVQQINAAIEAPSASYRSSVIGVLDIYGFEIFDTNSFEQFCINYCNEKLQQLFIELVLKQEQEEYSREGITWTPVPYFNNRDICALVDAPHAGIIAIMDEACLNPTKISDGQLLEAMDKRLNSHKHYTSRQLSPLDKKLKHAVDFQITHYAGQVTYNITGFMEKNKDSLWQDLKRLLHRSSNASLAKMWPEGAVDIQQINAAIEAPSASYRSSVIGVLDIYGFEIFDTNSFEQFCINYCNEKLQQLFIELVLKQEQEEYSREGITWTPVPYFNNRDICALVDAPHAGIIAIMDEACLNPTKISDGQLLEAMDKRLNSHKHYTSRQLSPLDKKLKHAVDFQITHYAGQVTYNITGFMEKNKDSLWQDLKRLLHRSSNASLAKMWPEGAVDIQQTSKRPPSAASLFRSSMAALVSGLSSKEPFYVRCVKPNPAQAAHLWDEQLVRHQVSYLGLVENVRVRRAGFASRQRYDRFLKRYKMLSQYTWPNFRGSSNKDAVMVLLRDLHITDVQFGHTKLFIRSARTLHELERARSELIPSIVVLLQKLWRGTLARQRYRRMKAALVIYNGWKRYRFRRYISELQAILSRHRNVIPSWPAAPRGVAVPLLQAAYRRWRAYLTLKPIPRDQWPQLKLKISAASVLKGRRAQWGASREWRGDYLAINSYNDKSSSYLSCVSSLQRSQSLGKPLFSCRVFKFNRYNKMSERCLLVTDTSLYKLDASSFKPLKKPTPITEVGGVRVMSGEAQLVVVVVPGARNDLVVGLVAPPHTDLLGELLGVLAHTYHRLTGSELPVEVESGASTRCILGGKTRALQLPPATTSPASPTATPAPFTHAHNVITYHPASARA-