Monarch geneset OGS2.0

DPOGS215054
TranscriptDPOGS215054-TA5877 bp
ProteinDPOGS215054-PA1958 aa
Genomic positionDPSCF300208 + 11472-43958
RNAseq coverage1763x (Rank: top 7%)
Annotation
HeliconiusHMEL0058020.085.71% 
BombyxBGIBMGA003044-TA5e-14139.19% 
Drosophilazip-PA0.073.07% 
EBI UniRef50UniRef50_F4X1H40.074.08%Myosin heavy chain, non-muscle n=5 Tax=Coelomata RepID=F4X1H4_ACREC
NCBI RefSeqXP_623323.20.078.71%PREDICTED: similar to zipper CG15792-PD, isoform D [Apis mellifera]
NCBI nr blastpgi|3407117210.078.53%PREDICTED: myosin heavy chain, non-muscle-like isoform 1 [Bombus terrestris]
NCBI nr blastxgi|3287904870.078.74%PREDICTED: myosin heavy chain, non-muscle [Apis mellifera]
Group
Gene OntologyGO:00055240ATP binding
GO:00164590myosin complex
GO:00037740motor activity
KEGG pathwayame:4120920.0 
 K10352 (MYH)maps-> Viral myocarditis
    Tight junction
InterPro domain[79-781] IPR0016090Myosin head, motor domain
[1243-1950] IPR0029284e-166Myosin tail
[33-69] IPR0040094.2e-07Myosin, N-terminal, SH3-like
Orthology groupMCL10056 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215054-TA
ATGGTGGATATGGATAGGAACGACCCGGAGCTCCGGTACCTGTCGGTCGATCGGAACAGTTTCAACGACCCCGCGACGCAGGCCGAATGGACTCAGAAGAGACTAGTGTGGGTGCCCCATGAGACTCATGGCTTCGTGGCCGCTGGCATCAAGGGCGAGCGGAGGGATGAAGTGGAAGTGGAGATCGCGGAGTCCGGGAAGCGGATGTTGGTGCCCATCGACGACATCCAGAAGATGAACCCGCCCAAGTTCGACAAAGTCGAGGACATGGCTGAGCTCACCTGCTTAAATGAAGCCTCCGTTCTTCACAACATCAAAGACAGATACTACTCCGGATTGATATATACATACTCTGGTCTCTTCTGCGTGGTGGTCAATCCGTACAAGAAGCTGCCGATCTACACGGAGAAGATAATGGAGCGTTATAAGGGCATCAAACGCCACGAGGTACCACCGCACGTGTTCGCAATCACCGACACAGCATACCGCTCCATGCTGCAAGATCGCGAGGACCAGTCCATCTTGTGTACCGGCGAGTCCGGTGCCGGCAAGACTGAGAACACCAAGAAAGTGATACAGTACCTCGCGTACGTCGCAGCCTCCAAGCCCAAAGGCTCGGGCGCGGGACCTTCGCCGCAGTTGATCATAGGCGAATTAGAACAACAGCTGCTTCAAGCCAATCCTATCCTCGAAGCGTTCGGTAACGCCAAGACAGTCAAGAACGACAATTCATCGCGTTTTGGTAAATTCATTAGAATAAACTTCGACGCGTCCGGTTTCATAGCTGGAGCTAATATTGAAACCTATTTACTGGAGAAGTCGAGAGCCATCAGGCAGGCGAAGGACGAACGGACGTTCCACATCTTCTATCAGCTGTTAGCGGGCGCGACCTCTCAGCAGAAGGCGGAGTACATACTAGAAGACCCCAAAAGCTATCCCTTCCTCACCAACGGCGCGCTGCCGGTGCCGGGCATCGACGACGCCGCGGAGTTCCAAGCCACCATCAAATCCATGCACATCATGGGCATGAACAACGAAGACTTCAACTCGATATTCAGAATCGTCTCAGCGGTCCTGCTGTTCGGCTCGATGCAGTTCAAACAGGAGAGGAACTCGGATCAGGCCACGTTGCCTGATAATACTGTTGCACAGAAAATTGCTCATTTGCTAGGTCTTTCTGTGACAGAAATGACAAAAGCTTTCCTCAAGCCGCGGATTAAAGTGGGTCGTGATTTCGTCACCAAAGCACAAACTAAAGAACAGGTGGAGTTCTCTGTCGAGGCTATCGGCAAGGCTTGCTACGAACGTCTCTTCAGATGGCTGGTCAACAGAATCAACAGGTCGCTGGACAGAACGAAGAGGCAGGGCGCATCGTTCATAGGGATCCTCGATATGGCCGGTTTTGAAATATTCGAACTAAACTCCTTCGAACAGCTGTGTATTAATTACACGAACGAGAAGCTGCAACAGCTCTTCAACCACACCATGTTCATCCTCGAACAGGAAGAGTACCAGAGAGAAGGCATCGAATGGAAATTCATCGACTTCGGTCTGGACTTGCAGCCGACGATCGACCTCATCGACAAACCCATGGGCATCATGGCCTTGCTCGACGAGGAGTGCTGGTTCCCCAAGGCGACGGACAAGACGTTCGTCGAGAAGCTCGTGTCGTCACACTCCGTACATCCGAAATTCATGAAAACGGACTTCAGAGGCGTCGCTGACTTCTCCATCATACACTACGCGGGGAAAGTTGATTACTCGGCCGCCAAGTGGCTCATGAAGAATATGGATCCTCTGAACGAGAACGTGGTGTCGCTGCTGCAGTCCTCGCAGGATCCGTTCGTGTGCCACATTTGGAAGGACGCCGAAATAGTCGGCATGGCGCAGCAAGCGATGACCGACACGCAATTCGGCAAGAGAATAAGGAACGGGATGTTCAGAACGGTCTCCCAACTGTACAAGGAACAGTTGACGAAGTTGATGGCAACGCTGAGGAACACGAACCCGAACTTCGTCAGGTGTATCATACCGAACCACGAGAAGAGGGCCGGCAAGATAGAAGCGCCCTTGGTCTTGGACCAGTTGCGCTGCAACGGTGTGCTCGAAGGTATTAGGATTTGCCGCCAAGGTTTCCCCAACAGGATCCCGTTCCAGGAGTTCAGACAGCGCTACGAGCTCCTCACGCCGAACATCATCCCGAAAGGCTTCATGGACGGCAAAAAGGCGTGCGAGCAAATGATAGAGGCGTTGGAACTAGACCACAACCTCTACCGCGTCGGTCAATCCAAGATCTTCTTCAGAGCTGGAGTGCTGGCGCATCTGGAGGAGGAGAGAGACTACAAGATAACCGACCTCATAGTGAACTTCCAAGCCTTCTGCAGGGGATTCCTGGCGAGAAGAAACTATCAGAAGAGATTGCAGCAGCTGAGCGCTATAAGGATCATCCAAAGAAACTGCGCCGCCTACCTCAAGCTGAGGAACTGGCAGTGGTGGAGGCTGTACATTAAAGTGAAACCTCTGCTAGAAGTGACGAAGCAGGAAGAGAAGTTGACTCAGAAGGAGGATGAGCTGCGTCTAATCCGCGAACAGCTGGATTCCCAGCTGAAGTTGTGCCAGGAGTACGAAGCCAAGTACCAGCAGGCTAGCGTCGAGAAGACCGAGCTCGCTGAGCAGTTACAGGCGGAACTTGAGCTATGCGCTGAAGCCGAGGAGTCCCGCGCTAGGGCGGCCGCCCGCAAGCAAGAGCTGGAGGAGCTGCTGCATGACCTGGAAGGAAGGATAGAGGAGGAGGAGGAGCGGTGTGTAGCGCTACAGCAGGAGAAGAAGAAGCTACAGCTGAACATACAGGATGTGATGGAACAGCTGGAGGAGGAGGAGGCAGCTCGTCAGAAGCTGCAGCTGGAGCGTGTCCAGTGCGATGCCAAGATCAAGAAGCTGGAGGAGGACCTGGCGCTCAGTGACGACACCAACCAGAAACTCCTCAAGGAGAAGAAGGTCCTAGAAGAGAGAGCTAACGATCTTTCACAAGCACTCGCCGAGGAAGAAGAGAAGGCTAAACATCTGAGTAAGTTGAAAGCGAAACAGGAGTCTGCTATTGCTGAGCTGGAGGAGAAACTATTGAAGGACCACCAATTAAGGCAAGAAGTGGACAGAACCAAGAGAAAGCTGGAGACGGAGCTCCAGGACGTTAAGGAACAACTGGTGGAGAGGAAGGCGCAGCTCGAAGAACTGCAGATTGTACTCAAGAAGCGCGAGGAGGAATTGGTGGCGGCCAACGCACGGGCCGACGAGGAAAGCTCACAAAGGAACAACGCACAGAAAGCGGCCCGGGAGGCCCAGGCGGCGCTGGACGAGGCACGGGAAGACCTGGACCATGAGCGGGCTGCCAGGAGTAAGGCGGAGAAGCTCCGGAGAGACCTCAACGAGGAACTGGAAGCTTTGAAGAGCGAACTGCTGGACTCTCTCGACACCACAGCCGCCCAGCAGGAGCTGCGGTCCAAAAGGGAGCAGGAGCTTTGGTACGATTATACAATGGCGGCTGTTAAGGAATTCAAGCAGAAAATCTTTAAATTCGGTTTACCGAAAGACGGCAGCTTCGAAGTAGAGAACGTTCAGGGTCACGACGAGGGTAACGCTTTCACGTCCACCAGAGATTACTGGCAACACATGTGCAAGGACGCGCCCGCCGAGGGCAAGCCGGACGTGTCGCTGCAGTATAAGTGTCTGGAGAAGGCCAAGCAAGTTCTGGAAGCTGAGAATGCTGACCTGGCGACCGAGCTGAAGAGCGCCAGCGCCGCACGCGCTGAGAGCGAGAGGAGGAGGAAACAGGCCGAAGCTCAGCTACAGGAAGTGGCCGCTAAACTACAGGACGTGGACAGAGCCAGGGCGGAATTGGCGGAGCGCTGTGTGCGGCTGCAGAGCGAATCGGAGCAGGCGCTGCAGCAGCTGGAAGCCGCTGAGCTGAAGGCATCCGCCGCAGCCAAACAGGCCGCCACCGCCGGCGCCAACCACGTGGAACTACAGGCCCAGCTTGAGGAGGAGACGAAACAGAAGCTGTCGCTGCAGACGAAGCTGCGAGCTGTGGAGCAACAGCTGGAACAGGCCAGGGACCAGCTGGACGAGGAGGAGGAGGCCAAGGGCAACCTGGAAAAACAGGTTGCAATGCTAACCCAACAAGTGGCCGACGCCAAGAAGAAGGCGGAAGAGGAAGCGGAAGTCGCCGCCGCCCTCGAGGAACAGAGGAAGAAGCTCAGCAAGGACGTGGAGGCACTGCACTGTCAGCTGGAGGAGAAGCAACAGGCCAATGACAAGATAGAGAAGAGTAAAAAGAAGATTCAGGAGGAGCTGGAGGACGCCAACATAGAACTGGCGGCGCAGCGAGCCAAAGTGCTGGAGCTGGAGAAGAAACAGAAGAACTTTGATAAGGTTGTTTCTGAGGAACGTGCCGTAGCCGAGAGGAACGCGGCCGAGAGGGACGCAGCGGAGAGGGAGGCTCGGGACAAGGAGACACGAGTCCTTTCACTCACCAGAGACCTGGACCAAGCCGCCGAGAAGATCGAGGAGTTGGAGCGTTCCAAGCGCCTCCTGCAGTCGGAACTGGACGAGCTGGCCAACACGCAGGGGACAGCCGACAAGAACGTGCACGAGCTGGAGAAGGCCAAGAGGGCACTAGAGTCACAGCTGGCCGAGCTAAAGGCGCAGAACGAGGAGATCGAGGACGACCTGCAGCTCACCGAGGACGCCAAGCTGCGCCTCGAGGTCAACATGCAGGCTATGAGGGCGCAGTTCGAGAGGGATCTGCAGGCTAAGGAGGAGGCCGGCGAGGAGAAGCGCCGTGGCATAGTGAAGCAGCTGAGGGAGCTGGAGGCGGAGCTGGAGGAGGAGCGGAAACAGAGAGCAGCCGCCGCCGCCAACCGGAAGAAGATGGAGGCCGACCTCAAGGACGCCGAGCAGGCGCTACATCTGGCCAACAAGGTGAAAGAGGACGCGGTAAAGCAAGCGAAGAAACTGACGGCGCAGCTCAAAGAGGCCGTGCGAGAGGCGGAGGAGGCGAGGGCCGGGAGGGAAGAGGCCGCGGCCACCTCCAGGGAGGCCGAGAGGCGCGTCAAGGCGTTAGAGGCTGAGGCGCTACAGCACGCGGAGGAGCTGGCGGCCGCAGAGAGAGCCCGCAGGCACGCAGAGGTCGAGAGAGACGACCGCGACGACGAGCTCACCGCCGCCGCCGCCAAGGCGACTCTTCTTATTGACGAGAAGAAACGTCTAGAAGCGAGAATAACAGCTCTGGAGGAAGATTTGGATGAGGAGCAGTCGAACAACGAGATTCTCAACGACAGACTCAGGAAGGCTCAGAACCAAATCGACCAGCTGACGATGGAGCTGGGCACAGAGAAGGCGGCCACACAGAAACTGGAGAGCAGCAAACTGGTGCTGGAGAGGCAGAACAAGGAGCTGAAGGCCAAGTTCGCTGAACTGGAGACCTCGGGCCGGGCCAAGACCAAGAGTATGATCACGTCCTTGGAGCTGAAGGTCCAGAACTTGGAGGAACAGCTCGAAGCGGAGTCCCGCGAGCGACTGGCCCAGCAGAAGGCCTCGAGGAAGCTGGACAAGAAGATGAAGGAACTGGCGCTACAGCTGGAGGAGGAGAGGAGGCACTCCGACCAGTACAAGGAACAGATCGAGAAGATGAACAGCCGCGTCAAGACGCTGAAGCGCGAGGTGGACGCCGCGGACGAGGAGGTGCAGAGAGAGAGGGCCGCCAAGAGGAAGGCGCAGAGAGAGCTGGACGACCTACTGGAAGCGCAGGAGACGCTCTCCAGGGAGTGCACCAACCTGCGCAACAAACTCAGACCCTCCTTGGCCGACGATAGCGACTAG

Protein sequence:

>DPOGS215054-PA
MVDMDRNDPELRYLSVDRNSFNDPATQAEWTQKRLVWVPHETHGFVAAGIKGERRDEVEVEIAESGKRMLVPIDDIQKMNPPKFDKVEDMAELTCLNEASVLHNIKDRYYSGLIYTYSGLFCVVVNPYKKLPIYTEKIMERYKGIKRHEVPPHVFAITDTAYRSMLQDREDQSILCTGESGAGKTENTKKVIQYLAYVAASKPKGSGAGPSPQLIIGELEQQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDASGFIAGANIETYLLEKSRAIRQAKDERTFHIFYQLLAGATSQQKAEYILEDPKSYPFLTNGALPVPGIDDAAEFQATIKSMHIMGMNNEDFNSIFRIVSAVLLFGSMQFKQERNSDQATLPDNTVAQKIAHLLGLSVTEMTKAFLKPRIKVGRDFVTKAQTKEQVEFSVEAIGKACYERLFRWLVNRINRSLDRTKRQGASFIGILDMAGFEIFELNSFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWKFIDFGLDLQPTIDLIDKPMGIMALLDEECWFPKATDKTFVEKLVSSHSVHPKFMKTDFRGVADFSIIHYAGKVDYSAAKWLMKNMDPLNENVVSLLQSSQDPFVCHIWKDAEIVGMAQQAMTDTQFGKRIRNGMFRTVSQLYKEQLTKLMATLRNTNPNFVRCIIPNHEKRAGKIEAPLVLDQLRCNGVLEGIRICRQGFPNRIPFQEFRQRYELLTPNIIPKGFMDGKKACEQMIEALELDHNLYRVGQSKIFFRAGVLAHLEEERDYKITDLIVNFQAFCRGFLARRNYQKRLQQLSAIRIIQRNCAAYLKLRNWQWWRLYIKVKPLLEVTKQEEKLTQKEDELRLIREQLDSQLKLCQEYEAKYQQASVEKTELAEQLQAELELCAEAEESRARAAARKQELEELLHDLEGRIEEEEERCVALQQEKKKLQLNIQDVMEQLEEEEAARQKLQLERVQCDAKIKKLEEDLALSDDTNQKLLKEKKVLEERANDLSQALAEEEEKAKHLSKLKAKQESAIAELEEKLLKDHQLRQEVDRTKRKLETELQDVKEQLVERKAQLEELQIVLKKREEELVAANARADEESSQRNNAQKAAREAQAALDEAREDLDHERAARSKAEKLRRDLNEELEALKSELLDSLDTTAAQQELRSKREQELWYDYTMAAVKEFKQKIFKFGLPKDGSFEVENVQGHDEGNAFTSTRDYWQHMCKDAPAEGKPDVSLQYKCLEKAKQVLEAENADLATELKSASAARAESERRRKQAEAQLQEVAAKLQDVDRARAELAERCVRLQSESEQALQQLEAAELKASAAAKQAATAGANHVELQAQLEEETKQKLSLQTKLRAVEQQLEQARDQLDEEEEAKGNLEKQVAMLTQQVADAKKKAEEEAEVAAALEEQRKKLSKDVEALHCQLEEKQQANDKIEKSKKKIQEELEDANIELAAQRAKVLELEKKQKNFDKVVSEERAVAERNAAERDAAEREARDKETRVLSLTRDLDQAAEKIEELERSKRLLQSELDELANTQGTADKNVHELEKAKRALESQLAELKAQNEEIEDDLQLTEDAKLRLEVNMQAMRAQFERDLQAKEEAGEEKRRGIVKQLRELEAELEEERKQRAAAAANRKKMEADLKDAEQALHLANKVKEDAVKQAKKLTAQLKEAVREAEEARAGREEAAATSREAERRVKALEAEALQHAEELAAAERARRHAEVERDDRDDELTAAAAKATLLIDEKKRLEARITALEEDLDEEQSNNEILNDRLRKAQNQIDQLTMELGTEKAATQKLESSKLVLERQNKELKAKFAELETSGRAKTKSMITSLELKVQNLEEQLEAESRERLAQQKASRKLDKKMKELALQLEEERRHSDQYKEQIEKMNSRVKTLKREVDAADEEVQRERAAKRKAQRELDDLLEAQETLSRECTNLRNKLRPSLADDSD-