Monarch geneset OGS2.0

DPOGS207155
TranscriptDPOGS207155-TA1959 bp
ProteinDPOGS207155-PA652 aa
Genomic positionDPSCF300001 + 4340916-4343367
RNAseq coverage16336x (Rank: top 1%)
Annotation
HeliconiusHMEL0130400.079.79% 
BombyxBGIBMGA000613-TA0.073.47% 
DrosophilaPrm-PA5e-15575.69% 
EBI UniRef50UniRef50_P354160.055.40%Paramyosin, short form n=19 Tax=Arthropoda RepID=MYSP2_DROME
NCBI RefSeqNP_001124374.10.094.51%paramyosin [Bombyx mori]
NCBI nr blastpgi|2213275790.074.37%miniparamyosin [Bombyx mori]
NCBI nr blastxgi|2294726270.074.82%miniparamyosin [Bombyx mandarina]
Group
Gene OntologyGO:00164594.1e-112myosin complex
GO:00037744.1e-112motor activity
KEGG pathwayoaa:1000814976e-62 
 K10352 (MYH)maps-> Viral myocarditis
    Tight junction
InterPro domain[287-628] IPR0029284.1e-112Myosin tail
Orthology groupMCL11094 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207155-TA
ATGGCTCCCGTAAAGGTTCCGGAACGTCAAAAGTGGACCAAACCACCGACAACGATTTATGAAAATAATTATGGTTATGGTATTAATTTTTATCAACCAATGATTGATTACATCGTAGCTAAAAAGGAAGGGTCCGACATAAAACCCCCTCACCTGCCATGGAATAACGAAAGAGGGCTCGATAAATACCGATATGATAGACCAGTGAAAAACTATTCGGAGAGTGATATAAGAAAATTATCACATGAAGTGGCAGAAAGAGCAAAGAAAGACCTAAATACTTTTAGTGTTGGTAAAAGAACACCATTCTCAGTTATTCAAACGGCTGCTGCTGCAAATTTAACAAAACATGTAGCTATGAAAAGTGTTTCAGTAAGATCTAAAAAAATGAAAAAAGATACACTGGAAACATTAAAATATCAAAACGCTGGGGACGCAGATCTAACAAAACAACTCGAGTTATATAATAATGAGTTAAATATTGAAAGTGATTTAATAGGCAAAGCCAAACTATTTCGTGGAAAATCTGCTAAGGCAATAGCGCAGACTTTATTAAACGAGACTAATAAAGGTTTAGCTGAAGGAAAAATTAAAAAAATTAAAGTGTCTGACTTGAGTATGGTAGAAAGAGGCAAAATGTCACAAAAATTAATGTCGGAGTTTAATAAACACTCATCAAAAGCTTTTCAAAATGAAATAAAACAAATTGCAGAGGCTACTATAAAAACACCGAAAGTTTGTGTTGTTCAAATAGAAACTGAAATTCCAACAATAAATAATGACTATTTAGAAAAAATTCACGAATTAAAACAAACTATTAACCAATTTGACGCATTAAGTTCTAATTTACTTATAGACAGAAGTAAGCAAACCAGCATTGAAATCGAGCAGCTAAACGCTCGTGTAGTAGAAGCTGAGATGAAGCTCAAGACTGAAGTTACCCGCATCAAGAAGAAGTTGCAGATCCAAATTACCGAACTCGAGCTGTCTCTTGATGTTGCCAACAAAACTAACATCGATTTGCAAAAAACCATCAAGAAGCAATCCCTACAGCTCACTGAGATACAGACCCACTACGACGAGGTCCAAAGGCAGCTGCAAGTAACTCTCGACCAATACGGTGTCGCTCAGCGCAGGATTCAATCCCTTACCGGCGAGGTAGAAGAAATCCGTGGCAACTACGAACAGGCCCTTCGTGCCAAGCGCTCTGTTGAGCAATCCTTCGAGGAGGCGCAGACCCGTATCAACGAACTTACAGTGATCAATGTGAATCTGTCCAGCAGCAAGGCCAAGATTGAACAGGAGCTGGCTGCTGTTGCTGCTGACTACGATGAGATCACCAAGGAGCTTCGCATTGCTGACGAAAGATATCAACGTGTCCAGACTGAACTGAAGCACACCGTTGAACACTTGCACGAAGAACAAGAAAGGATTGTGAAGATTGAAGCTGTTAAGAAGTCTCTCGAAATTGAAGTTAAGAACTTGTCTGTGCGTTTGGAAGAAGTAGAAACCAATGCTATTGTCGGTGGAAAGCGTATCATCAGCAAACTGGAAGCCCGTATTAAGGATATGGAATTGGAAATGGACGAGGAGAAGAGGAGGCATGCCGAGACCATCAAGATTCTTCGTAAGAAGGAACGTCAGCTTAAAGAGATTCTCATCCAATGCGAAGAAGACCAGAAGAACATCGCTCTTCTTCAGGACTCTTTAGAGAAATGTTCTCAGAAAGTCAACATTTACAAGAGGCAGTTGACTGAACAAGAGGGAGTATCCCAGCAGAGTGTCACCAGAGTGCGACGATTCCAGCGTGAACTCGAAGCTGCCGAAGACCGCGCTGACACCGCTGAGAGCAACTTGTCGCTCATCCGCGCTAAGCACCGCACCTTCGTCACCACCTCTACAGTGCCCGGCTCCCAGGTTTACTTGGTGCAGGAGTCTCGCGCTCTCAGCACGGAGTGA

Protein sequence:

>DPOGS207155-PA
MAPVKVPERQKWTKPPTTIYENNYGYGINFYQPMIDYIVAKKEGSDIKPPHLPWNNERGLDKYRYDRPVKNYSESDIRKLSHEVAERAKKDLNTFSVGKRTPFSVIQTAAAANLTKHVAMKSVSVRSKKMKKDTLETLKYQNAGDADLTKQLELYNNELNIESDLIGKAKLFRGKSAKAIAQTLLNETNKGLAEGKIKKIKVSDLSMVERGKMSQKLMSEFNKHSSKAFQNEIKQIAEATIKTPKVCVVQIETEIPTINNDYLEKIHELKQTINQFDALSSNLLIDRSKQTSIEIEQLNARVVEAEMKLKTEVTRIKKKLQIQITELELSLDVANKTNIDLQKTIKKQSLQLTEIQTHYDEVQRQLQVTLDQYGVAQRRIQSLTGEVEEIRGNYEQALRAKRSVEQSFEEAQTRINELTVINVNLSSSKAKIEQELAAVAADYDEITKELRIADERYQRVQTELKHTVEHLHEEQERIVKIEAVKKSLEIEVKNLSVRLEEVETNAIVGGKRIISKLEARIKDMELEMDEEKRRHAETIKILRKKERQLKEILIQCEEDQKNIALLQDSLEKCSQKVNIYKRQLTEQEGVSQQSVTRVRRFQRELEAAEDRADTAESNLSLIRAKHRTFVTTSTVPGSQVYLVQESRALSTE-