Monarch geneset OGS2.0

DPOGS212363
TranscriptDPOGS212363-TA3108 bp
ProteinDPOGS212363-PA1035 aa
Genomic positionDPSCF300019 + 71547-84034
RNAseq coverage644x (Rank: top 20%)
Annotation
HeliconiusHMEL0053060.068.67% 
BombyxBGIBMGA007127-TA1e-7131.30% 
Drosophilad-PE0.054.81% 
EBI UniRef50UniRef50_F4X5D60.057.77%Myosin-VIIa n=14 Tax=Pancrustacea RepID=F4X5D6_ACREC
NCBI RefSeqXP_969433.10.056.74%PREDICTED: similar to myosin x [Tribolium castaneum]
NCBI nr blastpgi|910924480.056.74%PREDICTED: similar to myosin x [Tribolium castaneum]
NCBI nr blastxgi|3287846130.054.26%PREDICTED: myosin-IXb isoform 1 [Apis mellifera]
Group
Gene OntologyGO:00055248.1e-112ATP binding
GO:00164598.1e-112myosin complex
GO:00037748.1e-112motor activity
KEGG pathwaymcc:6933915e-72 
 K08834 (MYO3, DFNB30)maps-> Phototransduction - fly
InterPro domain[167-862] IPR0016098.1e-112Myosin head, motor domain
Orthology groupMCL14752 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212363-TA
ATGGCCACGCTCGGCCTGTCCAAGGTGTTCATCCTCGACAAGTACTTCACCGAGCTGCAAAAGTTCTGGGAGACCGAGAAGAAACTGCAAGACGCATCATCCTCCAACGAGGCGGTGCACCTTCAGAGACGACTGCTGAGTCTCAGCTCTGAGCTCGTCACTCTCCGGAACCACCTTCACGTGGGCGGCGCGGCGGCGGGTCCGGGAACCGCGTCGGCTGCTTGCGGGACCGGAGCCCCGGGGGCACAGAGCATCGGGGGCCGCCGGAAGCCCCCCTGGCGCTCAGCCCGCCGTGCCACCGCGGGCGCCTCTGCTGCCACCGCCAGCACCGGCGCCGCCGCTCCTGCCGCATCATGCGCCGCCGCTGCCACCAGAGGCGTCGCAGGTGCCTCGTGCCCGGCGAGCGGCGGTAGCGCCTCGGTCCCGGGGAGTGTGGCGGGTCCGGGATCCGACGTAGACGATCTGATCCACCTGCGCGGTCCGCTCACCGAGGACGCCGTGACGTCGGTGGGTCCCATCCTGGTAGCTATGAACTCGTACACTGACGTGAGTAACGCACTGACTCCGAGCGCCGCGCGCGCTCACCGGCCAGAACTGGCGAGACTGGTCCACGATGCCGTCCGTCACCAAGCAGACACCGGCTGCCCCCAGGCCATCATACTCTCCGGCGTCTCTGGATCGGGGAAGACATACGCGTCGATGGTGTTGTTGCGCCGTCTGTTCGACGTAGCGGGAGGCGGACCCGAGACGGACGCCTTCAAACATCTCGCGGCCGCCTTCACCGTCCTTCGCTCTCTCGGCACGGCCGCGACGCATGCCAACTCACACTCCAGCCGCATTGGTCACTTTATCGAAGTGCAAGTGACGGACGGCGCTTTGTACCGTACCAAGATTCACTGCTATTTCCTGGACCAGACGCGCGTGGTCCGTCCGCCGGCCGGGGAGCGGAATTATCACATCTTCTACCAGATGTTAGCCGGCCTGTCCGCCGACGAGCGCTCCCAGCTTCACCTCGATGGATATTCCGCTCACGACTTAAGATACCTATCATCTTCTCATCCTCGTCGTCCAGAACCCGAGGACGGAGCCAGGTTCCACGCATGGAAGAGCTGCCTCGGCGTGTTAGGTATTCCATTTTTGGACGTGCTGCGCGTCTTAGCCGCTGTGCTTCTGCTGGGTAACGTCCACTTCGCAGATAACGCAGAAGGAAGCGCAGAGCCAAATGGGGAGGCCGAGTTAGTAGCGGCAGGCTCTCTCCTCCCGCACCCTCGACGGCGTCTTATGCGCGGCCTCGGCACGCGCTGCGCCCGCGCCGGTCGTGTGCCCGCCCGCGCACCAGCCACCGCCGCCGCCGCTGCCGCCGCCCGCGACTCCCTCGCTAAAGCGCTGTACTGCCGCACCGTCGCCACCATCGTTCGTCGCGCCAACTCTCTCAAACGTCTCGGTTCCACTCTGGGCACCCTATCCTCGGACTCCAACGAATCTGTGATACAGGACGCGGCGTCGCGACGCGCGTCTACGGCGGGAGGCGGGGCGCGAGGCCGGGCGGGTGCTCGCTCCATGGCCGTCCTCAACGACGCCGTGCGCCACGCGAACGACGGCTTCGTCGGCATACTAGACATGTTCGGCTTCGAAGACTCCGCGCCCAGTCGTCTCGAACACCTTTGCGCCAACCTATGTGCCGAGACGATGCAGCACTTCTATAACACTCACGTGTTTAAGTCATCGGCGGAGTCTTGTCGCGAGGAGGGCGTGTCTGGTGCGCTAGAGGTAGAATATGTGGACAACGTGCCGTGCATCGACCTGGTGTCATCATTAAGAACGGGACTGCTGGCGGCGCTGGACGCGGAGTGTGCCGCTCGAAGCGAGCCCGAACATTACGTGGCCAGGATTAAGAACGCGCACAGAGGCCACCTCAGACTGGCGGAGGCGCGACCACCGCACGCGCGTCGTTTCGCAGTGCGACACTATGCAGGCGAAGTCACTTACGACGCGAGTGACTTCCTGGAAGCCAACCGGGACGCGGTACCGGACGAATTGCTCGCCGCCTTCGATACACGAACCTGCGAATTCGGATTCGCAACGCATTTATTCGGCGCCGAGCTCAAGGCGTTGGCCGCGGCGGGTGGTCCGGCTGGCGCTCAGTTCCGAGCGTCGCCGACTGCGGGCGGTGCGGCCGCCGCCGTGGTCGCCGCGTCCACACTCACACAAGACTTCCACACTCGTCTCGACAACCTGCTGCGGACGCTAGTACACGCTCGCCCTCACTTTGTGCGGTGTCTCCGCGCCAACGCCACCGAGACTCCGATGCATTTCGAACGTGCAACGGTCGCCCGACAGGTGCGGGCGCTGCAGATCCTGGAGACGGTACAGCTGATGGCGAGCGGGTATCCGCATCGGATGAGGTTCCGAGCGTTCAGCTGTCGGTACCGCGCGCTGTGGGGCCGAGGACATGCCGGCGTCGAGCGTGAGTCGGGCGCGTCCTGTGCTCGCGTTCTGGCCGCCGTGGCCGCAGCCGCCGCCCCGCCGGCGCCCGCCTCGCCCGCCGCTGTGCGATGGGCGCTCGGTAAACGACACGTTTTCTTGAGCGAGGCCATGCGTCAGGTCCTCGAACGCATGCGAAGAGCGCGCCGTCAGGCCGCCGCCGAGTCCATCCAGGCGGCGTGGCGGGCGTACAAGAGTCGCGGCTCCTACGGGACCGGCACGGCGAGCAGGACGCGCCCCGCACCGCCAGCCCGTCGCCGACCCGCTCCCATTGCCGGCACGCCGCCCCCCGAACTTCCTGACAAGTGCGATCCTCAGGTGGTCAAAAGCACCTGCTCTCTCTTCGGACTGGACCTGGAGAGGCCGCCGCCACTGCCGCCGTCGCGAGCCTACACGGTATCAAACGGAGTGAAGCTGGGGTATCCTCAGCAGCGCGTGGTGCGCGCGGACTGGGCCGAGGGCGGCGTGCGGCTCCGTGCCGGGGACTGTGTGTTAGCGCTGGGCGCGGCTCCCCGCGGGCTGGTGTCGGTGCAGACGGGCGGACGCACGCTGCCCGTACCTCACTCCGTGCTGGGTCCGCCGCGGGCCGCCCGCTCCGCGCCGCCCCCCCCTCCACCACATTAG

Protein sequence:

>DPOGS212363-PA
MATLGLSKVFILDKYFTELQKFWETEKKLQDASSSNEAVHLQRRLLSLSSELVTLRNHLHVGGAAAGPGTASAACGTGAPGAQSIGGRRKPPWRSARRATAGASAATASTGAAAPAASCAAAATRGVAGASCPASGGSASVPGSVAGPGSDVDDLIHLRGPLTEDAVTSVGPILVAMNSYTDVSNALTPSAARAHRPELARLVHDAVRHQADTGCPQAIILSGVSGSGKTYASMVLLRRLFDVAGGGPETDAFKHLAAAFTVLRSLGTAATHANSHSSRIGHFIEVQVTDGALYRTKIHCYFLDQTRVVRPPAGERNYHIFYQMLAGLSADERSQLHLDGYSAHDLRYLSSSHPRRPEPEDGARFHAWKSCLGVLGIPFLDVLRVLAAVLLLGNVHFADNAEGSAEPNGEAELVAAGSLLPHPRRRLMRGLGTRCARAGRVPARAPATAAAAAAARDSLAKALYCRTVATIVRRANSLKRLGSTLGTLSSDSNESVIQDAASRRASTAGGGARGRAGARSMAVLNDAVRHANDGFVGILDMFGFEDSAPSRLEHLCANLCAETMQHFYNTHVFKSSAESCREEGVSGALEVEYVDNVPCIDLVSSLRTGLLAALDAECAARSEPEHYVARIKNAHRGHLRLAEARPPHARRFAVRHYAGEVTYDASDFLEANRDAVPDELLAAFDTRTCEFGFATHLFGAELKALAAAGGPAGAQFRASPTAGGAAAAVVAASTLTQDFHTRLDNLLRTLVHARPHFVRCLRANATETPMHFERATVARQVRALQILETVQLMASGYPHRMRFRAFSCRYRALWGRGHAGVERESGASCARVLAAVAAAAAPPAPASPAAVRWALGKRHVFLSEAMRQVLERMRRARRQAAAESIQAAWRAYKSRGSYGTGTASRTRPAPPARRRPAPIAGTPPPELPDKCDPQVVKSTCSLFGLDLERPPPLPPSRAYTVSNGVKLGYPQQRVVRADWAEGGVRLRAGDCVLALGAAPRGLVSVQTGGRTLPVPHSVLGPPRAARSAPPPPPPH-