Monarch geneset OGS2.0

DPOGS215184
TranscriptDPOGS215184-TA3240 bp
ProteinDPOGS215184-PA1079 aa
Genomic positionDPSCF300143 - 335261-344546
RNAseq coverage776x (Rank: top 17%)
Annotation
HeliconiusHMEL0092610.054.53% 
BombyxBGIBMGA008726-TA0.078.57% 
DrosophilaKlp61F-PA0.057.12% 
EBI UniRef50UniRef50_G6DRQ60.098.90%Putative uncharacterized protein n=3 Tax=Obtectomera RepID=G6DRQ6_DANPL
NCBI RefSeqXP_002047052.10.054.77%GJ13211 [Drosophila virilis]
NCBI nr blastpgi|1953765350.054.77%GJ13211 [Drosophila virilis]
NCBI nr blastxgi|1947485550.039.26%GF10068 [Drosophila ananassae]
Group
Gene OntologyGO:00070185.1e-180microtubule-based movement
GO:00055245.1e-180ATP binding
GO:00037775.1e-180microtubule motor activity
KEGG pathway 
InterPro domain[12-361] IPR0017525.1e-180Kinesin, motor domain
Orthology groupMCL11527 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215184-TA
ATGACGTCTGAAAAAGGATGCAGAAAAGAAAAAAATCAAAACATTCAAGTGTTCGTGAGGCTGAGACCTCTGAACCAAAGGGAGAGGGACATCAAGTCTCTAGGAGTGGTGGAAGTTGTTAACAACAGGGAGGTGGTTGTGCGTCAGTCACAACAGAACTCGTTAACTAAGAGGTTCACCTTTGACAGAGCATTCCCCCCTCACTCAAAACAGGTGGAGGTTTACCAAGAGGTGGTGAGTCCTCTGATTGAGGAGGTATTGGCTGGTTACAATTGTACGGTGTTTGCCTACGGTCAGACAGGCACCGGTAAGACTCACACCATGGTCGGAGAGGCAGCACAGGATGAGACCACTTGGCAGAATGATCCACTGGCTGGTATAATTCCGCGAGCCTTGAGCCAGCTGTTTGATGAGCTCCGTCTGTCCAACACTGAATATACCGTGAGGGTTTCCTACTTGGAGCTTTACAATGAAGAGCTGGTAGACCTGCTGTCAACAGAAGACGGTTCCAAGCTGCGCATCTACGAAGATGTAACCAGGAAGGGAGCTAATATAGTCAATGGTCTAGAAGAAATAACTGTGTACAATAAGAATGAGGTGTACAAGATCATGGCTCAAGGACAGGAACGGAAGAGAGTGGCTTCCACACTCATGAACGCTCAGTCCAGTCGTTCTCACACCGTGTTCACGATCGTGGTTCACATGAAGGAGAACAGTCCCGAGGGAGAGGAGCTGGTGAAGATCGGCAAGCTGAACCTCGTGGACCTGGCCGGCTCCGAGAACATCAGCAAGGCCGGCAGTGACAACCCTGCCAAGAGGGAGCGGGCCAGGGAGTGTGTTAACATCAACCAGTCCTTACTGACGCTGGGCAGGGTCATCACAGCACTTGTTGAGAGACATCCGCACATCCCATACAGGGAATCAAAGCTGACCCGCATCCTCCAGGAGTCCCTGGGAGGCAGGACGAAGACCTCGATAATAGCGACCATCTCACCTGGACACAAAGACCTGGAGGAAACTATGAGTACACTGGAATACGCCAACAGAGCGAAGAACATCCAGAACAAACCAGAAGTTAACCAGAAGCTGACCAAGAAAGCGATATTGAAGGATTACGCTGAGGAAATTGATAGATTGAAGAGAGATCTCCACGCAGCCCGGGAAAAGAATGGAGTGTATCTGGCTAACGACACGTTCGCCGAAATGACGCTCAAGCAAGAAGAACAGCGGAAGGAAATACAGGAGCTGCTGCTGAAGAAGAGGGCTATGGAGGAGGAGAGGGAGAGGATCCGCCAGGTGTTCCAGGAGTTGAGGAACCAGCTGGAAGATAAACTGAGAGACACACAGGACCAACTGGCCAAGACCAATGAGAACCTCCGGACAACGAAACAACAGTACTTGGAACAGCGACATCTGGCGAGCGTCCGCGCTAACACCGAGAAAGTACTAACCGAGCAAGCGACAAAGATTCTAGAAGCAGCGGACTGCGCCTCCGTTGATGCCATGGGGCTTCACGACGCGGTCGACCGGCGGAGGAAGCTGGAGGAAGACAACCTCGGCCTCACCACCAGCTACAGGGAACACGCCAGGAAACACCGCGACTACGCCAGGGAAGATCTCAACGACTACATGCACTCGCTGATCGGAGCGCTCAAAGAAATTACTGATCATTTAGACACCTTCACTAACATGTGCAGTCAGACACACTGTGATAATGAACAGACGCTCAATGAATTAGTCAAGAACACTATGAGTGTCATGGAAAAGATAAAGACAATGAAAGATAACTTCAGCGCTGATATTGAAGACACGGCGAAGACGGAGCTGTGTTCACTCAGAGAAAACGTCAAGTCCAGAGAAGGAGCCATTCTAGACGTACTGAAACGGCTGTCGAACGGAATTAAGACGAATTTGAAGAAGACGGAGGTCGACGACCTCGGGCTGTTACTGAGAGTTATCGAGGAACTGAGGTCGGCGTCCTCGGCGCGGGTGAGGGCGGTTCGTGAGTCCGGCGGCCGAGTGGAGAAAGCGGTCCGGGAGTCCGGCGGCCGGCTGAGGAGCGCCGCGGGAAATGCCAGGGAACATACGGACGGACACGCGGGAGCCCTGGACCGCGCCGTACGGGACAGACTCGCTCTCTTGATGGAGGACAAGTCGAGGACGGAGAAGTATCTAGCAGAACAAATGGCTGTAGCGGAGAACGAGAGAAGAGATATGGAGACGCGACTGAAGAAGTGGGAGGAAGAGGAAGTGAAGCGGATCAAACTACACGTGGAGACAGAGAGATGTCTGCTAGAAGATAGACTGGCTGCCAGAGTCAAGGATATAGAAGAACAGAGAAGGGTCATGGAGGACAGAATGACACAGAATATGGAGTGGCTGAACGGTCTGTTGGACTCCAGCGACACGCAGAGACGAAAGGCGGAAGAGGATCTACAGGCCTTGGAAACAGAACTACAGAGCTGTATGACGAGTGTTGATGGATGTATACAGGAGATGGATCAAGAAAGCATTGGAATAATAAGCGACACGGACAGAAGGACAGAGGACGTCGTGAAGGTTGTTGAGAGAATACAAGCAGACCTACAGGGAACGGTGACGGGGACAGAGACGTGCATACAGTCGATAGAGGCTAACAGTCAAACACTCAGTGAGGTCGCGGACCTCGCCGATAAAACCGTCAGTTTGATCGAACGGAAATGTAACGAAACTGTGGATTCAATAGTTAGTGAGTTCGAAACTGCATCCTCAGAAACTATGGAAAATGTCCGTCGTGTGGGGGCGTTGTCTCACAGCTTAGCGGAGAACACGCAGGGACACCGCCACGAACAGGGCGAAGCCCTACAAGGGGTGCGGCAGCGGGCGGAGGCGCTCCAGAACGACCTGGTCAGGGAGCTGGACGCGTTGAACCAACTCGACGAGCAATGCTCGACTCCGGCCCGCCGACCTTACAACTACCCTCGCTCGCTGGCTTCCACCTCCCCGGAGTCCTTGGTGCTGGCCAGATTGACGCTGCACTCCAATTCCGTCCTGGAGGACTCCAGCGACGAGGACCTCGACTCCAACACGGAGGAGAGCATCGTCAACGGCAGCTCCGACGCCAACAACAGTCGCAAGCAGCGCGAGGAGTCAGGGAAGGAGAACGTGAACTCGAACCGCTCGATGAGTTTCAAGAAGCCGTCCAAGTTGCCAGCTCCCTCCAGCATCAAGAAGCCGCTCGTGGATAGAAACGACTACTGA

Protein sequence:

>DPOGS215184-PA
MTSEKGCRKEKNQNIQVFVRLRPLNQRERDIKSLGVVEVVNNREVVVRQSQQNSLTKRFTFDRAFPPHSKQVEVYQEVVSPLIEEVLAGYNCTVFAYGQTGTGKTHTMVGEAAQDETTWQNDPLAGIIPRALSQLFDELRLSNTEYTVRVSYLELYNEELVDLLSTEDGSKLRIYEDVTRKGANIVNGLEEITVYNKNEVYKIMAQGQERKRVASTLMNAQSSRSHTVFTIVVHMKENSPEGEELVKIGKLNLVDLAGSENISKAGSDNPAKRERARECVNINQSLLTLGRVITALVERHPHIPYRESKLTRILQESLGGRTKTSIIATISPGHKDLEETMSTLEYANRAKNIQNKPEVNQKLTKKAILKDYAEEIDRLKRDLHAAREKNGVYLANDTFAEMTLKQEEQRKEIQELLLKKRAMEEERERIRQVFQELRNQLEDKLRDTQDQLAKTNENLRTTKQQYLEQRHLASVRANTEKVLTEQATKILEAADCASVDAMGLHDAVDRRRKLEEDNLGLTTSYREHARKHRDYAREDLNDYMHSLIGALKEITDHLDTFTNMCSQTHCDNEQTLNELVKNTMSVMEKIKTMKDNFSADIEDTAKTELCSLRENVKSREGAILDVLKRLSNGIKTNLKKTEVDDLGLLLRVIEELRSASSARVRAVRESGGRVEKAVRESGGRLRSAAGNAREHTDGHAGALDRAVRDRLALLMEDKSRTEKYLAEQMAVAENERRDMETRLKKWEEEEVKRIKLHVETERCLLEDRLAARVKDIEEQRRVMEDRMTQNMEWLNGLLDSSDTQRRKAEEDLQALETELQSCMTSVDGCIQEMDQESIGIISDTDRRTEDVVKVVERIQADLQGTVTGTETCIQSIEANSQTLSEVADLADKTVSLIERKCNETVDSIVSEFETASSETMENVRRVGALSHSLAENTQGHRHEQGEALQGVRQRAEALQNDLVRELDALNQLDEQCSTPARRPYNYPRSLASTSPESLVLARLTLHSNSVLEDSSDEDLDSNTEESIVNGSSDANNSRKQREESGKENVNSNRSMSFKKPSKLPAPSSIKKPLVDRNDY-