Monarch geneset OGS2.0

DPOGS202331
TranscriptDPOGS202331-TA3234 bp
ProteinDPOGS202331-PA1077 aa
Genomic positionDPSCF300032 + 619756-631768
RNAseq coverage38x (Rank: top 73%)
Annotation
HeliconiusHMEL0032758e-16155.88% 
BombyxBGIBMGA000816-TA3e-6040.56% 
DrosophilaKlp98A-PA1e-6436.45% 
EBI UniRef50UniRef50_UPI00022CA4636e-7641.13%UPI00022CA463 related cluster n=2 Tax=unknown RepID=UPI00022CA463
NCBI RefSeqXP_624891.25e-7542.50%PREDICTED: similar to kinesin-like motor protein C20orf23 [Apis mellifera]
NCBI nr blastpgi|3838594473e-7740.81%PREDICTED: uncharacterized protein LOC100883481 [Megachile rotundata]
NCBI nr blastxgi|3838594471e-7438.72%PREDICTED: uncharacterized protein LOC100883481 [Megachile rotundata]
Group
Gene OntologyGO:00070181.9e-94microtubule-based movement
GO:00055241.9e-94ATP binding
GO:00037771.9e-94microtubule motor activity
KEGG pathway 
InterPro domain[1-350] IPR0017521.9e-94Kinesin, motor domain
Orthology groupMCL34439 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202331-TA
ATGTCAAACGTACAACTAGCCATAAGAATTCGACCATTCACGGATAAGGAGTTAAAATCTGAAAAGGACCGTGTACAGGTCGTCAGCGCGATAGATGAGACTTCTGTTGCAATAACGAATATAAAGGTGAGTCTATCAGGCGCTGGAGACAGTCGCGAGCGTGTGCGTCGTTACTTCGCAGATTTTACTTTTGATAGCGCCTGTGCAAGTTCACATCCTAATTATGCGACACAGGAAAAGGTATTTGATAGGGTTGGAAGTGAAGTCTTGTCAACGATAGCAAAAGGTCGTTCGGCCTGTGTTTTAACTTATGGCCAAAGCGCGACTGGGAAGACCCATACTATGATGGGTTCTCCAGAAGAACCTGGATTGATTCCACTGATTTGCCGAGCTTTAGCCAGACATAACCCACTGGATATAACTGTCAGTTATCTCGAGATATACAACGAGAGGGTTCACGATTTGTTGGCGACAGAGGTCCTGCCACTCAACTCTCTTCCGAGACGAAAGGGTAACGCGAGGAAGGATTTGCGGGTGAGAGAAAATCCCTCTAAAGGGACGTACGTTCAAAATCTTCGACGGGTACCAGTTCACGATTTAGAGGCTTTATTATCTGTGGTGAGTGAGGGTACGAGGCGAAGAAGGACTGCTGCCACCAAACGCAACAGCTCCTCATCAAGATCACATGCCTTGCTGGAGTTGTCCACTGCTTCTGCTACTTTACATCTTGCTGATCTTGCTGGCAGCGAAAAGTCCGGTTGGGAAGGCTGTGGTGGCAGGCAGAAGGAAGGATCTAACATTAACAAATCACTAGTCGCGCTGAGCAATGTCATATCAGCTCTAGTTAGCGGAGGCTCTAGCCGCGGCAGGTTCGTTCCCTTTCGCGATTCAGCTCTCACGTGGCTGCTTAAGGAGTGTTTTACGGGCGGAGGGAAAACTTATATTATAGCGACTGTATCACCAAGTGCAGCCTGCTACGGAGAATCAGCCTCAACCCTTCGTTGGGCGTCACAAGCTCGACAACTACCAGCGCGACCTAACGTCACTAAAACAGCTGTAACCAAATCAGAAATACAAGCGCAGTACAATCAACTTATAGCTCAACTAGAAAATCACTTCATACAATACAAACCAGAGTTAGCTTTACTGCAATACGATGATAAGCATTGGAAGTTGCAGTTAAATAAGGCACAAAACTTAGATGTTTCATATCCAGAAGCAAACATAGGTAATATAATGAATGTAGGATATTCGAAATCTGATTCTCGTAATCATGAATCGACTCAATCTTCCATCACAAGCGGAGGTTCGGATATTAATATGGAAAAAAATATGGAAGTCAGAAATGAAATTAGTAAAGAAATAGATGAACTATTTGGACCGACCCTCGAAAGAACTAAAAGCGGAAGTGACATTGAAGTTGCTGTGCCTCTAAGACATAAAAAACGTCAATTCAGATCACAAGAAGTTCTGCAAGATGAAAAATTGTCGCAACGTCTTAAAGAACATCGTTTATCTGATGCTGGTGTTAACAATGAAATTGTTACTGACGAAAAAAATTACGACAGCACCAAAACACACGTACCAATACTTTATGATAACCAACGCGCTGAGATCATAGCGTCTGTAACAGAGAGGTTGTATTCGAAGCTTAGAAAGAATGAAGATGTATCGAAATCAGAATATACACCTGATAAGAAACAGCATGAAACCAAGGTGCAAGCATTAGACGAGTTGAAAATCTGCTCCAATGCTCGACAACGTTTGATGGAGATCAGCAAAAAAGCTTTGAGGAACAAAAGGAAAATTGGTATACCAGCACACACTCAAACCCGAAAATCTGTTATTCGTGTTAAAGATCAAGGCATTGACGTCCAAACGGATCTTCAAACGTATGTTATCGGTACTCAGAACTTGACGTACTTATTTCGGCAAGACGTCAGTACTGAGACGATCACAATGACCCCGAGAGTAAAAGATATAGCGATTGGCTCTCAGCATGGTACATTATATTGCAAAGATGGATCAACAGCGACAGAACATAGAAAAATAACTTTAAAATCGTCCTCTGTGATGACGGATAACGTGTCGACATTCGACCGGTCTACACAAACACACATACAGCCGCCACCTCGAAGAAGGAGACGTTTATCAGCTTATTCAAAGTATATAAAAAAAATAGAAAGTATTAAGCCGTCTACAGACGAACATTATGCCTCACCTGTAATAAACATCAATATATCACCCATTCTTCAAGAAGATTCTGAATCGCAGTCATCTGATGAAATATCAGATAACCCATCAAGAGTTAACGAAGAAAATGAAAGGAGGAGTGCAGTGACTACCCCAGATCTATTATCAAATCACAATAGCCATATAAAGTTAGAGTGTGATAAAGTATGCGACAAAAATAACACAAAAAATATTTGTGATTGTGATTATTCGAATACAGCAAGCAATATCTCTTCAGGATCAATTAATTTTGACGATTCAGATACAGAGGATTTAATACTACCGAGGGTGACAGTAGATGCTTCAAGAAAAATAGGCCAAAAAGATTACGAGCAATTAATATTAGGTTATAATGATGATGTGTATCCGTACAATATAAAGTTGTCTCCCACAAGACAAAGGGACGAATCTAAGAGAGTTATAAGATTCAAGGACATAGATGTAACACCAGCAGAATGTTGCAGTGATGAATGGCAGAGAAAAGTCAACACGGTCACGAATGTATCAGACGAGGGGAGCGTTGATAATGATTTAGACAACAATTATTGTGAATCGATAAAGTCAAATAAAATAGAGACCGATTCATTTGTATGGAATAAGTGCGGTTCAGACACGACCGATACGAGGCCATGTTCAGAAAGAAAATCAGTTAAATCTAAAAGTATGAGGGCAAAAATGTATGATGAGTTGGACGATTTCACCAGAGCGACCTGTTTCTGCGCAGGCGCATCAAACAATGACGACGATTATAATAATTCATATAAAAATGCAAGGAAGAGCAGCTTAAAGAGTTCTAAGACCTTCGATGTTTTCGAAAGGAAAATAAAAAGCGCCTGCAACAGTTTAGAGAACTCTGTTAATAAATACGACGATTATTTAATGAATTTTAGAGAGAGGAGTAAGCAGAAATACAAAGACGCTGTTCAAAGGCGGACTCCCACTGAATATTTGCAGCATTTAATCAGATTAAGAAGGGAGGCTGTATCGTCGGACTCTTTCTGA

Protein sequence:

>DPOGS202331-PA
MSNVQLAIRIRPFTDKELKSEKDRVQVVSAIDETSVAITNIKVSLSGAGDSRERVRRYFADFTFDSACASSHPNYATQEKVFDRVGSEVLSTIAKGRSACVLTYGQSATGKTHTMMGSPEEPGLIPLICRALARHNPLDITVSYLEIYNERVHDLLATEVLPLNSLPRRKGNARKDLRVRENPSKGTYVQNLRRVPVHDLEALLSVVSEGTRRRRTAATKRNSSSSRSHALLELSTASATLHLADLAGSEKSGWEGCGGRQKEGSNINKSLVALSNVISALVSGGSSRGRFVPFRDSALTWLLKECFTGGGKTYIIATVSPSAACYGESASTLRWASQARQLPARPNVTKTAVTKSEIQAQYNQLIAQLENHFIQYKPELALLQYDDKHWKLQLNKAQNLDVSYPEANIGNIMNVGYSKSDSRNHESTQSSITSGGSDINMEKNMEVRNEISKEIDELFGPTLERTKSGSDIEVAVPLRHKKRQFRSQEVLQDEKLSQRLKEHRLSDAGVNNEIVTDEKNYDSTKTHVPILYDNQRAEIIASVTERLYSKLRKNEDVSKSEYTPDKKQHETKVQALDELKICSNARQRLMEISKKALRNKRKIGIPAHTQTRKSVIRVKDQGIDVQTDLQTYVIGTQNLTYLFRQDVSTETITMTPRVKDIAIGSQHGTLYCKDGSTATEHRKITLKSSSVMTDNVSTFDRSTQTHIQPPPRRRRRLSAYSKYIKKIESIKPSTDEHYASPVININISPILQEDSESQSSDEISDNPSRVNEENERRSAVTTPDLLSNHNSHIKLECDKVCDKNNTKNICDCDYSNTASNISSGSINFDDSDTEDLILPRVTVDASRKIGQKDYEQLILGYNDDVYPYNIKLSPTRQRDESKRVIRFKDIDVTPAECCSDEWQRKVNTVTNVSDEGSVDNDLDNNYCESIKSNKIETDSFVWNKCGSDTTDTRPCSERKSVKSKSMRAKMYDELDDFTRATCFCAGASNNDDDYNNSYKNARKSSLKSSKTFDVFERKIKSACNSLENSVNKYDDYLMNFRERSKQKYKDAVQRRTPTEYLQHLIRLRREAVSSDSF-