Monarch geneset OGS2.0

DPOGS202874
TranscriptDPOGS202874-TA3924 bp
ProteinDPOGS202874-PA1307 aa
Genomic positionDPSCF300490 - 6530-23544
RNAseq coverage665x (Rank: top 19%)
Annotation
HeliconiusHMEL0179630.086.31% 
BombyxBGIBMGA005102-TA0.089.20% 
DrosophilaKlp3A-PA3e-12236.60% 
EBI UniRef50UniRef50_B4ND902e-15834.45%GK10187 n=3 Tax=Drosophila RepID=B4ND90_DROWI
NCBI RefSeqXP_002071813.13e-15934.45%GK10187 [Drosophila willistoni]
NCBI nr blastpgi|3838551263e-15838.19%PREDICTED: chromosome-associated kinesin KIF4-like [Megachile rotundata]
NCBI nr blastxgi|3838551268e-17636.79%PREDICTED: chromosome-associated kinesin KIF4-like [Megachile rotundata]
Group
Gene OntologyGO:00070185.1e-133microtubule-based movement
GO:00055245.1e-133ATP binding
GO:00037775.1e-133microtubule motor activity
KEGG pathway 
InterPro domain[1-394] IPR0017525.1e-133Kinesin, motor domain
Orthology groupMCL10683 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202874-TA
ATGAGAACAGAGATTGAGCGAGGGTGTGACGAATGTATTGACGTTGTATCTGGAGAACAACAGGTGCAAATAAAAGACCTGGCCTTTACTTACAATTATGTTTTCCCTCAACATATCAACCAGCAAGAGTTTTACGACACAGCAGTTAAAGGACTTATCGGGAAACTATTTCAAGGATACAATGTTACAATACTCGCCTACGGCCAAACTGGATCAGGTAAAACCTATACAATGGGAACAAATTATTCTGGATCTGACGGTGACTCAACTAAACTAGGTGTAATCCCCCAAGCTGTAGCTGATATATTTGACTTCATCGAGACCCATGAAGACAAGTTTATATTCAAAGTGTCTGTCTCCTTCATGGAGTTGTATCAAGAGCAATGTTACGACCTGCTTTCCGGCAAGGAGAGGGGTCATAGTATTATAGAAATTAGGGAAGATATAAACAAAGGCGTCATTTTACCTGGTATAACTGAATTGCCGGTTGCATCGACTCGTGAAACTATGACGGTGTTAGAAAGAGGATCATCTGGCCGAGTGACCGGTTCCACAGCGATGAACCAGGCATCGAGTCGTAGTCATGCAGTATTCACAATAGTAATTGCTAAAGAGAGCAGAAGTGATAAGAATTTAGCAACAACATCGAAGTTCCATCTCGTGGATCTGGCTGGCTCAGAGCGTATCAAGAAGACTAAGGCCAGCGGCGAACGACTCAGAGAAGGTGTCAAGATCAACCAAGGCCTACTGGCTCTAGGGAACGTTATATCAGCCCTCGGCGACGGAACCAATAGGAGCTTCATCAGCTACAGAGACAGCAAACTCACGAGATTGAATTTAGCAACAACATCGAAGTTCCATCTCGTGGATCTGGCTGGTTCAGAGCGTATCAAGAAGACTAAGGCCAGCGGCGAACGACTCAGAGAAGGTGTCAAGATCAACCAAGGCCTACTGGCACTAGGGAACGTTATATCAGCCCTCGGCGACGGAACCAATAGGAGCTTCATCAGCTACAGAGACAGCAAGCTCACGAGATTGTTACAAGACAGCCTGGGCGGTAATTCCTTAACATTGATGGTGGCGTGCGTCAGTCCGGCCGACTACAACCTGGACGAGACCGTGTCCACGTTACGGTACGCGGACAGAGCCAGACGTATACGGAACAAGCCTGTTATTAACCAGGACGCTAAGGCTGCTGAGATTGTAAGGTTGAACAATTTGGTTAACGAGCTCAGACTGCAACTGCTCGGGAAGCTTCCCACCATAAGCGAGCAGAACAATGAACAATTGCAAGAAGAATTAGACAGGGAGAGAGCAAAGTACTCTGAGTTGCTCAAGAAACACAAACAAGTCACCGAACACTTGAACAATATGTTGATAGAAAACACGAACCTGTGCGAGAAGGCGCTCCTGGCCGAGGCCGCCAAGGATAAAATAGAACGCAAGCTGAACGAAATGACGGAGCACTGCAACCAGACCATAGAGCATCTGAACGTGACGGACACCTCGCAAGACGACGCGCAGAAGTCGACGGTAGTGGACTACTTGAAGGAGATCAAGATGAGGCTGGAAGACTTACAGTCCGTCAATCTGAAGACTAACGAGGAGCTGATCGATCACGAGATAAAGCTGTCGTTCGTTAAAGAGGACGTGGACGGGGAGAAGGCGGACGACGACGTGATGCTCAACGAGGACCAGGCCGTGCTGGAGGAAGAGAAGCGAGCCATGGGACAGGTCGCACTCAATCAAGAGCTGCAAGAGCTTAACCGCGCTATGGCCATCAAGGCATCCGTGGTTCAAGCTATACTGGCCAACAACAAGGAAATATTGGACAGTCATAACAATCTGAAAGAAAACGAAGAAAGGATATCGCAATTAGAAAAACAAAGGGATGAGCTCATGCAGCAGTTGAAGCAGTCCAAGAGCAAAGATCCCTCGATGGAAGAACGCCGTACGAAAGTTTCTACGCTGGAACAAGAGATATCAGACCTGAAGAGGAAGTGTCAGCAGCAAGCGAACATTATAAAAACAAAGGAAAAGAACGAGGCTAAGATCGCTGCGCTGAATGCTGAGCTGCAAGCCATGAAAGCCACTAAGGTTAAAATAATCCGTCAAATGCGTGAAGAGAGCGAGAAGTTCCGCAAGTGGAAGGCTGACAACGAGCGCGCCATGCTGCGGCTCAGGAACGAGGACCGGAAGCGGGCCACGGCCATGGCGAAGATGGAGTCTCTACACGCCAAGCAACAGAACGTGCTGAAGAGGAAGATGGAGGAGGCCGTGGCCGTCAACAGGAGGCTCAAGGAAGCTCTGGATCGTCAGAAGCACACGGCCATGAAGCGGAACGCCAAAGGCAGTGTGAAGGCTGGTGCGTTACAGCAGTACATAGAGCAGGAGTTGGAGGTGCATCTCAGTATAGTGGAGGCGGAGAGGTCGCTCGAGGAACTCATGGAATATAGGGCCTGGATAACGGAACAGATTGAAAATCTCCGTAATAGTGCGGACGATGAAGCGAACAGAAAGAAAATAACGGAGCTGGAGGACGATCTCGCGCTCCGGAAGGCACAGATATCAGACCTACAGCAGAAGATACTAACAGCTGACCAGGAAAACAAATCGCGCACTCAATGGGACAACATACAGTCGATGCTTGAAGCGAAGGTAGCGCTTAAATGTCTTTTCGAGCTGCTCGTGGACGCTAAGAGGGAGCTGCAGAACCAGAGCGAGAAGGGCTACCAGGCGAGGTACGAGGAGATCAAAGAGACATACGACAGGCTGGCCGTGGAGTTCGAGACTACTAAGACTGAGTTCGAACGACAACTGGCCTCCGTGAAGTTACAGAACGAGCAGAAGCTAAGTGCACTAGTGGCGCTGCAGCGCGGCGTCGTTGGACGGGGTGAGCGAAGCGAAGCATGCCGTCATCTGCAGAATGTGATACAATACCAGCAGGACCGCCTGGAACAGCTGGAACAGGAGAACAAAAAGTTGGCCGATGAGTTAGAAGAGCTGAGGACGGCGAGCAAGAAGGCGAAGAAACGCAGCAAGAAGGATAGTTCGATGGAAGCTACAAAGAAGGTGGAATATGTAGAGCCCACCGACGAAGAGGACGACGAGGTCGAGGATCCTGACAAGGATCCTGACTGGAGAGCGACGCCTCTGTTCAAGAGAATACAGGCGCAGCGATCTCGTCTCACGATGAATTTCACTATGGACGAGAACAGAGCCATCAAGCGTTCCAACGACGGCGCGACTCACTGCACGTGCCGCGGCAGTTGCTCCACTAAGATGTGCGGATGTGTTAAATCGGAACGCGGATGCGGAACCGGATGCAGGTGTCAGGCGGAACTGTGCAAGAACAGGAGAGTGTCCTCCGACAGCGAGGATAAGGAGAATAATCCGTCCAGCACCGAGTACTCGCTAGACACGACACCGCCGTCTTCATACTTTGACAAACGATTCGAATCATCCGACGAGTTGAATCAAACGAATTGTGATGAAAAACCAGCGCCGAGTCCGCAGCAGATAGCGAGGCTGCGCCCCCAGTCCGAGAGAAGGTTGATAACTAGCGGCCGTACACAGAAATGGCTACTGGCTTTACTAATATACAACACGTGTGTCGCGCCGGTGTCCGAGTACTGCGCTAAGAGTCTCATTCAATACGGGCTAGACGTGGCCAAAGACAACGTCCTGTCTAGTATAATGAAATATATGACGCGAGAGGTTATTCTGATAGATCACCTTGACGCTACGTATAAAAAAAAAAAGAAAAGTTATTTTTTCCCACAAGACGCTACCAACGGCGAGGTTAAAAAGAAAATAACCGAAGGTGTGAAGACAGAACAAGGGATGATGAGTTTCGTCACGAATCAAGAAAGGCGCGAAGGCTTGCTTTCGTTGCTCGGAGAAATCACGGACCGGTCCCAGTAG

Protein sequence:

>DPOGS202874-PA
MRTEIERGCDECIDVVSGEQQVQIKDLAFTYNYVFPQHINQQEFYDTAVKGLIGKLFQGYNVTILAYGQTGSGKTYTMGTNYSGSDGDSTKLGVIPQAVADIFDFIETHEDKFIFKVSVSFMELYQEQCYDLLSGKERGHSIIEIREDINKGVILPGITELPVASTRETMTVLERGSSGRVTGSTAMNQASSRSHAVFTIVIAKESRSDKNLATTSKFHLVDLAGSERIKKTKASGERLREGVKINQGLLALGNVISALGDGTNRSFISYRDSKLTRLNLATTSKFHLVDLAGSERIKKTKASGERLREGVKINQGLLALGNVISALGDGTNRSFISYRDSKLTRLLQDSLGGNSLTLMVACVSPADYNLDETVSTLRYADRARRIRNKPVINQDAKAAEIVRLNNLVNELRLQLLGKLPTISEQNNEQLQEELDRERAKYSELLKKHKQVTEHLNNMLIENTNLCEKALLAEAAKDKIERKLNEMTEHCNQTIEHLNVTDTSQDDAQKSTVVDYLKEIKMRLEDLQSVNLKTNEELIDHEIKLSFVKEDVDGEKADDDVMLNEDQAVLEEEKRAMGQVALNQELQELNRAMAIKASVVQAILANNKEILDSHNNLKENEERISQLEKQRDELMQQLKQSKSKDPSMEERRTKVSTLEQEISDLKRKCQQQANIIKTKEKNEAKIAALNAELQAMKATKVKIIRQMREESEKFRKWKADNERAMLRLRNEDRKRATAMAKMESLHAKQQNVLKRKMEEAVAVNRRLKEALDRQKHTAMKRNAKGSVKAGALQQYIEQELEVHLSIVEAERSLEELMEYRAWITEQIENLRNSADDEANRKKITELEDDLALRKAQISDLQQKILTADQENKSRTQWDNIQSMLEAKVALKCLFELLVDAKRELQNQSEKGYQARYEEIKETYDRLAVEFETTKTEFERQLASVKLQNEQKLSALVALQRGVVGRGERSEACRHLQNVIQYQQDRLEQLEQENKKLADELEELRTASKKAKKRSKKDSSMEATKKVEYVEPTDEEDDEVEDPDKDPDWRATPLFKRIQAQRSRLTMNFTMDENRAIKRSNDGATHCTCRGSCSTKMCGCVKSERGCGTGCRCQAELCKNRRVSSDSEDKENNPSSTEYSLDTTPPSSYFDKRFESSDELNQTNCDEKPAPSPQQIARLRPQSERRLITSGRTQKWLLALLIYNTCVAPVSEYCAKSLIQYGLDVAKDNVLSSIMKYMTREVILIDHLDATYKKKKKSYFFPQDATNGEVKKKITEGVKTEQGMMSFVTNQERREGLLSLLGEITDRSQ-