Monarch geneset OGS2.0

DPOGS215693
TranscriptDPOGS215693-TA4995 bp
ProteinDPOGS215693-PA1664 aa
Genomic positionDPSCF300041 - 473256-505478
RNAseq coverage393x (Rank: top 31%)
Annotation
HeliconiusHMEL0225480.086.13% 
BombyxBGIBMGA003594-TA0.073.04% 
DrosophilaKlp31E-PD3e-18041.50% 
EBI UniRef50UniRef50_D6W8640.048.05%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W864_TRICA
NCBI RefSeqXP_973053.20.048.05%PREDICTED: similar to kinesin family member 21 [Tribolium castaneum]
NCBI nr blastpgi|3320273750.038.34%Kinesin-like protein KIF21A [Acromyrmex echinatior]
NCBI nr blastxgi|3320273750.038.30%Kinesin-like protein KIF21A [Acromyrmex echinatior]
Group
Gene OntologyGO:00070186e-111microtubule-based movement
GO:00055246e-111ATP binding
GO:00037776e-111microtubule motor activity
GO:00055151.8e-31protein binding
KEGG pathway 
InterPro domain[5-507] IPR0017526e-111Kinesin, motor domain
[1354-1663] IPR0110461.8e-31WD40 repeat-like-containing domain
[1357-1660] IPR0159432.7e-28WD40/YVTN repeat-like-containing domain
Orthology groupMCL11382 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215693-TA
ATGTCTAGCGATGAGTCCAGTGTCCGCGTTGCAGTCAGAACAATCGTGGGCGGCCGCTCAACACTTCGCTCCTCGCCCACCGACGTGGGCACATCTACGTGGGCACCGAAACTACAACCTTTACTAATCCGTCCCCAAACACCCGCGGAGGTGGTTGAGGGATGTGGTATATGCGCGCGGGCGGGCGGAGCGGCTGGGGAGGGCGGCGTGGCGCTGGGCTCCGAGCGAGCCTTCACCTTCGACTACGCCTTCGACCCCGCCAGCGACCAGCAACAGCTTTACGACACGTGCGTCAGGAAGCTGGTCGAAAACGCGCTCGATGGCTACAACGCCACTGTACTCGCGTACGGACAGGAGCTGACAGTCGGATACAATTACACACATCCAGCAGATCACCGTGAAGATGCCATTGAAAACAATAGTAGACGTCTGTTTGTTTCGTCTATCAGCGGACATCCGGCTCTCGAACATAGACCTTTATCAGTTCTACCTGTAGATACTGATGCGGGCGATTACTTCATTTACATCAGGAGACTTTATGTGCTGCATCCTGTTAGACAAAGAGCGCTGGGGCTGCGCTATGTTAATGTCTCATATACACATGCGCACTTATATGTCTGTCTGATCCCGTATTGTGGTCACGCGTCGGCCATAGCAGACGATCAGACGCCGACTGGGTCTGGTAAGACTTACACTATGGGTAGTGGATGGGAAGGCGAAGACGTTTATGAGGAGAAGAGGGGCATCATCCCCCGAGCGATACGCGATTTGTTCGCGGGGGCGGACGAGAGGGCGGAGGCAGCTCGCTCACAGGGTCAACTTCCACCAGAGTTCTCAGTACAGGCGCAGTTTATAGAGCTTTATAATGAAGATATTGTAGACTTACTGGATCCAGCAAGAGATCCTTTCGCTAAGGGTACATTAAGAATAACTGAAGACGGCGTCGGGGGAGTCCGTATAGTGGGTGCTTCTATGCGAACAGTGAGAGGAGTAAAAGAAGCACTAGCAGCGTTACGAGCTGGTGCTCTGGCTCGAACCACGGCCGCTACGAACATGAACTCGTCATCTTCGCGATCACATGCAGTGTTCACATTGCTGTTACGACAAAGACGACTAGCTCCAGACCAAGATGCTGTTGATAGAGAAAATGATGGAGATACTCCCGAACAATACGAGACGCTGACAGCCAAATTTCATTTTGTGGACTTAGCCGGCTCGGAGAGACTGAAACGGACAGGTGCGACTGGGGATAGAGCCAAAGAAGGAATTTCAATTAATTGTGGCCTTTTAGCTTTAGGAAATGTGATATCAGCTTTAGGTGATAAATCACGAAAAGTTCTACACGTACCTTACAGGGACTCCAAATTGACAAGACTCCTTCAAGATTCTCTTGGAGGGAACAGCAACACGATAATGATAGCCTGCATATCGCCCAGCGACCGCGACTTCATGGAGACGCTGAATACATTAAAGTACGCGAACAGGGCGAGGAACATCAAGAACCGCTGTGTGGTGAACCAGGACCTCACCTCACGGACGATCAGCCAACTGAGGCAGGAGGTGGCGCGGCTCCAGCTCGAGCTGGCTGAGTACAAACAGGGTAAACGAGTGGTATCAGAAAATGGAGAAGAAGGCTGGAGCGACGTGGTCCAAGAGAACGCTATACTGAACGCTGAGGTGGAATCTCTGAGACGAAGAGTCAAAGCTATGCAAGGAACCATCGAACAGTTGTCCGCGAGGAACAGCGAGCTGGTTGCTGAGAAGGCCTTAGGTACCTGGTCACCCAAGAATGGCAGTCCCGAGACCGCAGACTGCTCGTTGACAGCATTAGTGCAGGGCTACGTGAGCGAGATAGAAGATCTCCGAGCTCAACTAATGGAAGCCAATTCGTTGTATGAAGCCAGCAGACGAAGGGAGGCTCGCACTAGACATGACTCCTCGCTGATGGACGCCTCCACCATACTGGATGACGCCAAACGAGAGTTATATAAGGAAAAAGAACTTCTGGCTCGTAGTATGGGTGAGCTGGAGTTCCAGCGCAAGTTGTCGGAATCCGGCAGTCAACCTATAGAAGAGAGGGAGAGAGCTGAGGGGGAGAGCGCAGGAGACTCAGAGCCGTCTGGAGAGAGTGACTCCGAAGATGAAGAGGCTACGGGTCAACGTCAGCTAACGGCTCAGCTAGCGGCGCTCAGTGAGGACATAGACACCAAGGCGCGGCTCGTGGAACAGCTGGAGGCTTCCCAGAGACGGCTGGCGGCGCTCAGGACACACTACGAACAGAGACTCGACCAACTACACGCACAGATCAAGGCCACTGGGGATGAGAGGGACAAGGTGCTCGCCTCGCTCGCGAGCCAGTCGTCCCAGCCGAGCGACAAATTGAAGCGCGTCAAGGATGAGTACGAGCGTCGTATGAGCACCATGTCCAGAGAACTGAAGCGGCTGCAAGCGGCTCAGAGGGAACATTCACGACTGCAGCGGTCACAACAGCACACGGCCACGCAGATACACACGCTGAGGAACGAGCTGCAGAACCTGAAGAGGGATAAGGTGAAGCTGGTTCAACGTATGCGCGCTGAATCCAAACGTCACGCACAAGCGGAAGCGGCCCGAGCGAAGGAGGTGGCGCAGCTGAGGAAGGAATCTCGGAAAAATGCTAATCTGATAAGAAGTCTGGAGGCCGAGACCAGGCTGAAGGAGCAAGTGCTGAAACGGAAGCAGGAGGAGGTGTCGCTACTGAGGAGAGGACACAGGGACAAACTCAGCGTCCGGGCCGCTGGGAGGCTGCACGATCGCGGTCGTTCTCGAAGGCCTCGCGAGCTGTGGAGTCGTCTGGAGTGCTGGGTGTCTCGTGCGTGTGCGGCTCGCGGCACGCTGGCCGAGCTGGAGGCGGCGTTGGAGCACCAGCTGAGGGAGAGGGACCGCGCCGCCGCCGCTCCCCCCCACCCCGACAACACACACCTACTCGCCTACCTGCGGGAGGCCATCGCGGAGACACAGTCGCAGATCATGCAGATAGAGGAGGAGAGCGAGGAGTCCGAGCTGCCGGCGGTGCTGTCCGCGGCGGAGGGAGAGGCCTCGCGCTACGCCCTGGAGAGACTCGCCGCCCTCACACTCACACACGCGCACGACGCCGCCCGCAGGCTGCACGCGCTCAACGACGCCAGAGCACAGCTGGCCGACTTGGAGGAGAAATACGAGCGCGCCATGAGCGCCCTCCGCGCCACCGAGGACCAGAACCTCAACCCGTGGGGCGCGCCGCCCGCCCTGGCCGCCCTACTCGCCGCTGTGTCCTCGGGGACCTCCACGAGATCCGTCTCTCCTGTCGATAGCGCACTGTTAGAGGTGAGACCTCGCGCTGTCGGCACGCGCGAGTCCAGCACCGCGCCCACGTCACCCCCCGACGATAACAGGAGCTCGCCGTTCCAGAGAAATACCGTCCGTCGTGGGTCTGTACGTCTGCGGGACCTCGGTGTGTACGGTCGCGAGGGGGTCGGCGAGGACCCCATGTCACAGTCGATGGTGGAGCCCGAGGTGCTCCGCCCCGCGCCGCTCAGCAGGGTGCCCAGCGCTCCTGGGAGTCTCAGAGGTCTCCAGCCCGTGTCGTCCCCCCTGTCCCCGCGCCGCGCCCCCGAGAGCCCCCGCCCCGCACGCAGGCCCCCCGGACTCGCCAAGCCAGCTTCCTTGAGCATTTGTGTGCACTGTACTCTGATAGCTCGCGTCTCACCGCCGGTAGCTAGCCGCCGCCCGCCTGCACGCCGCGGCCTGGGGTCGCGCGCTCCTCCTCACCGACCTCTCGATAGTCGAGGACTGACGCCGTCCGCTGGGAACAGTAGCATACCCTCCCCTCGCATGACGCCTCCACACTACCCGACAGCTTTTTTTATCCCCTCTCCCGCCCGCAGTGAGCCGGAGGGTACTCCGCCCGCCTCCCCCGGGGCCACGCGGCGGGCCCGGGACGACGACGTGTTCCTGAGACTCACGGGCGCCGCGCCCGACCACGCGCCGCAGGGGACCGTCAAGGAAATCACTGTTAAGCGGGCCAGTGTGGGCGGCGGCAGCTGGCTTCAATGTACTCACGTGGCGGAGGGTCACGTGGGCGCCGCGCTGTCCCTGGCCGTGGCACAGGACGCCATCTACAGTGGAGGGGTCGACCGCACGGTCCGCGGCTGGGACCTGTGTGCGGGTGTCGAGTCGTGGCGGGCGTGGTGCGGGGGCGCCGTGGTGGGCCTGTCGTGTGTGAGCGGGGGCTCCGACCCTCGCCTCGTGCTGGCCGCGGCCGGGGCGGCCGTCAAGATCTTCGACACCAGAACCAACCAACCGACTGCTGTCGTCGGGGGCGTCCGGGGCGAACGCGTCCGTCCGCGGCGGGGAGGCGGCCGTCACTGCTCTCTCCCTGGCGTCCGCTCACACGCTGTACACGGCCGCCGGGGACAAGCTGAGGCTGGCAGGGCGTGGTGTGAGTGTGTGTACAAGACGTGGTCAGGTCACGCGGCGGCCGTGATGTGTGTGGCCCGGGAGCCCCTCCCGGGCGGGGACAGGCTGGCCACCGGCTCCAGGGACCACTGCGTGCGGGTCATCGACCTGCAGCATAACGCAGGTTCCTGGGAGGCGTGTAACCGGCGTCTCCTGGAGCCTCCTCACTACGACGGCGTCCAGGCCCTGCTGCTGCGCGGCTCGTTCCTCTACAGCGCGTCCAGGGACTCCAGCCTCAAGTGTTGGTCGCTGACCGACAACACACTCACACATAGCGTGATGAACGCCCACAAGGGCTGGGTGACGGGTGTGTGTTCCCTGGGAGGAGGCCTGGTGAGTTGTGGCCGGGACCAGGCGCTCCGTCTGTGGAGCTCCGCCCTCCGGCCCGCCGCCAGCCCCGCCACGCTGCCGGACGCTCCGCACGCTCTCGCAGCACACACAAACACACACGGACACTCCGTGTACACCGCGGGCAGCGGCGGCGAGGTGCGAGCGTGGCGCCTGGTCAACGAGCCCTGA

Protein sequence:

>DPOGS215693-PA
MSSDESSVRVAVRTIVGGRSTLRSSPTDVGTSTWAPKLQPLLIRPQTPAEVVEGCGICARAGGAAGEGGVALGSERAFTFDYAFDPASDQQQLYDTCVRKLVENALDGYNATVLAYGQELTVGYNYTHPADHREDAIENNSRRLFVSSISGHPALEHRPLSVLPVDTDAGDYFIYIRRLYVLHPVRQRALGLRYVNVSYTHAHLYVCLIPYCGHASAIADDQTPTGSGKTYTMGSGWEGEDVYEEKRGIIPRAIRDLFAGADERAEAARSQGQLPPEFSVQAQFIELYNEDIVDLLDPARDPFAKGTLRITEDGVGGVRIVGASMRTVRGVKEALAALRAGALARTTAATNMNSSSSRSHAVFTLLLRQRRLAPDQDAVDRENDGDTPEQYETLTAKFHFVDLAGSERLKRTGATGDRAKEGISINCGLLALGNVISALGDKSRKVLHVPYRDSKLTRLLQDSLGGNSNTIMIACISPSDRDFMETLNTLKYANRARNIKNRCVVNQDLTSRTISQLRQEVARLQLELAEYKQGKRVVSENGEEGWSDVVQENAILNAEVESLRRRVKAMQGTIEQLSARNSELVAEKALGTWSPKNGSPETADCSLTALVQGYVSEIEDLRAQLMEANSLYEASRRREARTRHDSSLMDASTILDDAKRELYKEKELLARSMGELEFQRKLSESGSQPIEERERAEGESAGDSEPSGESDSEDEEATGQRQLTAQLAALSEDIDTKARLVEQLEASQRRLAALRTHYEQRLDQLHAQIKATGDERDKVLASLASQSSQPSDKLKRVKDEYERRMSTMSRELKRLQAAQREHSRLQRSQQHTATQIHTLRNELQNLKRDKVKLVQRMRAESKRHAQAEAARAKEVAQLRKESRKNANLIRSLEAETRLKEQVLKRKQEEVSLLRRGHRDKLSVRAAGRLHDRGRSRRPRELWSRLECWVSRACAARGTLAELEAALEHQLRERDRAAAAPPHPDNTHLLAYLREAIAETQSQIMQIEEESEESELPAVLSAAEGEASRYALERLAALTLTHAHDAARRLHALNDARAQLADLEEKYERAMSALRATEDQNLNPWGAPPALAALLAAVSSGTSTRSVSPVDSALLEVRPRAVGTRESSTAPTSPPDDNRSSPFQRNTVRRGSVRLRDLGVYGREGVGEDPMSQSMVEPEVLRPAPLSRVPSAPGSLRGLQPVSSPLSPRRAPESPRPARRPPGLAKPASLSICVHCTLIARVSPPVASRRPPARRGLGSRAPPHRPLDSRGLTPSAGNSSIPSPRMTPPHYPTAFFIPSPARSEPEGTPPASPGATRRARDDDVFLRLTGAAPDHAPQGTVKEITVKRASVGGGSWLQCTHVAEGHVGAALSLAVAQDAIYSGGVDRTVRGWDLCAGVESWRAWCGGAVVGLSCVSGGSDPRLVLAAAGAAVKIFDTRTNQPTAVVGGVRGERVRPRRGGGRHCSLPGVRSHAVHGRRGQAEAGRAWCECVYKTWSGHAAAVMCVAREPLPGGDRLATGSRDHCVRVIDLQHNAGSWEACNRRLLEPPHYDGVQALLLRGSFLYSASRDSSLKCWSLTDNTLTHSVMNAHKGWVTGVCSLGGGLVSCGRDQALRLWSSALRPAASPATLPDAPHALAAHTNTHGHSVYTAGSGGEVRAWRLVNEP-