Monarch geneset OGS2.0

DPOGS205108
TranscriptDPOGS205108-TA3381 bp
ProteinDPOGS205108-PA1126 aa
Genomic positionDPSCF300172 - 101413-110885
RNAseq coverage62x (Rank: top 68%)
Annotation
HeliconiusHMEL0029530.086.25% 
BombyxBGIBMGA005875-TA0.080.46% 
DrosophilaCG14535-PA7e-15034.21% 
EBI UniRef50UniRef50_D1ZZG60.051.39%Putative uncharacterized protein GLEAN_08042 n=3 Tax=Tribolium castaneum RepID=D1ZZG6_TRICA
NCBI RefSeqXP_972268.20.051.44%PREDICTED: similar to CG14535 CG14535-PA [Tribolium castaneum]
NCBI nr blastpgi|1892364270.051.44%PREDICTED: similar to CG14535 CG14535-PA [Tribolium castaneum]
NCBI nr blastxgi|1892364270.052.96%PREDICTED: similar to CG14535 CG14535-PA [Tribolium castaneum]
Group
Gene OntologyGO:00070188.5e-58microtubule-based movement
GO:00055248.5e-58ATP binding
GO:00037778.5e-58microtubule motor activity
KEGG pathway 
InterPro domain[44-334] IPR0017528.5e-58Kinesin, motor domain
Orthology groupMCL15646 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205108-TA
ATGCTACGAGTGGGCGCGTCAGGAGAGGGACCCTTCACAAGCGGTACCCAGACCTTTTCTCTTGATAAACGTAAGAGACAAGTTACACTCTGTGAGACAGCCACTGCAGCAGCCGCTCCTGAAGACAGGAAAGTTGGTGTCGCAGCTCCAAAAATGTTTGCTTTCGACGCTATTTTCTCACAAGACGACCCCCAGACGGAAATATGTTCGAGCGCGCTAACTGATGTTATACACGCCGTTATAAATGGAACCGACGGCTGTCTCTTCTGCTTCGGACATGCAGGACTAGGCAAGTCGTATACAATGTTGGGACGGCCGGATTCATCTTCAACTTTGGGAGCCATACCATGCGCGATCTCGTGGCTCTTCCGAGGTATAGCGGAACAGAAACACAAGTGTGGCACACGCTTCTCAGTCAGAGTATCCGCTGTTGAACTCTGTACCAACACCAATCAAATACGTGACTTATTGGCTCCATATCACAACGACACGGAGCAATCACCTGGAGTCTATCTGCGCGACGATCCCTTGTTTGGAACCCAGTTGCAGAACCAATCGGAACTCCGAGTTCACAGCGCGGAGCGTGCTGCCTTTTATTTGGATGCAGCGCTCGGAGCACGGGTCAGGGAAGAGGGGAGGGATAGCCATCTGTTATATACTTTACATGTCTACCAGTATAGTGTGGGCGGCAAAGGCGCTGTTGCCGGAGGCCGCAGCCGTTTGCACTTGATAGACTTAGGAAACTCGGAACGTGGGAAAACTAATGGAGGAATACCACTATCCGGTTTAGGCAATATACTTCTTGCTATATTTAATGGCCAACGACATTTACCGTACAGAGACCACAATTTAACACACGTCCTAAAAGAATGTTTAGGATCCCTAACTTGTCATACAGCTATGGTAGTACACGTGTCTCCAAACGTCCAAAATTATTCTGACACTTTATCAACACTGCAGTTGGCATCAAGGATACATAGATTGAGACGGCGAAAAGTAAAATATAGCGGAAATAATAATGCAGGCTCGGGAGGAAGTTCTGGTGAAGATGCTTCAAAACCTAGTAGTAGTGAGCCAGACCCTTCCAGCAGTGATTTGTCGGCAGACACTGTAATATATGTGGGTTCATTAGACGATGCAACCGATGGTGAACATCCCCCGGTGTACATACCACACATTAATTCTAGTGACAACAGATGTTCATTTAGTAAAGCTTTAAGAGGATCTGCCGCTGAACATAAACACAAATCTTCATTATCTAAAGTAGTGGAGGAAAAAAGTCCTATACATAAATTACATGCATCGCCAAAATCATCTCCTCTAAAAAGTGCTATACCAACCTTCACACCAAAATCGTCACCAGTGCACACATCTACTCCAAAAGCTGCACCCTTGAAAAAACAAGGCTCAAAGCATAGTGAAGGTACAAGCGACGAACAATGGATAGATGGTCCTAGGATATCTAAATCGAAATTAGTTGAAGCGCGGCATATTATAAAAGAAACGCAAGTTAAAAAGAGAGAAACTTGGATTGATGGGCCAATGCAGGAACCTCAAATCCCGTTACAATTTGAATCTGTGCCAATACAACCTAATGCAGTACCACTACAAGTTGGAGTCCTTAATTCCTATGGCAATGAAAATATTGGATATGGATATATGGATAATCACAAAAAGAATATGATAAAAAAATGGGTGGAAAACCAAACATCCCAAATACACAAATCGAGGCACAACTCGCCCAGTCATAAAGCTACACCACAACATCCTCAAAAAGATTGTGTCAGATTACCGGACGATAATGATACAATACATCGAATGGAAATAAAAGATGCAACGAGACGGATACATGTTGAGGAATCAACAATCAGAACTGGATTAAAAGCTGGATCTAAAGGCAACGCAATAGAAGAAGAAAACATAGGTCCTCCAGAACAGGGGCCCGTACGAAAGATTAGTAGTGCTAGATCTTCAACAAAAAATGAACCTGACGACGACGATGATAGTGAAGAGTTGGCAGAAATACCTCCTGCATTGCCTTTAATTCAACCTAGTAGCCTAAATAGTAGAGAAGTGTCTATGGAAAGCTTAAATAATAAATATAAAGAAAGGATGAGGATGGATGATGAAATGATATCAAATCATAGCCGAGACATTGATCCACACGGAATGAGTAAAGGTCCAGATGAAGAAGACGAGGATATTCTAGAAATAATCGAGGTAGAAGAACCTCTTGAACCAGTTCCTATGCAAGATTGTTGCCTTCAAGTGACTGAAGAAGACATTGCTTATTGTATGGGATATACGGACAATCATTTAGCTGATTATGAGCACGAGGAAGGTGATGACCATCCTCTTAGAATATTAAGCCAAGAAAATTTAACTGTCGCTTCCACATTCACTGATACACTTTCAATGTATAGTGAAGTAGAAAGGCAGATTAGGACTGAAGCATATCTAAGACAAACAATCGGGTTTTACACTCAGGGTTATAGTGAAAATACACACGAAATTAACCCACACGAAAGTGTCCATTCATCAAGAACGAGATTGGAAGATCTTTTGGGTCTAACGGAACTGTACGGCTCAAGAAGAGTTTTAGACAGTAATAATACACCTCCTTCAAAAATAGCTCCTCAGTTTCAATCGTTGTCATTATGTAATGTTAGAGATACTGAAGAATATCATGGTGACGGTTCTGTTTACAGTGAACCAGCATATCGGCCAAGTGATAAAATATGTGATAGTTGCAAACGGTCTATGTCAAGGCCGGGTAGTGCTGTGGAGCAATGCGATGCATATCATGATTTAGGTGCCGCCCACAACGATCCCTATGGCCCATTCGAAAGATACAGACCAAATGACATCACCGATGCTCGAATAGCATCTTTAAGGCATCCGGATGGAGCATCCGACCCAAACTTACGAGAAGAGAACCGATTACCAGTAATCGGAGGCCCTTCAAGCACATTTCGAGAACTTAAATCCAATTTACATCTTGAAGTAGCGCCACCCATAGTTACTGTGGAACCACGTTTAGAAAAATGTGAAAAAGGTGACAAAAACGTCGCCAGAGTGGTAGCTGGTTGTAAGCCTGACGGTTACGACAGTGGTCATGAATCAACACCGAGAACAGGGAAACACAGCCCCGCTGCTACATCGAGAAGGGCAGAATCAGGTTATGACTCGGTCCCGAGAGATTCTGACGCATCTTCCTTAGACTCATACCCAACGAGACGGGCGGCTGTAGCACGAGCTCACGCAAAAAAACACACCACAGCAAAATACAAACAACACAGTGACAGATCATTCTGCTCGTGGCTAAGAAACCCCTTCACGTGCAAGTACGCGGATACGGACCCCGAAATTAGTGATTTTTAA

Protein sequence:

>DPOGS205108-PA
MLRVGASGEGPFTSGTQTFSLDKRKRQVTLCETATAAAAPEDRKVGVAAPKMFAFDAIFSQDDPQTEICSSALTDVIHAVINGTDGCLFCFGHAGLGKSYTMLGRPDSSSTLGAIPCAISWLFRGIAEQKHKCGTRFSVRVSAVELCTNTNQIRDLLAPYHNDTEQSPGVYLRDDPLFGTQLQNQSELRVHSAERAAFYLDAALGARVREEGRDSHLLYTLHVYQYSVGGKGAVAGGRSRLHLIDLGNSERGKTNGGIPLSGLGNILLAIFNGQRHLPYRDHNLTHVLKECLGSLTCHTAMVVHVSPNVQNYSDTLSTLQLASRIHRLRRRKVKYSGNNNAGSGGSSGEDASKPSSSEPDPSSSDLSADTVIYVGSLDDATDGEHPPVYIPHINSSDNRCSFSKALRGSAAEHKHKSSLSKVVEEKSPIHKLHASPKSSPLKSAIPTFTPKSSPVHTSTPKAAPLKKQGSKHSEGTSDEQWIDGPRISKSKLVEARHIIKETQVKKRETWIDGPMQEPQIPLQFESVPIQPNAVPLQVGVLNSYGNENIGYGYMDNHKKNMIKKWVENQTSQIHKSRHNSPSHKATPQHPQKDCVRLPDDNDTIHRMEIKDATRRIHVEESTIRTGLKAGSKGNAIEEENIGPPEQGPVRKISSARSSTKNEPDDDDDSEELAEIPPALPLIQPSSLNSREVSMESLNNKYKERMRMDDEMISNHSRDIDPHGMSKGPDEEDEDILEIIEVEEPLEPVPMQDCCLQVTEEDIAYCMGYTDNHLADYEHEEGDDHPLRILSQENLTVASTFTDTLSMYSEVERQIRTEAYLRQTIGFYTQGYSENTHEINPHESVHSSRTRLEDLLGLTELYGSRRVLDSNNTPPSKIAPQFQSLSLCNVRDTEEYHGDGSVYSEPAYRPSDKICDSCKRSMSRPGSAVEQCDAYHDLGAAHNDPYGPFERYRPNDITDARIASLRHPDGASDPNLREENRLPVIGGPSSTFRELKSNLHLEVAPPIVTVEPRLEKCEKGDKNVARVVAGCKPDGYDSGHESTPRTGKHSPAATSRRAESGYDSVPRDSDASSLDSYPTRRAAVARAHAKKHTTAKYKQHSDRSFCSWLRNPFTCKYADTDPEISDF-