Monarch geneset OGS2.0

DPOGS200259
TranscriptDPOGS200259-TA2151 bp
ProteinDPOGS200259-PA716 aa
Genomic positionDPSCF300026 - 1149000-1163757
RNAseq coverage33x (Rank: top 74%)
Annotation
HeliconiusHMEL0211725e-13790.07% 
BombyxBGIBMGA007234-TA0.085.87% 
DrosophilaKif19A-PB1e-15047.53% 
EBI UniRef50UniRef50_UPI00020622F52e-16548.87%UPI00020622F5 related cluster n=1 Tax=unknown RepID=UPI00020622F5
NCBI RefSeqXP_974613.27e-17951.90%PREDICTED: similar to GA22117-PA [Tribolium castaneum]
NCBI nr blastpgi|1892401501e-17751.90%PREDICTED: similar to GA22117-PA [Tribolium castaneum]
NCBI nr blastxgi|2700116818e-17851.65%hypothetical protein TcasGA2_TC005733 [Tribolium castaneum]
Group
Gene OntologyGO:00070181.2e-107microtubule-based movement
GO:00055241.2e-107ATP binding
GO:00037771.2e-107microtubule motor activity
KEGG pathway 
InterPro domain[20-345] IPR0017521.2e-107Kinesin, motor domain
Orthology groupMCL15850 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200259-TA
ATGACGCAGAGTAGCTCCGGTTCAGACAAGTTTCACGGCAGCGGACGCTCTGTTAGTGAAGAAAAGTTAATGGTCGCTGTGCGTGTGCGGCCCCTTCGTGCTGATGAAGGTCCCCGGATTGTCCACGTGGTCAGCGATAAGATGTTAGTCCTGGAAGAGGAAGCTGACATGCGTAAAGACGTTCTTCGCCAACGACGTGTCAATGACAAACATTACATCTACGACCGTGTTTTCGCTGAGGAAAGCACTCAAGAAGAGGTGTACGAGGCAGTTTGCGCGCCGCTCGTTGGTGACACCCTGAACGGTATAGCGGGTGCTATATTTGCTTACGGCGCCACGGGGGCTGGTAAAACACACACCATGACTGGCCTCATGTCACGAGCCCTCAACCATCTTTTCACATCAATCGGTGAAAGCGACGAGCCAAACTCCTTTGAGGTGAAGATGTCCTATATTGAAATTTACAATGAAAATATTCGTGATCTTTTGAACCCTGGAGCTGGTTTCTTGGAACTTCGAGACGAGGGCAGCAGCGGTCCCTCGATTGTGGCTGGTTTGAGTGAGATCCGTGCCGAAAATGCGACTCATGTAGCCGAGCTCCTAGCTAAAGGGGATCGCTCGCGGACTGCCGAGTCTACGTATGCGAATCAACACTCGTCCAGAGGTCACGCGTTACTTAGTGTATCTGTGAGCAAGACAGTTACAAAGGGTGTCCAGCGAGGTCGCTTGTTCCTCATCGACCTAGCTGGTTCTGAACGGGCGGGGGCGAGGGCTCGGAGACTAGAGGGCGCTCATATCAACCGCTCGTTACTCGCTCTTGGTAATTGCATCATGGCGCTGTCAGGTGGCGCGAGGTATGTGAACTACCGTGACTCCAAGTTGACCCGCCTCCTGCGCGAGGTTCTGGGCGGCAGGTGTCGCACCGCGATGGTGGCTCACGTGTCCCCGGCGGCCGGCCACAGAGACACCACGCGCTCGACGCTACATTACGCGCAGAGAGCGTCCGCCATCACCAACAAGGTGGAACGTGAGTTCATTGAAACTCCGATGCATTTATCTCAATACCGAACAGTCATCAGTGAACTACGAGAGGAGATAGCTCGCCTCAAGACCAAGATGAGAGACGACCGTACAAAAAGCAGAGAGGAACCAATAATAGACGAGACGATATCGAAAGAAGAGTCCGAAGAGAACTCAGCGCATCTGAAGAGTCTGAGGGAGGCGATCGTGTCTACTTTCAAACAACAGATGAGGCTGCGGCGCCGGCTGATGGAGCTGGACAGCCACCTGCTGGGGCTCGCCCTGGACGCGGAGAGGCAGCACGCGGCCATCTCTCACTGGGAGGCTCGCTTTAACAGGCTATACAAACCTATCAACTTCACCGGCTCCAGAATGAGCACACAGCAAAGTTATAGGGGCGGTGGGGGCAGTAGCGGCAGCGAGCGCGCGGAGGCCGAGGTCTCAGTGGAACAGGCCTGGGCTGAACTCGCGGCTGTCGAGAGGGAACAGGAAGCCGCGAGGACAGAGAGGCTGAGGGTTGAGAGACAGCTGGAGCAAGTCAGGCTGAGAGGAGCCCAGCTCGAACAGGAACTACCAGCGCACATATCAAGTGGTCCTGAACGAGAAGTATTAGCTTTGGTTTGTCGAGTTCATGAACTGGAGGCTGACAAGTTAGCGCTGCAGGGAGAACGTGCAGCGAGATCACACGAACTAAGGCGGAGGGATCTAGCGCTGCAGCGGCGAGACGCGCAGAGGAGGCTCACTGATGAAATTATTACCAGGCAGAGGCGGGCGCTGGAGGAGTACGGCGCAGGGGCGCAGGCGGATCTGGCGCAGCTCTACGAACTGTACCAGCAAGAGATCCACGCCTCCACCTACACGGAGACCAACGAGTACTACTCCCCGTACCGTTTACCGCCGATATCCACCAGCATGTCAGAATTGACTTGGTCGGAGAGCTCGAGCGGCTCGGGTAGAGGCTCGAGCTCGGGCTCGGGAGGAACGTTTGCCTTGGACCGTCTGCCACAGCTCGCGCCCACTCCTCGACCGCGCACCCGGTATAGTCAAGCGAGTGCCACAACCCTGAAGCGTTACTCGAGCGATGACAGTCTCGTCATCACCGCGCCGCGCGCCAGCGACAGGGCCGCTTGA

Protein sequence:

>DPOGS200259-PA
MTQSSSGSDKFHGSGRSVSEEKLMVAVRVRPLRADEGPRIVHVVSDKMLVLEEEADMRKDVLRQRRVNDKHYIYDRVFAEESTQEEVYEAVCAPLVGDTLNGIAGAIFAYGATGAGKTHTMTGLMSRALNHLFTSIGESDEPNSFEVKMSYIEIYNENIRDLLNPGAGFLELRDEGSSGPSIVAGLSEIRAENATHVAELLAKGDRSRTAESTYANQHSSRGHALLSVSVSKTVTKGVQRGRLFLIDLAGSERAGARARRLEGAHINRSLLALGNCIMALSGGARYVNYRDSKLTRLLREVLGGRCRTAMVAHVSPAAGHRDTTRSTLHYAQRASAITNKVEREFIETPMHLSQYRTVISELREEIARLKTKMRDDRTKSREEPIIDETISKEESEENSAHLKSLREAIVSTFKQQMRLRRRLMELDSHLLGLALDAERQHAAISHWEARFNRLYKPINFTGSRMSTQQSYRGGGGSSGSERAEAEVSVEQAWAELAAVEREQEAARTERLRVERQLEQVRLRGAQLEQELPAHISSGPEREVLALVCRVHELEADKLALQGERAARSHELRRRDLALQRRDAQRRLTDEIITRQRRALEEYGAGAQADLAQLYELYQQEIHASTYTETNEYYSPYRLPPISTSMSELTWSESSSGSGRGSSSGSGGTFALDRLPQLAPTPRPRTRYSQASATTLKRYSSDDSLVITAPRASDRAA-