Monarch geneset OGS2.0

DPOGS202795
TranscriptDPOGS202795-TA6021 bp
ProteinDPOGS202795-PA2006 aa
Genomic positionDPSCF300018 - 680360-697819
RNAseq coverage37x (Rank: top 73%)
Annotation
HeliconiusHMEL0124550.035.36% 
BombyxBGIBMGA010475-TA0.082.33% 
Drosophilakl-2-PB0.050.94% 
EBI UniRef50UniRef50_UPI0002247D2D0.055.06%UPI0002247D2D related cluster n=1 Tax=unknown RepID=UPI0002247D2D
NCBI RefSeqXP_967448.20.057.15%PREDICTED: similar to dynein axonemal heavy chain-like protein [Tribolium castaneum]
NCBI nr blastpgi|1892409710.057.15%PREDICTED: similar to dynein axonemal heavy chain-like protein [Tribolium castaneum]
NCBI nr blastxgi|1892409710.057.13%PREDICTED: similar to dynein axonemal heavy chain-like protein [Tribolium castaneum]
Group
Gene OntologyGO:00070181e-118microtubule-based movement
GO:00302861e-118dynein complex
GO:00037771e-118microtubule motor activity
KEGG pathwaynvi:1001213030.0 
 K10408 (DNAH)maps-> Huntington's disease
InterPro domain[1141-1533] IPR0042731e-118Dynein heavy chain
Orthology groupMCL10001 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202795-TA
ATGTTCAACAATATAGCTGCTAAGCTGCTGCCAACTCCCAGCAAGATGCATTATCTGTTTAACTTGAGAGATATATCCAAAATATTTCAGGGTTTACTGAGAAGCAACAAAGACTACCAGAACACTAAACCACGGTTTTTAAGGCTGTGGGTCCACGAGTGCTTCAGAGTGTTCGGTGATCGTTTGACCGAAGAAAAGGACCGAGATTGGTTCATGAATCACATGGGTGACATGTTGGGTAAGCACTTCGAGCTGACCTTCCACGCGCTCTGCCCCTCCAAGAGCCCGCCCATCTTCGGACACTTCCTCAATCCCTTCGAGGTGTACGACGATTTGAATGACCCCAACGCTTTAAGAAAGTACATAGCGAAACCAGATGGAGGAGTACAACAGCTGCCCGGGCGTCGTGAAGATGGACCTGGTGCTGTTCAAGGACGCCATCGAACACATCTGCCGCATCGTGCGCGTCATCTCGCAGCAGCGAGGGAACATGCTCTGCGCGGCTCCGGTCGACAAAGTTTAACCCGAGTAGCGTCCTACATCTGCGAGTGCAACTCGTTCCAAGTCGTGGTTACCAAGACCTACGGCGTCAAAGATTTCCGAGAAGATTTGAAGGTGCTATACACAAGCTGCGGAGTAGACACTAAGAAGACTACATTTATTTTTTGCGACACTCAGATAGTCGAGGAAACATTCACAGAGATCGTCAACAATTTGCTCAGCAGCGGAGAGGTCACCAACCTTTACAAACCTGATGAATTTGAAGACATTAAATCGGCTCTGGAGAAGCCCATGAAGAACGCGAATATACTACAGACAGCTGAGAGCGTGTATCTGTTCTTGGTAGAGCGAGTGAGAGCCAACATGCGGATAGTGCTGTGCTTCAGTCCCATTGGAGATGAATTTAGGAATCGCATTCGTCAGTACCCTGCACTTATCAATGCTACAACCACGAATTGGTTCCTGGAATGGCCGCGGGAGGCGCTGCTCGAGGTCGCCTATAAGTTTCTTGATGGAGTAGAGCTGTTGGCTTCTATCACTGGAACCAGGGTTCGCAGGAAGGAGTCCCTGGTGGAGAGTCGAGAGGATAGCATGAGGGCGGCAGTAGCGTCGACAATGTCGCTGATACACTCGTCCGTGGGGCGCGCGTCCGTTAAGATGTGGCAGGAGATGAGGCGAACCAACTACGTGACGCCCACCAACTACCTCGAGCTGGTGGCCGGATACAAGGAAATGCTCAAGAGCAAACGAATTGAGATTGCTCTGCAAGCGAACAAGCTTCGGAACGGCCTGGGGAAGGTCGAGGAGACGACTAACTTAGTGGGGCAGATGTCGGAAGAACTCGCTCTCGCTCAGGTTCAAGTCGCTGAGTACACGGAGCAGTGTATAGAGTATATGGGCGTGATCAACGTGCAGCAACGCAACGCTGACGAGCAGCAACGCAGTGTGGCCGCGCGCAGCAAGAAGACCATGGAGGAGGAGGTGCAGTGCAAGAAGCTGGCCGATGCCGCCATGAGAGACCTCGCCAGCGCCATGCCAGCTCTCGAGGAAGCCGTTAAGGCTTTAGACGCTTTGAACAAGAAGGACATCACTGAGGTCAAGTCTTACGCCAAGCCGCCGCAGAAGGTGGAGATGGTCTTGGAAGCTGTGCTTATATTGCTCCAGAAAGAACCAACCTGGGCTGAGGCGAAGAGACAACTCGGTGATCAGTACTTCTTAGATCGGTTGCGTGATTTCGACAAAGATAACATATCTGACAAGACGTTGAAAAAGATTGGAACTTACACCGCCAAGCCAGATTTTGATCCAGAGATCGTTGGCACTGTGTCGCTAGCGGCCAAGTCGCTGTGTTTGTGGGTGCGGGCTATAGAGAAATATGGGAAAGTTTACAAAATAGTGAAACCGAAAAAGGAACGTCTAGAAGAAGCGCTGGAGTCTTTGAGGGAGAAACAAAGGATATTGGCCGAAGCCCGCGCCAGGCTGAGGGAGCTGAGTGAGATGATCGCCCGCCTGCAGAAGGAGTACGACGAGAAACTGGCGCAGAAGGAGGAGCTGGAGAGGAAGTCGCGTCTGCTACAGCTCAAGCTGGAGAGAGCTGAGGCTCTCATCACAGGCTTATCTGGTGAAAGGGAAAGATGGGAACAAACCGTGGAACGTCTCGACAAGGAGTTCGAGAACCTTCCAGGGGACTGTCTGATTGCGACCGGCTTTGTGGCTTACCTCGGACCCTTCATCTCCGAATACCGGGAAGACCTCATGGGTGACTGGTTCAGAGAGGTTTTCGACCAAACTCTACCAGTGACGATGGACTTGAGCATGAAGACGTTTCTTCTCGATGACGCTACGCTCAGAGATTGGAATTATATGGGACTGCCAGATGATAACTTCAGCGCCGAGAATGGTATCATCGTAGTCCGAGCGACTCGCTGGCCGCTGGCGGTCGACCCTCAGGGACAGGCGCTCATTTGGATCTCACGCCTGGAGGAAAACAACCAGATACAGATAGTGGACTTTGGTCAGCCGAACTACCTTAAGATCATGGAGAATTGTTTATCGCAAGGCAAACCTATCATCATTCAAAACGTCGGGGAAGTTCTCGATCCGTCCATCGCTCCGATATTGGACAAAGCGATTGTCAAAATCGGTTCCGATATGGTCATCAAGTTCAATGAGAAAATGGTCCCATACAATCCAGAATTCAAGATGTACTTGACCACTAAGTTAGGTAATCCTGTTTATAGTCCGGAAACCTTGACGAAGACGACGATGGTGAACTTCGCGGTGAAGGAGCAGGGTCTGACGTCACAGCTGCTGAGCATCGTAGTGAGGAAGGAGAGGCCGCAGCTCGAGCAGATGAAGGACACCCTGGTCATGACCATAGCCAATAATAAGAAAGTACTTATGGACCTGGAGAACGACCTCCTGCGCATCATGTACGAGTCGCAGGTGCCTCTGTTGGAGAACGAAGAACTGTTCCAGACGTTGCAGGTGTCGCAGAGGACATCCCTCGAGGTGAAGGAAGCTCTCATCACCTCGCAAGATACTGAAAAAGAAATCGATGCGGCGAGACAGGGTTACGTACCCGTAGCAACTCGAGCTGCGGTTCTGTTCTTCGCTCTCAACGACCTAGCTCGTATCGACCCCATGTACCAGTTCTCCCTGGACGCCTACAACGATCTCTTCACCTACTCCATCGACCGAAGCCCTAAGGGCGGCGACCTCGAGGACCGGATCACTAACTTGAATGAATTCCACACCTTCGCCGTGTACAAGAACACATGTCGCGCGGTGTTTGAGAGACACAAGCTGTTGCTGTCTTTCCACATGGTGTCCCGAATCCTGTTCCAGGCCGGCAAGATGAGCAACCACGAGTACCTGTTCCTGCTGAAGGGCGGAGTGGTGCTGGATCGATCTGAACAACCCGATAACCCTACCAATTGGTTGCCGGAAGACTGCTGGGACAACATTACAGAGCTGGACAAACTGCCGGGCTTCCACGGCATCTGTGATACTTTTGAGACCTTCTCCAAGGAGTGGAAGGAGTGGTACCTGCATCCTGAGCCGGAGTCACAGCCGCTCATAGGCGAATGGAACGAGATTTGCAACGAGTTCCAACGCATCCTGTTGGTGCGGTCGCTGCGCGTGGATCGCGTGTCGTCCTGTGTAGCGGCCTACGTCGTTAACACGCTCGGCCCGCGGTACGTCGAACCCCCCGTCCTGGACATACGGGCTGCTTGGGAAGAGTCTACCTGGAAGACGTCTCTGCTGTTCGTGTTGTCCCCGGGCGTTGACCCCACGTCGGCCTTGATCCAGCTAGCCACCGATACAAAGATGTTCGATAAGTTCCAATCACTTAGCTTGGGTCAAGGTCAGGCGCCCACTGCTGCCAGGATGTTGAGTCAGGGTATGAAGGAAGGTGGCTGGGTGTTCCTGGCAAACTGTCACCTGGCGTGCGTATGGCTCGGCGCGCTGCGAGGCCTCGACAACCCCAAGATACACCCGCGGTTCCGGTTGTGGCTGTCGTCGATGCCTGATGACAAGTTCCCTCTCAACATACTGCAGAGGAGCATCAAGATGACCACAGAACCACCTCAGGGCTTGAAGGGCAACCTCGTCCGTATATTCGCGAACCTCAACGAGGAGAAGTTCGACCAGGCCACCCCCAAGTATCGACGTCTGTTGTTCTGTGTGTCCTTCTTCCACTGTTCCCTCATAGCTCGCAAACGGTTCCGACAACTTGGATATAACGCTGTTTACAGCTTCAATGATTCTGATTTTGAGGTATCCGATAACCTCCTAGCAAATTATCTGGAAGAGTATGAGGAGGTCCCTTGGGATGCCCTGCGCTACTTATTCTCTATCATCAACTACGGCGGTCACATCACCGACGACTGGGACAAGCGAGTTCTCATCGCATACATCAACCAGTTCTTCTGCGAGGAAGCCCTCGATACGCCATTCTATAGGTTGTCCAGTATCCCAGCCTACCACCTCCCCCGCGACGGTAGCCTGGAATCGTACCGCGACTTCCTGGACCTCCTGCCGGCCTCCGAGCGAGCCGAATCCGTCGGCCAGCACGCCTCCGCTGATGTAGCGACACTCGCACAGGGCTTGAAGGGCAACCTCGTCCGTATATTCGCGAACCTCAACGAGGAGAAGTTCGACCAGGCCACCCCCAAGTATCGACGTCTGTTGTTCTGTGTGTCCTTCTTCCACTGTTCCCTCATAGCTCGCAAACGGTTCCGACAACTTGGATATAACGCTGTTTACAGCTTCAACGATTCTGATTTTGAGGTATCCGATAACCTCCTAGCCAATTATCTGGAAGAGTACGAGGAGGTCCCTTGGGATGCCCTGCGCTACTTATTCTCTATCATCAACTACGGCGGTCACATCACCGACGACTGGGACAAGCGAGTGCTCATCGCATACATCAACCAGTTCTTCTGCGAGGAAGCCCTCGATACGCCATTCTATAGGTTATCCAGTATCCCAGCCTACCACCTCCCCCGCGACGGTAGCCTGGAATCGTACCGCGACTTCCTGGACCTCCTGCCGGCCTCCGAGCGAGCCGAATCCGTCGGCCAGCACGCCTCCGCTGACGTAGCGACACTCGCACAGGACGCCCGCATAATGTGCTCCACGTTGTTCGCGCTCGCCTCCACCGGCGGCGGCGGGGCGGGGGGAGGGGAGGATCAAAAGGTCGATGAGCTGGCTGCGGAGATGCTGTCGAAGTTGCCGAGTCGTATAGACGTGGAGACCACGGAGAGGATGATGGGTCCGGAGATCGTGATGCCCATGTGTGTCAGTCTGCTGCAGGAGATCGGATACTACAATGTGCTCATCAGCACCATCATCGCCGGCCTCAAGGAACTGCGACGTGCCATTGAGGGTCTGGTGGTGATGTCCGAGATGTTGGAGATCATGTACACGTGTATATTCGAGGGCCGCGTGCCGTCCTTCTGGCAGAAAGGTCGTCCGTCTATAAAGCCGTTGGGCGCGTGGTGTAGGGAGCTGTTCCTCCGCGGCTCTCACCTGACGTCGTGGTCCAACGCGCCCCGCTCTCCCCCCACCCTCTGCTGGCTGCCGTCCCTCGTCGCGCCGACAGGATTCCTCACCGCTGTGATGCAGACGACAGCGCGCGGGGAGAGCTGGCCCATCGACACTCTGTGCTGGGAGTTCACAGTCATGCCGCTGGAGGAGACTGCCTTCGTCAGACCGCCCCGAGACGGCGGGGTTTACATACGGGGTCTGTTCCTGGAGGGCGCCAGTTGGTTCCGGAAGGACGGTCACCTGCAGGAGCCGCTTCCCATGCAGCTGGTGTTCCCCATGACTCCGATACACTTCAAACCCATACGAGCCACCGGGAGGCGCCTGAGAAATCGTTACGTATGTCCCTGTTACTACTACCCTCTCCGTATGGGCGCCTTCGTGGTGGCCGTCGACCTTCACTCCGGCAAGGAGTCTTCCGATTTCTGGGTCAAGCGAGGCACCGCGCTGCTTTGCACTCTGGCCACTTAA

Protein sequence:

>DPOGS202795-PA
MFNNIAAKLLPTPSKMHYLFNLRDISKIFQGLLRSNKDYQNTKPRFLRLWVHECFRVFGDRLTEEKDRDWFMNHMGDMLGKHFELTFHALCPSKSPPIFGHFLNPFEVYDDLNDPNALRKYIAKPDGGVQQLPGRREDGPGAVQGRHRTHLPHRARHLAAAREHALRGSGRQSLTRVASYICECNSFQVVVTKTYGVKDFREDLKVLYTSCGVDTKKTTFIFCDTQIVEETFTEIVNNLLSSGEVTNLYKPDEFEDIKSALEKPMKNANILQTAESVYLFLVERVRANMRIVLCFSPIGDEFRNRIRQYPALINATTTNWFLEWPREALLEVAYKFLDGVELLASITGTRVRRKESLVESREDSMRAAVASTMSLIHSSVGRASVKMWQEMRRTNYVTPTNYLELVAGYKEMLKSKRIEIALQANKLRNGLGKVEETTNLVGQMSEELALAQVQVAEYTEQCIEYMGVINVQQRNADEQQRSVAARSKKTMEEEVQCKKLADAAMRDLASAMPALEEAVKALDALNKKDITEVKSYAKPPQKVEMVLEAVLILLQKEPTWAEAKRQLGDQYFLDRLRDFDKDNISDKTLKKIGTYTAKPDFDPEIVGTVSLAAKSLCLWVRAIEKYGKVYKIVKPKKERLEEALESLREKQRILAEARARLRELSEMIARLQKEYDEKLAQKEELERKSRLLQLKLERAEALITGLSGERERWEQTVERLDKEFENLPGDCLIATGFVAYLGPFISEYREDLMGDWFREVFDQTLPVTMDLSMKTFLLDDATLRDWNYMGLPDDNFSAENGIIVVRATRWPLAVDPQGQALIWISRLEENNQIQIVDFGQPNYLKIMENCLSQGKPIIIQNVGEVLDPSIAPILDKAIVKIGSDMVIKFNEKMVPYNPEFKMYLTTKLGNPVYSPETLTKTTMVNFAVKEQGLTSQLLSIVVRKERPQLEQMKDTLVMTIANNKKVLMDLENDLLRIMYESQVPLLENEELFQTLQVSQRTSLEVKEALITSQDTEKEIDAARQGYVPVATRAAVLFFALNDLARIDPMYQFSLDAYNDLFTYSIDRSPKGGDLEDRITNLNEFHTFAVYKNTCRAVFERHKLLLSFHMVSRILFQAGKMSNHEYLFLLKGGVVLDRSEQPDNPTNWLPEDCWDNITELDKLPGFHGICDTFETFSKEWKEWYLHPEPESQPLIGEWNEICNEFQRILLVRSLRVDRVSSCVAAYVVNTLGPRYVEPPVLDIRAAWEESTWKTSLLFVLSPGVDPTSALIQLATDTKMFDKFQSLSLGQGQAPTAARMLSQGMKEGGWVFLANCHLACVWLGALRGLDNPKIHPRFRLWLSSMPDDKFPLNILQRSIKMTTEPPQGLKGNLVRIFANLNEEKFDQATPKYRRLLFCVSFFHCSLIARKRFRQLGYNAVYSFNDSDFEVSDNLLANYLEEYEEVPWDALRYLFSIINYGGHITDDWDKRVLIAYINQFFCEEALDTPFYRLSSIPAYHLPRDGSLESYRDFLDLLPASERAESVGQHASADVATLAQGLKGNLVRIFANLNEEKFDQATPKYRRLLFCVSFFHCSLIARKRFRQLGYNAVYSFNDSDFEVSDNLLANYLEEYEEVPWDALRYLFSIINYGGHITDDWDKRVLIAYINQFFCEEALDTPFYRLSSIPAYHLPRDGSLESYRDFLDLLPASERAESVGQHASADVATLAQDARIMCSTLFALASTGGGGAGGGEDQKVDELAAEMLSKLPSRIDVETTERMMGPEIVMPMCVSLLQEIGYYNVLISTIIAGLKELRRAIEGLVVMSEMLEIMYTCIFEGRVPSFWQKGRPSIKPLGAWCRELFLRGSHLTSWSNAPRSPPTLCWLPSLVAPTGFLTAVMQTTARGESWPIDTLCWEFTVMPLEETAFVRPPRDGGVYIRGLFLEGASWFRKDGHLQEPLPMQLVFPMTPIHFKPIRATGRRLRNRYVCPCYYYPLRMGAFVVAVDLHSGKESSDFWVKRGTALLCTLAT-