DPGLEAN15547 in OGS1.0

New model in OGS2.0DPOGS207000 
Genomic Positionscaffold1:+ 534836-561233
See gene structure
CDS Length3369
Paired RNAseq reads  2179
Single RNAseq reads  5100
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012919 (3e-113)
Best Drosophila hit  myosin heavy chain-like, isoform B (3e-145)
Best Human hitmyosin-XVIIIa isoform b (6e-104)
Best NR hit (blastp)  PREDICTED: similar to CG31045-PA [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  conserved hypothetical protein [Culex quinquefasciatus] (8e-163)
GeneOntology terms



  
GO:0016459 myosin complex
GO:0042623 ATPase activity, coupled
GO:0005515 protein binding
GO:0005524 ATP binding
GO:0003774 motor activity
InterPro families  IPR001609 Myosin head, motor domain
Orthology groupMCL10441

Nucleotide sequence:

ATGTTTCACATCCAAGGGACATTTGAGCAGTCGTTACGCTGTGGCGACGATTTGTCACCC
AGTGGTAATAAGACCTTGGCGGCTCTTGCAAAAATTCACAAACACTGCACGGGCAAATCA
CACGCTCATGGTTGTGTGGATTCCAAACACAATCCTCTTGCCAAATCCGACTCGAACAAC
AGCTTGCAATCCAAAAAATCCTACCGCAGCGATTGCATCATAGAAATTAAAGAAGCTGTT
AAAAACGACAATTGCTCATCGAAAAACAACGGGAACACGAAAACATTCAGTAATAGTACC
AGCTCTTATGATGGTCACAGCGACACTGACTCAGAAAAAACTTTTAAGTTCCGAAATGAA
AACAAGCCGGCCGACCCCTGTGAACGTTTGTTCTCCCAAAACACAAAAGCCTCTCTCATG
AAAACAGCCCGCATGCGATATGCTGATGATAGTGTAACACCTCCAACTAATGATGAAGTT
GATGGGGACAAGTGGAAAAATAATTTTACCAGGACTTCATCACGTGGAAGCTATAAGGAA
AGGAAATCAGCTGTTGTAACAATTGAGGATATTAAATCTGACGGAATAAAGAACACTGAT
GATTTTGGCCAGTTAGAACATCTAGATGAGAAGAAAAAAGTACAAACTCATCCAAAAAAT
ACTGTAACACGTGTAGGTTCTTCTAATTCCATTACCAATGGAAAGACCAGTTCGAACGGA
ACCAAACCACCGTGGCGCGGCGCTAGCAAAAACGGAATACCAAATGAGACATTTGTCCGA
ACCCCATCATTACGAGGCAGTATAAGAAAGAAAAAGAACGACGGTGTTCCTTGGCGTAAG
CTGTATGAATACTCGCTGTGGAGAAAGTGTGGCGTCAATTTTGCCTCAAATGGAGCTGAC
TCGGATCGTGCCCTCCTTAGACCCTGGAGCGAGTTTTCCTCTCAATCGCCGCCACCATAC
GAGGCAAAGTCCGAAGAGCAGCTCGCATGCGAGAAAGAATGGCTACAAGCCGGTCAGATG
TGGCTGGCCCATCGTGGCGGGTTCACAGCTGTAGTGAGAGAGGGAGATGCTGAACCTGGC
CGGGCTAAAGTCAGGGTACTACAGACGGGAGAGATTATCACAGTCGATGAAGACGATCTA
GAGAAGGCTAATCCCCCTCAATTAGAGCGATGCGAGGACATTGCATCGTTGCGTTGCTTG
AACGAGTGCGGTGCGTTGCACGTGCTGCGTTCGCGTTATGCCGCTGCGTTGCCGCACGCT
CGTGCCGGACATGCTCTACTTGTGCTTGGACCACCCCGCCGTTCTACTCCCGTTTATACA
GAAAAGGTGGCGGCTATGTTCCGCAGCTGCCGCACCGACGACATGCCGCCGCACGTTTTT
GCGGCAGCTCAATCAGCACACCGCGCCATGTTGGCCGCGAGAAGAGACAGGGCTATTGTG
TTCCTAGGAAGGTCCGGTTCTGGTAAGACCTCTGCTATGCGTCACTGTGTTTGGTATCTG
GCTACAGCATATCCAGCGCAAGGCTCCAAACTCACACCGGAGAGGCTGGAAGCTGCTCTG
GATGTACTGCACATATTCGGCTCCAGTCGAAGCGCCTCCAATCTCCACGCCTCTCGTTTC
GTTTCTCTTACGTCGCTGGACTTCGACGGTGCGGGTGCGTTAGTGTCAGCTTCAGTACAA
GCGCTCCTGCCAGACCTCAGACCTGATCAGTCTCCACTGAGGGCGTTACACACATTGTTC
CACGGCTGTGATGCTCGTCTTCGTCGCGAGCTTCTGTTGGACCAGGCTCCAGTGAACGCT
CCCAATCCGTACATCAACTCCTCTGATAAGACTGACGCTCCGACCGAATTCGTTGCTTTG
AAGGAGGCGTTGCAACTTTTATCAGTCACCGAACAAGAGCAATTGGCCGTGTGGAAGATT
ATAGCTGCTATATGTCACTTGGGATGGGCCGGAGTTGCTAGAGCTAACTCTGGTTCCGGT
GTGAGGTATCAGTTCGCTAGTACAGGTTGTGCGGGTCGAGCTGGTCGTCTACTTGGCGTC
ACAGCTGAGGAACTCGCCAAGGCTTGCTTCGCCCCCGCCAGCCCGCCCTCACCGCAGCCG
CCTACTGGATTGAGAACTCCATCGCCTTCGAACGAGAAGGAAGCGCCAGGATCGGACGCT
CTCGATGCTTTCGCCACCGGACTGTATAGCGAAGTATTCAACGTTATCGTAGGATTCGTC
AACAGGAGTACGTCGACATCAAGCCGCACTTCCAGCTCTTTGCTACTGCTGGATTCTCCA
GGTGCTGATAACCCTATGTGTTCCGGACAACAAAGCGGCGCCGGGCTGACCAAGCTACTG
TCCAATTATATGCAGGAACGTCTACAAGCCGTGTTCCACGAGGCCATGCTGGTTGCTCCA
CGCGAACGGTACGCGCAAGAAGGAGTCACCATAGACGACGGTAACGACGAAGATTGGCTA
TCAGAGTCTGTCAACCCGGGGCCGATGGTGGATTTGTTGGACAAGTCACCTCAGAACACA
ATAGTCAGAAGTTCTCAGGCCGATCTGAGAGATTGCGACCGGCGGGGACTTCTCTGGTTG
TTGGACGAGGAATCTATGTACCCGGGTTCTTCTGACGATACTTTCTTAGAGCGCGTCATG
TCCCAGTACGGCGCCCCCCATCATACACACTACCTAATAAAGAAGGCTCCTCACAATAGA
CAATTCATACTTCAGCACTTACAGGGCACTAATCCTGTTTTATATGATGTCTCCGGGTGG
GTGAAGGCCAGCCGCGAGAATCCAGCTATGAAGAGAGCTCATACTTTGCTTCAGGAAAGC
CAAAATCCCTTGTTGTCGTCGTGTGAACGTACGACTGACGCTGGCACGCTGCGAAGAGCC
GCATCCGTACGACGAGCGCTGGCTTCTGGCACCGCTGGTATGAAGCGTCGGTCAACAGCT
CTCCAAGCCAAGTTCATAGCTGACGGGGTCGTGGATACCTTACGCCGCTGCGGGTCAGGC
GGGATACAGTTTGTTTGCTGTCTCCTCACCAACCAACCTAACGAGACCCCCGACGATGTA
AACGTACCGCTATTAAGATCCCAGTTCCGCGGCTTCCAATTACTAGACGCGGCTAGATTG
TATAAACAAGGATTCCCTGAACATATGCCATTATCGGAATTCGCTAGGAGATATCGACTA
TTGGCAACGTCGGAGAAGGAGGAATCTGAAACATCCCAGCAGTCAACGACGTTATCGGAC
AGACAGATCGTGGACGAGATGCTGTTGGTGCTAGACCTAGATGTGACGAGCTACAGACTC
GGACCCACTCAGGCCGTGCCGGAGAGCTCGCCCGGCGCTCGCGCCAGTCTCTCACCGTAT
CTCGACTAA

Protein sequence:

MFHIQGTFEQSLRCGDDLSPSGNKTLAALAKIHKHCTGKSHAHGCVDSKHNPLAKSDSNN
SLQSKKSYRSDCIIEIKEAVKNDNCSSKNNGNTKTFSNSTSSYDGHSDTDSEKTFKFRNE
NKPADPCERLFSQNTKASLMKTARMRYADDSVTPPTNDEVDGDKWKNNFTRTSSRGSYKE
RKSAVVTIEDIKSDGIKNTDDFGQLEHLDEKKKVQTHPKNTVTRVGSSNSITNGKTSSNG
TKPPWRGASKNGIPNETFVRTPSLRGSIRKKKNDGVPWRKLYEYSLWRKCGVNFASNGAD
SDRALLRPWSEFSSQSPPPYEAKSEEQLACEKEWLQAGQMWLAHRGGFTAVVREGDAEPG
RAKVRVLQTGEIITVDEDDLEKANPPQLERCEDIASLRCLNECGALHVLRSRYAAALPHA
RAGHALLVLGPPRRSTPVYTEKVAAMFRSCRTDDMPPHVFAAAQSAHRAMLAARRDRAIV
FLGRSGSGKTSAMRHCVWYLATAYPAQGSKLTPERLEAALDVLHIFGSSRSASNLHASRF
VSLTSLDFDGAGALVSASVQALLPDLRPDQSPLRALHTLFHGCDARLRRELLLDQAPVNA
PNPYINSSDKTDAPTEFVALKEALQLLSVTEQEQLAVWKIIAAICHLGWAGVARANSGSG
VRYQFASTGCAGRAGRLLGVTAEELAKACFAPASPPSPQPPTGLRTPSPSNEKEAPGSDA
LDAFATGLYSEVFNVIVGFVNRSTSTSSRTSSSLLLLDSPGADNPMCSGQQSGAGLTKLL
SNYMQERLQAVFHEAMLVAPRERYAQEGVTIDDGNDEDWLSESVNPGPMVDLLDKSPQNT
IVRSSQADLRDCDRRGLLWLLDEESMYPGSSDDTFLERVMSQYGAPHHTHYLIKKAPHNR
QFILQHLQGTNPVLYDVSGWVKASRENPAMKRAHTLLQESQNPLLSSCERTTDAGTLRRA
ASVRRALASGTAGMKRRSTALQAKFIADGVVDTLRRCGSGGIQFVCCLLTNQPNETPDDV
NVPLLRSQFRGFQLLDAARLYKQGFPEHMPLSEFARRYRLLATSEKEESETSQQSTTLSD
RQIVDEMLLVLDLDVTSYRLGPTQAVPESSPGARASLSPYLD