New model in OGS2.0 | DPOGS207000  |
---|---|
Genomic Position | scaffold1:+ 534836-561233 |
See gene structure | |
CDS Length | 3369 |
Paired RNAseq reads   | 2179 |
Single RNAseq reads   | 5100 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012919 (3e-113) |
Best Drosophila hit   | myosin heavy chain-like, isoform B (3e-145) |
Best Human hit | myosin-XVIIIa isoform b (6e-104) |
Best NR hit (blastp)   | PREDICTED: similar to CG31045-PA [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | conserved hypothetical protein [Culex quinquefasciatus] (8e-163) |
GeneOntology terms    | GO:0016459 myosin complex GO:0042623 ATPase activity, coupled GO:0005515 protein binding GO:0005524 ATP binding GO:0003774 motor activity |
InterPro families   | IPR001609 Myosin head, motor domain |
Orthology group | MCL10441 |
Nucleotide sequence:
ATGTTTCACATCCAAGGGACATTTGAGCAGTCGTTACGCTGTGGCGACGATTTGTCACCC
AGTGGTAATAAGACCTTGGCGGCTCTTGCAAAAATTCACAAACACTGCACGGGCAAATCA
CACGCTCATGGTTGTGTGGATTCCAAACACAATCCTCTTGCCAAATCCGACTCGAACAAC
AGCTTGCAATCCAAAAAATCCTACCGCAGCGATTGCATCATAGAAATTAAAGAAGCTGTT
AAAAACGACAATTGCTCATCGAAAAACAACGGGAACACGAAAACATTCAGTAATAGTACC
AGCTCTTATGATGGTCACAGCGACACTGACTCAGAAAAAACTTTTAAGTTCCGAAATGAA
AACAAGCCGGCCGACCCCTGTGAACGTTTGTTCTCCCAAAACACAAAAGCCTCTCTCATG
AAAACAGCCCGCATGCGATATGCTGATGATAGTGTAACACCTCCAACTAATGATGAAGTT
GATGGGGACAAGTGGAAAAATAATTTTACCAGGACTTCATCACGTGGAAGCTATAAGGAA
AGGAAATCAGCTGTTGTAACAATTGAGGATATTAAATCTGACGGAATAAAGAACACTGAT
GATTTTGGCCAGTTAGAACATCTAGATGAGAAGAAAAAAGTACAAACTCATCCAAAAAAT
ACTGTAACACGTGTAGGTTCTTCTAATTCCATTACCAATGGAAAGACCAGTTCGAACGGA
ACCAAACCACCGTGGCGCGGCGCTAGCAAAAACGGAATACCAAATGAGACATTTGTCCGA
ACCCCATCATTACGAGGCAGTATAAGAAAGAAAAAGAACGACGGTGTTCCTTGGCGTAAG
CTGTATGAATACTCGCTGTGGAGAAAGTGTGGCGTCAATTTTGCCTCAAATGGAGCTGAC
TCGGATCGTGCCCTCCTTAGACCCTGGAGCGAGTTTTCCTCTCAATCGCCGCCACCATAC
GAGGCAAAGTCCGAAGAGCAGCTCGCATGCGAGAAAGAATGGCTACAAGCCGGTCAGATG
TGGCTGGCCCATCGTGGCGGGTTCACAGCTGTAGTGAGAGAGGGAGATGCTGAACCTGGC
CGGGCTAAAGTCAGGGTACTACAGACGGGAGAGATTATCACAGTCGATGAAGACGATCTA
GAGAAGGCTAATCCCCCTCAATTAGAGCGATGCGAGGACATTGCATCGTTGCGTTGCTTG
AACGAGTGCGGTGCGTTGCACGTGCTGCGTTCGCGTTATGCCGCTGCGTTGCCGCACGCT
CGTGCCGGACATGCTCTACTTGTGCTTGGACCACCCCGCCGTTCTACTCCCGTTTATACA
GAAAAGGTGGCGGCTATGTTCCGCAGCTGCCGCACCGACGACATGCCGCCGCACGTTTTT
GCGGCAGCTCAATCAGCACACCGCGCCATGTTGGCCGCGAGAAGAGACAGGGCTATTGTG
TTCCTAGGAAGGTCCGGTTCTGGTAAGACCTCTGCTATGCGTCACTGTGTTTGGTATCTG
GCTACAGCATATCCAGCGCAAGGCTCCAAACTCACACCGGAGAGGCTGGAAGCTGCTCTG
GATGTACTGCACATATTCGGCTCCAGTCGAAGCGCCTCCAATCTCCACGCCTCTCGTTTC
GTTTCTCTTACGTCGCTGGACTTCGACGGTGCGGGTGCGTTAGTGTCAGCTTCAGTACAA
GCGCTCCTGCCAGACCTCAGACCTGATCAGTCTCCACTGAGGGCGTTACACACATTGTTC
CACGGCTGTGATGCTCGTCTTCGTCGCGAGCTTCTGTTGGACCAGGCTCCAGTGAACGCT
CCCAATCCGTACATCAACTCCTCTGATAAGACTGACGCTCCGACCGAATTCGTTGCTTTG
AAGGAGGCGTTGCAACTTTTATCAGTCACCGAACAAGAGCAATTGGCCGTGTGGAAGATT
ATAGCTGCTATATGTCACTTGGGATGGGCCGGAGTTGCTAGAGCTAACTCTGGTTCCGGT
GTGAGGTATCAGTTCGCTAGTACAGGTTGTGCGGGTCGAGCTGGTCGTCTACTTGGCGTC
ACAGCTGAGGAACTCGCCAAGGCTTGCTTCGCCCCCGCCAGCCCGCCCTCACCGCAGCCG
CCTACTGGATTGAGAACTCCATCGCCTTCGAACGAGAAGGAAGCGCCAGGATCGGACGCT
CTCGATGCTTTCGCCACCGGACTGTATAGCGAAGTATTCAACGTTATCGTAGGATTCGTC
AACAGGAGTACGTCGACATCAAGCCGCACTTCCAGCTCTTTGCTACTGCTGGATTCTCCA
GGTGCTGATAACCCTATGTGTTCCGGACAACAAAGCGGCGCCGGGCTGACCAAGCTACTG
TCCAATTATATGCAGGAACGTCTACAAGCCGTGTTCCACGAGGCCATGCTGGTTGCTCCA
CGCGAACGGTACGCGCAAGAAGGAGTCACCATAGACGACGGTAACGACGAAGATTGGCTA
TCAGAGTCTGTCAACCCGGGGCCGATGGTGGATTTGTTGGACAAGTCACCTCAGAACACA
ATAGTCAGAAGTTCTCAGGCCGATCTGAGAGATTGCGACCGGCGGGGACTTCTCTGGTTG
TTGGACGAGGAATCTATGTACCCGGGTTCTTCTGACGATACTTTCTTAGAGCGCGTCATG
TCCCAGTACGGCGCCCCCCATCATACACACTACCTAATAAAGAAGGCTCCTCACAATAGA
CAATTCATACTTCAGCACTTACAGGGCACTAATCCTGTTTTATATGATGTCTCCGGGTGG
GTGAAGGCCAGCCGCGAGAATCCAGCTATGAAGAGAGCTCATACTTTGCTTCAGGAAAGC
CAAAATCCCTTGTTGTCGTCGTGTGAACGTACGACTGACGCTGGCACGCTGCGAAGAGCC
GCATCCGTACGACGAGCGCTGGCTTCTGGCACCGCTGGTATGAAGCGTCGGTCAACAGCT
CTCCAAGCCAAGTTCATAGCTGACGGGGTCGTGGATACCTTACGCCGCTGCGGGTCAGGC
GGGATACAGTTTGTTTGCTGTCTCCTCACCAACCAACCTAACGAGACCCCCGACGATGTA
AACGTACCGCTATTAAGATCCCAGTTCCGCGGCTTCCAATTACTAGACGCGGCTAGATTG
TATAAACAAGGATTCCCTGAACATATGCCATTATCGGAATTCGCTAGGAGATATCGACTA
TTGGCAACGTCGGAGAAGGAGGAATCTGAAACATCCCAGCAGTCAACGACGTTATCGGAC
AGACAGATCGTGGACGAGATGCTGTTGGTGCTAGACCTAGATGTGACGAGCTACAGACTC
GGACCCACTCAGGCCGTGCCGGAGAGCTCGCCCGGCGCTCGCGCCAGTCTCTCACCGTAT
CTCGACTAA
Protein sequence:
MFHIQGTFEQSLRCGDDLSPSGNKTLAALAKIHKHCTGKSHAHGCVDSKHNPLAKSDSNN
SLQSKKSYRSDCIIEIKEAVKNDNCSSKNNGNTKTFSNSTSSYDGHSDTDSEKTFKFRNE
NKPADPCERLFSQNTKASLMKTARMRYADDSVTPPTNDEVDGDKWKNNFTRTSSRGSYKE
RKSAVVTIEDIKSDGIKNTDDFGQLEHLDEKKKVQTHPKNTVTRVGSSNSITNGKTSSNG
TKPPWRGASKNGIPNETFVRTPSLRGSIRKKKNDGVPWRKLYEYSLWRKCGVNFASNGAD
SDRALLRPWSEFSSQSPPPYEAKSEEQLACEKEWLQAGQMWLAHRGGFTAVVREGDAEPG
RAKVRVLQTGEIITVDEDDLEKANPPQLERCEDIASLRCLNECGALHVLRSRYAAALPHA
RAGHALLVLGPPRRSTPVYTEKVAAMFRSCRTDDMPPHVFAAAQSAHRAMLAARRDRAIV
FLGRSGSGKTSAMRHCVWYLATAYPAQGSKLTPERLEAALDVLHIFGSSRSASNLHASRF
VSLTSLDFDGAGALVSASVQALLPDLRPDQSPLRALHTLFHGCDARLRRELLLDQAPVNA
PNPYINSSDKTDAPTEFVALKEALQLLSVTEQEQLAVWKIIAAICHLGWAGVARANSGSG
VRYQFASTGCAGRAGRLLGVTAEELAKACFAPASPPSPQPPTGLRTPSPSNEKEAPGSDA
LDAFATGLYSEVFNVIVGFVNRSTSTSSRTSSSLLLLDSPGADNPMCSGQQSGAGLTKLL
SNYMQERLQAVFHEAMLVAPRERYAQEGVTIDDGNDEDWLSESVNPGPMVDLLDKSPQNT
IVRSSQADLRDCDRRGLLWLLDEESMYPGSSDDTFLERVMSQYGAPHHTHYLIKKAPHNR
QFILQHLQGTNPVLYDVSGWVKASRENPAMKRAHTLLQESQNPLLSSCERTTDAGTLRRA
ASVRRALASGTAGMKRRSTALQAKFIADGVVDTLRRCGSGGIQFVCCLLTNQPNETPDDV
NVPLLRSQFRGFQLLDAARLYKQGFPEHMPLSEFARRYRLLATSEKEESETSQQSTTLSD
RQIVDEMLLVLDLDVTSYRLGPTQAVPESSPGARASLSPYLD