New model in OGS2.0 | DPOGS206998  |
---|---|
Genomic Position | scaffold1:+ 514497-520683 |
See gene structure | |
CDS Length | 1578 |
Paired RNAseq reads   | 193 |
Single RNAseq reads   | 604 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012918 (1e-131) |
Best Drosophila hit   | myosin heavy chain-like, isoform B (1e-34) |
Best Human hit | myosin-XVIIIa isoform b (6e-11) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC015276 [Tribolium castaneum] (3e-95) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC015276 [Tribolium castaneum] (1e-42) |
GeneOntology terms    | GO:0016459 myosin complex GO:0042623 ATPase activity, coupled GO:0005515 protein binding GO:0005524 ATP binding GO:0003774 motor activity |
InterPro families   | IPR001478 PDZ/DHR/GLGF |
Orthology group | MCL19686 |
Nucleotide sequence:
ATGTTCAATTTTATGAAGAAAACATCCGTTCTAACTAGCGACTCGGGAAGTTTCGAGGAG
AGAGATGGCGATAAAGAGCGTAGGAAAAAAGAGAAGAAGGAGAGGAAGGAGAGAGAAAAA
AGGGAGAGAATAGCGGCTGGGCTTGAAGAGCCTTTGAGATTAGAAGAGGTTCGAAGATCA
TTGAAATTACGAGGACGTCGCAAGGAGAAGGAAAAGTTACCGTCGGGTATTACAGCCGAC
TATACCGCCTCGCTTTTCGCCCATCTCGAAAAAGATACCAACTATACAAATTATAAGGTT
ATCTCGAATTCAGGTTCCAATTCGAACCTCAGCGACGAAAACAATCATTTGTCTCCGGGA
TACCCGAACCATAATTGGAATAAGAAAGAAGGCGTGCTGCAATCTGATAGTTCAGAGACG
TCGCTAAATTCGTTAAACAATCCGAATAATGTTAACATCAGCCCCAGACAGGCACCGAAC
TTACCACCCATACCACCGCGTCCACCGAAACGAGGAATCCTGAAGGGGCCACGTCTCAGC
AACACTAGCTCCGTCTCCCAAGAAAGCAATGTTCAAAACACCGATACGGTTGATTATATG
AACGGACAGGATCCAAATCTTCTCGCACGGAACACTCAACAAAACGAACTCATCTCGTAC
GCTATCCAACCCTCTAAGAGTACATCCTCTAGTGATGATATGCAACAGATACAGAATTTC
ACCAAAAAAGTCATACACAGCCCAGTTGAGAATCAATACAAAAATTATAAAAACAACTCC
AATCCATCGATCCGGACTCACGATGTAGACGAAGCGACAAAGACGAACGGGAACAGTTAT
CACGGGGTGACGTCTACCTCACCGAGTGCTGATTCATTAACTGATACCACGACAAACTCT
TCGTTCGCTACACCTCCATTTTCAACGTCCCCAGTCGGTGAATCTCAAGGTTTCCATAGA
TGGTCAAGGACGAGTACCTTCGACGATGTTTACTTGCCTCTTCCGTCTTTGTCTCCTCTT
CATTTGCCGAAGCCGAGACTGTTAACCATCCAACGCCAAAAGGCACCCAGGAATGATTTC
GGGTTTAGTCTTCGAAGAGCTATGATTCAAGAGAGAGTCTTCATTGGTGATATAAGAACT
CTCGCTGGTCACGAAGAGAAGGTTATAGCCAATGGTGATAATAAATACAACGACAGCAAT
GTGGTTATCAGCAAGCTGACGTATAAAGCTGTGATACTCGCTGAACCCGGCTCCTACCCA
GGTGCCAGTGAGACCGGCTTGTTACCAGGCGATAGACTCATCGAAGTCAATGACGTTAAC
GTAGAAGGGAGGAGTAGGGAAGAAGTGATTGATCTGATAAAAAGTAGCCAGGACTCTGTT
ACGGTTAAGGTGCAACCTATAGCTGAACTATGCGAACTATCGAGCCGCAGGGCTGCGGAC
GGCGGGGCGCAAGTTGAGTTATCAGAGAGCAATGTTAGAGGTGGAACGCTCAGTCGATCT
GGCAGTCGAAGATTCACTAGCACACAGGTTAGTTACATCACAGAGAACTCTGAGACTATA
CGATTATTACAAACATAA
Protein sequence:
MFNFMKKTSVLTSDSGSFEERDGDKERRKKEKKERKEREKRERIAAGLEEPLRLEEVRRS
LKLRGRRKEKEKLPSGITADYTASLFAHLEKDTNYTNYKVISNSGSNSNLSDENNHLSPG
YPNHNWNKKEGVLQSDSSETSLNSLNNPNNVNISPRQAPNLPPIPPRPPKRGILKGPRLS
NTSSVSQESNVQNTDTVDYMNGQDPNLLARNTQQNELISYAIQPSKSTSSSDDMQQIQNF
TKKVIHSPVENQYKNYKNNSNPSIRTHDVDEATKTNGNSYHGVTSTSPSADSLTDTTTNS
SFATPPFSTSPVGESQGFHRWSRTSTFDDVYLPLPSLSPLHLPKPRLLTIQRQKAPRNDF
GFSLRRAMIQERVFIGDIRTLAGHEEKVIANGDNKYNDSNVVISKLTYKAVILAEPGSYP
GASETGLLPGDRLIEVNDVNVEGRSREEVIDLIKSSQDSVTVKVQPIAELCELSSRRAAD
GGAQVELSESNVRGGTLSRSGSRRFTSTQVSYITENSETIRLLQT