New model in OGS2.0 | DPOGS202157  |
---|---|
Genomic Position | scaffold981:+ 33865-39801 |
See gene structure | |
CDS Length | 1476 |
Paired RNAseq reads   | 720 |
Single RNAseq reads   | 2053 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003431 (2e-93) |
Best Drosophila hit   | WASp, isoform B (8e-50) |
Best Human hit | wiskott-Aldrich syndrome protein (1e-46) |
Best NR hit (blastp)   | Neural Wiskott-Aldrich syndrome protein, putative [Pediculus humanus corporis] (4e-74) |
Best NR hit (blastx)   | PREDICTED: similar to Wiskott-Aldrich syndrome [Nasonia vitripennis] (1e-62) |
GeneOntology terms    | GO:0003779 actin binding GO:0007423 sensory organ development GO:0007413 axonal fasciculation GO:0008407 bristle morphogenesis GO:0007409 axonogenesis GO:0005515 protein binding GO:0016028 rhabdomere GO:0005902 microvillus GO:0005083 small GTPase regulator activity GO:0015629 actin cytoskeleton GO:0045886 negative regulation of synaptic growth at neuromuscular junction GO:0008356 asymmetric cell division GO:0045165 cell fate commitment GO:0009913 epidermal cell differentiation GO:0035017 cuticle pattern formation GO:0035212 cell competition in a multicellular organism GO:0007520 myoblast fusion GO:0034314 Arp2/3 complex-mediated actin nucleation GO:0016476 regulation of embryonic cell shape GO:0030833 regulation of actin filament polymerization GO:0003383 apical constriction |
InterPro families    | IPR000095 PAK-box/P21-Rho-binding IPR000697 EVH1 IPR003124 Actin-binding WH2 IPR011993 Pleckstrin homology-type IPR011026 Wiscott-Aldrich syndrome, C-terminal |
Orthology group | MCL11844 |
Nucleotide sequence:
ATGCCAAAGGGAGAGAACAGGCCGAGCGTCCTGTTGACTCCGGAAGAGAACGATCTGGTG
TTTAGCCTCATCGGAGCTAAATGTCAGAGTCTAGCGACAGCTGTAGTACAATTATTCACT
ACCGAGGGACCGGATCATTCAGAATGGAAGAAGAAAGACACGGGGGTGCTGTGCCTTATA
AAAGATAATAGCAAACGTTCATACTTCTTCCGGATCTACTGCCTCTATCGGAGGTCGTTG
ATTTGGGAACATGAAGTCTATCTGCAGATTGAATACAAAAATCCCAGACCGTATTTACAT
ACGTTTGAAGCCGAGGAATACATGACGGCATTTAATTTCGCAAATGAAATGGAGGCGACG
GTGCTAAGGAATATTCTTTTAGAGAAAATTGAACTGCGTAAACAAAGACGGCAAGTTCGT
AACAATCGTTCGATGATGGTCCCCCGTAATAACTCGACGGTTCATGAGTCTTCGTCGCGG
TACAACGGCGCCCCTCCCCCGCCGCCGCTCACCACCACCACCGCCACCACCAATACTAAG
ACTAACACCCTCAATTCCTTGAAAGGCTCGGGGAGGAAACCGAAAGCGCGCAAACTGACC
AAGGCTGACATCGGCATGCCGAAGGACTTCAAGCACGTGTCACACGTCGGATGGGACGCC
AACAAAGGGTTCGACGTGGATCTGCCGGAGGAGAAGCTCCGCTGGTTCTTCGACAAGGCG
GGCGTGTCGGAGACGCAGCTCAACGACCAGGAGACGAGGATGTTCATATACGACTTCATC
ATCAAGAACGGCGGAGCGGACGCGGTCAACGAGGACCTCACGGACGAACCGCCGCCGCCA
TACTCGGAGTCCCGGAGCCCCGCGCCGCCTGTCCCCGCCCGCGCCCCGCACCCCCCCGCG
CCCCCGTCTCGTGCTCCGCCCCCGCCGCCGGCGCGGTCCGTACCCCCTCCGCCGCCGCCG
CCAGCGACCCTCGCGCCGCGGAACCCTCCGCCGCCCAGACCGACACAACCCCCGGCCCCG
GCGCCGCCGTCCATGCCTCCCCCTCCTCCCCCGCCGTCGCTGGCTCCGCCTCCGCCGCCG
CCTCCCCCCGCACCGCCCGCGCCTCCCCCGCCGAGCTCCGAGGACAGGTCAGAGCTGCCC
GCCGCCAACAGTGACCCGCGGGCCGCTCTCATGGCGAGCATACGGAGCGGCAACAAGAAC
TTGAGGCCCGTGGATTCCGTATCTAAGTCTTCGGCCAGCACGGACGACAGCAGGAACAAC
TTATTGAGCGAAATCCGTCAAGGGATCACATTGAAATCGGTGCGGCGGGAGAGTGTCACC
GCGGGCGACGAGAAGACTACCAACAACGTAGAAAACGCGAGTGGCCTCGCCGGCGCGCTG
GCCCGGGCGCTCAAGGAGCGGGCGAGGGCGATACACTCCTCGGACGACGAAGACGACACC
GACAACACCACCAGCGACGGAGAGTGGGACTTCTAG
Protein sequence:
MPKGENRPSVLLTPEENDLVFSLIGAKCQSLATAVVQLFTTEGPDHSEWKKKDTGVLCLI
KDNSKRSYFFRIYCLYRRSLIWEHEVYLQIEYKNPRPYLHTFEAEEYMTAFNFANEMEAT
VLRNILLEKIELRKQRRQVRNNRSMMVPRNNSTVHESSSRYNGAPPPPPLTTTTATTNTK
TNTLNSLKGSGRKPKARKLTKADIGMPKDFKHVSHVGWDANKGFDVDLPEEKLRWFFDKA
GVSETQLNDQETRMFIYDFIIKNGGADAVNEDLTDEPPPPYSESRSPAPPVPARAPHPPA
PPSRAPPPPPARSVPPPPPPPATLAPRNPPPPRPTQPPAPAPPSMPPPPPPPSLAPPPPP
PPPAPPAPPPPSSEDRSELPAANSDPRAALMASIRSGNKNLRPVDSVSKSSASTDDSRNN
LLSEIRQGITLKSVRRESVTAGDEKTTNNVENASGLAGALARALKERARAIHSSDDEDDT
DNTTSDGEWDF