DPGLEAN13191 in OGS1.0

New model in OGS2.0DPOGS200827 
Genomic Positionscaffold5537:+ 509-6899
See gene structure
CDS Length1368
Paired RNAseq reads  1008
Single RNAseq reads  3227
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009881 (0.0)
Best Drosophila hit  CG32138, isoform B (2e-100)
Best Human hitformin-like protein 2 (4e-118)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC012762 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  AGAP004805-PB [Anopheles gambiae str. PEST] (9e-169)
GeneOntology terms






  
GO:0005737 cytoplasm
GO:0005575 cellular_component
GO:0003779 actin binding
GO:0017048 Rho GTPase binding
GO:0030036 actin cytoskeleton organization
GO:0016043 cellular component organization
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families


  
IPR010472 Diaphanous FH3
IPR010473 Diaphanous GTPase-binding
IPR014768 GTPase-binding/formin homology 3
IPR016024 Armadillo-type fold
Orthology groupMCL10661

Nucleotide sequence:

GCGTCCATGGACCTGCCGCCAGACAAGGCTAAGCTGCTGCGGAACTATGACTTGGAAAAG
AAATGGGAGATCATATGCGACCAAGACATGGTGCAGGCGAAGGACTCGCCCGCCCACTAT
CTCAACAAACTGAGGACCTACCTTGACCCTAAGGCGTCCAGGAGTCACAGAAAGAGAAAG
ATGGTCGGTGACTCCACGTCGACGCAGGTTCTTAGGGATTTAGAAATATCACTGCGAACC
AATCACATCGAATGGGTCCGTGAGTTCCTGAACGATCAGAATCAAGGTCTGGACGTGTTG
ATCGACTACCTCAGCTTCAGACTGAGCATGATGAGGCACGAACAGCGAATAGCACTCGCG
AGGAGCCACTCCACAGACGCCATCAACCAAGCGAACACGACGACTTCAGAGTGCAGCGGG
CCGGAGATGGGTGCGGGCTCCACGTGGCGGCGGAGGGCGAGGTCCGCGGACTCGGAGGGC
GAAGGTCCGGGGGCGGGGTCCCCGGCCGCGGCCAGACGGAGGACCAGGCATGCGGCGAGG
CTCAACATGGGCGCCTCCACTGATGATATACACGTCTGCATCATGTGCATGAGGGCCATC
ATGAACAATAAGTATGGCTTCAACATGGTGATCCAACATCGCGAGGCTATCAACAGCATA
GCCTTGTCCCTGGTACATCACTCGCTCAGAACGAAGGCGCTGGTGTTGGAACTGTTAGCG
GCCATCTGCCTGGTGAAGGGCGGTCATCAGATCATTCTCTCCGCCTTCGATAACTTCAAG
GAGGTGGTCGGTGAGCCGAGGAGGTTCCACACTCTCATGGAGTACTTCATGAACTATGAC
AGCTTCCATATTGAGTTCATGGTGGCGTGTATGCAGTTCGTCAACATAATAGTACATTCA
GTAGAGGACATGAACTTCCGGGTGCACCTCCAGTACGAGTTCACGGCGCTCAAACTAGAC
GACTACCTCGAGAGGCTGAGGCTCTGTGAGAGCGAAGACTTACAGGTCCAAATATCAGCG
TACCTCGACAACGTGTTCGACGTGGCGGCTCTCATGGAGGACAGCGAGACGAAGACGGCG
GCCTTGGAGAAGGTCAATGAACTGGAGGATGAATTGGGACATGCTCACGAGCGGCTGGCG
TCCTTGGAGAGAGAAGCCATCGCTAAACAAGCAACGCTGGAGGCGGAACTAGCGCAAGTT
AGACACGAGAGAGACCAGCTCGCTGAAGCACGGAGGCAGGTCGTGGAGGAGGTGTCGACT
CTGAGGAGAGCTCAGCAGGACTCGAGGAACAGGCAGTCGATGTTGGAGTCGAAGGTGCAG
GAACTGGAATCGCTGACCAAGTCACTACCACGAGGAGCCTCCAGTTAG

Protein sequence:

ASMDLPPDKAKLLRNYDLEKKWEIICDQDMVQAKDSPAHYLNKLRTYLDPKASRSHRKRK
MVGDSTSTQVLRDLEISLRTNHIEWVREFLNDQNQGLDVLIDYLSFRLSMMRHEQRIALA
RSHSTDAINQANTTTSECSGPEMGAGSTWRRRARSADSEGEGPGAGSPAAARRRTRHAAR
LNMGASTDDIHVCIMCMRAIMNNKYGFNMVIQHREAINSIALSLVHHSLRTKALVLELLA
AICLVKGGHQIILSAFDNFKEVVGEPRRFHTLMEYFMNYDSFHIEFMVACMQFVNIIVHS
VEDMNFRVHLQYEFTALKLDDYLERLRLCESEDLQVQISAYLDNVFDVAALMEDSETKTA
ALEKVNELEDELGHAHERLASLEREAIAKQATLEAELAQVRHERDQLAEARRQVVEEVST
LRRAQQDSRNRQSMLESKVQELESLTKSLPRGASS