DPGLEAN20207 in OGS1.0

New model in OGS2.0DPOGS214861 
Genomic Positionscaffold809:- 40608-44914
See gene structure
CDS Length1530
Paired RNAseq reads  1918
Single RNAseq reads  4791
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010072 (1e-73)
Best Drosophila hit  cortactin (5e-72)
Best Human hitsrc substrate cortactin isoform b (2e-94)
Best NR hit (blastp)  PREDICTED: cortactin [Taeniopygia guttata] (3e-127)
Best NR hit (blastx)  PREDICTED: cortactin [Taeniopygia guttata] (4e-126)
GeneOntology terms




  
GO:0001726 ruffle
GO:0005515 protein binding
GO:0005737 cytoplasm
GO:0005938 cell cortex
GO:0006898 receptor-mediated endocytosis
GO:0030027 lamellipodium
InterPro families


  
IPR015503 Cortactin
IPR001452 Src homology-3 domain
IPR003134 Hs1/Cortactin
IPR000108 Neutrophil cytosol factor 2 p67phox
Orthology groupMCL17978

Nucleotide sequence:

ATGTGGAAAGCGGCCACTGATGTAGTGGCGCCCACACCGGCCGAGGCTGACGATTGGGAG
ACAGATCCCGACTTTGTGAATGATGTCACAGAACAGGAACAACGTTGGGGGCCAGGGGGA
AGACATGTAGAAGCTATTGATATGGCTAAACTCAGAGAGGAAGTTCTGGAAGCAGACAAG
CAAATTAAACAGAAGCAGTACGAGGAAGGGCCTAAACCCTCATATGGATATGGAGGGAAA
TTTGGTGTCCAACAAGACAGGATGGATAAATCAGCGGTCGGGCACGATTACGTCGGCAAA
ACAGAGAAGCATGTCTCGCAGAAAGATTACGCACAAGGTTTCGGCGGTAAGTTTGGCGTT
CAAACTGACCGTATGGACGCCAGCGCGGTGGGTCACGACTATGTGGGCGTCGTGTCCAAG
CACGCCTCGCAGACCGATCATAGTAGGGGCTTCGGGGGGAAGTACGGCGTGCAGACTGAC
AGAGTTGACAAGAGCGCGGCTGGTTGGGAACACAAGGAGCAGATAGAGAAGCATCCGTCG
CAGAAAGACTACTCGGTCGGCTTCGGAGGCAAGTTCGGTGTACAGGTCGACCGGCAGGAC
GCCAGCGCCGCCGACTGGGGACACAAGGAACCCACTGCGGCACACGAGTCGCAGACTGAT
CACTCCCGCGGTTTCGGTGGTAAGTTCGGGGTGCAGACGGACAGACAGGACGCGTCCGCC
GTCGGCTGGGATCACCAGGAGAAGACGGAGGCTCACGCTAGCCAAGTGGACCATAAGAAG
GGCTTCGGTGGTAAATTCGGTGTCCAAACTGACAGAGTGGATAAATGCGCCCAAGGTTTC
GACTCCGTGGAGAAGTCGGGCGGGTACAGTAGACCCAGGCCAGACATCGGCGGAGCCAAG
CCCAGCTCCATACGAGCCAAGTTTGAGAACATGGCCAAGGAAAAAGAACAGATCCTTCGA
GATCAATCCGTTCAGAAATTAAGACAGGAGAGGCAACAACTAGATCGTAGTTTGTCAGAA
AAAGAAAAACAACGTCTGGAGAAAGAAAAGGAGCAAAATCAAGAAGAGACGGCCAGCACG
AACGTGTTCAAGAAGACTGAAGGTGGTAACGCAGTGCCCGCGGCTGTGCAGGCTGTGCAG
GACGCGAGACAAGAGGTGGAGCAGGACGTTAGACAGGACTCTGTGCACGAGAAACAGGAA
GTGAAGCAGAGCAACCTGCCGGATGTGACTCTTGTGGGAGACGCCAAGGACGAAGACAAG
GAAGAGCATCCGCGGCAGCCCACGATAGTGGTGTCTCCTGTGGGCTGGGAGGGGGAGGGC
GAGGGCGAGGCGTGCGAGGCTGACGACGAGGACGGGTACACGGCCCGCGCGCTGTACGAC
TACCAGGCCGCGGCGCCCGACGAAATATCATTCGACCCCGACGACCTCATCACCAACATC
GTCATGATCGACGAGGGCTGGTGGCAGGGTCTGTGTAAGGGCGCATACGGCCTGTTCCCG
GCTAACTACGTACAGCTACAAGACAAATAA

Protein sequence:

MWKAATDVVAPTPAEADDWETDPDFVNDVTEQEQRWGPGGRHVEAIDMAKLREEVLEADK
QIKQKQYEEGPKPSYGYGGKFGVQQDRMDKSAVGHDYVGKTEKHVSQKDYAQGFGGKFGV
QTDRMDASAVGHDYVGVVSKHASQTDHSRGFGGKYGVQTDRVDKSAAGWEHKEQIEKHPS
QKDYSVGFGGKFGVQVDRQDASAADWGHKEPTAAHESQTDHSRGFGGKFGVQTDRQDASA
VGWDHQEKTEAHASQVDHKKGFGGKFGVQTDRVDKCAQGFDSVEKSGGYSRPRPDIGGAK
PSSIRAKFENMAKEKEQILRDQSVQKLRQERQQLDRSLSEKEKQRLEKEKEQNQEETAST
NVFKKTEGGNAVPAAVQAVQDARQEVEQDVRQDSVHEKQEVKQSNLPDVTLVGDAKDEDK
EEHPRQPTIVVSPVGWEGEGEGEACEADDEDGYTARALYDYQAAAPDEISFDPDDLITNI
VMIDEGWWQGLCKGAYGLFPANYVQLQDK