DPGLEAN03224 in OGS1.0

New model in OGS2.0DPOGS213541 
Genomic Positionscaffold66:- 63948-67998
See gene structure
CDS Length1713
Paired RNAseq reads  661
Single RNAseq reads  1897
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011823 (0.0)
Best Drosophila hit  p130CAS, isoform D (5e-64)
Best Human hitbreast cancer anti-estrogen resistance protein 1 isoform 2 (8e-27)
Best NR hit (blastp)  conserved hypothetical protein [Pediculus humanus corporis] (2e-123)
Best NR hit (blastx)  PREDICTED: similar to p130CAS CG1212-PA, isoform A [Apis mellifera] (6e-80)
GeneOntology terms

  
GO:0005925 focal adhesion
GO:0007016 cytoskeletal anchoring at plasma membrane
GO:0007414 axonal defasciculation
InterPro families

  
IPR001452 Src homology-3 domain
IPR021901 CAS family, C-terminal domain of unknown function
IPR014928 Serine rich protein interaction
Orthology groupMCL15430

Nucleotide sequence:

ATGGCGCGTGCGTTGTACGACAATATAGCGGAGTCGCCGGACGAGTTGGCGTTCAGACGA
GGCGACCTCCTCACAGTCCTGGAGCAGAACACTGGCGGCAGCGAGGGCTGGTGGCTCTGC
TCGCTCAGAGGAAGACAGGGAATATGCCCCGGAAACAGGCTGCGTATAGTTGCCGGGGTT
TTCGACGCAAGTTCCGCTCTACAGAGGCGACGCACGCGCACCCCCGCCCCCGTCGCCCAC
CAACCACTCCCCTCACAACAGTCACCCGCCTTCACAAAAATCGAGGAATGCTCTCATTAC
GACGTCCCTCGGGCTCCGATGCCGGTTCAACGTATTATCGGTACGTACGATTGTCCCCGG
TCGCAAGGGGACTGGTACGACGCCCCTCGTGCGCCGCGGCCGGCCAGCGCGGACTCCGCC
TGCAGTGGCACGGGTTCCCTGACGTCGGCCACATCCAGCGCCTCCGCCAACTCGGGCAGT
TCAGCGAACTCAGCTTCCAGCACTTATGATGTACCCAGATCCCGAGCCTTGCCGCTGCCG
TGCGACGCCGCCATGGAGGCCTTAGAACGGTTACAGGAGGAGGCGTCCACGGCGGTTTCC
CGCCTGCTGTCCTATGTCACACCGGGCTGGCGGCGGCGCGGGGCGTTGCGACCGCGCGTG
CTAGACGTGCGTGTTGCGGGCGCTCGTCTGCGAGCAGCCTTACACGACCTCGCCGTGTTC
GCTGACGCTACACTGGCCAACGCTCATGATGCACAAGACAAAGGTATCGCAGTAAAGCTA
CGGCCACTAGTGAAGGCTTTAAAGGACGCCGAGCGGATCACACACGAGGCGACCAGCGCG
CTCGACGCCGGCGACTGGGCCCCGGAGCGGCTGGAGCGCGACAGGGAGCCCACGGACGGC
ACGCACGACGCGCTCGACCAGCTCGTCGCGTGTGCGCGCTCCCTCACCGAGGACGTGCGC
CGAGCCGCCTCCTTCATACACGGAAACGCCTCACTACTGTTCAGGCGTTCCGCGACAGTT
CCTGAACACGAGTGGACTGAGGAGTACGATTACGTGAGGTTGGAGTCCAGGAGTGCCGTT
GGTCGGAGGAACGCCGAGATCCGGGCAGCTTTGCCGGACAAACTCAGGGCCTCCTTCGAC
GCGCTCGTCCGCGACGCGGACCATGCGGGCGAGGTGAGTGCTGTAGCAGCTGCAACCCGC
CTTCCAGCGGATGATCGCCAGCTGGCCGCGTTCTACGCCGCGCAGACGGCTACGTACGGA
GCGCACCTCTCGACCGCCGTGGAAGCCTTCCTCAGGACCATCGACATGGGACAACCGCCT
GACGTGTTCCTCGCACACGGCAAGTTCGTGGTGCTCAGCGCGCACAGGATCGTACACGTC
GGGGACACCGTGCACAGGAGCGCCCAGCACTCGGGGCTGAAGTCAAAAATACTAAGGTGT
TCGGACGCACTATCGGACTCGTTAGCGGCGACCGTGGCCAAAACTAAAGCGGCGGCGCTG
CAGTTCCCTTGCGCGAGCGCGGTGGCCGAGATGGCTGAGGCGGCGCGGACCTTGGCCGCC
AGGGCGCAGGAGTTGAGACGAGCCCTAGTGAGAGCGGCCGAACCACCTCAAGACACACCC
TCTACCACGGTGCCGCCGTCCTCCACCACGACACCTCTCACCCCACTCACGCCGCTCGCA
CCTCACCCCACCACCACCCTCCCCGTATTATAA

Protein sequence:

MARALYDNIAESPDELAFRRGDLLTVLEQNTGGSEGWWLCSLRGRQGICPGNRLRIVAGV
FDASSALQRRRTRTPAPVAHQPLPSQQSPAFTKIEECSHYDVPRAPMPVQRIIGTYDCPR
SQGDWYDAPRAPRPASADSACSGTGSLTSATSSASANSGSSANSASSTYDVPRSRALPLP
CDAAMEALERLQEEASTAVSRLLSYVTPGWRRRGALRPRVLDVRVAGARLRAALHDLAVF
ADATLANAHDAQDKGIAVKLRPLVKALKDAERITHEATSALDAGDWAPERLERDREPTDG
THDALDQLVACARSLTEDVRRAASFIHGNASLLFRRSATVPEHEWTEEYDYVRLESRSAV
GRRNAEIRAALPDKLRASFDALVRDADHAGEVSAVAAATRLPADDRQLAAFYAAQTATYG
AHLSTAVEAFLRTIDMGQPPDVFLAHGKFVVLSAHRIVHVGDTVHRSAQHSGLKSKILRC
SDALSDSLAATVAKTKAAALQFPCASAVAEMAEAARTLAARAQELRRALVRAAEPPQDTP
STTVPPSSTTTPLTPLTPLAPHPTTTLPVL