DPGLEAN15051 in OGS1.0

New model in OGS2.0DPOGS200102 
Genomic Positionscaffold2444:- 794-5333
See gene structure
CDS Length1539
Paired RNAseq reads  5720
Single RNAseq reads  13730
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004550 (4e-168)
Best Drosophila hit  arouser, isoform C (1e-65)
Best Human hitepidermal growth factor receptor kinase substrate 8 (1e-30)
Best NR hit (blastp)  PREDICTED: similar to EPS (human endocytosis) related family member (eps-8) [Apis mellifera] (2e-128)
Best NR hit (blastx)  GH10490 [Drosophila grimshawi] (1e-69)
GeneOntology terms  GO:0007173 epidermal growth factor receptor signaling pathway
InterPro families  IPR013625 Tensin phosphotyrosine-binding domain
Orthology groupMCL11926

Nucleotide sequence:

ATGGCGCTGCGAAACGCTGGCGGGGGGGCTCCCCGGGGTAGGGCTTCCGACGCCACTGCG
ACCCGAGCTCTTGACGAATTTGACACATTAGCTGGTCTGCGACGAAGTACTTATGCTCTA
GAACACCTGGCTACGTTTACAGTGACAAGGGAAACTGGGATAGTATACCCCGCTGATGGC
ATGAGAAGATTGCTTCAACTCGAAAAGACCAATGGAATATGGAGTCAAAAGATGCAACTG
TCTTTAGAAGGTCAATGGGTGCTAGTTATGGATTACGAAACGGGGTCCATCATGGAACGG
TTTCCAGCGTCGTGGGTGCATTCACCAACAGCATTCACATCCCCAGAACCAGCTGAACTG
TACAATAATGTGTTAGCATTTGTTGTCGAAGCCCCGGAGTCGGGAGGCTCTCCAGCGGGT
GCTAGGCGTGAGCTGCACATCTTCCAATGCCACGATGTTGGTGCACAAGCTCTTGTTGAA
GAACTTAATGCACTTAAGGGCGAAGGCGGCGGAGGCTCTGAAGGAGGAAGAGACTTCGTG
ATTGAAAGAGAAAGAGAGAGGGAAAGAGAAAATGATTTAGACAGACCAAGACGTCAGCAA
ATGTCCGCACAGCTCGGCCGCGGAGATCGACCAGATCGTGATCGTGGAGGGTCAGCTGGT
GAGCGCGATGATGCTTCATCTACAGGCTCTGAGAGACTATATGAGCAAGACATCGCGATC
CTCAATAGATGCTTCGACGACATAGAGAAGTTTATCGCCAGGCTGCAACATGCAGCGGCA
GCATCAAGAGAACTTGAGAGAAGGCGGAGATCGAGAGGAGGAAAGAGGAGTGCTGGGACT
GGAGAAGGAATGCTGGCACTAAGAACACGCCCTCCACCTGAAAGAGATTTTGTTGATGTG
CTCCAAAAGTTTAAACTGTCTTTCAACCTTCTGGCCCGTTTGCGAGCTCACATACATGAC
CCTAATGCTCCAGAATTAGTGCACTTTCTCTTCACACCATTGGCTCTGATAGTAGATGCA
GCACAGGATGTCGCAGACGGTCGTCTGCCAGCACGTGTGGTACAGCCACTGCTTACTCGA
GATGCACTTAATCTGCTCGCCAACTGCGTAACCAGCAAAGAAACTGAGCTATGGCACTCG
CTGGGTGATGCTTGGCTTATACCAAGAGAGCAATGGAAAACGACAATCCCTCCCTACCAG
CCGGTGTTTATGGACGGCTGGTCCCCAGATTACCAGGTGGACGACCAACCTTTGCGACGA
GCATCTCCAAGAAGAAGTGAAGCTGGTAGAGGTGGAACGGGTGCATTAGGAGAAGAAGCG
ATCAGGGAAGGAGCAGACAGGGCCGACGGCTACGGGTATGAGAGGGAAGAACCTGACCTA
TACGGTGAACAATACACACCTTACTCTAGAAATCCGCGTACTCTGACCCGAGAAGATTCT
GGCTCAGCGGCTTCCTCTCCAGAACGCGAACCACCATATAGAGCAGACAGAGATGAAGGT
AAGAGTCATAATTCTTTACCATTTTACAAACTGATATAA

Protein sequence:

MALRNAGGGAPRGRASDATATRALDEFDTLAGLRRSTYALEHLATFTVTRETGIVYPADG
MRRLLQLEKTNGIWSQKMQLSLEGQWVLVMDYETGSIMERFPASWVHSPTAFTSPEPAEL
YNNVLAFVVEAPESGGSPAGARRELHIFQCHDVGAQALVEELNALKGEGGGGSEGGRDFV
IERERERERENDLDRPRRQQMSAQLGRGDRPDRDRGGSAGERDDASSTGSERLYEQDIAI
LNRCFDDIEKFIARLQHAAAASRELERRRRSRGGKRSAGTGEGMLALRTRPPPERDFVDV
LQKFKLSFNLLARLRAHIHDPNAPELVHFLFTPLALIVDAAQDVADGRLPARVVQPLLTR
DALNLLANCVTSKETELWHSLGDAWLIPREQWKTTIPPYQPVFMDGWSPDYQVDDQPLRR
ASPRRSEAGRGGTGALGEEAIREGADRADGYGYEREEPDLYGEQYTPYSRNPRTLTREDS
GSAASSPEREPPYRADRDEGKSHNSLPFYKLI