DPGLEAN14084 in OGS1.0

New model in OGS2.0DPOGS212234 
Genomic Positionscaffold959:- 27150-35647
See gene structure
CDS Length3402
Paired RNAseq reads  294
Single RNAseq reads  716
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004446 (0.0)
Best Drosophila hit  CG10137 (2e-106)
Best Human hitglycine-, glutamate-, thienylcyclohexylpiperidine-binding protein (7e-45)
Best NR hit (blastp)  PREDICTED: similar to CG10137 CG10137-PA [Tribolium castaneum] (4e-148)
Best NR hit (blastx)  GJ18223 [Drosophila virilis] (7e-121)
GeneOntology terms

  
GO:0016595 glutamate binding
GO:0016596 thienylcyclohexylpiperidine binding
GO:0016594 glycine binding
InterPro families  IPR008979 Galactose-binding domain-like
Orthology groupMCL15356

Nucleotide sequence:

ATGCCGAAACGTATACCCTTCCACATAGTTTACGCAACCAGTGAAGATAGTTCATATCCA
GCATGCGAGTTGAACGCCCAGGGTCCTGCGGCTCGCGGATGGCGGAGCGCTGGTCCTCCG
CCCCACGAGCTCCTGCTGCGCCTCACCGCCGTTACCAGCATACACAAGCTACAGCTTCTA
GCTCATCATCAGCTGATACCTGCTTGCGTAGAAGTGTTAGTGTCTGGAGGTCTGCTGTCA
GAGGGAGCTGCGACACCGTGCGGAGCAACTTACACTAGTGTCGGTAGAGTGACACTCGCC
AAACCAGCGCCCCAAGCACGCACTAGAGAACTAAGATCGGCTGCTCTGCCCGAGCCAACG
GTAGCTCGCTTCGTAAAACTGAGGCTATCTGGACCACATCCACCAGCAAAAGACGATGAG
CAGGTTGCGTTAATGGCTGTAAACGTTCTTGGTGATGAGGTGGAAGACGTCGCCAAATCG
TTGCCAACAACAAAAGCTGAGGTGTGTTTCTCGCCTTACGATGACCTGGCCTTCGTTATG
TATGTGGACAATGAAATTGCGGATCTCGTTCGTAATTTAGATGAAAAGAAAAAAACAGCT
GTATGTGAAGAACGATTCGAATATGCACGACGGCTCAAATCAGCTGGTCAGGCTTTAGCT
GCTGCGGGCATCAGGATCGGGAGATGGAGACTTCGCAAGAGAACCGCCGCAGCTCGGGAT
GACTTCGAACTGGCGAGACGCATGAGAGACAGAATAGCAGACGCACTGATCGGCGTCCAA
GAAGACCCAGAGTTGAGGAGACTATTTGAAGATGATGGACCGGACACTCGCAACGACTCT
TCTATGCCCCAAGCCTACGACTTCTCCCACCATCTGTCGCCGTCCGTCGCTATGGGAGTT
CATAGCGTCGAAATTCCCTCGCCTGTACCGCCCATCGAACATTTACCAGAAAACGAATTC
AATGGAGATCACATCGACAGTCATAATATACTCGCTTCACCCGTCCATATTCTTGAAGAT
GAAACCGAAGTACCAGAAGAACCGGCCCAACCAGATGAACCGATCCAAGAAGATAAAACT
GAAGCTCAAAAGATAGAAGAAGAATTAAGAAAGGAGACTGAAAGTCCCCGTAGAAGTATA
ACTCCTACTGCCTCTAATGGTAATAGAGCATCAGAACTAAGCTATCCAGGTACATTAGTG
AGACGAAGAAACAAAAGTGCTGGTCCCAGGTCTACTTTTGAAGCTTATGAAGAAAGATTA
TTGCCTGCACTCAGACATTCACATACAAACGAATACCTCCGTGAGGCCCGTGAAGAAGAC
TGCACAGGAAGCTCTTCTTCACATCCTCGTGTAGTACACAAGTTGAATGAGCGGGAACGA
AAACAGGCCGCGCTGCCGATACTTATATTTGGATATCCTTTGGTTGAAAAATTCTTCTCC
AAAAGCTATTTGGACAAGGAAGAAGGTCTGGCGCGCCTGCGAGCTGAGTTGACGTCACCA
TCGAACGGCAGCACCAAGACGTCTCCGAACAAAACAGCGCGAGCAGCGGCGACTTTGCTC
CAGAGAGTTCTGAGAGATAAAGTATTCTCAGTCTACAGTCAAGCCAATGAAGTTGTCAGA
GTGCTTTTCAAAGAATTCGTCCCTGAAAGGGTTTGCGCAGCGGAAGTAGGTCGATGTCTG
GACAAACTCCTCCCTGAACTGCTGCGTGCTTGTGGGGACCCCGCCCCACGCGTGCATTCA
ACGGCTCAACACACCGTGCTCACAGTTGCTGACTGTCCTCTAGTCAGAAGCCTACACACA
ATTCCACAACAGCTTGTTCGACCTGTAGCTGCTTCCATGCATCCTCGACTAGCTCTCTCT
CGTCTTCAGATGCTGGAACAACTCATCCTGAGCCATGGAATCTCGACCGACAAGAATAGT
GGTCTGACGGTGCGTCGTCTAGCGGAGTGTGGTGCTGCAGGGGCTCAACACGCAGCGGGC
TCAGTCAGAGCTGCTGCTGAAAGAATTCTCTTAGCAGCATACGCAAGATCCCCTAGAGTT
GTCAGAGCACAACTTCCGCCAGACGATGCTGTCACCAGAAGAAATCTAATTTACAGACAC
CTCTTTCAACAATTTGATAGAATTGATATGCAGAAAATGCTAAATCAAGCACCTACAGAA
GAACAACTTCTTAATGGAGATCAGTCCATTGCTGATTCAAACTTAGAAGCTAGCGTAACA
CAGTCTACACGAAGCGGGACTACGGTTAGTGGAATGACCACATCTTATGGAATGACGTCT
TCTATGGATGCCACATCATCCTATAGCTTAAAATCAAGTGCCAGTGGTGGCACCCTGGCT
CCTTCTAGTTTAAGTGGAAGTTTTACAACGTCGAGAACAAAAAGCAGTTTAAAAAAAACA
CCCACTAAAAAATACACACCGACAAAATCATCCAAAGACGCTACCAATTATCCTGGCTAC
AACAAACTAAGACTTGATAGTGCCATTAGTCCAAAACATTCCCCAAGATCATCAGTCGGT
GGGAATGAAAAGGTCCATTTCCAGGAACGTCAAACGGAGGAAGTTGTGTTCCGTCGTACA
AGCAGGAACTTAGAAAACCGCCACTCCATGATCCACTACGATCATGACTTGTCTAAACCC
CAACTGAAAGAACGTCCAGTCACGGTTTACGAACCTCTACATTTAGAGTATAGAGACTCC
CCTACTATAGGCTCGCCAAAAAATTCCAAAAATGACAACCGAAGCATGGACTCCCTTCCT
ATGGACTCGCCTCAAATGTCAAGAAACGATATGAGATGCGACTCTGATAGCAGAAGTTTG
GATTCCCCTAAATTAAAGGCCGACTATTTTAGAGATGTGGGCTTGGAATCCCCAAAATTA
GTAGCCGGGGTTAGAAATTTGCATTTGGATGAACAAAGCCAATTGGATGAAAGTGGATAT
TATAGTCCAGGACGAAGACAGCAGACGCAAAACAATGAGCCATACGAAGCTTATGAAGGA
GTAGCAGCTGATGCTAGCAGTGAAACCACGCCGGAGCCAGTAACGAGCACATCTTGCACC
TGGTGCGGTAGACGCGTGCGCACTGCTGCATTGGAGGCACACTACTGGCGAAGGTGCGTG
CTTCTCGCTCGATGCCCGCACTGTCATCTTGCTCTAGAAGCCCGGGCTCTACACTCGCAT
TTACTGGAAGAGTGCTCGCTTAGCGAAGGATTGTGGAAGGCGTGCCAGAAATGTGGCGCG
GCCTTACGTTCAGACGAAAGTGAATATCACGTCAACTGCACACCTTTAGGCTTGGATGAG
TGGAAGTGTCCGTACTGTTTGACCAACATATTAGCTCGCGACCTTCCTTGGCAACGTCAT
CTGATGCAGTGTCCTCGCAACCCGAGACTAACACAACACTAA

Protein sequence:

MPKRIPFHIVYATSEDSSYPACELNAQGPAARGWRSAGPPPHELLLRLTAVTSIHKLQLL
AHHQLIPACVEVLVSGGLLSEGAATPCGATYTSVGRVTLAKPAPQARTRELRSAALPEPT
VARFVKLRLSGPHPPAKDDEQVALMAVNVLGDEVEDVAKSLPTTKAEVCFSPYDDLAFVM
YVDNEIADLVRNLDEKKKTAVCEERFEYARRLKSAGQALAAAGIRIGRWRLRKRTAAARD
DFELARRMRDRIADALIGVQEDPELRRLFEDDGPDTRNDSSMPQAYDFSHHLSPSVAMGV
HSVEIPSPVPPIEHLPENEFNGDHIDSHNILASPVHILEDETEVPEEPAQPDEPIQEDKT
EAQKIEEELRKETESPRRSITPTASNGNRASELSYPGTLVRRRNKSAGPRSTFEAYEERL
LPALRHSHTNEYLREAREEDCTGSSSSHPRVVHKLNERERKQAALPILIFGYPLVEKFFS
KSYLDKEEGLARLRAELTSPSNGSTKTSPNKTARAAATLLQRVLRDKVFSVYSQANEVVR
VLFKEFVPERVCAAEVGRCLDKLLPELLRACGDPAPRVHSTAQHTVLTVADCPLVRSLHT
IPQQLVRPVAASMHPRLALSRLQMLEQLILSHGISTDKNSGLTVRRLAECGAAGAQHAAG
SVRAAAERILLAAYARSPRVVRAQLPPDDAVTRRNLIYRHLFQQFDRIDMQKMLNQAPTE
EQLLNGDQSIADSNLEASVTQSTRSGTTVSGMTTSYGMTSSMDATSSYSLKSSASGGTLA
PSSLSGSFTTSRTKSSLKKTPTKKYTPTKSSKDATNYPGYNKLRLDSAISPKHSPRSSVG
GNEKVHFQERQTEEVVFRRTSRNLENRHSMIHYDHDLSKPQLKERPVTVYEPLHLEYRDS
PTIGSPKNSKNDNRSMDSLPMDSPQMSRNDMRCDSDSRSLDSPKLKADYFRDVGLESPKL
VAGVRNLHLDEQSQLDESGYYSPGRRQQTQNNEPYEAYEGVAADASSETTPEPVTSTSCT
WCGRRVRTAALEAHYWRRCVLLARCPHCHLALEARALHSHLLEECSLSEGLWKACQKCGA
ALRSDESEYHVNCTPLGLDEWKCPYCLTNILARDLPWQRHLMQCPRNPRLTQH