New model in OGS2.0 | DPOGS212234  |
---|---|
Genomic Position | scaffold959:- 27150-35647 |
See gene structure | |
CDS Length | 3402 |
Paired RNAseq reads   | 294 |
Single RNAseq reads   | 716 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004446 (0.0) |
Best Drosophila hit   | CG10137 (2e-106) |
Best Human hit | glycine-, glutamate-, thienylcyclohexylpiperidine-binding protein (7e-45) |
Best NR hit (blastp)   | PREDICTED: similar to CG10137 CG10137-PA [Tribolium castaneum] (4e-148) |
Best NR hit (blastx)   | GJ18223 [Drosophila virilis] (7e-121) |
GeneOntology terms    | GO:0016595 glutamate binding GO:0016596 thienylcyclohexylpiperidine binding GO:0016594 glycine binding |
InterPro families   | IPR008979 Galactose-binding domain-like |
Orthology group | MCL15356 |
Nucleotide sequence:
ATGCCGAAACGTATACCCTTCCACATAGTTTACGCAACCAGTGAAGATAGTTCATATCCA
GCATGCGAGTTGAACGCCCAGGGTCCTGCGGCTCGCGGATGGCGGAGCGCTGGTCCTCCG
CCCCACGAGCTCCTGCTGCGCCTCACCGCCGTTACCAGCATACACAAGCTACAGCTTCTA
GCTCATCATCAGCTGATACCTGCTTGCGTAGAAGTGTTAGTGTCTGGAGGTCTGCTGTCA
GAGGGAGCTGCGACACCGTGCGGAGCAACTTACACTAGTGTCGGTAGAGTGACACTCGCC
AAACCAGCGCCCCAAGCACGCACTAGAGAACTAAGATCGGCTGCTCTGCCCGAGCCAACG
GTAGCTCGCTTCGTAAAACTGAGGCTATCTGGACCACATCCACCAGCAAAAGACGATGAG
CAGGTTGCGTTAATGGCTGTAAACGTTCTTGGTGATGAGGTGGAAGACGTCGCCAAATCG
TTGCCAACAACAAAAGCTGAGGTGTGTTTCTCGCCTTACGATGACCTGGCCTTCGTTATG
TATGTGGACAATGAAATTGCGGATCTCGTTCGTAATTTAGATGAAAAGAAAAAAACAGCT
GTATGTGAAGAACGATTCGAATATGCACGACGGCTCAAATCAGCTGGTCAGGCTTTAGCT
GCTGCGGGCATCAGGATCGGGAGATGGAGACTTCGCAAGAGAACCGCCGCAGCTCGGGAT
GACTTCGAACTGGCGAGACGCATGAGAGACAGAATAGCAGACGCACTGATCGGCGTCCAA
GAAGACCCAGAGTTGAGGAGACTATTTGAAGATGATGGACCGGACACTCGCAACGACTCT
TCTATGCCCCAAGCCTACGACTTCTCCCACCATCTGTCGCCGTCCGTCGCTATGGGAGTT
CATAGCGTCGAAATTCCCTCGCCTGTACCGCCCATCGAACATTTACCAGAAAACGAATTC
AATGGAGATCACATCGACAGTCATAATATACTCGCTTCACCCGTCCATATTCTTGAAGAT
GAAACCGAAGTACCAGAAGAACCGGCCCAACCAGATGAACCGATCCAAGAAGATAAAACT
GAAGCTCAAAAGATAGAAGAAGAATTAAGAAAGGAGACTGAAAGTCCCCGTAGAAGTATA
ACTCCTACTGCCTCTAATGGTAATAGAGCATCAGAACTAAGCTATCCAGGTACATTAGTG
AGACGAAGAAACAAAAGTGCTGGTCCCAGGTCTACTTTTGAAGCTTATGAAGAAAGATTA
TTGCCTGCACTCAGACATTCACATACAAACGAATACCTCCGTGAGGCCCGTGAAGAAGAC
TGCACAGGAAGCTCTTCTTCACATCCTCGTGTAGTACACAAGTTGAATGAGCGGGAACGA
AAACAGGCCGCGCTGCCGATACTTATATTTGGATATCCTTTGGTTGAAAAATTCTTCTCC
AAAAGCTATTTGGACAAGGAAGAAGGTCTGGCGCGCCTGCGAGCTGAGTTGACGTCACCA
TCGAACGGCAGCACCAAGACGTCTCCGAACAAAACAGCGCGAGCAGCGGCGACTTTGCTC
CAGAGAGTTCTGAGAGATAAAGTATTCTCAGTCTACAGTCAAGCCAATGAAGTTGTCAGA
GTGCTTTTCAAAGAATTCGTCCCTGAAAGGGTTTGCGCAGCGGAAGTAGGTCGATGTCTG
GACAAACTCCTCCCTGAACTGCTGCGTGCTTGTGGGGACCCCGCCCCACGCGTGCATTCA
ACGGCTCAACACACCGTGCTCACAGTTGCTGACTGTCCTCTAGTCAGAAGCCTACACACA
ATTCCACAACAGCTTGTTCGACCTGTAGCTGCTTCCATGCATCCTCGACTAGCTCTCTCT
CGTCTTCAGATGCTGGAACAACTCATCCTGAGCCATGGAATCTCGACCGACAAGAATAGT
GGTCTGACGGTGCGTCGTCTAGCGGAGTGTGGTGCTGCAGGGGCTCAACACGCAGCGGGC
TCAGTCAGAGCTGCTGCTGAAAGAATTCTCTTAGCAGCATACGCAAGATCCCCTAGAGTT
GTCAGAGCACAACTTCCGCCAGACGATGCTGTCACCAGAAGAAATCTAATTTACAGACAC
CTCTTTCAACAATTTGATAGAATTGATATGCAGAAAATGCTAAATCAAGCACCTACAGAA
GAACAACTTCTTAATGGAGATCAGTCCATTGCTGATTCAAACTTAGAAGCTAGCGTAACA
CAGTCTACACGAAGCGGGACTACGGTTAGTGGAATGACCACATCTTATGGAATGACGTCT
TCTATGGATGCCACATCATCCTATAGCTTAAAATCAAGTGCCAGTGGTGGCACCCTGGCT
CCTTCTAGTTTAAGTGGAAGTTTTACAACGTCGAGAACAAAAAGCAGTTTAAAAAAAACA
CCCACTAAAAAATACACACCGACAAAATCATCCAAAGACGCTACCAATTATCCTGGCTAC
AACAAACTAAGACTTGATAGTGCCATTAGTCCAAAACATTCCCCAAGATCATCAGTCGGT
GGGAATGAAAAGGTCCATTTCCAGGAACGTCAAACGGAGGAAGTTGTGTTCCGTCGTACA
AGCAGGAACTTAGAAAACCGCCACTCCATGATCCACTACGATCATGACTTGTCTAAACCC
CAACTGAAAGAACGTCCAGTCACGGTTTACGAACCTCTACATTTAGAGTATAGAGACTCC
CCTACTATAGGCTCGCCAAAAAATTCCAAAAATGACAACCGAAGCATGGACTCCCTTCCT
ATGGACTCGCCTCAAATGTCAAGAAACGATATGAGATGCGACTCTGATAGCAGAAGTTTG
GATTCCCCTAAATTAAAGGCCGACTATTTTAGAGATGTGGGCTTGGAATCCCCAAAATTA
GTAGCCGGGGTTAGAAATTTGCATTTGGATGAACAAAGCCAATTGGATGAAAGTGGATAT
TATAGTCCAGGACGAAGACAGCAGACGCAAAACAATGAGCCATACGAAGCTTATGAAGGA
GTAGCAGCTGATGCTAGCAGTGAAACCACGCCGGAGCCAGTAACGAGCACATCTTGCACC
TGGTGCGGTAGACGCGTGCGCACTGCTGCATTGGAGGCACACTACTGGCGAAGGTGCGTG
CTTCTCGCTCGATGCCCGCACTGTCATCTTGCTCTAGAAGCCCGGGCTCTACACTCGCAT
TTACTGGAAGAGTGCTCGCTTAGCGAAGGATTGTGGAAGGCGTGCCAGAAATGTGGCGCG
GCCTTACGTTCAGACGAAAGTGAATATCACGTCAACTGCACACCTTTAGGCTTGGATGAG
TGGAAGTGTCCGTACTGTTTGACCAACATATTAGCTCGCGACCTTCCTTGGCAACGTCAT
CTGATGCAGTGTCCTCGCAACCCGAGACTAACACAACACTAA
Protein sequence:
MPKRIPFHIVYATSEDSSYPACELNAQGPAARGWRSAGPPPHELLLRLTAVTSIHKLQLL
AHHQLIPACVEVLVSGGLLSEGAATPCGATYTSVGRVTLAKPAPQARTRELRSAALPEPT
VARFVKLRLSGPHPPAKDDEQVALMAVNVLGDEVEDVAKSLPTTKAEVCFSPYDDLAFVM
YVDNEIADLVRNLDEKKKTAVCEERFEYARRLKSAGQALAAAGIRIGRWRLRKRTAAARD
DFELARRMRDRIADALIGVQEDPELRRLFEDDGPDTRNDSSMPQAYDFSHHLSPSVAMGV
HSVEIPSPVPPIEHLPENEFNGDHIDSHNILASPVHILEDETEVPEEPAQPDEPIQEDKT
EAQKIEEELRKETESPRRSITPTASNGNRASELSYPGTLVRRRNKSAGPRSTFEAYEERL
LPALRHSHTNEYLREAREEDCTGSSSSHPRVVHKLNERERKQAALPILIFGYPLVEKFFS
KSYLDKEEGLARLRAELTSPSNGSTKTSPNKTARAAATLLQRVLRDKVFSVYSQANEVVR
VLFKEFVPERVCAAEVGRCLDKLLPELLRACGDPAPRVHSTAQHTVLTVADCPLVRSLHT
IPQQLVRPVAASMHPRLALSRLQMLEQLILSHGISTDKNSGLTVRRLAECGAAGAQHAAG
SVRAAAERILLAAYARSPRVVRAQLPPDDAVTRRNLIYRHLFQQFDRIDMQKMLNQAPTE
EQLLNGDQSIADSNLEASVTQSTRSGTTVSGMTTSYGMTSSMDATSSYSLKSSASGGTLA
PSSLSGSFTTSRTKSSLKKTPTKKYTPTKSSKDATNYPGYNKLRLDSAISPKHSPRSSVG
GNEKVHFQERQTEEVVFRRTSRNLENRHSMIHYDHDLSKPQLKERPVTVYEPLHLEYRDS
PTIGSPKNSKNDNRSMDSLPMDSPQMSRNDMRCDSDSRSLDSPKLKADYFRDVGLESPKL
VAGVRNLHLDEQSQLDESGYYSPGRRQQTQNNEPYEAYEGVAADASSETTPEPVTSTSCT
WCGRRVRTAALEAHYWRRCVLLARCPHCHLALEARALHSHLLEECSLSEGLWKACQKCGA
ALRSDESEYHVNCTPLGLDEWKCPYCLTNILARDLPWQRHLMQCPRNPRLTQH