DPGLEAN07603 in OGS1.0

New model in OGS2.0DPOGS206111 
Genomic Positionscaffold89:+ 10826-12061
See gene structure
CDS Length1236
Paired RNAseq reads  390
Single RNAseq reads  1179
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006832 (6e-151)
Best Drosophila hit  TBPH, isoform F (2e-96)
Best Human hitTAR DNA-binding protein 43 (4e-85)
Best NR hit (blastp)  PREDICTED: similar to TBPH CG10327-PA, isoform A [Apis mellifera] (2e-133)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC006055 [Tribolium castaneum] (3e-110)
GeneOntology terms



  
GO:0003729 mRNA binding
GO:0003676 nucleic acid binding
GO:0000166 nucleotide binding
GO:0008344 adult locomotory behavior
GO:0007528 neuromuscular junction development
InterPro families
  
IPR000504 RNA recognition motif domain
IPR012677 Nucleotide-binding, alpha-beta plait
Orthology groupMCL11836

Nucleotide sequence:

ATGTCCTTCGAGTACTTGCCCGTGGCCGAAGATGAAAATGAAGAACCCATAGAACTTCCA
ATTGAAGAAGATGGGACTTTGATGTTAACAACTGTATCCGCGCAGTTTCCTGGCTGTTGT
GGACTCAAGTATCGTCATCCTGAAACGAAAACGTTTAGAGGTATAAGATTACGAGATGGT
AGGCTTTACCCACCACCAGAAGGTTGGGGAAATCAACTGTACATATGCAGTTTTCCGAAA
GAAAATAAACGAAAATCTGGTGACAATTCTGAAACATCTTCGGTGAAAAGTAAACGGAAT
GATAATTTGTGCTCAGATTTAATAGTGTTGGGGTTACCATGGAAAGCAACAGAGCAAACC
GTGCGGGAGTATTTCGAAAAGTTTGGCGAAGTGTTAATGGCTCAATTAAAACGTGATCCT
AAAACCGGTATGTCAAAAGGCTTCGCTTTTATTAGATTTTCATCTTACACATCTCAGATG
AGAGTCTTAGCTCAAAGACATATGATTGATGGACGTTGGTGTGATGTACGGATACCTAAC
TCAAAGGAAGGTTCTGTCACATCTATGCCTTGTAAAGTTTTTGTTGGCCGCTGTACAGAA
GATTTAACAGCCAATGATTTAAGAGAATATTTTTCACAATTTGGTGAAGTAACAGATGTT
TTTATTCCAAAGCCTTTTAGGGCATTCAGCTTTATAACATTTTTGGATCCTGAAGTTGCA
CAAAGCTTATGTGGTCAAGACCACATTATAAAAGGAGTATCTGTAAATGTGTCTAATGCA
TCACCTAAACAAAATAAAAGTGGTTCTAATCAACGAAACTTACCAAGTAGAAACTATGAA
GAAGGACATCCACACAGTGCCTCAAACAATAATTCATGGAGTAGCCGTAATATGGATATG
GTGAATATGCAAGCCTTAGGATTGTCTGGCCAACACGGTCAAACCGCCGTGGCCGGTGGT
GGAGGGCAAGGCCAAGGTGGAAGTATGCCACTCGGCATGGGTGGTTTGCCAGTAAATCAA
GCTCTAGTAGCTGCTGCACTAAATCAGGCAGCAGGCTGGGGTTTAATTAATAATATACCA
TCGGGGGGATCAGATCAAGGTGCCTTTGCTGGACCGGCTTCTTCTGCTCCACCAGCACCA
CCTAACTTCCTGTCATGGATGCAACAGGGCAATTCTGGACAAGGACCTTCTAGTCAGTGG
GGACAGAGACACCAATCCCAAGGCCACTCCGTTTGA

Protein sequence:

MSFEYLPVAEDENEEPIELPIEEDGTLMLTTVSAQFPGCCGLKYRHPETKTFRGIRLRDG
RLYPPPEGWGNQLYICSFPKENKRKSGDNSETSSVKSKRNDNLCSDLIVLGLPWKATEQT
VREYFEKFGEVLMAQLKRDPKTGMSKGFAFIRFSSYTSQMRVLAQRHMIDGRWCDVRIPN
SKEGSVTSMPCKVFVGRCTEDLTANDLREYFSQFGEVTDVFIPKPFRAFSFITFLDPEVA
QSLCGQDHIIKGVSVNVSNASPKQNKSGSNQRNLPSRNYEEGHPHSASNNNSWSSRNMDM
VNMQALGLSGQHGQTAVAGGGGQGQGGSMPLGMGGLPVNQALVAAALNQAAGWGLINNIP
SGGSDQGAFAGPASSAPPAPPNFLSWMQQGNSGQGPSSQWGQRHQSQGHSV