DPGLEAN04650 in OGS1.0

New model in OGS2.0DPOGS203821 
Genomic Positionscaffold120:+ 341091-343766
See gene structure
CDS Length1806
Paired RNAseq reads  39100
Single RNAseq reads  110454
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003469 (0.0)
Best Drosophila hit  polyA-binding protein, isoform G (0.0)
Best Human hitpolyadenylate-binding protein 4 isoform 3 (0.0)
Best NR hit (blastp)  poly A binding protein [Bombyx mori] (0.0)
Best NR hit (blastx)  poly A binding protein [Bombyx mori] (0.0)
GeneOntology terms
  
GO:0000166 nucleotide binding
GO:0003723 RNA binding
InterPro families



  
IPR000504 RNA recognition motif domain
IPR002004 Polyadenylate-binding protein/Hyperplastic disc protein
IPR006515 Polyadenylate binding protein, human types 1, 2, 3, 4
IPR012677 Nucleotide-binding, alpha-beta plait
IPR003954 RNA recognition motif domain, eukaryote
Orthology groupMCL10381

Nucleotide sequence:

ATGGCATCGCTATATGTCGGGGACTTGCACTCCGACATCACCGAGGCCATGTTGTTTGAA
AAATTTTCTCCTGCTGGTCCTGTTCTCTCTATTCGCGTATGCAGAGATATGATAACTCGT
AGGTCTCTTGGCTACGCTTACGTAAATTTTCAGCAGCCTTCTGATGCCGAAAGAGCACTA
GACACTATGAATTTTGATATGATCAAAGGTAGGCCAATTAGAATTATGTGGTCCCAAAGA
GATCCATCCCTCCGCAAATCTGGAGTTGGAAATGTTTTTATTAAAAATCTTGACAAAGCC
ATAGATAACAAGGCCATGTATGATACATTCTCTGCTTTTGGCAATATATTGAGTTGTAAG
GTAGCTCAAGATGAAAATGGGGCATCTAAGGGATATGGTTTTGTTCACTTTGAAACAGAA
GAAGCTGCTAATAAATCCATTGAAAAAGTAAATGGAATGTTGCTAAATGGAAAAAAGGTT
TACGTAGGCAGATTTATCCCTCGTAAGGAACGCGAAAAGGAACTGGGAGAGAAAGCTAAA
TTGTTCACTAATGTTTACGTCAAGAACTTCGGCGAAGATTTCTCGGATGAAATGCTAAGA
GATATGTTTGAAAAATATGGCAGAATAACTAGCCATAAAGTAATGTATAAAGAGGATGGC
TCATCCAGAGGTTTTGGTTTTGTAGCCTTTGAAGATCCAGATGCTGCCGAAAGGGCATGT
CTTGAGCTTAATGGCAAAGAACTTGTTGAAGGAAAACCTCTATATGTAGGACGTGCTCAG
AAGAAAGCTGAACGCCAAAAAGAACTAAAGCGTAAATTTGAGCAGTTAAAATCTGAACGT
TTGACTCGTTATCAAGGAGTTAATCTGTATGTGAAGAATTTGGATGACACAATTGATGAT
GAGAGACTCCGTAAGGAATTTGCACCATTTGGTACTATTACTTCAGCCAAGGTTATGTTG
GAAGATGGTCGTAGCAAAGGGTTTGGGTTTGTATGTTTCTCATCTCCTGAAGAAGCTACT
AAAGCTGTTACTGAAATGAATGGAAGAATTGTAGGTACTAAACCTCTTTATGTAGCTCTT
GCTCAAAGGAAAGAAGACCGCAAAGCTCATTTGACTTCACAATACATGCAACGCATGGCA
AGTATGAGAATGCAACAAATGGGTCAAATATTCCAACCAGGCAGTGCTGGAGGTTACTTC
GTCCCAACTATTCCCCCAGCCCAAAGATTCTATGGCCCTGCTCAAATTACTCAGATGAGA
CCTTCACAGAGATGGACTGCACAGCCTCCTGTAAGACCCAGCACTCAAACTGCTGCCTCA
GCATATCCAAACATGCAAGCACCATTTAGACCCACTACACGTGGACCAACTCAAACAGCT
TTGCGCACTTCTCTTGGAGCTAGACCTATAACAGGTCAACAGGGTGTAGCTGCAGCACCA
TCTATTCGTGCACCTCTTGTGCCAAGCGGTCGTACTGCTGGCTACAAATATACATCAACT
GTGCGCAACCCACCAGCTCCACAGCCAGCTGTTCACATCCAGGGTCAAGAGCCATTGACA
GCTTCCATGTTAGCTGCTGCACCACTTCAAGAACAAAAACAAATGCTTGGAGAACGTCTC
TTCCCTCTCATTCAGAGAATGCACCCTGATCTTGCTGGCAAAATTACTGGAATGCTTCTA
GAAATAGATAATTCTGAACTTTTACATATGTTAGAGCATGGAGAGTCTCTTAAAGCGAAG
GTTGATGAGGCTGTTGCTGTTTTGCAAGCTCACCAAGCTAAACAGCAAGCCACTAAGAAA
GATTAA

Protein sequence:

MASLYVGDLHSDITEAMLFEKFSPAGPVLSIRVCRDMITRRSLGYAYVNFQQPSDAERAL
DTMNFDMIKGRPIRIMWSQRDPSLRKSGVGNVFIKNLDKAIDNKAMYDTFSAFGNILSCK
VAQDENGASKGYGFVHFETEEAANKSIEKVNGMLLNGKKVYVGRFIPRKEREKELGEKAK
LFTNVYVKNFGEDFSDEMLRDMFEKYGRITSHKVMYKEDGSSRGFGFVAFEDPDAAERAC
LELNGKELVEGKPLYVGRAQKKAERQKELKRKFEQLKSERLTRYQGVNLYVKNLDDTIDD
ERLRKEFAPFGTITSAKVMLEDGRSKGFGFVCFSSPEEATKAVTEMNGRIVGTKPLYVAL
AQRKEDRKAHLTSQYMQRMASMRMQQMGQIFQPGSAGGYFVPTIPPAQRFYGPAQITQMR
PSQRWTAQPPVRPSTQTAASAYPNMQAPFRPTTRGPTQTALRTSLGARPITGQQGVAAAP
SIRAPLVPSGRTAGYKYTSTVRNPPAPQPAVHIQGQEPLTASMLAAAPLQEQKQMLGERL
FPLIQRMHPDLAGKITGMLLEIDNSELLHMLEHGESLKAKVDEAVAVLQAHQAKQQATKK
D