DPGLEAN22524 in OGS1.0

New model in OGS2.0DPOGS203043 
Genomic Positionscaffold558:- 37540-42283
See gene structure
CDS Length1401
Paired RNAseq reads  10087
Single RNAseq reads  26829
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006796 (9e-51)
Best Drosophila hit  cabeza (4e-36)
Best Human hitRNA-binding protein EWS isoform 3 (5e-20)
Best NR hit (blastp)  GF19034 [Drosophila ananassae] (9e-39)
Best NR hit (blastx)  conserved hypothetical protein [Culex quinquefasciatus] (1e-36)
GeneOntology terms







  
GO:0016251 general RNA polymerase II transcription factor activity
GO:0006367 transcription initiation from RNA polymerase II promoter
GO:0005669 transcription factor TFIID complex
GO:0003729 mRNA binding
GO:0005634 nucleus
GO:0003676 nucleic acid binding
GO:0008270 zinc ion binding
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0071013 catalytic step 2 spliceosome
InterPro families
  
IPR012677 Nucleotide-binding, alpha-beta plait
IPR000504 RNA recognition motif domain
Orthology groupND

Nucleotide sequence:

ATGGGTGATCCTTATGCTTCTGGTGACTATAGCGGCGGAGGGTATCCTGCTCAATATTCA
ATGCCGCCTCCAGCAGTTAGTTCGGGAGATAATAGTTTTAATTCCGGTCAACAACCAGGA
GGCTACAATCAAAATAGTTATAGTCAAAATTCAGGTGCGGCTTGGAATCCACCAAGCTCC
GGAAGTGGAAGTGGTGGAAATTATGGTGGTAATCAAAACGAAAGCAATTTTAACTATGGA
CCCTCTTATGGGGGTGGTGGCAGCGGCGGGGGTGGTGGTGGCTATGACCGAAACAATGGA
AATAACTACAATGTGAGTGGGGGAGGTGATCGAGGAAGCAGTAATTACGGTGGAGGAGAC
CGTGGAAGTGGTTATGGAAACAGTGACCGCAGTGGTGGAAACTTTGGTGGCAATGACAGA
GGTGGCTCCAACTATGGTGGAGACAGAGGCGGCGGAGGTTACTCTGGAGACCGAGGCTAT
GGCGGAGGCGGAGGTGATAGATCAAGCTACAATAGAGATGGGGGGAATAGAGAAGGAGGC
TATGGTGGCGGCGGCAGGGGGGGTGGTTATAATAAAGGTGGCGGTGGAGGCTACGGCGGA
GATCGAGGTGGGGGTGATATGATAACACAGGAAGACACAATCTTCATCCAAGGCATGAAC
CCATCGACAACTGAGGACGAGTTATGTCAACATTTTGGTGCTATTGGCATAATTAAGACC
GATAAAAAAACACAAAGGCCGAAAGTTTGGATGTATAAAGACAAAGCTACTGGTCAGCCC
AAAGGAGAAGCAACAGTCACTTATGAAGACTCGAACGCTGCCTCATCTGCTATTCAGTGG
TTCGACGGGAAGGACTTCAACGGTGCCACCGTGAAAGTATCTCTTGCTCAGAGGCAGAAT
ACCTGGGGTGGCAATAAGGGTGGTGGTGGAGGAGGAGGCGGATACCGCGGTGGTAGAGGA
GGTGGCGGCGGCGGCGGTGGAGGAGGAGGAGGAGGAGGCCGTGGTGGCGGTGGTGGCGGT
GGTGGTGGGGAGGTGGAGGCGGAGGCGGACCACCTTCCGGCAACCGAGCCGGTGACTGGA
GATGTCCGAATCCCAGCTGCGGGAACACTAACTTTTCATGGCGGAAAGCTTGCAATAGAT
GCAACGAAGAGAAGCCCGGCGGAGGTGGTAACGGTGGTGGTGGAGGGGGAGGTGGCGGTG
GTCCACCTCCTAGTCGCGGAGGCGGTGGCGGAAACGGACCTCCAGGACGTGGGGGACGTG
GAGGAGGTCGCGGCGGTGGCGGAAGAGGAGGAGGAGGAGGTGGTGGCGGCGGTTGGGGTG
GAGGACGACCAGATCGTAGCGGAGACCGTGGAGGAGACCGACGAGGCTCCGGCGGTGGAG
ACCGTGGCGGCGGTGCCATGA

Protein sequence:

MGDPYASGDYSGGGYPAQYSMPPPAVSSGDNSFNSGQQPGGYNQNSYSQNSGAAWNPPSS
GSGSGGNYGGNQNESNFNYGPSYGGGGSGGGGGGYDRNNGNNYNVSGGGDRGSSNYGGGD
RGSGYGNSDRSGGNFGGNDRGGSNYGGDRGGGGYSGDRGYGGGGGDRSSYNRDGGNREGG
YGGGGRGGGYNKGGGGGYGGDRGGGDMITQEDTIFIQGMNPSTTEDELCQHFGAIGIIKT
DKKTQRPKVWMYKDKATGQPKGEATVTYEDSNAASSAIQWFDGKDFNGATVKVSLAQRQN
TWGGNKGGGGGGGGYRGGRGGGGGGGGGGGGGGRGGGGGGGGGEVEAEADHLPATEPVTG
DVRIPAAGTLTFHGGKLAIDATKRSPAEVVTVVVEGEVAVVHLLVAEAVAETDLQDVGDV
EEVAAVAEEEEEEVVAAVGVEDDQIVAETVEETDEAPAVETVAAVP