New model in OGS2.0 | DPOGS203043  |
---|---|
Genomic Position | scaffold558:- 37540-42283 |
See gene structure | |
CDS Length | 1401 |
Paired RNAseq reads   | 10087 |
Single RNAseq reads   | 26829 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006796 (9e-51) |
Best Drosophila hit   | cabeza (4e-36) |
Best Human hit | RNA-binding protein EWS isoform 3 (5e-20) |
Best NR hit (blastp)   | GF19034 [Drosophila ananassae] (9e-39) |
Best NR hit (blastx)   | conserved hypothetical protein [Culex quinquefasciatus] (1e-36) |
GeneOntology terms    | GO:0016251 general RNA polymerase II transcription factor activity GO:0006367 transcription initiation from RNA polymerase II promoter GO:0005669 transcription factor TFIID complex GO:0003729 mRNA binding GO:0005634 nucleus GO:0003676 nucleic acid binding GO:0008270 zinc ion binding GO:0000398 nuclear mRNA splicing, via spliceosome GO:0071013 catalytic step 2 spliceosome |
InterPro families    | IPR012677 Nucleotide-binding, alpha-beta plait IPR000504 RNA recognition motif domain |
Orthology group | ND |
Nucleotide sequence:
ATGGGTGATCCTTATGCTTCTGGTGACTATAGCGGCGGAGGGTATCCTGCTCAATATTCA
ATGCCGCCTCCAGCAGTTAGTTCGGGAGATAATAGTTTTAATTCCGGTCAACAACCAGGA
GGCTACAATCAAAATAGTTATAGTCAAAATTCAGGTGCGGCTTGGAATCCACCAAGCTCC
GGAAGTGGAAGTGGTGGAAATTATGGTGGTAATCAAAACGAAAGCAATTTTAACTATGGA
CCCTCTTATGGGGGTGGTGGCAGCGGCGGGGGTGGTGGTGGCTATGACCGAAACAATGGA
AATAACTACAATGTGAGTGGGGGAGGTGATCGAGGAAGCAGTAATTACGGTGGAGGAGAC
CGTGGAAGTGGTTATGGAAACAGTGACCGCAGTGGTGGAAACTTTGGTGGCAATGACAGA
GGTGGCTCCAACTATGGTGGAGACAGAGGCGGCGGAGGTTACTCTGGAGACCGAGGCTAT
GGCGGAGGCGGAGGTGATAGATCAAGCTACAATAGAGATGGGGGGAATAGAGAAGGAGGC
TATGGTGGCGGCGGCAGGGGGGGTGGTTATAATAAAGGTGGCGGTGGAGGCTACGGCGGA
GATCGAGGTGGGGGTGATATGATAACACAGGAAGACACAATCTTCATCCAAGGCATGAAC
CCATCGACAACTGAGGACGAGTTATGTCAACATTTTGGTGCTATTGGCATAATTAAGACC
GATAAAAAAACACAAAGGCCGAAAGTTTGGATGTATAAAGACAAAGCTACTGGTCAGCCC
AAAGGAGAAGCAACAGTCACTTATGAAGACTCGAACGCTGCCTCATCTGCTATTCAGTGG
TTCGACGGGAAGGACTTCAACGGTGCCACCGTGAAAGTATCTCTTGCTCAGAGGCAGAAT
ACCTGGGGTGGCAATAAGGGTGGTGGTGGAGGAGGAGGCGGATACCGCGGTGGTAGAGGA
GGTGGCGGCGGCGGCGGTGGAGGAGGAGGAGGAGGAGGCCGTGGTGGCGGTGGTGGCGGT
GGTGGTGGGGAGGTGGAGGCGGAGGCGGACCACCTTCCGGCAACCGAGCCGGTGACTGGA
GATGTCCGAATCCCAGCTGCGGGAACACTAACTTTTCATGGCGGAAAGCTTGCAATAGAT
GCAACGAAGAGAAGCCCGGCGGAGGTGGTAACGGTGGTGGTGGAGGGGGAGGTGGCGGTG
GTCCACCTCCTAGTCGCGGAGGCGGTGGCGGAAACGGACCTCCAGGACGTGGGGGACGTG
GAGGAGGTCGCGGCGGTGGCGGAAGAGGAGGAGGAGGAGGTGGTGGCGGCGGTTGGGGTG
GAGGACGACCAGATCGTAGCGGAGACCGTGGAGGAGACCGACGAGGCTCCGGCGGTGGAG
ACCGTGGCGGCGGTGCCATGA
Protein sequence:
MGDPYASGDYSGGGYPAQYSMPPPAVSSGDNSFNSGQQPGGYNQNSYSQNSGAAWNPPSS
GSGSGGNYGGNQNESNFNYGPSYGGGGSGGGGGGYDRNNGNNYNVSGGGDRGSSNYGGGD
RGSGYGNSDRSGGNFGGNDRGGSNYGGDRGGGGYSGDRGYGGGGGDRSSYNRDGGNREGG
YGGGGRGGGYNKGGGGGYGGDRGGGDMITQEDTIFIQGMNPSTTEDELCQHFGAIGIIKT
DKKTQRPKVWMYKDKATGQPKGEATVTYEDSNAASSAIQWFDGKDFNGATVKVSLAQRQN
TWGGNKGGGGGGGGYRGGRGGGGGGGGGGGGGGRGGGGGGGGGEVEAEADHLPATEPVTG
DVRIPAAGTLTFHGGKLAIDATKRSPAEVVTVVVEGEVAVVHLLVAEAVAETDLQDVGDV
EEVAAVAEEEEEEVVAAVGVEDDQIVAETVEETDEAPAVETVAAVP