New model in OGS2.0 | DPOGS214304  |
---|---|
Genomic Position | scaffold979:- 79303-80859 |
See gene structure | |
CDS Length | 1557 |
Paired RNAseq reads   | 2137 |
Single RNAseq reads   | 5134 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004109 (3e-171) |
Best Drosophila hit   | spenito, isoform A (8e-88) |
Best Human hit | putative RNA-binding protein 15B (3e-55) |
Best NR hit (blastp)   | RNA recognition motif protein split ends [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | RNA recognition motif protein split ends [Culex quinquefasciatus] (0.0) |
GeneOntology terms    | GO:0003729 mRNA binding GO:0045449 regulation of transcription GO:0000166 nucleotide binding GO:0003676 nucleic acid binding GO:0071011 precatalytic spliceosome GO:0000398 nuclear mRNA splicing, via spliceosome |
InterPro families    | IPR000504 RNA recognition motif domain IPR012677 Nucleotide-binding, alpha-beta plait IPR016194 Spen Paralogue and Orthologue SPOC, C-terminal-like IPR010912 Spen paralogue/orthologue C-terminal, metazoa IPR012921 Spen paralogue and orthologue SPOC, C-terminal |
Orthology group | MCL11927 |
Nucleotide sequence:
ATGCATGAGTATCCCATGGCAGGTCCTCACGGGCCTCCAATGCACCATCGCCCACCCATG
CATCATCCACCTCATCCTCATTACATGCCACGCCCTTACATGCCGCGTCCTCATCACCCA
CCATTTGAAAAAATGGAAAACAAAAAAGACAAGTTCCCTAATTACTTACATCATGTTCAA
CCAGAAGATGATCCTCTTGCAACAAGAACTTTGTTTGCTGGGAACTTGGAAATAAATATA
TCAGATGAGGAATTAAGAAGAATCTTTGGTCGTTATGGGATTGTTGAAGATATTGACATT
AAAAGGCCTCCTCCAGGCACTGGAAATGCATTTGCATTTGTTCGCTATCAAACATTAGAC
ATGGCTCACCGAGCCAAAGTAGAGCTATCTGGCCAGTATATTGGTAAATTTCAATGTAAA
ATTGGATATGGCAAAGCTACACCGACTACTCGTGTTTGGGTTGGTGGGCTAGGTCCATGG
ACATCAGTAGCCCAATTAGAAAGAGAATTTGACAGATTTGGGGCCATAAAGAAAATTGAA
TATGCTAAAGGTGAACCTCATGCATATATACTGTATGATTCGATAGATGCAGCTCAAGCT
GCTGTAAAAGAAATGAGAGGCTTTCCATTAGGTGGACCAGACAGGCGCCTTAGGATTGAT
TTTGCAGATGTCGGCACTGGGGGACCATACAGACCGAAACCATATGCAGCACCCGTTGAA
GAAGGTCGTTCTGAAGGTTATGAAGGATATGAAGGTTCTTGGGAGGATGGTTATAGTTAT
GGTTCTGGTTATAGAGGTAGGGGCGGCCACCGTGGGCGAGGTCGTGGTATGTATCGTGGA
GTGTATCACGGCAGCGCTGATTATAGGGATGAGGAATGGAGGAGAGCACCAGATGCTGAA
TATGACAGTAGAGCTCGTCGTTCTGGTTCCCGAGAACCTGGCGTTGACAGATCACGTTCC
CGTTCTCCACGTCGTCGTTCTCCCGACAGTGATTCTGATGGATCTCCCCGACGTAGCAGT
GGCATGCTTGCCTCAGCTAGAACACTCCCTGAGGTTGTTCGTAAAGCTACAACAATCTGG
AATGGTGCCCTCATACTCAAGAATTCCTTGTTTCCAACTAAATTCCACCTTACAGATGGA
GATTCAGACATAATTGACAGTTTAATGAAAGATGAGGAAGGTAAAAATCAATTGAGGATT
ACACAAAGGCTTCGTCTGGATCAGCCAAAGTTAGATGATGTACAAAAACGTATTGCTACT
TCTAGTTCACACGCTATCTTCCTTGGTGTGGCAGGATCAACGGCTTCCATTACAAATGAA
GATGCAAGCATACAGACAAGGCCTATGAGGAATTTAGTTTCCTATTTGAAACAAAAAGAG
GCTGCTGGAGTTATATCATTGTTGAATAAAGAAACTGAAGCCACTGGGGTTTTGTACTCT
TTCCCTCCCTGTGACTTCTCCACGGAACTGCTCAAGAGAACTTGTCACAACCTGACTGAG
GAGAGTTTGAAGGAGGATCATTTAGTTATAGTGGTAGTAAGGGGCGGTTCTGCATAG
Protein sequence:
MHEYPMAGPHGPPMHHRPPMHHPPHPHYMPRPYMPRPHHPPFEKMENKKDKFPNYLHHVQ
PEDDPLATRTLFAGNLEINISDEELRRIFGRYGIVEDIDIKRPPPGTGNAFAFVRYQTLD
MAHRAKVELSGQYIGKFQCKIGYGKATPTTRVWVGGLGPWTSVAQLEREFDRFGAIKKIE
YAKGEPHAYILYDSIDAAQAAVKEMRGFPLGGPDRRLRIDFADVGTGGPYRPKPYAAPVE
EGRSEGYEGYEGSWEDGYSYGSGYRGRGGHRGRGRGMYRGVYHGSADYRDEEWRRAPDAE
YDSRARRSGSREPGVDRSRSRSPRRRSPDSDSDGSPRRSSGMLASARTLPEVVRKATTIW
NGALILKNSLFPTKFHLTDGDSDIIDSLMKDEEGKNQLRITQRLRLDQPKLDDVQKRIAT
SSSHAIFLGVAGSTASITNEDASIQTRPMRNLVSYLKQKEAAGVISLLNKETEATGVLYS
FPPCDFSTELLKRTCHNLTEESLKEDHLVIVVVRGGSA