DPGLEAN16440 in OGS1.0

New model in OGS2.0DPOGS214304 
Genomic Positionscaffold979:- 79303-80859
See gene structure
CDS Length1557
Paired RNAseq reads  2137
Single RNAseq reads  5134
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004109 (3e-171)
Best Drosophila hit  spenito, isoform A (8e-88)
Best Human hitputative RNA-binding protein 15B (3e-55)
Best NR hit (blastp)  RNA recognition motif protein split ends [Aedes aegypti] (0.0)
Best NR hit (blastx)  RNA recognition motif protein split ends [Culex quinquefasciatus] (0.0)
GeneOntology terms




  
GO:0003729 mRNA binding
GO:0045449 regulation of transcription
GO:0000166 nucleotide binding
GO:0003676 nucleic acid binding
GO:0071011 precatalytic spliceosome
GO:0000398 nuclear mRNA splicing, via spliceosome
InterPro families



  
IPR000504 RNA recognition motif domain
IPR012677 Nucleotide-binding, alpha-beta plait
IPR016194 Spen Paralogue and Orthologue SPOC, C-terminal-like
IPR010912 Spen paralogue/orthologue C-terminal, metazoa
IPR012921 Spen paralogue and orthologue SPOC, C-terminal
Orthology groupMCL11927

Nucleotide sequence:

ATGCATGAGTATCCCATGGCAGGTCCTCACGGGCCTCCAATGCACCATCGCCCACCCATG
CATCATCCACCTCATCCTCATTACATGCCACGCCCTTACATGCCGCGTCCTCATCACCCA
CCATTTGAAAAAATGGAAAACAAAAAAGACAAGTTCCCTAATTACTTACATCATGTTCAA
CCAGAAGATGATCCTCTTGCAACAAGAACTTTGTTTGCTGGGAACTTGGAAATAAATATA
TCAGATGAGGAATTAAGAAGAATCTTTGGTCGTTATGGGATTGTTGAAGATATTGACATT
AAAAGGCCTCCTCCAGGCACTGGAAATGCATTTGCATTTGTTCGCTATCAAACATTAGAC
ATGGCTCACCGAGCCAAAGTAGAGCTATCTGGCCAGTATATTGGTAAATTTCAATGTAAA
ATTGGATATGGCAAAGCTACACCGACTACTCGTGTTTGGGTTGGTGGGCTAGGTCCATGG
ACATCAGTAGCCCAATTAGAAAGAGAATTTGACAGATTTGGGGCCATAAAGAAAATTGAA
TATGCTAAAGGTGAACCTCATGCATATATACTGTATGATTCGATAGATGCAGCTCAAGCT
GCTGTAAAAGAAATGAGAGGCTTTCCATTAGGTGGACCAGACAGGCGCCTTAGGATTGAT
TTTGCAGATGTCGGCACTGGGGGACCATACAGACCGAAACCATATGCAGCACCCGTTGAA
GAAGGTCGTTCTGAAGGTTATGAAGGATATGAAGGTTCTTGGGAGGATGGTTATAGTTAT
GGTTCTGGTTATAGAGGTAGGGGCGGCCACCGTGGGCGAGGTCGTGGTATGTATCGTGGA
GTGTATCACGGCAGCGCTGATTATAGGGATGAGGAATGGAGGAGAGCACCAGATGCTGAA
TATGACAGTAGAGCTCGTCGTTCTGGTTCCCGAGAACCTGGCGTTGACAGATCACGTTCC
CGTTCTCCACGTCGTCGTTCTCCCGACAGTGATTCTGATGGATCTCCCCGACGTAGCAGT
GGCATGCTTGCCTCAGCTAGAACACTCCCTGAGGTTGTTCGTAAAGCTACAACAATCTGG
AATGGTGCCCTCATACTCAAGAATTCCTTGTTTCCAACTAAATTCCACCTTACAGATGGA
GATTCAGACATAATTGACAGTTTAATGAAAGATGAGGAAGGTAAAAATCAATTGAGGATT
ACACAAAGGCTTCGTCTGGATCAGCCAAAGTTAGATGATGTACAAAAACGTATTGCTACT
TCTAGTTCACACGCTATCTTCCTTGGTGTGGCAGGATCAACGGCTTCCATTACAAATGAA
GATGCAAGCATACAGACAAGGCCTATGAGGAATTTAGTTTCCTATTTGAAACAAAAAGAG
GCTGCTGGAGTTATATCATTGTTGAATAAAGAAACTGAAGCCACTGGGGTTTTGTACTCT
TTCCCTCCCTGTGACTTCTCCACGGAACTGCTCAAGAGAACTTGTCACAACCTGACTGAG
GAGAGTTTGAAGGAGGATCATTTAGTTATAGTGGTAGTAAGGGGCGGTTCTGCATAG

Protein sequence:

MHEYPMAGPHGPPMHHRPPMHHPPHPHYMPRPYMPRPHHPPFEKMENKKDKFPNYLHHVQ
PEDDPLATRTLFAGNLEINISDEELRRIFGRYGIVEDIDIKRPPPGTGNAFAFVRYQTLD
MAHRAKVELSGQYIGKFQCKIGYGKATPTTRVWVGGLGPWTSVAQLEREFDRFGAIKKIE
YAKGEPHAYILYDSIDAAQAAVKEMRGFPLGGPDRRLRIDFADVGTGGPYRPKPYAAPVE
EGRSEGYEGYEGSWEDGYSYGSGYRGRGGHRGRGRGMYRGVYHGSADYRDEEWRRAPDAE
YDSRARRSGSREPGVDRSRSRSPRRRSPDSDSDGSPRRSSGMLASARTLPEVVRKATTIW
NGALILKNSLFPTKFHLTDGDSDIIDSLMKDEEGKNQLRITQRLRLDQPKLDDVQKRIAT
SSSHAIFLGVAGSTASITNEDASIQTRPMRNLVSYLKQKEAAGVISLLNKETEATGVLYS
FPPCDFSTELLKRTCHNLTEESLKEDHLVIVVVRGGSA