DPGLEAN08019 in OGS1.0

New model in OGS2.0DPOGS212023 
Genomic Positionscaffold221:- 329426-335616
See gene structure
CDS Length1227
Paired RNAseq reads  2215
Single RNAseq reads  7440
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010178 (4e-78)
Best Drosophila hit  B52, isoform C (3e-70)
Best Human hitserine/arginine-rich splicing factor 4 (6e-58)
Best NR hit (blastp)  splicing factor arginine/serine-rich 6 [Bombyx mori] (2e-92)
Best NR hit (blastx)  splicing factor arginine/serine-rich 6 [Bombyx mori] (6e-78)
GeneOntology terms











  
GO:0005634 nucleus
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0003729 mRNA binding
GO:0006376 mRNA splice site selection
GO:0035062 omega speckle
GO:0016607 nuclear speck
GO:0005515 protein binding
GO:0003676 nucleic acid binding
GO:0000166 nucleotide binding
GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome
GO:0048024 regulation of nuclear mRNA splicing, via spliceosome
GO:0071011 precatalytic spliceosome
GO:0071013 catalytic step 2 spliceosome
InterPro families
  
IPR000504 RNA recognition motif domain
IPR012677 Nucleotide-binding, alpha-beta plait
Orthology groupMCL11360

Nucleotide sequence:

ATGGTTGGATCCCGGGTGTACGTTGGCGGGCTGCCTTTTGGTGTTAGAGAAAGAGACTTA
GAAAAGTTTTTTAAAGGATTTGGAAGAATAAGAGACATCCTTATTAAGAATGGATATGGT
TTTGTGGAATTTGAAGACTACAGAGATGCTGATGATGCAGTCTATGAATTAAATGGAAAA
GAATTGCTTGGTGAAAGGGTGGTGGTGGAGCCGGCGCGGGGCATCGACCGCAGCGCGGAC
CGCTACCGTCGCGACCGCTACTACGAGCGCGACCGCGGCCGATCGAGATACGATGACTAC
AATTATAGATACGGGCCGCCGACGCGTACGGAGTATCGACTTATAGTTGAAAACCTATCC
AGCCGCATTAGCTGGCAGGATTTGAAGGATTACATGCGTCAGGCTGGCGAAGTTACTTAC
GCGGATGCTCACAAGCAACATAGAAACGAAGGGGTTGTGGAGTTCGCAACTCATTCAGAC
ATGCGAGCTGCTATCGAGAAATTGGACGGTACTGAGCTGAACGGCCGCCGCGTCCGCCTG
GTGGAGGACCGACGTTCGTCCAGACGACGCAGCCGCTCCTCTTCCTCAAGGAGCCGCTCA
CGGTCACGAGACAGGCGCCGCTCACGATCCAGGTCTCGTTCTCGTGGCTCCCGCAGCCGC
TCCAAATCCAAATCTCGTCCAAAGAGCAAGAGCCCAGCTGCCAAATCTCATTCGAGATCT
CGCTCCAAAGACCGCAGCCGTTCCAGATCCGCTTCCCGCAAGTCTGAGCGCGGGTCGGCA
TCACGTCCGTCCCGTGAGCGTTCCGCGGGACGGAAGTCCGCAGAGCGGAACGGAAGGTCC
GCATCGCGCTCCAAGTCTCGCTCACCTATGGATGATAATAATATTATTCATATGACTATA
TTAAACTCACATTTATTAGGGAGCGATCTCGCTCACGCAGCAAGGAGGCGGGATCACCCA
AACGAGAGGAAGAGCGTCGTGAGAGCAAGTCTCGTTCAAGGTCTCGCTCCCGGTCTGGAT
CGCGCGAGCGGTCAGTCTCACGCGAGCGGTCGCGCTCCGCCTCGCCCAGGCAAAATGGAG
ACGAACGGGCCGCCGACGAGCGCTCCCCGCGCAGCGGAGACTGAGGGGGCACTAGGGAGT
GATGCGGGGGCGGGAGGGGGGAAGCTCACAGTAGACTGGACGGACGGTTACATGGCAAAG
AAAAATCGTACATCTGTTACTATCTGA

Protein sequence:

MVGSRVYVGGLPFGVRERDLEKFFKGFGRIRDILIKNGYGFVEFEDYRDADDAVYELNGK
ELLGERVVVEPARGIDRSADRYRRDRYYERDRGRSRYDDYNYRYGPPTRTEYRLIVENLS
SRISWQDLKDYMRQAGEVTYADAHKQHRNEGVVEFATHSDMRAAIEKLDGTELNGRRVRL
VEDRRSSRRRSRSSSSRSRSRSRDRRRSRSRSRSRGSRSRSKSKSRPKSKSPAAKSHSRS
RSKDRSRSRSASRKSERGSASRPSRERSAGRKSAERNGRSASRSKSRSPMDDNNIIHMTI
LNSHLLGSDLAHAARRRDHPNERKSVVRASLVQGLAPGLDRASGQSHASGRAPPRPGKME
TNGPPTSAPRAAETEGALGSDAGAGGGKLTVDWTDGYMAKKNRTSVTI