DPGLEAN03830 in OGS1.0

New model in OGS2.0DPOGS215939 
Genomic Positionscaffold720:- 13043-17763
See gene structure
CDS Length1767
Paired RNAseq reads  1346
Single RNAseq reads  3795
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001870 (6e-127)
Best Drosophila hit  Srp54 (3e-78)
Best Human hitsplicing factor, arginine/serine-rich 12 isoform a (7e-50)
Best NR hit (blastp)  PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] (1e-109)
Best NR hit (blastx)  PREDICTED: similar to Srp54 CG4602-PA [Apis mellifera] (9e-93)
GeneOntology terms








  
GO:0008187 poly-pyrimidine tract binding
GO:0003729 mRNA binding
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0005681 spliceosomal complex
GO:0006376 mRNA splice site selection
GO:0000166 nucleotide binding
GO:0005634 nucleus
GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome
GO:0071013 catalytic step 2 spliceosome
GO:0071011 precatalytic spliceosome
InterPro families
  
IPR000504 RNA recognition motif domain
IPR012677 Nucleotide-binding, alpha-beta plait
Orthology groupMCL13431

Nucleotide sequence:

ATGGTTTCAAGCAGTACGAGGGTGATTCAAGTCACCAACATCGCCCCTCAAGCTACAAAA
GACCAAATGCAAACCTTATTTGGTTATTTAGGAAAAATTGATGATATAAGATTGTACCCA
ACAATAAGAGACGTATCATGTCCGGTACAGTCTCGTATATGTTACGTGAAATATTATGAT
TCGGCGACAGTCAATGTCGCCCAGCATATGACAAATACGGTGTTTATAGATCGTGCTTTA
ATCGTGATTCCCATGCAGTCAGGAGAGATTCCTGACGAGCACCGAGCTCTAGAGATGTCA
AGCAACGGAACTTTAGTGCCGGGTCTTAGTACAGTTGAACCACGATTACCAGCTCACGTG
ATCAACACTTTGGAAGGCGCACCACCCAACCAGGTCATTCAAACATACGATCCTAACATA
GCAGCAGCGGGATTACCACCGTACCCGCCGCTTCCAGCCATTTATGACTCGAGGAAAATA
GAAGAAATAAGAAGAACACTTTTACTTATAGATGTGGGCGAACTAACATCCCAACAACTC
ATTGATCATTTTTGTCAAGCTGGCGAAGTCAGCTACGTGCGATTTTGTGAACGGGAAGTT
GACAACTTAAAGTATGCGCTGATAGAAATGACAGAACAAGAAAGCATATCAAAGGCTCTT
CAGCTTAATGGAGTCGCATTAAATGGCCAAGTCATTAAGGTCCATCATTCTACGGTGGCT
ATATCAAAGCCTCAGGCTAAGAGCAATGAGGCAGCTCAAAGGGAGATCGAAGAGGCCATG
TGCAGAGTTAAGGAAGCCCAGAACTTGATATCGGCTGCCATCGACCCCGTTATTGGATTG
TTGTCTAAAGACAAAAGGACTCGTTCCCGGTCCCGGTCCCGCCGCCGCTCCCGGTCCCGG
TCTCGTCGTTCCCGGTCCCGTCACCGCTCTAAGCGATCCCGCTCACGGTCCAGACATCGC
TCCCGGAGATCACGCTCGAGGCACCGACACCGCACTAGGTCTCGTTCCCGTCACCGAAGC
TCACGGCGCTCCAGGTCCAGATCCAGACACCGCAGCTCGAGATCCAAACGAGAGAAGTCG
AAAGAACGCGATAGAAAAGACAAGAAAGACATCGGTGATAAGGAAAAGAGAGACAGTGAT
AAGACGAAGTCACCGCAGAAAGACGTGGGTAGGGACGGAAAAGACGAGCTCAAGATTGAC
ATCAGCGAGGTTGATACGAACGGCAGCTCGTATGAACATAAATCTAAAGCCTCCACACCC
GCTGATGATAAAGAAAAGACGACAGAGCTCGACAAGGACAAGTCGCCGAGGAAAAAGGAA
AGGTCCCGCTCCAAGGAAAGGAAGAGGGAACGGTCGCGGTCGAAACGAAGGTCGCGGTCA
CGATCAAGAAGGAAACGCTCGAGGTCACGTAAAAGATCGAGGTCCAGGGACAGAAAGAAA
TCCCGCTCCAGAGAGAGGAAGAAGTCGAGGTCGCGGGACAGAAAACGGTCCAGGTCCAGG
GACAGGAAGCGGACGAAGTCGAGGGAGAGGAAGAGGTCGCGGTCCAAAGATAGGAAAAGA
TCGCGCTCCAAGGATAGGAAGCGTTCGCGGTCACCCAGCAGGCGCTCCAAGAGCCGGTCC
CATAGAGATTCCAAAACGCCTCACGAGAGGAAGTCACGTGACCACTCGCCGCTACCAGCA
ATAATGGAAAAGACTCCACACAAAACTATAGACGTGACAGATGAAAAGAATTCCCCAGAC
AATATGGACATTTCAAATTCCCCATAA

Protein sequence:

MVSSSTRVIQVTNIAPQATKDQMQTLFGYLGKIDDIRLYPTIRDVSCPVQSRICYVKYYD
SATVNVAQHMTNTVFIDRALIVIPMQSGEIPDEHRALEMSSNGTLVPGLSTVEPRLPAHV
INTLEGAPPNQVIQTYDPNIAAAGLPPYPPLPAIYDSRKIEEIRRTLLLIDVGELTSQQL
IDHFCQAGEVSYVRFCEREVDNLKYALIEMTEQESISKALQLNGVALNGQVIKVHHSTVA
ISKPQAKSNEAAQREIEEAMCRVKEAQNLISAAIDPVIGLLSKDKRTRSRSRSRRRSRSR
SRRSRSRHRSKRSRSRSRHRSRRSRSRHRHRTRSRSRHRSSRRSRSRSRHRSSRSKREKS
KERDRKDKKDIGDKEKRDSDKTKSPQKDVGRDGKDELKIDISEVDTNGSSYEHKSKASTP
ADDKEKTTELDKDKSPRKKERSRSKERKRERSRSKRRSRSRSRRKRSRSRKRSRSRDRKK
SRSRERKKSRSRDRKRSRSRDRKRTKSRERKRSRSKDRKRSRSKDRKRSRSPSRRSKSRS
HRDSKTPHERKSRDHSPLPAIMEKTPHKTIDVTDEKNSPDNMDISNSP