DPGLEAN03904 in OGS1.0

New model in OGS2.0DPOGS215370 
Genomic Positionscaffold1021:+ 8373-14266
See gene structure
CDS Length1776
Paired RNAseq reads  869
Single RNAseq reads  2266
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012387 (7e-10)
Best Drosophila hit  poly U binding factor 68kD, isoform A (6e-109)
Best Human hitpoly(U)-binding-splicing factor PUF60 isoform b (2e-88)
Best NR hit (blastp)  PREDICTED: similar to poly U binding factor 68kD CG12085-PA, isoform A [Apis mellifera] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC000381 [Tribolium castaneum] (1e-132)
GeneOntology terms












  
GO:0008266 poly(U) RNA binding
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0003729 mRNA binding
GO:0006376 mRNA splice site selection
GO:0007282 cystoblast division
GO:0000380 alternative nuclear mRNA splicing, via spliceosome
GO:0005634 nucleus
GO:0051726 regulation of cell cycle
GO:0000166 nucleotide binding
GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome
GO:0005515 protein binding
GO:0043234 protein complex
GO:0071011 precatalytic spliceosome
GO:0071013 catalytic step 2 spliceosome
InterPro families


  
IPR000504 RNA recognition motif domain
IPR006532 Poly-U binding splicing factor, half-pint
IPR012677 Nucleotide-binding, alpha-beta plait
IPR003954 RNA recognition motif domain, eukaryote
Orthology groupMCL13141

Nucleotide sequence:

ATGGAATTGTTGCAGGGCATCGAGATGAACGCAGGGATGGGGGTGCCGGCTTCTTTGGGC
GTCGTCTGTCCGCCGAGTGCGGCGAGCTCGGTGGGCGCGGTGGGAGTGATGGGCGCGATG
GGTGGCGCTTGTCCGACCGTGAGCGGAGACTTCCACGCGGCGCCTATCTATGACCTGTTG
CAGGTCGGCGACGTGTTCACAGGGCCGGGCGCCAAGTGTTCTTCGCTGCCTGCCATCCTT
GGCGGAAACATGCCGCGACTGTCCTCGGAACAAGCGGACGCCGTGGCGCGAGCCAAGAAG
TACGCCATGGAGCAGAGCATCAAGATGGTGCTGATGAAGCAGACGCTGGCGCACCAACAG
CAGCAGATGGCCTCGCAAAGGACGCAGGTGCAGCGGCAACAGGCGCTCGCGCTCATGTGC
AGAGTGTACGTGGGGTCGATATCGTTCGAGCTCAAAGAGGACACGATCCGCCAAGCGTTC
CTGCCGTTCGGGCCGATCAAGTCCATCAACATGTCGTGGGACCCAGTCACTCAGAAACAC
AAAGGGTTCGCCTTCGTCGAGTACGAGATACCGGAGGCGGCGCAGCTGAGTCTCGAGCAG
ATGAACGGAGTGATGCTGGGCGGGAGAAATATCAAAGTGGTAGGGAGACCTTCCAACATG
CCGCAGGCCCAGGCCGTCATAGACGAGATACAGGAGGAAGCAAAGCAGTACAACAGAATA
TACGTCGCCTCCATACACCCGGAGCTGACGGAGGACGATATTAAGAACGTGTTCGAGGCG
TTCGGTCCCATCACGTATTGCAAGCTGGCATACGGAGCGTCCGCGCACAAACACAAGGGC
TACGGGTTCATCGAGTATGCGACTCTCCCGGCCGCGCTGGAGGCGATCGCCTCCATGAAC
CTGTTCGACCTCGGTGGCCAGTACCTGCGGGTGGGACGCGCCATCACTCCGCCCAATGCT
CTCGCCGGCCCGCCGCAAGCCTCCGCCATGCCGACCGCGGCCGCCGTGGCCGCCGCCGCC
GCCACCGCCAAGATACAGGCCATGGACGCCGTCGCCAGCAACGCCGTTGCGCTCGGACTG
ACCAAGCTCAACGCGCTCGGCGTTCCGCCCGCCGCCGCGCTGCCGACGCTCGCCGCCGCG
CTGCCGGTGGCGCTGCCCGCCGCTCTGCCCGCCGCGCTCCCGGTCACTCTGCCGACCGCG
CTTCCGGTCACTCTGCCGGTCACTCTGCCCGCCTCTCTGCCGGCCGCCCTGCCCCCGGCG
CCGGTCATCCCGCCGCCGGGTGTGGTGATCCCGCCGCCTCCCCGTCCGCCCGCGGCCGAG
CCCTCGGCGGACGGCGAGGGTGGCCAGCAGGCGGCGCTACAACGCAAGCTGCTGGACAGT
TCGCCGGATACGCTCCAGCAGCAGGAGTCTTTGTCGATCTCGGGTCAGTCGGCGCGACAC
CTCGTCATGCAGAGACTGATGAGGCGCCGCGCGAGCAGGACCGTGCTGCTCGAGAACATG
GTGGCGGCTCACGAGGTGGACGACGCGCTCCACCATGAGATACAGGAGGAGTGTTGCAAG
TGGGGCCGGGTGGAGAGACTAGTCATATACAACGAGAGACAAAGCGAGGACGATGACCCT
GCACATGCTGACGTTAAGATATTCGTCCAGTTCGCGGACCCCGAGGAGGCGGGAGCTGCG
GCCGGGGCTCTATCCGGCCGATACTTCGGAGGTCGTACGGTGCGCGCTCGGCTCTACGAC
CAGGACCTGTTCGACCACGGGGACCTCTCGGGCTGA

Protein sequence:

MELLQGIEMNAGMGVPASLGVVCPPSAASSVGAVGVMGAMGGACPTVSGDFHAAPIYDLL
QVGDVFTGPGAKCSSLPAILGGNMPRLSSEQADAVARAKKYAMEQSIKMVLMKQTLAHQQ
QQMASQRTQVQRQQALALMCRVYVGSISFELKEDTIRQAFLPFGPIKSINMSWDPVTQKH
KGFAFVEYEIPEAAQLSLEQMNGVMLGGRNIKVVGRPSNMPQAQAVIDEIQEEAKQYNRI
YVASIHPELTEDDIKNVFEAFGPITYCKLAYGASAHKHKGYGFIEYATLPAALEAIASMN
LFDLGGQYLRVGRAITPPNALAGPPQASAMPTAAAVAAAAATAKIQAMDAVASNAVALGL
TKLNALGVPPAAALPTLAAALPVALPAALPAALPVTLPTALPVTLPVTLPASLPAALPPA
PVIPPPGVVIPPPPRPPAAEPSADGEGGQQAALQRKLLDSSPDTLQQQESLSISGQSARH
LVMQRLMRRRASRTVLLENMVAAHEVDDALHHEIQEECCKWGRVERLVIYNERQSEDDDP
AHADVKIFVQFADPEEAGAAAGALSGRYFGGRTVRARLYDQDLFDHGDLSG