New model in OGS2.0 | DPOGS215370  |
---|---|
Genomic Position | scaffold1021:+ 8373-14266 |
See gene structure | |
CDS Length | 1776 |
Paired RNAseq reads   | 869 |
Single RNAseq reads   | 2266 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012387 (7e-10) |
Best Drosophila hit   | poly U binding factor 68kD, isoform A (6e-109) |
Best Human hit | poly(U)-binding-splicing factor PUF60 isoform b (2e-88) |
Best NR hit (blastp)   | PREDICTED: similar to poly U binding factor 68kD CG12085-PA, isoform A [Apis mellifera] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC000381 [Tribolium castaneum] (1e-132) |
GeneOntology terms    | GO:0008266 poly(U) RNA binding GO:0000398 nuclear mRNA splicing, via spliceosome GO:0003729 mRNA binding GO:0006376 mRNA splice site selection GO:0007282 cystoblast division GO:0000380 alternative nuclear mRNA splicing, via spliceosome GO:0005634 nucleus GO:0051726 regulation of cell cycle GO:0000166 nucleotide binding GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome GO:0005515 protein binding GO:0043234 protein complex GO:0071011 precatalytic spliceosome GO:0071013 catalytic step 2 spliceosome |
InterPro families    | IPR000504 RNA recognition motif domain IPR006532 Poly-U binding splicing factor, half-pint IPR012677 Nucleotide-binding, alpha-beta plait IPR003954 RNA recognition motif domain, eukaryote |
Orthology group | MCL13141 |
Nucleotide sequence:
ATGGAATTGTTGCAGGGCATCGAGATGAACGCAGGGATGGGGGTGCCGGCTTCTTTGGGC
GTCGTCTGTCCGCCGAGTGCGGCGAGCTCGGTGGGCGCGGTGGGAGTGATGGGCGCGATG
GGTGGCGCTTGTCCGACCGTGAGCGGAGACTTCCACGCGGCGCCTATCTATGACCTGTTG
CAGGTCGGCGACGTGTTCACAGGGCCGGGCGCCAAGTGTTCTTCGCTGCCTGCCATCCTT
GGCGGAAACATGCCGCGACTGTCCTCGGAACAAGCGGACGCCGTGGCGCGAGCCAAGAAG
TACGCCATGGAGCAGAGCATCAAGATGGTGCTGATGAAGCAGACGCTGGCGCACCAACAG
CAGCAGATGGCCTCGCAAAGGACGCAGGTGCAGCGGCAACAGGCGCTCGCGCTCATGTGC
AGAGTGTACGTGGGGTCGATATCGTTCGAGCTCAAAGAGGACACGATCCGCCAAGCGTTC
CTGCCGTTCGGGCCGATCAAGTCCATCAACATGTCGTGGGACCCAGTCACTCAGAAACAC
AAAGGGTTCGCCTTCGTCGAGTACGAGATACCGGAGGCGGCGCAGCTGAGTCTCGAGCAG
ATGAACGGAGTGATGCTGGGCGGGAGAAATATCAAAGTGGTAGGGAGACCTTCCAACATG
CCGCAGGCCCAGGCCGTCATAGACGAGATACAGGAGGAAGCAAAGCAGTACAACAGAATA
TACGTCGCCTCCATACACCCGGAGCTGACGGAGGACGATATTAAGAACGTGTTCGAGGCG
TTCGGTCCCATCACGTATTGCAAGCTGGCATACGGAGCGTCCGCGCACAAACACAAGGGC
TACGGGTTCATCGAGTATGCGACTCTCCCGGCCGCGCTGGAGGCGATCGCCTCCATGAAC
CTGTTCGACCTCGGTGGCCAGTACCTGCGGGTGGGACGCGCCATCACTCCGCCCAATGCT
CTCGCCGGCCCGCCGCAAGCCTCCGCCATGCCGACCGCGGCCGCCGTGGCCGCCGCCGCC
GCCACCGCCAAGATACAGGCCATGGACGCCGTCGCCAGCAACGCCGTTGCGCTCGGACTG
ACCAAGCTCAACGCGCTCGGCGTTCCGCCCGCCGCCGCGCTGCCGACGCTCGCCGCCGCG
CTGCCGGTGGCGCTGCCCGCCGCTCTGCCCGCCGCGCTCCCGGTCACTCTGCCGACCGCG
CTTCCGGTCACTCTGCCGGTCACTCTGCCCGCCTCTCTGCCGGCCGCCCTGCCCCCGGCG
CCGGTCATCCCGCCGCCGGGTGTGGTGATCCCGCCGCCTCCCCGTCCGCCCGCGGCCGAG
CCCTCGGCGGACGGCGAGGGTGGCCAGCAGGCGGCGCTACAACGCAAGCTGCTGGACAGT
TCGCCGGATACGCTCCAGCAGCAGGAGTCTTTGTCGATCTCGGGTCAGTCGGCGCGACAC
CTCGTCATGCAGAGACTGATGAGGCGCCGCGCGAGCAGGACCGTGCTGCTCGAGAACATG
GTGGCGGCTCACGAGGTGGACGACGCGCTCCACCATGAGATACAGGAGGAGTGTTGCAAG
TGGGGCCGGGTGGAGAGACTAGTCATATACAACGAGAGACAAAGCGAGGACGATGACCCT
GCACATGCTGACGTTAAGATATTCGTCCAGTTCGCGGACCCCGAGGAGGCGGGAGCTGCG
GCCGGGGCTCTATCCGGCCGATACTTCGGAGGTCGTACGGTGCGCGCTCGGCTCTACGAC
CAGGACCTGTTCGACCACGGGGACCTCTCGGGCTGA
Protein sequence:
MELLQGIEMNAGMGVPASLGVVCPPSAASSVGAVGVMGAMGGACPTVSGDFHAAPIYDLL
QVGDVFTGPGAKCSSLPAILGGNMPRLSSEQADAVARAKKYAMEQSIKMVLMKQTLAHQQ
QQMASQRTQVQRQQALALMCRVYVGSISFELKEDTIRQAFLPFGPIKSINMSWDPVTQKH
KGFAFVEYEIPEAAQLSLEQMNGVMLGGRNIKVVGRPSNMPQAQAVIDEIQEEAKQYNRI
YVASIHPELTEDDIKNVFEAFGPITYCKLAYGASAHKHKGYGFIEYATLPAALEAIASMN
LFDLGGQYLRVGRAITPPNALAGPPQASAMPTAAAVAAAAATAKIQAMDAVASNAVALGL
TKLNALGVPPAAALPTLAAALPVALPAALPAALPVTLPTALPVTLPVTLPASLPAALPPA
PVIPPPGVVIPPPPRPPAAEPSADGEGGQQAALQRKLLDSSPDTLQQQESLSISGQSARH
LVMQRLMRRRASRTVLLENMVAAHEVDDALHHEIQEECCKWGRVERLVIYNERQSEDDDP
AHADVKIFVQFADPEEAGAAAGALSGRYFGGRTVRARLYDQDLFDHGDLSG