DPGLEAN21414 in OGS1.0

New model in OGS2.0DPOGS210557 
Genomic Positionscaffold953:- 273-6458
See gene structure
CDS Length1905
Paired RNAseq reads  3350
Single RNAseq reads  9019
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013450 (3e-127)
Best Drosophila hit  splicing factor 1 (3e-113)
Best Human hitsplicing factor 1 isoform 1 (1e-91)
Best NR hit (blastp)  PREDICTED: similar to zinc finger protein [Nasonia vitripennis] (2e-160)
Best NR hit (blastx)  PREDICTED: similar to Splicing factor 1 CG5836-PA [Apis mellifera] (1e-130)
GeneOntology terms





  
GO:0005634 nucleus
GO:0005681 spliceosomal complex
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0003723 RNA binding
GO:0008270 zinc ion binding
GO:0071011 precatalytic spliceosome
GO:0002121 inter-male aggressive behavior
InterPro families



  
IPR013084 Zinc finger, CCHC retroviral-type
IPR004087 K Homology
IPR001878 Zinc finger, CCHC-type
IPR004088 K Homology, type 1
IPR018111 K Homology, type 1, subgroup
Orthology groupMCL13981

Nucleotide sequence:

ATGAGTTCTCGACATAGAGACAGAAGCCGATCACGGTCTCGTGATCGTGATCGTCTTAAG
GACAGGGATAAGGGACGGGAACGGGACCGGGATAAAGATAGGGAGAGAGAAAGGGACCGG
GACCGTGATAGGGATCGAGAACGCGACAGGGACCGCGATCGAGACCGTGACAGAGACCGA
GAGAGAGATAGAGATCGCGATCGAAACAGGGACCGGGAGAGAGACCGGGATCGAGATAGA
GAGCGTCATCGGTCTAAGAGGGACAAAGATCGTGATAGAAGCCGCAGCCGTGATCGCCAT
AAAGAAAAGAGACGCAGTCGCTCTCGAAGCCGTAGTAGGAGTCGCGGCAGAAAATCAAAA
GACAGGGATGGTACAATAGCTTTACTGGATCAAATGGTGGGCACCACTACCAAGGCGACG
GCTCGCCAGGTGGCCGTTCCCACCTCCATGAACCCAGCAACACAAGCCGCCATACTGGCA
GCAGCAGCCGTGGCTCAGCGGCGGCTGGCGGCGCCCGTGCAGCCCGCGGCGGCGGCGGCG
GCAGCCCTGTCTGCAGCCCTGTCCGCGGCCACCGCCATCCCGCCGCCCACCTCTGTACAG
CAGAAGCTGGAGCTGCTGCAGGCGCGCACTGAGGGACGGTACCGCGACAAGCAACCTCCC
GACCACCATCCGGACGACGACCACGACGACGGACAAGGTCCTCCCGGGGAGACGGCGGCC
GAGCGTCGGGCCCGGCGGCGCCGCACTCGCTGGATGGGCTCCGAGCACGACAAGACCTTC
ATCCCGGGCCTGCCCACCGTGCTGCCCTCCACGCTCACTCGCGAGCAGGAGGAGCAATAT
CTACTTCAGCTGCAGATCGAGGAGGTGAGCCGCAAGCTGCGCTCGGGCGACCTCGGCATC
CCGGCCAGCGTGGACGAGAGGTCGCCCTCGCCCGAGCCGATCTACTCCACGGACGGCAAG
AGGCTGAACACGCGCGAGTACCGCACGAGGCGGAAACTCGAGGAGGAGAGACACCGGCTC
GTCACCCGCATGCATCAGATCAACCCCGAGTTCAAGCCGCCGCCCGACTACAAGCCGCCC
ATCGTCCGTGTGCACGACAAGGTGATGATCCCTCAGGAGGAACACCCCGACATCAACTTC
GTGGGTCTGCTCATCGGCCCGCGAGGCAACACGCTCAAAGCGATGGAGAAGGAGACCGGC
GCCAAGATCATAATAAGAGGAAAGGGCTCCGTGAAGGAGGGAAAAGTCGGCAGGAAGGAC
GGCCAGCCGCTGCCCGGGGAAGACGAGCCTCTGCACGCCTACATCACCGCCACCAACGCC
GACTGCGTCAAGAAGGCCGTCGAGAAGATCAAGGAGGTGATCCGTCAGGGTGTGGAGGTG
CCCGAGGGACAGAACGACCTCCGCCGCATGCAGCTGAGGGAACTGGCGCAACTCAACGGG
ACTCTCAGGGAGAGCGACTCGCCGCGCTGCGCCAACTGCAGCGCCGCCGACCACAAGACG
TGGCTCTGTCCGGACAAGCCGAACGTGACGAACAGTATCGTGTGTTCATCGTGCGGCGGC
GCGGGACACATCGCGCGCGACTGCCGCGCCAAGAGACCGGGACACGCGCCGCCCGCCCTG
CATCACGACAAGGCTAAGATCGACGAGGAGTACATGTCGCTGATGGCGGAGCTGGGGGAG
GCGCCGCCCGGGGTCGGCGGAGTCACCGGCCCGTCCGCCGCGGCCGCTCGACGCACGCAC
GGACCCTTCGCCCCCGCGCCGCCGCCGCGGGCTATCATGCCGGCTCCCGGTATGCTGGCC
GGTGGTCCGTGGCGCGGGTTCGCTCCCCCGCCGCCCTCCCGCCGAGGAGGGGGGCGGCGT
CTGTTCGCTCCGCCGCCGCCGCCCCCGCCGGTCTCCTCCGCATAA

Protein sequence:

MSSRHRDRSRSRSRDRDRLKDRDKGRERDRDKDRERERDRDRDRDRERDRDRDRDRDRDR
ERDRDRDRNRDRERDRDRDRERHRSKRDKDRDRSRSRDRHKEKRRSRSRSRSRSRGRKSK
DRDGTIALLDQMVGTTTKATARQVAVPTSMNPATQAAILAAAAVAQRRLAAPVQPAAAAA
AALSAALSAATAIPPPTSVQQKLELLQARTEGRYRDKQPPDHHPDDDHDDGQGPPGETAA
ERRARRRRTRWMGSEHDKTFIPGLPTVLPSTLTREQEEQYLLQLQIEEVSRKLRSGDLGI
PASVDERSPSPEPIYSTDGKRLNTREYRTRRKLEEERHRLVTRMHQINPEFKPPPDYKPP
IVRVHDKVMIPQEEHPDINFVGLLIGPRGNTLKAMEKETGAKIIIRGKGSVKEGKVGRKD
GQPLPGEDEPLHAYITATNADCVKKAVEKIKEVIRQGVEVPEGQNDLRRMQLRELAQLNG
TLRESDSPRCANCSAADHKTWLCPDKPNVTNSIVCSSCGGAGHIARDCRAKRPGHAPPAL
HHDKAKIDEEYMSLMAELGEAPPGVGGVTGPSAAAARRTHGPFAPAPPPRAIMPAPGMLA
GGPWRGFAPPPPSRRGGGRRLFAPPPPPPPVSSA