DPGLEAN20155 in OGS1.0

New model in OGS2.0DPOGS211156 
Genomic Positionscaffold540:+ 42060-44273
See gene structure
CDS Length1569
Paired RNAseq reads  245
Single RNAseq reads  915
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003142 (0.0)
Best Drosophila hit  CG6197 (6e-123)
Best Human hitpre-mRNA-splicing factor SYF1 (7e-109)
Best NR hit (blastp)  PREDICTED: similar to XPA-binding protein 2 [Tribolium castaneum] (7e-154)
Best NR hit (blastx)  XPA-binding protein, putative [Pediculus humanus corporis] (1e-147)
GeneOntology terms





  
GO:0005488 binding
GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome
GO:0005634 nucleus
GO:0006911 phagocytosis, engulfment
GO:0071011 precatalytic spliceosome
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0071013 catalytic step 2 spliceosome
InterPro families
  
IPR011990 Tetratricopeptide-like helical
IPR003107 RNA-processing protein, HAT helix
Orthology groupMCL16141

Nucleotide sequence:

ATGCCTAGAATATGGATGGACTATTGTACATTTTTGACTGATCAGTGGAAAATTACCGCT
ACAAGAAAAGCATTTGATTCTGCTTTACGGGCTTTACCAATTACACAGCATCACAGAATA
TGGCCCCTCTATTTAAATTTCTTGAAAAAGCATAATATTCCAGAAACTGCTGTTAGGGTA
TTCAGACGCTACCTAAAGTTGTGTCCCGAAGATACTGAAGAATATATTGATTATTTAATA
TCTATAGAAAAATTAGATGAAGCTGCTTTAAAATTAGCTCAACTTGTAAACAATGAGAAT
TTTCAATCCAAACATGGAAAATCCAACCACCAGCTTTGGAATGAATTGTGTGAACTAATA
TCCAAAAACCCAGACAAAATTCATTCACTTAATGTTGATGCCATAATAAGAGGTGGACTT
CGTCGCTACACTGACCAGCTGGGTCATCTGTGGAATTCACTAGCTGATTACTATGTTAGA
AGTGGGTTATTTGAGAGAGCCAGAGATATATATGAGGAAGCCATTCAGACTGTCACAACA
GTAAGAGATTTCACCCAAGTCTTTGATGCCTATGCTCAATTTGAAGAGTTGAGTTTGAGT
AAAAAGATGGAAGAAGTTGCAAAGAAACCCAACCCCACTGAGGATGAAGATATTGATTTA
GAATTACGTCTTGCTAGGTTTGAATATTTGATGGAAAGAAGATTGTTACTGCTAAATTCA
GTACTATTAAGACAAAATCCACATAATATTGCTGAATGGCACAAAAGAGTAAAGCTCTAT
GAAGGTAAACCTCATGAAATCATAGATACATATACAGAAGCTGTGCAGACAGTAGATCCA
AAATTAGCGGTAGGAAAACTTTATACACTGTGGGTTGGTTTTGCAAAATTTTATGAGAGC
AATGACCAAATTGATGATGCAAGGTTAATTTTTGAGAAAGCGACCCAAGCTGCAGAAATA
TATGGCGTACCCAAAACGCGACAAATATATGAAAAAGCAATCGAGACTCTACCAGATGAA
AAAGCTAGAGAGATGTGCTTGCGATTTTCGGAAATGGAAACGAAACTTGGGGAGATCGAC
AGAGCTCGTGCCATATACGCTCACTGTAGTCAGATGTGCGATCCAAGGATTACGACAGAA
TTCTGGAATACGTGGAAAGAATTTGAAGTAAGGCATGGTAATGAAGATACTATGAGGGAA
ATGCTTAGAATTAAGAGAAGTGTACAAGCTACTTATAACACGCAAGTCAATATGATGTCA
GCCCAAATGCTAGGCTCAGCTGCTCAGGCTGCGGGTACAATATCGGATCTTGCACCCGGA
ATGAAGGACGGCATGAGATTGTTGGAGGCTAAAGCTGCCGAAATGGCTGTCCAAAGCAAG
GGCAATATATTGTTCGTCAGAGGTGAAACACAAGGTCTCAAAGAAAACGATAAAGTTGTT
AATCCTGATGAAATTGATATTGATGACGAAGAATCTGATAATAGTAATGATGACGAGGAA
GTTGCACCTGTACAGAAAAAGGAAATTCCTGCAGCAGTGTTTGGAGGCTTGGTTCCAGAA
AATCAATAA

Protein sequence:

MPRIWMDYCTFLTDQWKITATRKAFDSALRALPITQHHRIWPLYLNFLKKHNIPETAVRV
FRRYLKLCPEDTEEYIDYLISIEKLDEAALKLAQLVNNENFQSKHGKSNHQLWNELCELI
SKNPDKIHSLNVDAIIRGGLRRYTDQLGHLWNSLADYYVRSGLFERARDIYEEAIQTVTT
VRDFTQVFDAYAQFEELSLSKKMEEVAKKPNPTEDEDIDLELRLARFEYLMERRLLLLNS
VLLRQNPHNIAEWHKRVKLYEGKPHEIIDTYTEAVQTVDPKLAVGKLYTLWVGFAKFYES
NDQIDDARLIFEKATQAAEIYGVPKTRQIYEKAIETLPDEKAREMCLRFSEMETKLGEID
RARAIYAHCSQMCDPRITTEFWNTWKEFEVRHGNEDTMREMLRIKRSVQATYNTQVNMMS
AQMLGSAAQAAGTISDLAPGMKDGMRLLEAKAAEMAVQSKGNILFVRGETQGLKENDKVV
NPDEIDIDDEESDNSNDDEEVAPVQKKEIPAAVFGGLVPENQ