DPGLEAN01542 in OGS1.0

New model in OGS2.0DPOGS207852 
Genomic Positionscaffold1638:+ 384-19426
See gene structure
CDS Length7734
Paired RNAseq reads  15287
Single RNAseq reads  37982
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009816 (1e-156)
Best Drosophila hit  pre-mRNA processing factor 8 (0.0)
Best Human hitpre-mRNA-processing-splicing factor 8 (0.0)
Best NR hit (blastp)  PREDICTED: similar to CG8877-PA [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to pre-mrna splicing factor prp8 [Tribolium castaneum] (0.0)
GeneOntology terms



  
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0005681 spliceosomal complex
GO:0005682 U5 snRNP
GO:0071013 catalytic step 2 spliceosome
GO:0071011 precatalytic spliceosome
InterPro families







  
IPR012592 PROCN
IPR021983 PRP8 domain IV core
IPR019580 Pre-mRNA-processing-splicing factor 8, U6-snRNA-binding
IPR012591 Pre-mRNA-processing-splicing factor 8
IPR019581 Pre-mRNA-processing-splicing factor 8, U5-snRNA-binding
IPR012984 PRO, C-terminal
IPR019582 RNA recognition motif, spliceosomal PrP8
IPR000432 DNA mismatch repair protein MutS, C-terminal domain
IPR000555 Mov34/MPN/PAD-1
Orthology groupMCL15755

Nucleotide sequence:

ATGAAGGAAAGTCAGCATATATTAAAAGGGCTTACATCTTCGAGTTTGGTCATAATCGAC
GAACTCTGCCGTGGAACGAACGTTGAAGAGGGTACGAGTATAGCGTGGTCGATTTGTGAG
GAACTCTTGATGAGTGAGGCGTATACATTTTTAACAACCCATTTCATGTATTTAACAAAA
CTTGAGGACTTATACTACAATGTTATAAATGTCCATACAGCTGTGAAAGAGGAATCCCAA
GGTCCAGATGTACTGGAAAAGAGATTGATATATCAACATAAAATTGAACCTGGAATTACA
CAGATTAAACATTACGGTATAGCATTAGCTGCTAAGACAAATCTACCACAAGATATTGTT
AGTTTGGCTAAAGAACTTGCGGAACTAATAGAAAGCAACACAAAGCCAATGTCAGGTTCA
TCGCAAAAAGAAACAGATTTAAAACTATTATATGATTTGAATGCCAAAATTCAGATGGAA
TCTAGAAAGAATTATAATAATGAAGAATCTATAAGAAATATATTGAGACAATTTAAGAAT
AAATATCCACACATAGTAGAGGGATTAAAGTTAGAAAGAAATTCGAGAAATATTCATAAT
TATTCATCCCATGAGAGTCCTAAAGTAACCGGCGCAGCGATGTCGCTGCCGCCATACCTA
TTGGGGCCCAACCCCTGGGCCACCATGATGGCGCAGCAGCAGCTAGCGGCAGCTCAACAA
GCAGCGCTCCAAGCGCATGCTGCCGCTGCTGCTGCTGCACCGCCCGTGCCGCCGACCCAG
CCACCTAAACCTCACCACATACCAGAAGAAAAGATCAAAGAGAAAGCTCAAAAATGGCTT
CAGCTGCAATCAAAGCGTTTCTCGGACAAGAGGAAATTTGGTTTTGTGGACGCCCAAAAG
GAAGATATGCCTCCGGAGCACATTCGAAAGATAATCCGAGATCATGGTGATATGACCAGT
CGCAAGTATCGTCATGACAAACGAGTGTATCTGGGAGCCCTTAAGTATATGCCACATGCT
GTAATGAAGCTTCTAGAAAACATGCCCATGCCCTGGGAACAGATCAGAGATGTCAATGTC
CTGTACCACATCACCGGTGCTATAACATTTGTCAATGAGATTCCCTGGGTCATAGAGCCA
GTGTATATCGCGCAGTGGGGCACAATGTGGATTATGATGCGTAGAGAGAAACGTGATCGT
CGGCATTTCAAGCGTATGAGATTCCCACCATTTGATGATGAAGAACCACCTTTGGATTAT
GCTGACAACATTTTGGATGTTGAACCTCTGGAACCCATACAAATTGAATTAGATCCGGAA
GAGGACGGAGCTGTGGCCTCATGGTTTTACGACCACAAACCTTTATTGGGAACAAAACAC
GTGAACGGCTCGACATACAGGAAGTGGAATCTTAGCTTACCACAGATGGCTACACTGTAT
CGTCTTGCAAATCAGCTTCTAACTGACTTAGTAGACGATAATTACTTCTACCTATTTGAT
TCTAAAAGTTTCTTTACCGCGAAAGCTCTAAACATGGCAATTCCCGGAGGTCCCAAGTTT
GAACCACTTGTCAAAGACAACTCTGCTGGTGATGAAGACTGGAATGAATTCAACGATATC
AACAAGATTATTATTCGTCAGCCGATCAGAACAGAGTACAGAATAGCTTTCCCATACCTT
TACAACAACTTGCCGCATTTCGTCCAATTATCCTGGTATCATACTCCCAATGTGGTGTAT
ATAAAAACAGAAGATCCCGACTTGCCAGCCTTCTACTTTGATCCGCTTATCAATCCAATC
TCTCACCGTCATACCGTGAAGTCATTAGATCCAATTCCGGAAGAGGAAGATTTCTTGCTA
CCTGAAGAAGTAACGCCATTCCTGCAGGAAACGGCTTTGTACACAGACAACACCGCTAAC
GGGATCGCTTTGCTGTGGGCTCCGCGACCTTTTAGTATGAGATCAGGTCGTTCCCGGCGA
GCGATCGACGTTCCTCTCGTGAAGACATGGTACAAGGAGCACTGCCCGCCAGGACAACCC
GTGAAAGTGCGTGTGTCATATCAAAAACTACTCAAGTACTACGTGCTTAATTCGCTCAAA
CATAGGCCACCTAAGCCACAGAAGAAAAGATATCTCTTCCGTTCGTTCAAATCGACGAAG
TTCTTCCAAACTACAACTTTGGATTGGGTGGAGGCTGGTCTCCAAGTATGCAGACAGGGC
TACAACATGCTCAATCTGCTGATACACAGAAAGAATCTTAATTATCTGCATTTAGATTAT
AATTTCAACTTGAAACCAGTCAAGACTCTCACTACTAAAGAGAGAAAGAAGTCTCGGTTT
GGTAATGCATTCCACTTGTGCCGCGAGATCCTCCGCCTCACTAAACTGATAGTGGATTCT
CATGTTCAATATCGTCTGAACAACGTGGACTCGTTCCAGCTAGCGGACGGTCTACAGTAC
ATTTTCGCTCACGTCGGCCAACTCACCGGCATGTACAGATACAAGTACAAACTCATGAGA
CAGATACGAATGTGCAAGGACTTGAAACATCTCATCTACTACAGATTTAATACGGGTCCA
GTATCTAAGGGTCCAGGATGTGGTTTCTGGGCACCTGGTTGGCGTGTGTGGTTGTTCTTC
ATGCGAGGTATCACGCCGCTACTTGAGCGATGGCTTGGAAACCTATTGTCGAGACAATTC
GAGGGTCGCCATTCGAAAGGGGTCGCAAAAACGGTGACGAAACAGCGCGTAGAGTCGCAC
TTCGACCTGGAACTGCGAGCATCCGTCATGCACGATATTGTGGACATGATGCCGGAAGGT
ATCAAACAGAATAAGGCCAGAACAATCCTACAGCATCTCTCTGAAGCCTGGAGATGCTGG
AAAGCTAATATTCCCTGGAAGGTCCCAGGTCTTCCTACTCCCATAGAGAACATGATACTT
CGTTACGTTAAAATGAAGGCGGACTGGTGGACAAATACAGCGCACTACAACAGGGAGAGA
ATACGTCGCGGTGCCACTGTAGACAAAACCGTCTGTAAGAAGAACTTGGGAAGACTAACA
CGATTGTACTTAAAGGCTGAGCAGGAAAGACAGCATAATTATTTAAAGGATGGTCCATAC
ATATCCCCTGAAGAAGCAGTCGCTATTTACACGACAACAGTCCATTGGCTCGAGTCCCGA
CGTTTCGCGCCCATACCATTCCCGCCCCTGTCATACAAACACGACACCAAACTACTCATA
TTGGCTTTGGAGAGACTGAAAGAAGCTTACAGCGTTAAGTCGAGGCTCAATCAAAGTCAA
AGGGAAGAACTGGGTCTTATAGAACAGGCGTATGATAACCCACACGAGGCGCTGTCTAGG
ATAAAACGTCATTTGCTCACACAGAGGACTTTCAGAGAAGTGGGCATAGAGTTTATGGAC
CTCTACTCACATCTAGTGCCAGTATACGACGTGGAACCTCTAGAGAAGATAACGGACGCG
TACCTCGATCAATATCTTTGGTATGAAGCTGACAAACGACGTCTTCTACCGCCGTGGGTG
AAACCCGCTGACACAGAGCCCAGTCCGCTCCTCGTCTATAAATGGTGTCAAGGTATCAAC
AATCTTCAAGATGTATGGGAGGTCGGCGAAGGTGAATGCAACGTTTTGCTAGAGTCGAGA
TTTGAAAAACTCTATGAGAAGATTGATCTGACACTGCTGAATCGTCTCTTGCGTTTGATA
GTGGACCACAACATTGCTGATTACATGACGGCTAAGAACAACGTCGTCATTAATTACAAG
GATATGAATCATACAAATTCCTATGGTATCATTCGGGGTTTGCAATTTGCTTCCTTCATA
GTTCAATACTATGGTCTTGTACTGGATCTGTTAGTGCTGGGTCTGCAACGGGCCAGCGAA
ATGGCTGGACCTCCCCAACTACCAAACGACTTCTTGTCTTACCAAGAGAGGCCGGCGGAG
CAGGCGCATCCTATAAGACTGTATTGCAGATACATTGACAGAATACATATTTTCTTCAGA
TTCACAGCAGAAGAAGCTCGCGACCTCATCCAAAGGTACCTGACGGAACATCCCGACCCC
AATAATGAGAATATCGTCGGCTACAACAATAAAAAGTGCTGGCCGCGTGACGCCAGAATG
AGACTCATGAAACACGATGTTAACTTGGGTCGAGCGGTGTTCTGGGATATTAAGAACCGT
CTTCCACGTTCCGTCACCACTATACAGTGGGAGAATAGTTTCGTCTCGGTCTACTCCAAG
GACAACCCCAACTTGTTATTCAACATGGCCGGATTTGAGTGCAGGATATTGCCTAAATGC
CGTAGTCTTCACGAAGAGTTGTCACATCGCGATGGTGTTTGGAATCTACAAAACGAAGTT
ACCAAGGAACGCACAGCTCAATGCTACCTGAGAGTGGATGACGAATCACTGGCACGGTTC
CACAACCGTGTTAGACAGATATTGATGGCCTCAGGTTCAACGACCTTCACCAAAATTGTC
AACAAATGGAATACCGCTTTAATCGGTCTCATGACGTACTTCCGTGAAGCCGTAGTAAAC
ACTCAAGAGCTATTAGACCTGCTAGTGAAATGCGAGAATAAAATTCAAACTCGTATTAAA
ATTGGTTTGAACTCAAAAATGCCTTCGCGTTTCCCGCCTGTTGTGTTCTACACGCCCAAA
GAGTTAGGCGGGCTTGGGATGTTGTCTATGGGTCATGTTCTGATTCCACAGTCGGATCTG
CGTTGGTCAAAACAAACAGACGTTGGCATCACTCACTTCCGATCGGGAATGTCACATGAT
GAAGATCAGCTGATTCCTAATCTTTATCGTTACATACAACCATGGGAGGCTGAGTTTGTC
GACTCACAGAGAGTATGGGCTGAGTACGCTCTCAAGAGACAGGAGGCCAATGCTCAGAAC
AGGCGTCTCACACTCGAAGATTTGGAAGACTCCTGGGATAGAGGTATACCAAGAATAAAT
ACACTCTTCCAAAAGGACAGACACACACTTGCATATGACAAAGGATGGCGTATTCGTACC
GAGTTTAAACAGTATCAAGTACTGAAACAAAACCCGTTCTGGTGGACACATCAGAGACAC
GACGGAAAATTATGGAATCTGAACAACTACCGTACTGATATGATACAGGCTTTGGGAGGA
GTAGAAGGAATTCTGGAACACACATTGTTTAAGGGCACCTACTTCCCTACTTGGGAGGGT
TTGTTCTGGGAGAAGGCATCCGGTTTCGAGGAGTCGATGAAATATAAAAAACTGACAAAC
GCTCAACGATCTGGTTTGAACCAGATTCCAAACCGACGGTTCACCTTATGGTGGTCACCG
ACCATCAACAGAGCCAATGTGTATGTTGGTTTCCAGGTGCAATTAGATTTGACAGGTATA
TTCATGCACGGCAAAATACCAACACTCAAGATATCTCTTATCCAGATATTTAGAGCTCAC
TTGTGGCAGAAAGTCCATGAGTCAATTGTTATGGACTTGTGTCAAGTGTTTGATCAAGAA
TTGGATGCTCTGGAAATAGAAACAGTACAAAAGGAAACCATTCATCCTCGAAAATCATAC
AAGATGAACTCCTCATGTGCAGACATTTTACTCTTCTCAGCCTACAAGTGGAATGTCTCC
CGTCCCTCACTGCTGGCTGACACAAAGGATACAATGGATAATACCACAACCCAGAAATAT
TGGTTGGATATACAATTACGTTGGGGAGACTATGACTCGCACGATGTCGAGAGATACGCT
CGAGCGAAGTTCTTGGACTACACCACGGATAACATGTCCATATATCCTTCGCCCACTGGA
CTGCTGATCGCTATAGATTTGGCTTATAACTTGCACAGTGCATATGGTAATTGGTTCCCG
GGATGCAAGCCGCTCATACAACAGGCGATGGCGAAAATCATGAAGGCAAATCCAGCCCTT
TATGTGCTAAGGGAGCGTATACGGAAGGCTTTACAGTTGTACTCGTCTGAACCTACCGAG
CCATACTTGTCCAGTCAGAATTATGGAGAGCTGTTCTCAAATCAGATTATTTGGTTTGTC
GACGACACGAACGTGTACCGTGTAACTATACACAAGACCTTTGAAGGAAATCTCACAACT
AAACCTATTAACGGAGCCATCTTCATATTCAACCCTCGGACTGGACAACTGTTCCTCAAG
ATCATCCACACCAGCGTGTGGGCCGGTCAGAAACGTCTTGGACAGCTCGCTAAATGGAAA
ACAGCTGAAGAAGTGGCCGCCCTGATTCGTTCCCTGCCTGTTGAAGAACAACCCAAACAG
ATTATTGTCACAAGAAAGGGAATGTTGGATCCACTTGAGGTGCACTTGCTAGACTTCCCC
AACATTGTCATCAAAGGTTCAGAACTGCAGCTACCTTTCCAAGCGTGTCTTAAAGTGGAG
AAATTCGGAGACCTCATCCTCAAGGCCACAGAGCCACAGATGGTGCTCTTCAACTTGTAT
GATGATTGGTTAAAGACTATATCTTCTTATACCGCATTCAGCAGATTGATACTCATTCTG
AGAGCGTTACACGTGAACACTGAGCGTACTAAGGTACTTCTGAAACCAGACAAGACTACA
CTCACTGAACCACATCACATCTGGCCCACACTCACCGATGATGACTGGATCAAGGTGGAA
GTGCAACTCAAGGACCTTATATTGGCTGACTACGGCAAAAAGAATAACGTAAACGTGGCA
TCACTGACACAATCAGAAATCCGCGACATTATACTTGGTATGGAAATATCAGCTCCGTCA
GCACAGAGGCAGCAGATAGCCGAGATTGAGAAACAGAGCAAGGAACAGAGCCAGCTCACA
GCAACCACGACCAGGACTGTTAACAAACACGGAGACGAGATCATCACCTCCACCACCAGC
AACTACGAGTCGCAGACCTTCAGTTCCAAAACCGAATGGCGTGTGAGAGCGATATCAGCG
ACCAATCTTCACTTGAGGACAAACCACATCTATGTAAGCTCTGATGACATCAAGGAAAGT
GGCTATACTTATATATTGCCAAAGAACTTGCTCAAGAAGTTTGTCACCATATCCGATTTG
AGAGCACAGATCGCCTGCTACCTGTACGGCACATCGCCTCCTGACAACCCTCAAGTCCGT
GAAGTACACTGCGCGGTTCTTCCTCCTCAATGGGGAACACATCAGACTGTACATCTACCG
CGACAACTTCCTAAACATCCAGCTTTAGCCCACCTTCAACCATTGGGATGGATGCACACT
CAGCCTAACGAACTGCCACAACTTTCGCCACAGGATATAACCACTCACGCCAAAATAATG
GCGGAAAATCAGACGTGGGACGGTGAGAAGACGATCATAATCACGTGCTCCTTCACACCG
GGGTCGTGTTCGCTGACTGCATACAAGTTGACACCGAGCGGATATGAATGGGGCGCCAAG
AACACGGACAAAGGCAATAATCCCAAGGGATATCTCCCTAGCCACTATGAGCGAGTGCAA
ATGTTACTGTCCGATCGATTCCTAGGATACTTCATGGTGCCTTCACAGGGTAGCTGGAAT
TATAACTTCATGGGTGTCCGTCACGATCCCAACATGAAGTATGGCGTTCAGCTGGGGAAT
CCCCGCGAGTTCTACCACGAGGTGCATCGACCTGCACACTTTATGAACTTCGCGGCAATG
GAGGATTCAGTCGCGCCCATACCAGCTGCTGACCGAGAGGATTTCTTCGCCTAG

Protein sequence:

MKESQHILKGLTSSSLVIIDELCRGTNVEEGTSIAWSICEELLMSEAYTFLTTHFMYLTK
LEDLYYNVINVHTAVKEESQGPDVLEKRLIYQHKIEPGITQIKHYGIALAAKTNLPQDIV
SLAKELAELIESNTKPMSGSSQKETDLKLLYDLNAKIQMESRKNYNNEESIRNILRQFKN
KYPHIVEGLKLERNSRNIHNYSSHESPKVTGAAMSLPPYLLGPNPWATMMAQQQLAAAQQ
AALQAHAAAAAAAPPVPPTQPPKPHHIPEEKIKEKAQKWLQLQSKRFSDKRKFGFVDAQK
EDMPPEHIRKIIRDHGDMTSRKYRHDKRVYLGALKYMPHAVMKLLENMPMPWEQIRDVNV
LYHITGAITFVNEIPWVIEPVYIAQWGTMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDY
ADNILDVEPLEPIQIELDPEEDGAVASWFYDHKPLLGTKHVNGSTYRKWNLSLPQMATLY
RLANQLLTDLVDDNYFYLFDSKSFFTAKALNMAIPGGPKFEPLVKDNSAGDEDWNEFNDI
NKIIIRQPIRTEYRIAFPYLYNNLPHFVQLSWYHTPNVVYIKTEDPDLPAFYFDPLINPI
SHRHTVKSLDPIPEEEDFLLPEEVTPFLQETALYTDNTANGIALLWAPRPFSMRSGRSRR
AIDVPLVKTWYKEHCPPGQPVKVRVSYQKLLKYYVLNSLKHRPPKPQKKRYLFRSFKSTK
FFQTTTLDWVEAGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRF
GNAFHLCREILRLTKLIVDSHVQYRLNNVDSFQLADGLQYIFAHVGQLTGMYRYKYKLMR
QIRMCKDLKHLIYYRFNTGPVSKGPGCGFWAPGWRVWLFFMRGITPLLERWLGNLLSRQF
EGRHSKGVAKTVTKQRVESHFDLELRASVMHDIVDMMPEGIKQNKARTILQHLSEAWRCW
KANIPWKVPGLPTPIENMILRYVKMKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLT
RLYLKAEQERQHNYLKDGPYISPEEAVAIYTTTVHWLESRRFAPIPFPPLSYKHDTKLLI
LALERLKEAYSVKSRLNQSQREELGLIEQAYDNPHEALSRIKRHLLTQRTFREVGIEFMD
LYSHLVPVYDVEPLEKITDAYLDQYLWYEADKRRLLPPWVKPADTEPSPLLVYKWCQGIN
NLQDVWEVGEGECNVLLESRFEKLYEKIDLTLLNRLLRLIVDHNIADYMTAKNNVVINYK
DMNHTNSYGIIRGLQFASFIVQYYGLVLDLLVLGLQRASEMAGPPQLPNDFLSYQERPAE
QAHPIRLYCRYIDRIHIFFRFTAEEARDLIQRYLTEHPDPNNENIVGYNNKKCWPRDARM
RLMKHDVNLGRAVFWDIKNRLPRSVTTIQWENSFVSVYSKDNPNLLFNMAGFECRILPKC
RSLHEELSHRDGVWNLQNEVTKERTAQCYLRVDDESLARFHNRVRQILMASGSTTFTKIV
NKWNTALIGLMTYFREAVVNTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVVFYTPK
ELGGLGMLSMGHVLIPQSDLRWSKQTDVGITHFRSGMSHDEDQLIPNLYRYIQPWEAEFV
DSQRVWAEYALKRQEANAQNRRLTLEDLEDSWDRGIPRINTLFQKDRHTLAYDKGWRIRT
EFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILEHTLFKGTYFPTWEG
LFWEKASGFEESMKYKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGI
FMHGKIPTLKISLIQIFRAHLWQKVHESIVMDLCQVFDQELDALEIETVQKETIHPRKSY
KMNSSCADILLFSAYKWNVSRPSLLADTKDTMDNTTTQKYWLDIQLRWGDYDSHDVERYA
RAKFLDYTTDNMSIYPSPTGLLIAIDLAYNLHSAYGNWFPGCKPLIQQAMAKIMKANPAL
YVLRERIRKALQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTT
KPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQ
IIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKFGDLILKATEPQMVLFNLY
DDWLKTISSYTAFSRLILILRALHVNTERTKVLLKPDKTTLTEPHHIWPTLTDDDWIKVE
VQLKDLILADYGKKNNVNVASLTQSEIRDIILGMEISAPSAQRQQIAEIEKQSKEQSQLT
ATTTRTVNKHGDEIITSTTSNYESQTFSSKTEWRVRAISATNLHLRTNHIYVSSDDIKES
GYTYILPKNLLKKFVTISDLRAQIACYLYGTSPPDNPQVREVHCAVLPPQWGTHQTVHLP
RQLPKHPALAHLQPLGWMHTQPNELPQLSPQDITTHAKIMAENQTWDGEKTIIITCSFTP
GSCSLTAYKLTPSGYEWGAKNTDKGNNPKGYLPSHYERVQMLLSDRFLGYFMVPSQGSWN
YNFMGVRHDPNMKYGVQLGNPREFYHEVHRPAHFMNFAAMEDSVAPIPAADREDFFA