New model in OGS2.0 | DPOGS207852  |
---|---|
Genomic Position | scaffold1638:+ 384-19426 |
See gene structure | |
CDS Length | 7734 |
Paired RNAseq reads   | 15287 |
Single RNAseq reads   | 37982 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009816 (1e-156) |
Best Drosophila hit   | pre-mRNA processing factor 8 (0.0) |
Best Human hit | pre-mRNA-processing-splicing factor 8 (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to CG8877-PA [Apis mellifera] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to pre-mrna splicing factor prp8 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0000398 nuclear mRNA splicing, via spliceosome GO:0005681 spliceosomal complex GO:0005682 U5 snRNP GO:0071013 catalytic step 2 spliceosome GO:0071011 precatalytic spliceosome |
InterPro families    | IPR012592 PROCN IPR021983 PRP8 domain IV core IPR019580 Pre-mRNA-processing-splicing factor 8, U6-snRNA-binding IPR012591 Pre-mRNA-processing-splicing factor 8 IPR019581 Pre-mRNA-processing-splicing factor 8, U5-snRNA-binding IPR012984 PRO, C-terminal IPR019582 RNA recognition motif, spliceosomal PrP8 IPR000432 DNA mismatch repair protein MutS, C-terminal domain IPR000555 Mov34/MPN/PAD-1 |
Orthology group | MCL15755 |
Nucleotide sequence:
ATGAAGGAAAGTCAGCATATATTAAAAGGGCTTACATCTTCGAGTTTGGTCATAATCGAC
GAACTCTGCCGTGGAACGAACGTTGAAGAGGGTACGAGTATAGCGTGGTCGATTTGTGAG
GAACTCTTGATGAGTGAGGCGTATACATTTTTAACAACCCATTTCATGTATTTAACAAAA
CTTGAGGACTTATACTACAATGTTATAAATGTCCATACAGCTGTGAAAGAGGAATCCCAA
GGTCCAGATGTACTGGAAAAGAGATTGATATATCAACATAAAATTGAACCTGGAATTACA
CAGATTAAACATTACGGTATAGCATTAGCTGCTAAGACAAATCTACCACAAGATATTGTT
AGTTTGGCTAAAGAACTTGCGGAACTAATAGAAAGCAACACAAAGCCAATGTCAGGTTCA
TCGCAAAAAGAAACAGATTTAAAACTATTATATGATTTGAATGCCAAAATTCAGATGGAA
TCTAGAAAGAATTATAATAATGAAGAATCTATAAGAAATATATTGAGACAATTTAAGAAT
AAATATCCACACATAGTAGAGGGATTAAAGTTAGAAAGAAATTCGAGAAATATTCATAAT
TATTCATCCCATGAGAGTCCTAAAGTAACCGGCGCAGCGATGTCGCTGCCGCCATACCTA
TTGGGGCCCAACCCCTGGGCCACCATGATGGCGCAGCAGCAGCTAGCGGCAGCTCAACAA
GCAGCGCTCCAAGCGCATGCTGCCGCTGCTGCTGCTGCACCGCCCGTGCCGCCGACCCAG
CCACCTAAACCTCACCACATACCAGAAGAAAAGATCAAAGAGAAAGCTCAAAAATGGCTT
CAGCTGCAATCAAAGCGTTTCTCGGACAAGAGGAAATTTGGTTTTGTGGACGCCCAAAAG
GAAGATATGCCTCCGGAGCACATTCGAAAGATAATCCGAGATCATGGTGATATGACCAGT
CGCAAGTATCGTCATGACAAACGAGTGTATCTGGGAGCCCTTAAGTATATGCCACATGCT
GTAATGAAGCTTCTAGAAAACATGCCCATGCCCTGGGAACAGATCAGAGATGTCAATGTC
CTGTACCACATCACCGGTGCTATAACATTTGTCAATGAGATTCCCTGGGTCATAGAGCCA
GTGTATATCGCGCAGTGGGGCACAATGTGGATTATGATGCGTAGAGAGAAACGTGATCGT
CGGCATTTCAAGCGTATGAGATTCCCACCATTTGATGATGAAGAACCACCTTTGGATTAT
GCTGACAACATTTTGGATGTTGAACCTCTGGAACCCATACAAATTGAATTAGATCCGGAA
GAGGACGGAGCTGTGGCCTCATGGTTTTACGACCACAAACCTTTATTGGGAACAAAACAC
GTGAACGGCTCGACATACAGGAAGTGGAATCTTAGCTTACCACAGATGGCTACACTGTAT
CGTCTTGCAAATCAGCTTCTAACTGACTTAGTAGACGATAATTACTTCTACCTATTTGAT
TCTAAAAGTTTCTTTACCGCGAAAGCTCTAAACATGGCAATTCCCGGAGGTCCCAAGTTT
GAACCACTTGTCAAAGACAACTCTGCTGGTGATGAAGACTGGAATGAATTCAACGATATC
AACAAGATTATTATTCGTCAGCCGATCAGAACAGAGTACAGAATAGCTTTCCCATACCTT
TACAACAACTTGCCGCATTTCGTCCAATTATCCTGGTATCATACTCCCAATGTGGTGTAT
ATAAAAACAGAAGATCCCGACTTGCCAGCCTTCTACTTTGATCCGCTTATCAATCCAATC
TCTCACCGTCATACCGTGAAGTCATTAGATCCAATTCCGGAAGAGGAAGATTTCTTGCTA
CCTGAAGAAGTAACGCCATTCCTGCAGGAAACGGCTTTGTACACAGACAACACCGCTAAC
GGGATCGCTTTGCTGTGGGCTCCGCGACCTTTTAGTATGAGATCAGGTCGTTCCCGGCGA
GCGATCGACGTTCCTCTCGTGAAGACATGGTACAAGGAGCACTGCCCGCCAGGACAACCC
GTGAAAGTGCGTGTGTCATATCAAAAACTACTCAAGTACTACGTGCTTAATTCGCTCAAA
CATAGGCCACCTAAGCCACAGAAGAAAAGATATCTCTTCCGTTCGTTCAAATCGACGAAG
TTCTTCCAAACTACAACTTTGGATTGGGTGGAGGCTGGTCTCCAAGTATGCAGACAGGGC
TACAACATGCTCAATCTGCTGATACACAGAAAGAATCTTAATTATCTGCATTTAGATTAT
AATTTCAACTTGAAACCAGTCAAGACTCTCACTACTAAAGAGAGAAAGAAGTCTCGGTTT
GGTAATGCATTCCACTTGTGCCGCGAGATCCTCCGCCTCACTAAACTGATAGTGGATTCT
CATGTTCAATATCGTCTGAACAACGTGGACTCGTTCCAGCTAGCGGACGGTCTACAGTAC
ATTTTCGCTCACGTCGGCCAACTCACCGGCATGTACAGATACAAGTACAAACTCATGAGA
CAGATACGAATGTGCAAGGACTTGAAACATCTCATCTACTACAGATTTAATACGGGTCCA
GTATCTAAGGGTCCAGGATGTGGTTTCTGGGCACCTGGTTGGCGTGTGTGGTTGTTCTTC
ATGCGAGGTATCACGCCGCTACTTGAGCGATGGCTTGGAAACCTATTGTCGAGACAATTC
GAGGGTCGCCATTCGAAAGGGGTCGCAAAAACGGTGACGAAACAGCGCGTAGAGTCGCAC
TTCGACCTGGAACTGCGAGCATCCGTCATGCACGATATTGTGGACATGATGCCGGAAGGT
ATCAAACAGAATAAGGCCAGAACAATCCTACAGCATCTCTCTGAAGCCTGGAGATGCTGG
AAAGCTAATATTCCCTGGAAGGTCCCAGGTCTTCCTACTCCCATAGAGAACATGATACTT
CGTTACGTTAAAATGAAGGCGGACTGGTGGACAAATACAGCGCACTACAACAGGGAGAGA
ATACGTCGCGGTGCCACTGTAGACAAAACCGTCTGTAAGAAGAACTTGGGAAGACTAACA
CGATTGTACTTAAAGGCTGAGCAGGAAAGACAGCATAATTATTTAAAGGATGGTCCATAC
ATATCCCCTGAAGAAGCAGTCGCTATTTACACGACAACAGTCCATTGGCTCGAGTCCCGA
CGTTTCGCGCCCATACCATTCCCGCCCCTGTCATACAAACACGACACCAAACTACTCATA
TTGGCTTTGGAGAGACTGAAAGAAGCTTACAGCGTTAAGTCGAGGCTCAATCAAAGTCAA
AGGGAAGAACTGGGTCTTATAGAACAGGCGTATGATAACCCACACGAGGCGCTGTCTAGG
ATAAAACGTCATTTGCTCACACAGAGGACTTTCAGAGAAGTGGGCATAGAGTTTATGGAC
CTCTACTCACATCTAGTGCCAGTATACGACGTGGAACCTCTAGAGAAGATAACGGACGCG
TACCTCGATCAATATCTTTGGTATGAAGCTGACAAACGACGTCTTCTACCGCCGTGGGTG
AAACCCGCTGACACAGAGCCCAGTCCGCTCCTCGTCTATAAATGGTGTCAAGGTATCAAC
AATCTTCAAGATGTATGGGAGGTCGGCGAAGGTGAATGCAACGTTTTGCTAGAGTCGAGA
TTTGAAAAACTCTATGAGAAGATTGATCTGACACTGCTGAATCGTCTCTTGCGTTTGATA
GTGGACCACAACATTGCTGATTACATGACGGCTAAGAACAACGTCGTCATTAATTACAAG
GATATGAATCATACAAATTCCTATGGTATCATTCGGGGTTTGCAATTTGCTTCCTTCATA
GTTCAATACTATGGTCTTGTACTGGATCTGTTAGTGCTGGGTCTGCAACGGGCCAGCGAA
ATGGCTGGACCTCCCCAACTACCAAACGACTTCTTGTCTTACCAAGAGAGGCCGGCGGAG
CAGGCGCATCCTATAAGACTGTATTGCAGATACATTGACAGAATACATATTTTCTTCAGA
TTCACAGCAGAAGAAGCTCGCGACCTCATCCAAAGGTACCTGACGGAACATCCCGACCCC
AATAATGAGAATATCGTCGGCTACAACAATAAAAAGTGCTGGCCGCGTGACGCCAGAATG
AGACTCATGAAACACGATGTTAACTTGGGTCGAGCGGTGTTCTGGGATATTAAGAACCGT
CTTCCACGTTCCGTCACCACTATACAGTGGGAGAATAGTTTCGTCTCGGTCTACTCCAAG
GACAACCCCAACTTGTTATTCAACATGGCCGGATTTGAGTGCAGGATATTGCCTAAATGC
CGTAGTCTTCACGAAGAGTTGTCACATCGCGATGGTGTTTGGAATCTACAAAACGAAGTT
ACCAAGGAACGCACAGCTCAATGCTACCTGAGAGTGGATGACGAATCACTGGCACGGTTC
CACAACCGTGTTAGACAGATATTGATGGCCTCAGGTTCAACGACCTTCACCAAAATTGTC
AACAAATGGAATACCGCTTTAATCGGTCTCATGACGTACTTCCGTGAAGCCGTAGTAAAC
ACTCAAGAGCTATTAGACCTGCTAGTGAAATGCGAGAATAAAATTCAAACTCGTATTAAA
ATTGGTTTGAACTCAAAAATGCCTTCGCGTTTCCCGCCTGTTGTGTTCTACACGCCCAAA
GAGTTAGGCGGGCTTGGGATGTTGTCTATGGGTCATGTTCTGATTCCACAGTCGGATCTG
CGTTGGTCAAAACAAACAGACGTTGGCATCACTCACTTCCGATCGGGAATGTCACATGAT
GAAGATCAGCTGATTCCTAATCTTTATCGTTACATACAACCATGGGAGGCTGAGTTTGTC
GACTCACAGAGAGTATGGGCTGAGTACGCTCTCAAGAGACAGGAGGCCAATGCTCAGAAC
AGGCGTCTCACACTCGAAGATTTGGAAGACTCCTGGGATAGAGGTATACCAAGAATAAAT
ACACTCTTCCAAAAGGACAGACACACACTTGCATATGACAAAGGATGGCGTATTCGTACC
GAGTTTAAACAGTATCAAGTACTGAAACAAAACCCGTTCTGGTGGACACATCAGAGACAC
GACGGAAAATTATGGAATCTGAACAACTACCGTACTGATATGATACAGGCTTTGGGAGGA
GTAGAAGGAATTCTGGAACACACATTGTTTAAGGGCACCTACTTCCCTACTTGGGAGGGT
TTGTTCTGGGAGAAGGCATCCGGTTTCGAGGAGTCGATGAAATATAAAAAACTGACAAAC
GCTCAACGATCTGGTTTGAACCAGATTCCAAACCGACGGTTCACCTTATGGTGGTCACCG
ACCATCAACAGAGCCAATGTGTATGTTGGTTTCCAGGTGCAATTAGATTTGACAGGTATA
TTCATGCACGGCAAAATACCAACACTCAAGATATCTCTTATCCAGATATTTAGAGCTCAC
TTGTGGCAGAAAGTCCATGAGTCAATTGTTATGGACTTGTGTCAAGTGTTTGATCAAGAA
TTGGATGCTCTGGAAATAGAAACAGTACAAAAGGAAACCATTCATCCTCGAAAATCATAC
AAGATGAACTCCTCATGTGCAGACATTTTACTCTTCTCAGCCTACAAGTGGAATGTCTCC
CGTCCCTCACTGCTGGCTGACACAAAGGATACAATGGATAATACCACAACCCAGAAATAT
TGGTTGGATATACAATTACGTTGGGGAGACTATGACTCGCACGATGTCGAGAGATACGCT
CGAGCGAAGTTCTTGGACTACACCACGGATAACATGTCCATATATCCTTCGCCCACTGGA
CTGCTGATCGCTATAGATTTGGCTTATAACTTGCACAGTGCATATGGTAATTGGTTCCCG
GGATGCAAGCCGCTCATACAACAGGCGATGGCGAAAATCATGAAGGCAAATCCAGCCCTT
TATGTGCTAAGGGAGCGTATACGGAAGGCTTTACAGTTGTACTCGTCTGAACCTACCGAG
CCATACTTGTCCAGTCAGAATTATGGAGAGCTGTTCTCAAATCAGATTATTTGGTTTGTC
GACGACACGAACGTGTACCGTGTAACTATACACAAGACCTTTGAAGGAAATCTCACAACT
AAACCTATTAACGGAGCCATCTTCATATTCAACCCTCGGACTGGACAACTGTTCCTCAAG
ATCATCCACACCAGCGTGTGGGCCGGTCAGAAACGTCTTGGACAGCTCGCTAAATGGAAA
ACAGCTGAAGAAGTGGCCGCCCTGATTCGTTCCCTGCCTGTTGAAGAACAACCCAAACAG
ATTATTGTCACAAGAAAGGGAATGTTGGATCCACTTGAGGTGCACTTGCTAGACTTCCCC
AACATTGTCATCAAAGGTTCAGAACTGCAGCTACCTTTCCAAGCGTGTCTTAAAGTGGAG
AAATTCGGAGACCTCATCCTCAAGGCCACAGAGCCACAGATGGTGCTCTTCAACTTGTAT
GATGATTGGTTAAAGACTATATCTTCTTATACCGCATTCAGCAGATTGATACTCATTCTG
AGAGCGTTACACGTGAACACTGAGCGTACTAAGGTACTTCTGAAACCAGACAAGACTACA
CTCACTGAACCACATCACATCTGGCCCACACTCACCGATGATGACTGGATCAAGGTGGAA
GTGCAACTCAAGGACCTTATATTGGCTGACTACGGCAAAAAGAATAACGTAAACGTGGCA
TCACTGACACAATCAGAAATCCGCGACATTATACTTGGTATGGAAATATCAGCTCCGTCA
GCACAGAGGCAGCAGATAGCCGAGATTGAGAAACAGAGCAAGGAACAGAGCCAGCTCACA
GCAACCACGACCAGGACTGTTAACAAACACGGAGACGAGATCATCACCTCCACCACCAGC
AACTACGAGTCGCAGACCTTCAGTTCCAAAACCGAATGGCGTGTGAGAGCGATATCAGCG
ACCAATCTTCACTTGAGGACAAACCACATCTATGTAAGCTCTGATGACATCAAGGAAAGT
GGCTATACTTATATATTGCCAAAGAACTTGCTCAAGAAGTTTGTCACCATATCCGATTTG
AGAGCACAGATCGCCTGCTACCTGTACGGCACATCGCCTCCTGACAACCCTCAAGTCCGT
GAAGTACACTGCGCGGTTCTTCCTCCTCAATGGGGAACACATCAGACTGTACATCTACCG
CGACAACTTCCTAAACATCCAGCTTTAGCCCACCTTCAACCATTGGGATGGATGCACACT
CAGCCTAACGAACTGCCACAACTTTCGCCACAGGATATAACCACTCACGCCAAAATAATG
GCGGAAAATCAGACGTGGGACGGTGAGAAGACGATCATAATCACGTGCTCCTTCACACCG
GGGTCGTGTTCGCTGACTGCATACAAGTTGACACCGAGCGGATATGAATGGGGCGCCAAG
AACACGGACAAAGGCAATAATCCCAAGGGATATCTCCCTAGCCACTATGAGCGAGTGCAA
ATGTTACTGTCCGATCGATTCCTAGGATACTTCATGGTGCCTTCACAGGGTAGCTGGAAT
TATAACTTCATGGGTGTCCGTCACGATCCCAACATGAAGTATGGCGTTCAGCTGGGGAAT
CCCCGCGAGTTCTACCACGAGGTGCATCGACCTGCACACTTTATGAACTTCGCGGCAATG
GAGGATTCAGTCGCGCCCATACCAGCTGCTGACCGAGAGGATTTCTTCGCCTAG
Protein sequence:
MKESQHILKGLTSSSLVIIDELCRGTNVEEGTSIAWSICEELLMSEAYTFLTTHFMYLTK
LEDLYYNVINVHTAVKEESQGPDVLEKRLIYQHKIEPGITQIKHYGIALAAKTNLPQDIV
SLAKELAELIESNTKPMSGSSQKETDLKLLYDLNAKIQMESRKNYNNEESIRNILRQFKN
KYPHIVEGLKLERNSRNIHNYSSHESPKVTGAAMSLPPYLLGPNPWATMMAQQQLAAAQQ
AALQAHAAAAAAAPPVPPTQPPKPHHIPEEKIKEKAQKWLQLQSKRFSDKRKFGFVDAQK
EDMPPEHIRKIIRDHGDMTSRKYRHDKRVYLGALKYMPHAVMKLLENMPMPWEQIRDVNV
LYHITGAITFVNEIPWVIEPVYIAQWGTMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDY
ADNILDVEPLEPIQIELDPEEDGAVASWFYDHKPLLGTKHVNGSTYRKWNLSLPQMATLY
RLANQLLTDLVDDNYFYLFDSKSFFTAKALNMAIPGGPKFEPLVKDNSAGDEDWNEFNDI
NKIIIRQPIRTEYRIAFPYLYNNLPHFVQLSWYHTPNVVYIKTEDPDLPAFYFDPLINPI
SHRHTVKSLDPIPEEEDFLLPEEVTPFLQETALYTDNTANGIALLWAPRPFSMRSGRSRR
AIDVPLVKTWYKEHCPPGQPVKVRVSYQKLLKYYVLNSLKHRPPKPQKKRYLFRSFKSTK
FFQTTTLDWVEAGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRF
GNAFHLCREILRLTKLIVDSHVQYRLNNVDSFQLADGLQYIFAHVGQLTGMYRYKYKLMR
QIRMCKDLKHLIYYRFNTGPVSKGPGCGFWAPGWRVWLFFMRGITPLLERWLGNLLSRQF
EGRHSKGVAKTVTKQRVESHFDLELRASVMHDIVDMMPEGIKQNKARTILQHLSEAWRCW
KANIPWKVPGLPTPIENMILRYVKMKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLT
RLYLKAEQERQHNYLKDGPYISPEEAVAIYTTTVHWLESRRFAPIPFPPLSYKHDTKLLI
LALERLKEAYSVKSRLNQSQREELGLIEQAYDNPHEALSRIKRHLLTQRTFREVGIEFMD
LYSHLVPVYDVEPLEKITDAYLDQYLWYEADKRRLLPPWVKPADTEPSPLLVYKWCQGIN
NLQDVWEVGEGECNVLLESRFEKLYEKIDLTLLNRLLRLIVDHNIADYMTAKNNVVINYK
DMNHTNSYGIIRGLQFASFIVQYYGLVLDLLVLGLQRASEMAGPPQLPNDFLSYQERPAE
QAHPIRLYCRYIDRIHIFFRFTAEEARDLIQRYLTEHPDPNNENIVGYNNKKCWPRDARM
RLMKHDVNLGRAVFWDIKNRLPRSVTTIQWENSFVSVYSKDNPNLLFNMAGFECRILPKC
RSLHEELSHRDGVWNLQNEVTKERTAQCYLRVDDESLARFHNRVRQILMASGSTTFTKIV
NKWNTALIGLMTYFREAVVNTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVVFYTPK
ELGGLGMLSMGHVLIPQSDLRWSKQTDVGITHFRSGMSHDEDQLIPNLYRYIQPWEAEFV
DSQRVWAEYALKRQEANAQNRRLTLEDLEDSWDRGIPRINTLFQKDRHTLAYDKGWRIRT
EFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILEHTLFKGTYFPTWEG
LFWEKASGFEESMKYKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGI
FMHGKIPTLKISLIQIFRAHLWQKVHESIVMDLCQVFDQELDALEIETVQKETIHPRKSY
KMNSSCADILLFSAYKWNVSRPSLLADTKDTMDNTTTQKYWLDIQLRWGDYDSHDVERYA
RAKFLDYTTDNMSIYPSPTGLLIAIDLAYNLHSAYGNWFPGCKPLIQQAMAKIMKANPAL
YVLRERIRKALQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTT
KPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQ
IIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKFGDLILKATEPQMVLFNLY
DDWLKTISSYTAFSRLILILRALHVNTERTKVLLKPDKTTLTEPHHIWPTLTDDDWIKVE
VQLKDLILADYGKKNNVNVASLTQSEIRDIILGMEISAPSAQRQQIAEIEKQSKEQSQLT
ATTTRTVNKHGDEIITSTTSNYESQTFSSKTEWRVRAISATNLHLRTNHIYVSSDDIKES
GYTYILPKNLLKKFVTISDLRAQIACYLYGTSPPDNPQVREVHCAVLPPQWGTHQTVHLP
RQLPKHPALAHLQPLGWMHTQPNELPQLSPQDITTHAKIMAENQTWDGEKTIIITCSFTP
GSCSLTAYKLTPSGYEWGAKNTDKGNNPKGYLPSHYERVQMLLSDRFLGYFMVPSQGSWN
YNFMGVRHDPNMKYGVQLGNPREFYHEVHRPAHFMNFAAMEDSVAPIPAADREDFFA