DPGLEAN05241 in OGS1.0

New model in OGS2.0DPOGS200625 
Genomic Positionscaffold3658:+ 3213-12091
See gene structure
CDS Length2886
Paired RNAseq reads  1345
Single RNAseq reads  3487
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008911 (0.0)
Best Drosophila hit  CG6841 (0.0)
Best Human hitpre-mRNA-processing factor 6 (0.0)
Best NR hit (blastp)  PREDICTED: similar to CG6841-PA [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to CG6841-PA [Apis mellifera] (0.0)
GeneOntology terms







  
GO:0005684 U2-type spliceosomal complex
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0005682 U5 snRNP
GO:0030532 small nuclear ribonucleoprotein complex
GO:0005488 binding
GO:0005634 nucleus
GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome
GO:0071013 catalytic step 2 spliceosome
GO:0071011 precatalytic spliceosome
InterPro families


  
IPR003107 RNA-processing protein, HAT helix
IPR013026 Tetratricopeptide repeat-containing
IPR010491 PRP1 splicing factor, N-terminal
IPR011990 Tetratricopeptide-like helical
Orthology groupMCL14393

Nucleotide sequence:

ATGTCAGTTCCTCCGCAAGCATTTGTAAACAAAAACAAAAAACATTTTCTTGGTATTCCG
GCACCTCTTGGTTATGTGGCTGGTGTTGGTAGAGGAGCTACAGGTTTTACTACTAGATCA
GATATTGGACCCGCCAGAGACGCCAATGACGTATCTGATGATCGTCATGCACCCCCAGCA
GCCAAGCGAAAAAAAACTGAAGAGGAAGACGATGATGAAGACTTGAATGACTCTAATTAT
GACGAATTTTCAGGTTATAGTGGCTCTCTCTTTTCAAAGGATCCATATGACAAAGATGAT
GCAGAGGCAGATGCTATATATGAGTCAATTGATAAACGAATGGATGAAAAAAGAAAAGAG
TATAGAGAGAAGAGACTTAAAGAGGATTTGGAGAGATATCGCCAAGAGAGACCCAAAATA
CAACAACAGTTTTCTGATCTCAAACGGGAATTGAAAATGGTGTCGGAGGATGAATGGGCT
GCTATACCGGAAGTTGGTGACGCGAGGAACAGGAAGCAAAGGAATCCAAGAGCGGAGAAA
TTTACTCCTTTACCAGATAGTGTGTTATCTAGGAATCTTGGTGGAGAGTCTAGTTCAACA
ATTGATCCTAGTTCAGGCTTAGCTTCTATGATGCCGGGGGTTATGACACCTGGAATGCTG
ACACCTTCAGGTGATCTCGATCTACGTAAGATAGGTCAAGCGAGGAACACTTTAATGACG
GTGAAATTGTCACAAGTCTCCGACTCTGTGAGCGGTCAGACAGTGGTGGACCCTAAAGGT
TACTTAACTGACTTACAGTCCATGATACCTACCTATGGCGGTGACATTAATGACATCAAA
AAGGCCAGGCTGCTCCTCAAGTCGGTGAGGGAAACCAATCCTAACCATCCACCAGCTTGG
ATTGCTAGTGCTAGATTAGAAGAAGTTACTGGTAAAATCCAGTCAGCCCGCAACCTCATA
ATGAAGGGTTGTGAGGTTAACCCCAGCAGTGAAGAGCTCTGGTTGGAAGCGGCTCGTCTA
CAACCACCGGATACAGCTCGGGCGGTTATAGCACACGCCGCCCGCAACCTGCCTCATAGT
GTACGAGTTTGGGTGAAGGCGGCTGAACTGGAACAAGAACCAAAGGCTAAACGTCGTGTT
TACAGAAAGGCGTTGGAGCATATACCAAATTCAGTGCGTTTGTGGAAAGCGGCCGTCGAA
TTGGAGAACCCTGAAGATGCTAGGATCCTGCTTTCAAGGGCCGTGGAGTGTTGTCCGACG
AGCGTAGAACTATGGCTGGCTCTGGCTAGACTGGAAACATATGAAAATGCAAGAAAAGTA
CTAAATAAGGCACGTGAAAATATTCCCACCGATAGACAGATCTGGGTAACAGCTGCTAAA
CTTGAAGAGGCTCAAGGCAACACTCATATGGTAGAAAAGATTATAGACCGTGCCATAACG
TCGCTTAGTGCTAATGGCGTTGAAATAAACAGAGAGCATTGGTTCAAAGAGGCGATGGAG
GCTGAGAAATCTGGAGCAGTTCATACGTGTCAGGTGATCGGTCACGGCATTGAACCAGAG
GATCAAAAACATACTTGGATGGAGGATGCTGATGCTTGCGCCAACGAAGGTGCGTACGAG
TGTGCCCGGGCGGTGTATGGGTACGCGCTATCAGTTTTCCCCTCGAAGAAGTCCATCTGG
CTGAGAGCCGCCTACCTCGAGAAGCAGCATGGTACGAGGGCGACGTTGGAGGCTCTGTTA
CAGAGGGCGGTCGCTCACTGTCCCAAGAGCGAAGTCCTATGGCTCATGGGGGCGAAGTCC
AAGTGGCTAGCGGGTGACGTGAGAGCGGCTAGACAGATCCTGTCGTTAGCTTTCCAAGCC
AATCCTAACTCGGAGGAGATCTGGCTGGCCGCTGTCAAACTGGAGAGCGAGAACAAAGAA
TATGATCGAGCCAGGAGGTTGTTGGAGAAAGCCAGAGCGTCCGCACCCACACCTAGGGTC
ATGATAAAATCAGCAAAACTAGAATGGGCTTTGAACAAATTAGACGTAGCCCTGAACCTG
CTGTCAGAAGCTATCACAATATTTGGGGATTACGCGAAGCTACACATGATGAAAGGACAG
ATAGAGGAGCAGATGGGGAGGGATAGTGACGCACACAACACGTACACACAAGGGTTGAAG
AAGTGCGCTACCAGTGTCCCTATGTGGATACTGCTGTCGAGATTGGAAGAAAAACTCAAA
CACGTCACCAAAGCCAGATCTGTGTTGGAGAAGGCGCGTCTCAGGAATCAGAAGAACGCT
GAGTTATGGTTGGAGAGTGTTCGCCTGGAACAGCGAGCTGGTTGTGTGGAAGCGGCCGGC
TCCTTGTTGGCGAAGGCGCTCCAGGAGTGTCCTACGGCCGGCAGACTGTGGGCCCTCGCC
GTCTTCATGGAGCCCCGCCCGCAGAGGAAGACTAAGAGTGTGGATGCCCTGAAGAAATGT
GAACACGACGCTCACGTCCTGCTGGCGGTGTCGCAGCTGTTCTGGACGGAGAGGAAATTA
AATAAATGCAGGGAATGGTTCAACAGAACTGTGGATGCTCTGAAGAAATGTGAACACGAC
GCTCACGTCCTGCTGGCGGTGTCGCAGCTGTTCTGGACGGAGAGGAAATTAAATAAATGC
AGAGAATGGTTCAACAGAACTGTGAAAATCGACCCGGATCTCGGTGACGCTTGGGCTTAC
TTCTACAAATTCGAATTGCACCACGGCAACGAACAGCAACAGGAAGACGTGAAGAACAGG
TGCAAGGCCGCCGAACCCCACCACGGAGAGAACTGGTGCAAGGTCTCCAAAGACATAGCC
AACTGGTGTTACAATACAGAACAGATATTGTTACTGGTGGCTAAGAATCTACCCGTGCCC
ACGTAG

Protein sequence:

MSVPPQAFVNKNKKHFLGIPAPLGYVAGVGRGATGFTTRSDIGPARDANDVSDDRHAPPA
AKRKKTEEEDDDEDLNDSNYDEFSGYSGSLFSKDPYDKDDAEADAIYESIDKRMDEKRKE
YREKRLKEDLERYRQERPKIQQQFSDLKRELKMVSEDEWAAIPEVGDARNRKQRNPRAEK
FTPLPDSVLSRNLGGESSSTIDPSSGLASMMPGVMTPGMLTPSGDLDLRKIGQARNTLMT
VKLSQVSDSVSGQTVVDPKGYLTDLQSMIPTYGGDINDIKKARLLLKSVRETNPNHPPAW
IASARLEEVTGKIQSARNLIMKGCEVNPSSEELWLEAARLQPPDTARAVIAHAARNLPHS
VRVWVKAAELEQEPKAKRRVYRKALEHIPNSVRLWKAAVELENPEDARILLSRAVECCPT
SVELWLALARLETYENARKVLNKARENIPTDRQIWVTAAKLEEAQGNTHMVEKIIDRAIT
SLSANGVEINREHWFKEAMEAEKSGAVHTCQVIGHGIEPEDQKHTWMEDADACANEGAYE
CARAVYGYALSVFPSKKSIWLRAAYLEKQHGTRATLEALLQRAVAHCPKSEVLWLMGAKS
KWLAGDVRAARQILSLAFQANPNSEEIWLAAVKLESENKEYDRARRLLEKARASAPTPRV
MIKSAKLEWALNKLDVALNLLSEAITIFGDYAKLHMMKGQIEEQMGRDSDAHNTYTQGLK
KCATSVPMWILLSRLEEKLKHVTKARSVLEKARLRNQKNAELWLESVRLEQRAGCVEAAG
SLLAKALQECPTAGRLWALAVFMEPRPQRKTKSVDALKKCEHDAHVLLAVSQLFWTERKL
NKCREWFNRTVDALKKCEHDAHVLLAVSQLFWTERKLNKCREWFNRTVKIDPDLGDAWAY
FYKFELHHGNEQQQEDVKNRCKAAEPHHGENWCKVSKDIANWCYNTEQILLLVAKNLPVP
T