New model in OGS2.0 | DPOGS200625  |
---|---|
Genomic Position | scaffold3658:+ 3213-12091 |
See gene structure | |
CDS Length | 2886 |
Paired RNAseq reads   | 1345 |
Single RNAseq reads   | 3487 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008911 (0.0) |
Best Drosophila hit   | CG6841 (0.0) |
Best Human hit | pre-mRNA-processing factor 6 (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to CG6841-PA [Apis mellifera] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to CG6841-PA [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0005684 U2-type spliceosomal complex GO:0000398 nuclear mRNA splicing, via spliceosome GO:0005682 U5 snRNP GO:0030532 small nuclear ribonucleoprotein complex GO:0005488 binding GO:0005634 nucleus GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome GO:0071013 catalytic step 2 spliceosome GO:0071011 precatalytic spliceosome |
InterPro families    | IPR003107 RNA-processing protein, HAT helix IPR013026 Tetratricopeptide repeat-containing IPR010491 PRP1 splicing factor, N-terminal IPR011990 Tetratricopeptide-like helical |
Orthology group | MCL14393 |
Nucleotide sequence:
ATGTCAGTTCCTCCGCAAGCATTTGTAAACAAAAACAAAAAACATTTTCTTGGTATTCCG
GCACCTCTTGGTTATGTGGCTGGTGTTGGTAGAGGAGCTACAGGTTTTACTACTAGATCA
GATATTGGACCCGCCAGAGACGCCAATGACGTATCTGATGATCGTCATGCACCCCCAGCA
GCCAAGCGAAAAAAAACTGAAGAGGAAGACGATGATGAAGACTTGAATGACTCTAATTAT
GACGAATTTTCAGGTTATAGTGGCTCTCTCTTTTCAAAGGATCCATATGACAAAGATGAT
GCAGAGGCAGATGCTATATATGAGTCAATTGATAAACGAATGGATGAAAAAAGAAAAGAG
TATAGAGAGAAGAGACTTAAAGAGGATTTGGAGAGATATCGCCAAGAGAGACCCAAAATA
CAACAACAGTTTTCTGATCTCAAACGGGAATTGAAAATGGTGTCGGAGGATGAATGGGCT
GCTATACCGGAAGTTGGTGACGCGAGGAACAGGAAGCAAAGGAATCCAAGAGCGGAGAAA
TTTACTCCTTTACCAGATAGTGTGTTATCTAGGAATCTTGGTGGAGAGTCTAGTTCAACA
ATTGATCCTAGTTCAGGCTTAGCTTCTATGATGCCGGGGGTTATGACACCTGGAATGCTG
ACACCTTCAGGTGATCTCGATCTACGTAAGATAGGTCAAGCGAGGAACACTTTAATGACG
GTGAAATTGTCACAAGTCTCCGACTCTGTGAGCGGTCAGACAGTGGTGGACCCTAAAGGT
TACTTAACTGACTTACAGTCCATGATACCTACCTATGGCGGTGACATTAATGACATCAAA
AAGGCCAGGCTGCTCCTCAAGTCGGTGAGGGAAACCAATCCTAACCATCCACCAGCTTGG
ATTGCTAGTGCTAGATTAGAAGAAGTTACTGGTAAAATCCAGTCAGCCCGCAACCTCATA
ATGAAGGGTTGTGAGGTTAACCCCAGCAGTGAAGAGCTCTGGTTGGAAGCGGCTCGTCTA
CAACCACCGGATACAGCTCGGGCGGTTATAGCACACGCCGCCCGCAACCTGCCTCATAGT
GTACGAGTTTGGGTGAAGGCGGCTGAACTGGAACAAGAACCAAAGGCTAAACGTCGTGTT
TACAGAAAGGCGTTGGAGCATATACCAAATTCAGTGCGTTTGTGGAAAGCGGCCGTCGAA
TTGGAGAACCCTGAAGATGCTAGGATCCTGCTTTCAAGGGCCGTGGAGTGTTGTCCGACG
AGCGTAGAACTATGGCTGGCTCTGGCTAGACTGGAAACATATGAAAATGCAAGAAAAGTA
CTAAATAAGGCACGTGAAAATATTCCCACCGATAGACAGATCTGGGTAACAGCTGCTAAA
CTTGAAGAGGCTCAAGGCAACACTCATATGGTAGAAAAGATTATAGACCGTGCCATAACG
TCGCTTAGTGCTAATGGCGTTGAAATAAACAGAGAGCATTGGTTCAAAGAGGCGATGGAG
GCTGAGAAATCTGGAGCAGTTCATACGTGTCAGGTGATCGGTCACGGCATTGAACCAGAG
GATCAAAAACATACTTGGATGGAGGATGCTGATGCTTGCGCCAACGAAGGTGCGTACGAG
TGTGCCCGGGCGGTGTATGGGTACGCGCTATCAGTTTTCCCCTCGAAGAAGTCCATCTGG
CTGAGAGCCGCCTACCTCGAGAAGCAGCATGGTACGAGGGCGACGTTGGAGGCTCTGTTA
CAGAGGGCGGTCGCTCACTGTCCCAAGAGCGAAGTCCTATGGCTCATGGGGGCGAAGTCC
AAGTGGCTAGCGGGTGACGTGAGAGCGGCTAGACAGATCCTGTCGTTAGCTTTCCAAGCC
AATCCTAACTCGGAGGAGATCTGGCTGGCCGCTGTCAAACTGGAGAGCGAGAACAAAGAA
TATGATCGAGCCAGGAGGTTGTTGGAGAAAGCCAGAGCGTCCGCACCCACACCTAGGGTC
ATGATAAAATCAGCAAAACTAGAATGGGCTTTGAACAAATTAGACGTAGCCCTGAACCTG
CTGTCAGAAGCTATCACAATATTTGGGGATTACGCGAAGCTACACATGATGAAAGGACAG
ATAGAGGAGCAGATGGGGAGGGATAGTGACGCACACAACACGTACACACAAGGGTTGAAG
AAGTGCGCTACCAGTGTCCCTATGTGGATACTGCTGTCGAGATTGGAAGAAAAACTCAAA
CACGTCACCAAAGCCAGATCTGTGTTGGAGAAGGCGCGTCTCAGGAATCAGAAGAACGCT
GAGTTATGGTTGGAGAGTGTTCGCCTGGAACAGCGAGCTGGTTGTGTGGAAGCGGCCGGC
TCCTTGTTGGCGAAGGCGCTCCAGGAGTGTCCTACGGCCGGCAGACTGTGGGCCCTCGCC
GTCTTCATGGAGCCCCGCCCGCAGAGGAAGACTAAGAGTGTGGATGCCCTGAAGAAATGT
GAACACGACGCTCACGTCCTGCTGGCGGTGTCGCAGCTGTTCTGGACGGAGAGGAAATTA
AATAAATGCAGGGAATGGTTCAACAGAACTGTGGATGCTCTGAAGAAATGTGAACACGAC
GCTCACGTCCTGCTGGCGGTGTCGCAGCTGTTCTGGACGGAGAGGAAATTAAATAAATGC
AGAGAATGGTTCAACAGAACTGTGAAAATCGACCCGGATCTCGGTGACGCTTGGGCTTAC
TTCTACAAATTCGAATTGCACCACGGCAACGAACAGCAACAGGAAGACGTGAAGAACAGG
TGCAAGGCCGCCGAACCCCACCACGGAGAGAACTGGTGCAAGGTCTCCAAAGACATAGCC
AACTGGTGTTACAATACAGAACAGATATTGTTACTGGTGGCTAAGAATCTACCCGTGCCC
ACGTAG
Protein sequence:
MSVPPQAFVNKNKKHFLGIPAPLGYVAGVGRGATGFTTRSDIGPARDANDVSDDRHAPPA
AKRKKTEEEDDDEDLNDSNYDEFSGYSGSLFSKDPYDKDDAEADAIYESIDKRMDEKRKE
YREKRLKEDLERYRQERPKIQQQFSDLKRELKMVSEDEWAAIPEVGDARNRKQRNPRAEK
FTPLPDSVLSRNLGGESSSTIDPSSGLASMMPGVMTPGMLTPSGDLDLRKIGQARNTLMT
VKLSQVSDSVSGQTVVDPKGYLTDLQSMIPTYGGDINDIKKARLLLKSVRETNPNHPPAW
IASARLEEVTGKIQSARNLIMKGCEVNPSSEELWLEAARLQPPDTARAVIAHAARNLPHS
VRVWVKAAELEQEPKAKRRVYRKALEHIPNSVRLWKAAVELENPEDARILLSRAVECCPT
SVELWLALARLETYENARKVLNKARENIPTDRQIWVTAAKLEEAQGNTHMVEKIIDRAIT
SLSANGVEINREHWFKEAMEAEKSGAVHTCQVIGHGIEPEDQKHTWMEDADACANEGAYE
CARAVYGYALSVFPSKKSIWLRAAYLEKQHGTRATLEALLQRAVAHCPKSEVLWLMGAKS
KWLAGDVRAARQILSLAFQANPNSEEIWLAAVKLESENKEYDRARRLLEKARASAPTPRV
MIKSAKLEWALNKLDVALNLLSEAITIFGDYAKLHMMKGQIEEQMGRDSDAHNTYTQGLK
KCATSVPMWILLSRLEEKLKHVTKARSVLEKARLRNQKNAELWLESVRLEQRAGCVEAAG
SLLAKALQECPTAGRLWALAVFMEPRPQRKTKSVDALKKCEHDAHVLLAVSQLFWTERKL
NKCREWFNRTVDALKKCEHDAHVLLAVSQLFWTERKLNKCREWFNRTVKIDPDLGDAWAY
FYKFELHHGNEQQQEDVKNRCKAAEPHHGENWCKVSKDIANWCYNTEQILLLVAKNLPVP
T