New model in OGS2.0 | DPOGS204107  |
---|---|
Genomic Position | scaffold731:+ 28501-32187 |
See gene structure | |
CDS Length | 2193 |
Paired RNAseq reads   | 2773 |
Single RNAseq reads   | 6670 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013594 (0.0) |
Best Drosophila hit   | CG16941 (8e-151) |
Best Human hit | splicing factor 3A subunit 1 isoform 1 (1e-119) |
Best NR hit (blastp)   | PREDICTED: similar to spliceosome associated protein [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to spliceosome associated protein [Tribolium castaneum] (2e-162) |
GeneOntology terms    | GO:0005681 spliceosomal complex GO:0000398 nuclear mRNA splicing, via spliceosome GO:0005686 U2 snRNP GO:0003723 RNA binding GO:0005634 nucleus GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome GO:0007052 mitotic spindle organization GO:0071011 precatalytic spliceosome GO:0071013 catalytic step 2 spliceosome |
InterPro families    | IPR000061 SWAP/Surp IPR000626 Ubiquitin IPR019955 Ubiquitin supergroup IPR022030 Pre-mRNA splicing factor PRP21-like protein |
Orthology group | MCL12521 |
Nucleotide sequence:
ATGCCTCCGGGGCAGCAGCATGAAGGTCTGCCAGAGGAGCCTCAAGCGCCCCCGCCACCC
GTTATCGGCATCATATATCCACCACCTGAAGTCAGGAATATCGTGGACAAAACAGCAAGC
TTCGTAGCCCGAAACGGCCCCGAATTTGAGGCTCGTATTAGACAGAATGAACTCGGCAAT
CCAAAATTCAACTTTCTCAACTCTGGGGATCCCTACCATGCATATTACCAGCATAAGGTC
AAAGAAATCCGGGAGGGCAAAGGTGATTCTATTGGCCCCGCTCCAGTCGCCGCTGTCCAG
CGTCCCGGCCTCGCTCCAGCCACTGCTGCCCGTCAACAGGAGCTTCTTAAAGCAGCCGCG
CCTCTCGAACCACCGCCGCCTCGTGATCCCCCTCCAGACTTCGAGTTTATAGCTGATCCT
CCGTCTATATCAGCCCTCGAACTCGACATTGTCAAACTTACTGCACAATTCGTTGCCCGG
AACGGTCGACAGTTTTTAACAGATTTGATGAAGAAGGAGGAAAGAAACCATCAGTTTGAC
TTTCTGCGACCACAGCATTCGCTGTTCCAGTACTTCACGAGGCTATTGGAGCAGTACACT
AAGGTATTGCTGCCGCCAAAGGAATTAGTGTCAAAATTAGGAGCGGAGATTCGCAGCGGA
GTGCTGGAGCAGGCTCGCAGTAGAGCCGCGTGGCATTCGCACCAGGCGCGGAGGAAGGCG
GCCGATGAAGCCAAGATAGAAAAAGAGAGACTCGCCTACGCTTCTATAGACTGGCACGAC
TTCGTAGTAGTGGAGACTGTGGACTATCCCGCGGGAGAACCGGGCGACTTCCCTCCGCCG
ACCACTCCGCTGGAAGTAGGAGCGAGGGTACTGGCCCAGGAACGAGGAGACGACATCATA
CAACCTAACGAGGAAGACGACACGGAGATGCAGCTGGAGTCGGAATCAGAGTCCGAATCG
GACGAGGACCTGGCCGAGATGGAGGACAGGACGCACCAGCAACAACGACCCGACGACAAC
AGGGTCCAGGACATGGAGGAGGAGTCCTCGTCAGACGACGACGGACCTCCCGAGGCCCGC
GCCGGTGAGGAGGCTCCCATGCCACCGCGGCCGGATCGCGTAGTTGTTAAGAAGTACGAC
CCGAAACGCGCTAGACCGCAACCAGCTCCCGCTAGCGAGGAATGGCTGGTGTCGCCCATC
ACAGGAGAGAAGATCCCCGCGAACAAGGTGACGGACCACGTGCGTATCGGATTGCTCGAC
CCTCGCTGGCTGGAACAGAGAGACCGCGCTGCAGCGGAGCGCTCAGATAGAGACGAAGCC
CTTGCCCCCGGGGCGGCTATTGAAGCGTCGCTTAAACAGTTGGCCGAGCGTCGTACTGAC
ATCTTCGGTGTTGGAGACGAGGAGACAGCCATCGGTAAGAAGATAGGAGAAGAGGAGAAA
AGACGCGACGAGAGAGTCACCTGGGACGGACACACGTCCAGCGTAGAGGCTGCGACCAGG
GCGGCGCGGGCTCACATCACACTGGAAGACCAGATACAACAGATACATAAGGTCAAAGGT
CTTCTACCTGATGAGGAAAAAGAGAAGATCGGTCCCAAGCCGGTGCCGCCAGGTCGTGTG
GGGGTTGCGATCAGACCGCCTCCTCCGCCCGCCCAGCCAAGAGTACAACCTCCAGCCGCC
CCCGCGCCCGCCCCCGTCTTGCTACAACCTATACCACCACTGATGGTAATACCACCAATG
GCACCACGACCACCGCTGATAGCCACCCAAGTCGCCACTCCCTATGGGTACCCCGTCCCG
GGAGTGCCCGCAGTCCCGGGGGTTCCGGGAGTCCAAGATGATGAAGAGCCTACAGCCAAG
AGACCGAGGACTGAAGACGCTCTCGAACCAGAACAAGCGTGGCTTGCTTCTCACCCTGGA
TCTGTACCTATACAGGTGTCGGTGCCGATGGCCCCCGAGCGTTCCGAGTGGCGTCTGGAC
GGGCGGACGCTGTCCCTGTCCCTGCCCCTGGCGGCTCCGGTCTCTGAACTCAAAGCATCA
CTACAGCGAACCACTAACATGCCCACCGCTAAACAGAAGCTCTACTATGAGGGTCTATTC
TTCAAGGACACCAACACCCTGGCGTACTACAACGTGCCTCCCGGAGCCGTCATACAGCTG
CAGATCAAGGAGCGCGGAGGAAGGAAGAAGTAG
Protein sequence:
MPPGQQHEGLPEEPQAPPPPVIGIIYPPPEVRNIVDKTASFVARNGPEFEARIRQNELGN
PKFNFLNSGDPYHAYYQHKVKEIREGKGDSIGPAPVAAVQRPGLAPATAARQQELLKAAA
PLEPPPPRDPPPDFEFIADPPSISALELDIVKLTAQFVARNGRQFLTDLMKKEERNHQFD
FLRPQHSLFQYFTRLLEQYTKVLLPPKELVSKLGAEIRSGVLEQARSRAAWHSHQARRKA
ADEAKIEKERLAYASIDWHDFVVVETVDYPAGEPGDFPPPTTPLEVGARVLAQERGDDII
QPNEEDDTEMQLESESESESDEDLAEMEDRTHQQQRPDDNRVQDMEEESSSDDDGPPEAR
AGEEAPMPPRPDRVVVKKYDPKRARPQPAPASEEWLVSPITGEKIPANKVTDHVRIGLLD
PRWLEQRDRAAAERSDRDEALAPGAAIEASLKQLAERRTDIFGVGDEETAIGKKIGEEEK
RRDERVTWDGHTSSVEAATRAARAHITLEDQIQQIHKVKGLLPDEEKEKIGPKPVPPGRV
GVAIRPPPPPAQPRVQPPAAPAPAPVLLQPIPPLMVIPPMAPRPPLIATQVATPYGYPVP
GVPAVPGVPGVQDDEEPTAKRPRTEDALEPEQAWLASHPGSVPIQVSVPMAPERSEWRLD
GRTLSLSLPLAAPVSELKASLQRTTNMPTAKQKLYYEGLFFKDTNTLAYYNVPPGAVIQL
QIKERGGRKK