New model in OGS2.0 | DPOGS207257  |
---|---|
Genomic Position | scaffold76:+ 45142-52346 |
See gene structure | |
CDS Length | 3651 |
Paired RNAseq reads   | 9214 |
Single RNAseq reads   | 20880 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000829 (0.0) |
Best Drosophila hit   | CG13900, isoform A (0.0) |
Best Human hit | splicing factor 3B subunit 3 (0.0) |
Best NR hit (blastp)   | Splicing factor 3B subunit, putative [Pediculus humanus corporis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to splicing factor 3b, subunit 3 isoform 1 [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0005634 nucleus GO:0005681 spliceosomal complex GO:0006397 mRNA processing GO:0003676 nucleic acid binding GO:0005575 cellular_component GO:0003674 molecular_function GO:0008150 biological_process GO:0008380 RNA splicing |
InterPro families   | IPR004871 Cleavage/polyadenylation specificity factor, A subunit, C-terminal |
Orthology group | MCL13039 |
Nucleotide sequence:
ATGTATCTTTACAATTTAACGCTGCAAGGCTCAACGGCCATTTCGCACGCCGTGCATGGG
AATTTTTCTGGCACGAAACAGCAAGAAATAATTATATCTCGAGGCAAAACTTTAGAATTG
CTAAGACCTGATCCAAATACTGGAAAAGTGCACACTTTAATGAAAGTGGAGATTTTCGGC
GTCATTCGTTCTATGATGTCATTTCGACTCACCGGCGGCACTAAAGATTATATAGTTGTT
GGATCTGACTCGGGGCGCATTGTTATACTTGAATATATACCTGCTAAGAACATTCTCGAA
AAAGTACATCAAGAGACATTTGGGAAATCGGGATGTAGAAGAATAGTACCAGGGCAGTAC
CTAGCAATTGATCCGAAAGGCAGAGCTGTTATGATAGGTGCAATTGAAAAGCAAAAATTG
GTATACATTTTGAACAGAGATGCTGAAGCTAGATTAACAATCTCGTCACCGCTTGAAGCT
CACAAATCTAACACATTAGTATACCACATGGTGGGAGTTGATGTTGGCTTTGAAAACCCC
ATGTTTGCTTGCTTGGAGATAGACTATGAAGAGGCAGACTCTGATCCCACAGGGGAAGCA
GCTCAAAAGACACAGCAGACATTAACATTTTATGAGCTGGATTTGGGTTTAAATCATGTA
GTAAGAAAATATTCAGAACCTCTTGAAGAACATGCCAATTTCCTTATAACGGTACCTGGT
GGTAATGATGGCCCGTCAGGTGTACTTATCTGTTCAGAAAATTATCTAACCTACAAAAAT
TTGGGAGACCAGCATGATATTAGATGTCCTATTCCTAGAAGGAGAAATGATTTAGACGAC
CCAGAAAGGGGTATGATCTTTGTCTGCTCGGCCACACACAAGACAAAATCGATGTTCTTT
TTCCTGGCACAAACTGAACAGGGTGATATATTTAAAATCACTATAGAAACCGATGAAGAT
ATGGTGACGGAAATTAAACTGAAATACTTTGATACTGTACCAGTTGCAACTGCTATGTGC
GTTCTGAAGACTGGCTTTCTTTTTGTTGCTTGTGAATTTGGCAACCACTATTTATACCAA
ATTGCTCACTTGGGTGATGAAGATGATGAACCAGAATTCAGTTCTGCAATGCCATTAGAG
GAAGGAGACACATTTTTCTTTGCTCCCCGACCTCTCAGGAACTTGGTGCTGGTTGATGAA
TTGGATTCCCTCTCACCCATACTCGCTTGCCATGTGGCAGACTTAACTGGTGAAGATACA
CCTCAAGTGTATTTAGCATGTGGCAGAGGACCAAGATCTTCACTGAGAGCCTTAAGACAT
GGTTTAGAAGTAGCAGAGATGGCTGTATCAGAACTACCCGGTTCACCAAATGCAGTATGG
ACTGTGCGACGGCACAAAGATGATGACTATGATTCGTACATCATAGTGAGTTTCGTAAAC
GCTACGTTGGTGCTATCTATCGGTGAGACTGTGGAAGAGGTGACGGACTCTGGTTTTCTC
GGAACCACACCGACATTGAGCTGCCACGCACTTGGAAGTGATGCATTGGTTCAAGTATAT
CCTGATGGTATAAGACATATCAGGGCTGACAAACGAGTTAACGAGTGGAAGGCACCCGGC
AAGAAGTCTATTGTGAAATGTGCCGTCAATCAAAGACAAGTTGTCATAGCACTGACTGGA
GGTGAACTGGTGTACTTTGAAATGGACCCGACTGGCCAATTGAATGAGTACACTGAACGA
AAGAAGTTGTCATCTGATGTATCCTGTATGGCACTGGGATCAGTAGCTACTGGAGAACAG
AGAGCTTGGTTCCTAGCTGTTGGTTTAGTTGACAATACTGTCAGAATTATTTCACTGGAT
CCTGCTGATTGTCTAGCACCTCGTTCAATGCAAGCCCTGCCTGCCAGCCCCGAGTCCTTG
TGTATTGTTGATCAACCCTTTGAGTCTGGTGCCAAATCTGCTTTACACCTTAACATTGGC
TTAAGTAATGGAGTATTACTACGTACAACTCTGGACTCTGTTAGTGGTGATTTAGCTGAT
ACAAGAACAAGATACCTGGGATCTCGCCCTGTGAAACTTTTCAAAGTTAGAGTGCAGTCA
GCGGAAGCAGTGCTGGCTGTGTCTTCGAGGACATGGCTCGGTTATCAATATCAGAACAGA
TTCCATCTAACGCCATTGTCATATGAATGTCTAGAGTATGCTGCGGGATTTAGCTCTGAA
CAATGTACCGAGGGTATAGTGGCCATTTCATCAAATACACTAAGAATTTTAGCCCTAGAA
AAATTGGGTGCCGTATTCAATCAAACATTCCAACAATTAGATTACACACCAAGAAAGTTT
GTTATAAATAGTGATAACAATCACATCATAGTTTTGGAGACTGACCACAATGCTTACACT
GAAGAAATGAAGAAGCAAAGAAGAGTGCAAATGGCACAAGAAATGAGAGAAGCTGCTGCT
GGGGGAACTCCCGAGGAACAACAACTAGCAAATGAAATGGCCGACGCGTTCCTTTCAGAT
GTGTTGCCAGAAAATATATTTTCTTCCCCGAAAGCTGGTGCCGGCATGTGGGCGTCTCAG
ATCCGTATACTGGACATGAGTGGCGGCGTGGGCGGGTGTAGCACTGTGTGTCTACTACCG
CTGGAACAGAACGAGGCGGCCGTGTCTTTGTGTGTAGTACGATGGGCCGCTCTCACTGAC
AACACACCACATCTAGTAGTAGGGGTTGCCAAGGACGCTCTGCTGTCACCACGTAGCTGC
TCTGAGGGCAGTCTACATGTTTATAAGATTTATAATACTGGAAAATTGGAATTGGTACAT
AAAACACCAATAGATGAATACCCTGGAGCGTTGGCAGCATTCAATGGCAAGCTGCTGGCA
GGAGTGGGGCGGATGTTGAGGTTGTACGACATTGGTAGAAGGAAACTATTACGGAAGTGC
GAAAACAGACACATTCCAAACCTCATAGCGGATATCAAAACTATAAGGCAGAGAATATTC
GTATCGGACGTCCAAGAATCCGTGTTCTGTGTTAAATACAAGAAGAGGGAAAACCAGCTG
ATTATTTTCGCCGACGACACCAATCCCAGGTGGATCACCAACACTTGTATTCTAGACTAC
GACACGGTCGCTATGGCCGACAAGTTTGGCAACGTAGCCGTTTTGAGACTGCCTCAGTCT
GTGAGCGACGATGTGGATGAGGATCCGACTGGAAACAAAGCGCTCTGGGACAGAGGTCTT
CTGAATGGAGCGTCTCAAAAGGGTGACATCACTGTTAATTTCCACGTTGGAGAGACTGTG
ACGTCTTTGCAAAGAGCTACTCTAATCCCGGGCGGTTCGGAGGCGCTCTTGTACGCCACA
GTGAGCGGAGCACTGGGAGTGTTCCTACCGTTCACCTCCAGGGAAGATCACGACTTCTTC
CAGCACCTTGAAATGCACATGAGGAGTGAAAACTCACCTCTGTGCGGACGAGACCACTTG
TCATTCAGAAGCTACTATTATCCAGTAAAGAATGTGATAGACGGCGACCTCTGCGAACAG
TTCAACTCGCTGGAGCCGGCGAAACAGAAAGCCATCGCCGGAGACCTGGAGCGAACTCCG
GCCGAGGTGTCCAAGAAGCTGGAGGACATCAGAACTAGATACGCCTTTTAA
Protein sequence:
MYLYNLTLQGSTAISHAVHGNFSGTKQQEIIISRGKTLELLRPDPNTGKVHTLMKVEIFG
VIRSMMSFRLTGGTKDYIVVGSDSGRIVILEYIPAKNILEKVHQETFGKSGCRRIVPGQY
LAIDPKGRAVMIGAIEKQKLVYILNRDAEARLTISSPLEAHKSNTLVYHMVGVDVGFENP
MFACLEIDYEEADSDPTGEAAQKTQQTLTFYELDLGLNHVVRKYSEPLEEHANFLITVPG
GNDGPSGVLICSENYLTYKNLGDQHDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSMFF
FLAQTEQGDIFKITIETDEDMVTEIKLKYFDTVPVATAMCVLKTGFLFVACEFGNHYLYQ
IAHLGDEDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDELDSLSPILACHVADLTGEDT
PQVYLACGRGPRSSLRALRHGLEVAEMAVSELPGSPNAVWTVRRHKDDDYDSYIIVSFVN
ATLVLSIGETVEEVTDSGFLGTTPTLSCHALGSDALVQVYPDGIRHIRADKRVNEWKAPG
KKSIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERKKLSSDVSCMALGSVATGEQ
RAWFLAVGLVDNTVRIISLDPADCLAPRSMQALPASPESLCIVDQPFESGAKSALHLNIG
LSNGVLLRTTLDSVSGDLADTRTRYLGSRPVKLFKVRVQSAEAVLAVSSRTWLGYQYQNR
FHLTPLSYECLEYAAGFSSEQCTEGIVAISSNTLRILALEKLGAVFNQTFQQLDYTPRKF
VINSDNNHIIVLETDHNAYTEEMKKQRRVQMAQEMREAAAGGTPEEQQLANEMADAFLSD
VLPENIFSSPKAGAGMWASQIRILDMSGGVGGCSTVCLLPLEQNEAAVSLCVVRWAALTD
NTPHLVVGVAKDALLSPRSCSEGSLHVYKIYNTGKLELVHKTPIDEYPGALAAFNGKLLA
GVGRMLRLYDIGRRKLLRKCENRHIPNLIADIKTIRQRIFVSDVQESVFCVKYKKRENQL
IIFADDTNPRWITNTCILDYDTVAMADKFGNVAVLRLPQSVSDDVDEDPTGNKALWDRGL
LNGASQKGDITVNFHVGETVTSLQRATLIPGGSEALLYATVSGALGVFLPFTSREDHDFF
QHLEMHMRSENSPLCGRDHLSFRSYYYPVKNVIDGDLCEQFNSLEPAKQKAIAGDLERTP
AEVSKKLEDIRTRYAF