DPGLEAN19648 in OGS1.0

New model in OGS2.0DPOGS207257 
Genomic Positionscaffold76:+ 45142-52346
See gene structure
CDS Length3651
Paired RNAseq reads  9214
Single RNAseq reads  20880
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000829 (0.0)
Best Drosophila hit  CG13900, isoform A (0.0)
Best Human hitsplicing factor 3B subunit 3 (0.0)
Best NR hit (blastp)  Splicing factor 3B subunit, putative [Pediculus humanus corporis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to splicing factor 3b, subunit 3 isoform 1 [Apis mellifera] (0.0)
GeneOntology terms






  
GO:0005634 nucleus
GO:0005681 spliceosomal complex
GO:0006397 mRNA processing
GO:0003676 nucleic acid binding
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
GO:0008380 RNA splicing
InterPro families  IPR004871 Cleavage/polyadenylation specificity factor, A subunit, C-terminal
Orthology groupMCL13039

Nucleotide sequence:

ATGTATCTTTACAATTTAACGCTGCAAGGCTCAACGGCCATTTCGCACGCCGTGCATGGG
AATTTTTCTGGCACGAAACAGCAAGAAATAATTATATCTCGAGGCAAAACTTTAGAATTG
CTAAGACCTGATCCAAATACTGGAAAAGTGCACACTTTAATGAAAGTGGAGATTTTCGGC
GTCATTCGTTCTATGATGTCATTTCGACTCACCGGCGGCACTAAAGATTATATAGTTGTT
GGATCTGACTCGGGGCGCATTGTTATACTTGAATATATACCTGCTAAGAACATTCTCGAA
AAAGTACATCAAGAGACATTTGGGAAATCGGGATGTAGAAGAATAGTACCAGGGCAGTAC
CTAGCAATTGATCCGAAAGGCAGAGCTGTTATGATAGGTGCAATTGAAAAGCAAAAATTG
GTATACATTTTGAACAGAGATGCTGAAGCTAGATTAACAATCTCGTCACCGCTTGAAGCT
CACAAATCTAACACATTAGTATACCACATGGTGGGAGTTGATGTTGGCTTTGAAAACCCC
ATGTTTGCTTGCTTGGAGATAGACTATGAAGAGGCAGACTCTGATCCCACAGGGGAAGCA
GCTCAAAAGACACAGCAGACATTAACATTTTATGAGCTGGATTTGGGTTTAAATCATGTA
GTAAGAAAATATTCAGAACCTCTTGAAGAACATGCCAATTTCCTTATAACGGTACCTGGT
GGTAATGATGGCCCGTCAGGTGTACTTATCTGTTCAGAAAATTATCTAACCTACAAAAAT
TTGGGAGACCAGCATGATATTAGATGTCCTATTCCTAGAAGGAGAAATGATTTAGACGAC
CCAGAAAGGGGTATGATCTTTGTCTGCTCGGCCACACACAAGACAAAATCGATGTTCTTT
TTCCTGGCACAAACTGAACAGGGTGATATATTTAAAATCACTATAGAAACCGATGAAGAT
ATGGTGACGGAAATTAAACTGAAATACTTTGATACTGTACCAGTTGCAACTGCTATGTGC
GTTCTGAAGACTGGCTTTCTTTTTGTTGCTTGTGAATTTGGCAACCACTATTTATACCAA
ATTGCTCACTTGGGTGATGAAGATGATGAACCAGAATTCAGTTCTGCAATGCCATTAGAG
GAAGGAGACACATTTTTCTTTGCTCCCCGACCTCTCAGGAACTTGGTGCTGGTTGATGAA
TTGGATTCCCTCTCACCCATACTCGCTTGCCATGTGGCAGACTTAACTGGTGAAGATACA
CCTCAAGTGTATTTAGCATGTGGCAGAGGACCAAGATCTTCACTGAGAGCCTTAAGACAT
GGTTTAGAAGTAGCAGAGATGGCTGTATCAGAACTACCCGGTTCACCAAATGCAGTATGG
ACTGTGCGACGGCACAAAGATGATGACTATGATTCGTACATCATAGTGAGTTTCGTAAAC
GCTACGTTGGTGCTATCTATCGGTGAGACTGTGGAAGAGGTGACGGACTCTGGTTTTCTC
GGAACCACACCGACATTGAGCTGCCACGCACTTGGAAGTGATGCATTGGTTCAAGTATAT
CCTGATGGTATAAGACATATCAGGGCTGACAAACGAGTTAACGAGTGGAAGGCACCCGGC
AAGAAGTCTATTGTGAAATGTGCCGTCAATCAAAGACAAGTTGTCATAGCACTGACTGGA
GGTGAACTGGTGTACTTTGAAATGGACCCGACTGGCCAATTGAATGAGTACACTGAACGA
AAGAAGTTGTCATCTGATGTATCCTGTATGGCACTGGGATCAGTAGCTACTGGAGAACAG
AGAGCTTGGTTCCTAGCTGTTGGTTTAGTTGACAATACTGTCAGAATTATTTCACTGGAT
CCTGCTGATTGTCTAGCACCTCGTTCAATGCAAGCCCTGCCTGCCAGCCCCGAGTCCTTG
TGTATTGTTGATCAACCCTTTGAGTCTGGTGCCAAATCTGCTTTACACCTTAACATTGGC
TTAAGTAATGGAGTATTACTACGTACAACTCTGGACTCTGTTAGTGGTGATTTAGCTGAT
ACAAGAACAAGATACCTGGGATCTCGCCCTGTGAAACTTTTCAAAGTTAGAGTGCAGTCA
GCGGAAGCAGTGCTGGCTGTGTCTTCGAGGACATGGCTCGGTTATCAATATCAGAACAGA
TTCCATCTAACGCCATTGTCATATGAATGTCTAGAGTATGCTGCGGGATTTAGCTCTGAA
CAATGTACCGAGGGTATAGTGGCCATTTCATCAAATACACTAAGAATTTTAGCCCTAGAA
AAATTGGGTGCCGTATTCAATCAAACATTCCAACAATTAGATTACACACCAAGAAAGTTT
GTTATAAATAGTGATAACAATCACATCATAGTTTTGGAGACTGACCACAATGCTTACACT
GAAGAAATGAAGAAGCAAAGAAGAGTGCAAATGGCACAAGAAATGAGAGAAGCTGCTGCT
GGGGGAACTCCCGAGGAACAACAACTAGCAAATGAAATGGCCGACGCGTTCCTTTCAGAT
GTGTTGCCAGAAAATATATTTTCTTCCCCGAAAGCTGGTGCCGGCATGTGGGCGTCTCAG
ATCCGTATACTGGACATGAGTGGCGGCGTGGGCGGGTGTAGCACTGTGTGTCTACTACCG
CTGGAACAGAACGAGGCGGCCGTGTCTTTGTGTGTAGTACGATGGGCCGCTCTCACTGAC
AACACACCACATCTAGTAGTAGGGGTTGCCAAGGACGCTCTGCTGTCACCACGTAGCTGC
TCTGAGGGCAGTCTACATGTTTATAAGATTTATAATACTGGAAAATTGGAATTGGTACAT
AAAACACCAATAGATGAATACCCTGGAGCGTTGGCAGCATTCAATGGCAAGCTGCTGGCA
GGAGTGGGGCGGATGTTGAGGTTGTACGACATTGGTAGAAGGAAACTATTACGGAAGTGC
GAAAACAGACACATTCCAAACCTCATAGCGGATATCAAAACTATAAGGCAGAGAATATTC
GTATCGGACGTCCAAGAATCCGTGTTCTGTGTTAAATACAAGAAGAGGGAAAACCAGCTG
ATTATTTTCGCCGACGACACCAATCCCAGGTGGATCACCAACACTTGTATTCTAGACTAC
GACACGGTCGCTATGGCCGACAAGTTTGGCAACGTAGCCGTTTTGAGACTGCCTCAGTCT
GTGAGCGACGATGTGGATGAGGATCCGACTGGAAACAAAGCGCTCTGGGACAGAGGTCTT
CTGAATGGAGCGTCTCAAAAGGGTGACATCACTGTTAATTTCCACGTTGGAGAGACTGTG
ACGTCTTTGCAAAGAGCTACTCTAATCCCGGGCGGTTCGGAGGCGCTCTTGTACGCCACA
GTGAGCGGAGCACTGGGAGTGTTCCTACCGTTCACCTCCAGGGAAGATCACGACTTCTTC
CAGCACCTTGAAATGCACATGAGGAGTGAAAACTCACCTCTGTGCGGACGAGACCACTTG
TCATTCAGAAGCTACTATTATCCAGTAAAGAATGTGATAGACGGCGACCTCTGCGAACAG
TTCAACTCGCTGGAGCCGGCGAAACAGAAAGCCATCGCCGGAGACCTGGAGCGAACTCCG
GCCGAGGTGTCCAAGAAGCTGGAGGACATCAGAACTAGATACGCCTTTTAA

Protein sequence:

MYLYNLTLQGSTAISHAVHGNFSGTKQQEIIISRGKTLELLRPDPNTGKVHTLMKVEIFG
VIRSMMSFRLTGGTKDYIVVGSDSGRIVILEYIPAKNILEKVHQETFGKSGCRRIVPGQY
LAIDPKGRAVMIGAIEKQKLVYILNRDAEARLTISSPLEAHKSNTLVYHMVGVDVGFENP
MFACLEIDYEEADSDPTGEAAQKTQQTLTFYELDLGLNHVVRKYSEPLEEHANFLITVPG
GNDGPSGVLICSENYLTYKNLGDQHDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSMFF
FLAQTEQGDIFKITIETDEDMVTEIKLKYFDTVPVATAMCVLKTGFLFVACEFGNHYLYQ
IAHLGDEDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDELDSLSPILACHVADLTGEDT
PQVYLACGRGPRSSLRALRHGLEVAEMAVSELPGSPNAVWTVRRHKDDDYDSYIIVSFVN
ATLVLSIGETVEEVTDSGFLGTTPTLSCHALGSDALVQVYPDGIRHIRADKRVNEWKAPG
KKSIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERKKLSSDVSCMALGSVATGEQ
RAWFLAVGLVDNTVRIISLDPADCLAPRSMQALPASPESLCIVDQPFESGAKSALHLNIG
LSNGVLLRTTLDSVSGDLADTRTRYLGSRPVKLFKVRVQSAEAVLAVSSRTWLGYQYQNR
FHLTPLSYECLEYAAGFSSEQCTEGIVAISSNTLRILALEKLGAVFNQTFQQLDYTPRKF
VINSDNNHIIVLETDHNAYTEEMKKQRRVQMAQEMREAAAGGTPEEQQLANEMADAFLSD
VLPENIFSSPKAGAGMWASQIRILDMSGGVGGCSTVCLLPLEQNEAAVSLCVVRWAALTD
NTPHLVVGVAKDALLSPRSCSEGSLHVYKIYNTGKLELVHKTPIDEYPGALAAFNGKLLA
GVGRMLRLYDIGRRKLLRKCENRHIPNLIADIKTIRQRIFVSDVQESVFCVKYKKRENQL
IIFADDTNPRWITNTCILDYDTVAMADKFGNVAVLRLPQSVSDDVDEDPTGNKALWDRGL
LNGASQKGDITVNFHVGETVTSLQRATLIPGGSEALLYATVSGALGVFLPFTSREDHDFF
QHLEMHMRSENSPLCGRDHLSFRSYYYPVKNVIDGDLCEQFNSLEPAKQKAIAGDLERTP
AEVSKKLEDIRTRYAF