DPGLEAN16094 in OGS1.0

New model in OGS2.0DPOGS209458 
Genomic Positionscaffold24:+ 76630-82965
See gene structure
CDS Length2220
Paired RNAseq reads  1965
Single RNAseq reads  5182
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005838 (8e-133)
Best Drosophila hit  CG6686, isoform B (2e-58)
Best Human hitU4/U6.U5 tri-snRNP-associated protein 1 (1e-44)
Best NR hit (blastp)  PREDICTED: similar to CG6686-PB, isoform B isoform 1 [Apis mellifera] (4e-146)
Best NR hit (blastx)  PREDICTED: similar to CG6686-PB, isoform B isoform 1 [Apis mellifera] (5e-109)
GeneOntology terms
  
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0071011 precatalytic spliceosome
InterPro families  IPR005011 SART-1 protein
Orthology groupMCL11416

Nucleotide sequence:

ATGGGTTCGAAGAAACATAAAAAAGAATCGAAAAAGAGGAAGCACAGAAGTAGATCCAGG
TCTCCTTTGGATGGTGAAGAGCGTGAACGAAAGAGGCACAGGAAACACAAGGATCGCAAG
AAAGATCGCTCTCCGGATGTGGAGGAGGTGCCGGTGGACTCGCACCTGGAGCGCGAGGCG
GAACCCTCGCCCGCGCCCAGCAGCAGCTCCAGCGCTAGGAGACATGAAGGAAGAGACAAA
AGCCAGGAGTCGGAACGTGACTCGTCACCCACGCCCGCCTCGGGGTCGGCTCAGGAGAGT
CTCTCCATAGAAGAGACCAACAAACTGCGAGCCAAGCTGGGGCTGAAACCACTTGAAGTG
CCAGAGAAACCTGCCGACGACGGTAAGTTCAAGGATGACCTGGGAGAGTTCTACCACGTG
CCCGCCTCCAACATCGCCGCGCAGAAGAAAGCGGACAAGTTGAGAGAGAAGCTCGCCGAG
AGGAGGGACAAGAGGGACATCGACAACAAGTTACAGACCTCCCTGCTGGCCGAGGGTTCC
GATGATGAAGACGCCTCCGCCTGGGTCAGGAAGACGCGGGACATGGAGAAACAGAAGCAG
GAAGCCGCCAAGAGAGCGGCTCTGCTGGACGAGATGGACGAGGTGTTCGGTGTGGGCGCG
CTCGTGGCGGACGACCAGCACCGAGACCGACAGGAGGCCTACCACCAGGAACATCTCAAG
GGGCTGCGGGTGGCGCACACTCTGGACGCGTTGCCCGAGGAGCGCGAGACGGTGTTGACG
CTGGCGGACAAGGAGGTGCTAGCGGACGACGACGAGGATGTACTCGTCAATGTGAACATC
GTGGACGACGAGAAATATAAAAAGAACATCGAGGAGCGCAAGAAGGCCCGCACGGGCTAC
CAGGCCTACGACGAGGAGGCCGACATACAGGCCGCCCTGGGGTACAGCAGACCCGTGCTG
GCCAAGTACGACGACGAGATAGAACCCAGCAAGGGAGACAAGACCAGGGGCTTCTTCATA
GGAGACGAGGACGCGCTCATGGAGCAGAAACTGAAGGACATGATGCGTGCGGAACTGATA
GCGGGCGGTCCGGATAAAGTGCTGGAATCGCTCCAGAGCACCGGCCTCAGACCCGCCAGC
GACTACCTGCAGCCCGACGAGCTCCAGGCGAGGTTCAAGAAAGTTAAGAAGAAGGGCAAA
ATACGTAAGAAGGCCAAGCAGGAGCCCATAGACGTGGAGGAGCACGAGGCGGGCTCCGTG
CCTCTGGACACCGATGACACGGAGATCAGCCAGGAGGTGACTGCGCCTGTTCTGGACGAA
GACGAAGTGGAGGTGGACACGGAGTTACAGGCGGCGCTCCATCGCGCCAGGAGACTGAGG
CAGGGAGGGGACCAACACAGGACTCCTAAGCTGGAGGAGATCCTCCAACAAATTAAAGAA
GAGAAAACGGAAGAGAGTCAAGAAGCCGGGGGCAGCATGGTGCTGGATGCCACCGCCGAG
TTCTGTCGCACGCTCGGGGACATACCGACATACGGCCTGGCGGGGAACAGGGAACATACC
GCCGAGATCATGGACTTCGATCGCGAGGAAGCCGAGCCGGAGCCGGAGAGCGGAGCCAGC
GGCGGCGCCTGGAGCAGGGTCGACGTGCGCACAGACCGGCCTGCCGACCTCGAGCATGAA
GGAGCGTCGGGCGCCGGGCTGGAGGCAGAGCCCGCACTGGGGGCCGGCGTGGCGGGCGCG
CTGCGACTGGCGCTCAGCAAGGGCTATCTGGAGAGAGACGGCGCACTGCCCGCTCCTAGA
CCGACACGCTCCTCACTGGCCGCGCTCGCCGCGCTGCACTACTCCATCGAGGACAAGACT
TACGGCGAGGACGACAAGTACGGTCGGCGCGAGAGAGGAGGTCACTCGGGACCTCTGAGC
GAGTTCAGGGAGAAGAGTGACTTCAGACCCGACATCAAGCTGGAGTACGTCGACGACGAC
GGGCGACCGCTCTGTCCCAAGGAGGCCTTCCGCTACCTCTCCCACAAGTTCCACGGCAAG
GGACCCGGCAAGAACAAGCAGGAGAAGAGAATCAAAAAGGCCGTGCAGGAGGGACTGATG
AAGAAGATGAGTTCCACGGACACGCCTCTCAACACGTTACAGATGCTGCAGCAGAAACAA
CGAGAGACACAGTCGCCCTTCGTGGTGCTCAGCGGCGCCAAGAGAGACGCGCCCAACTGA

Protein sequence:

MGSKKHKKESKKRKHRSRSRSPLDGEERERKRHRKHKDRKKDRSPDVEEVPVDSHLEREA
EPSPAPSSSSSARRHEGRDKSQESERDSSPTPASGSAQESLSIEETNKLRAKLGLKPLEV
PEKPADDGKFKDDLGEFYHVPASNIAAQKKADKLREKLAERRDKRDIDNKLQTSLLAEGS
DDEDASAWVRKTRDMEKQKQEAAKRAALLDEMDEVFGVGALVADDQHRDRQEAYHQEHLK
GLRVAHTLDALPEERETVLTLADKEVLADDDEDVLVNVNIVDDEKYKKNIEERKKARTGY
QAYDEEADIQAALGYSRPVLAKYDDEIEPSKGDKTRGFFIGDEDALMEQKLKDMMRAELI
AGGPDKVLESLQSTGLRPASDYLQPDELQARFKKVKKKGKIRKKAKQEPIDVEEHEAGSV
PLDTDDTEISQEVTAPVLDEDEVEVDTELQAALHRARRLRQGGDQHRTPKLEEILQQIKE
EKTEESQEAGGSMVLDATAEFCRTLGDIPTYGLAGNREHTAEIMDFDREEAEPEPESGAS
GGAWSRVDVRTDRPADLEHEGASGAGLEAEPALGAGVAGALRLALSKGYLERDGALPAPR
PTRSSLAALAALHYSIEDKTYGEDDKYGRRERGGHSGPLSEFREKSDFRPDIKLEYVDDD
GRPLCPKEAFRYLSHKFHGKGPGKNKQEKRIKKAVQEGLMKKMSSTDTPLNTLQMLQQKQ
RETQSPFVVLSGAKRDAPN