New model in OGS2.0 | DPOGS209458  |
---|---|
Genomic Position | scaffold24:+ 76630-82965 |
See gene structure | |
CDS Length | 2220 |
Paired RNAseq reads   | 1965 |
Single RNAseq reads   | 5182 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005838 (8e-133) |
Best Drosophila hit   | CG6686, isoform B (2e-58) |
Best Human hit | U4/U6.U5 tri-snRNP-associated protein 1 (1e-44) |
Best NR hit (blastp)   | PREDICTED: similar to CG6686-PB, isoform B isoform 1 [Apis mellifera] (4e-146) |
Best NR hit (blastx)   | PREDICTED: similar to CG6686-PB, isoform B isoform 1 [Apis mellifera] (5e-109) |
GeneOntology terms    | GO:0000398 nuclear mRNA splicing, via spliceosome GO:0071011 precatalytic spliceosome |
InterPro families   | IPR005011 SART-1 protein |
Orthology group | MCL11416 |
Nucleotide sequence:
ATGGGTTCGAAGAAACATAAAAAAGAATCGAAAAAGAGGAAGCACAGAAGTAGATCCAGG
TCTCCTTTGGATGGTGAAGAGCGTGAACGAAAGAGGCACAGGAAACACAAGGATCGCAAG
AAAGATCGCTCTCCGGATGTGGAGGAGGTGCCGGTGGACTCGCACCTGGAGCGCGAGGCG
GAACCCTCGCCCGCGCCCAGCAGCAGCTCCAGCGCTAGGAGACATGAAGGAAGAGACAAA
AGCCAGGAGTCGGAACGTGACTCGTCACCCACGCCCGCCTCGGGGTCGGCTCAGGAGAGT
CTCTCCATAGAAGAGACCAACAAACTGCGAGCCAAGCTGGGGCTGAAACCACTTGAAGTG
CCAGAGAAACCTGCCGACGACGGTAAGTTCAAGGATGACCTGGGAGAGTTCTACCACGTG
CCCGCCTCCAACATCGCCGCGCAGAAGAAAGCGGACAAGTTGAGAGAGAAGCTCGCCGAG
AGGAGGGACAAGAGGGACATCGACAACAAGTTACAGACCTCCCTGCTGGCCGAGGGTTCC
GATGATGAAGACGCCTCCGCCTGGGTCAGGAAGACGCGGGACATGGAGAAACAGAAGCAG
GAAGCCGCCAAGAGAGCGGCTCTGCTGGACGAGATGGACGAGGTGTTCGGTGTGGGCGCG
CTCGTGGCGGACGACCAGCACCGAGACCGACAGGAGGCCTACCACCAGGAACATCTCAAG
GGGCTGCGGGTGGCGCACACTCTGGACGCGTTGCCCGAGGAGCGCGAGACGGTGTTGACG
CTGGCGGACAAGGAGGTGCTAGCGGACGACGACGAGGATGTACTCGTCAATGTGAACATC
GTGGACGACGAGAAATATAAAAAGAACATCGAGGAGCGCAAGAAGGCCCGCACGGGCTAC
CAGGCCTACGACGAGGAGGCCGACATACAGGCCGCCCTGGGGTACAGCAGACCCGTGCTG
GCCAAGTACGACGACGAGATAGAACCCAGCAAGGGAGACAAGACCAGGGGCTTCTTCATA
GGAGACGAGGACGCGCTCATGGAGCAGAAACTGAAGGACATGATGCGTGCGGAACTGATA
GCGGGCGGTCCGGATAAAGTGCTGGAATCGCTCCAGAGCACCGGCCTCAGACCCGCCAGC
GACTACCTGCAGCCCGACGAGCTCCAGGCGAGGTTCAAGAAAGTTAAGAAGAAGGGCAAA
ATACGTAAGAAGGCCAAGCAGGAGCCCATAGACGTGGAGGAGCACGAGGCGGGCTCCGTG
CCTCTGGACACCGATGACACGGAGATCAGCCAGGAGGTGACTGCGCCTGTTCTGGACGAA
GACGAAGTGGAGGTGGACACGGAGTTACAGGCGGCGCTCCATCGCGCCAGGAGACTGAGG
CAGGGAGGGGACCAACACAGGACTCCTAAGCTGGAGGAGATCCTCCAACAAATTAAAGAA
GAGAAAACGGAAGAGAGTCAAGAAGCCGGGGGCAGCATGGTGCTGGATGCCACCGCCGAG
TTCTGTCGCACGCTCGGGGACATACCGACATACGGCCTGGCGGGGAACAGGGAACATACC
GCCGAGATCATGGACTTCGATCGCGAGGAAGCCGAGCCGGAGCCGGAGAGCGGAGCCAGC
GGCGGCGCCTGGAGCAGGGTCGACGTGCGCACAGACCGGCCTGCCGACCTCGAGCATGAA
GGAGCGTCGGGCGCCGGGCTGGAGGCAGAGCCCGCACTGGGGGCCGGCGTGGCGGGCGCG
CTGCGACTGGCGCTCAGCAAGGGCTATCTGGAGAGAGACGGCGCACTGCCCGCTCCTAGA
CCGACACGCTCCTCACTGGCCGCGCTCGCCGCGCTGCACTACTCCATCGAGGACAAGACT
TACGGCGAGGACGACAAGTACGGTCGGCGCGAGAGAGGAGGTCACTCGGGACCTCTGAGC
GAGTTCAGGGAGAAGAGTGACTTCAGACCCGACATCAAGCTGGAGTACGTCGACGACGAC
GGGCGACCGCTCTGTCCCAAGGAGGCCTTCCGCTACCTCTCCCACAAGTTCCACGGCAAG
GGACCCGGCAAGAACAAGCAGGAGAAGAGAATCAAAAAGGCCGTGCAGGAGGGACTGATG
AAGAAGATGAGTTCCACGGACACGCCTCTCAACACGTTACAGATGCTGCAGCAGAAACAA
CGAGAGACACAGTCGCCCTTCGTGGTGCTCAGCGGCGCCAAGAGAGACGCGCCCAACTGA
Protein sequence:
MGSKKHKKESKKRKHRSRSRSPLDGEERERKRHRKHKDRKKDRSPDVEEVPVDSHLEREA
EPSPAPSSSSSARRHEGRDKSQESERDSSPTPASGSAQESLSIEETNKLRAKLGLKPLEV
PEKPADDGKFKDDLGEFYHVPASNIAAQKKADKLREKLAERRDKRDIDNKLQTSLLAEGS
DDEDASAWVRKTRDMEKQKQEAAKRAALLDEMDEVFGVGALVADDQHRDRQEAYHQEHLK
GLRVAHTLDALPEERETVLTLADKEVLADDDEDVLVNVNIVDDEKYKKNIEERKKARTGY
QAYDEEADIQAALGYSRPVLAKYDDEIEPSKGDKTRGFFIGDEDALMEQKLKDMMRAELI
AGGPDKVLESLQSTGLRPASDYLQPDELQARFKKVKKKGKIRKKAKQEPIDVEEHEAGSV
PLDTDDTEISQEVTAPVLDEDEVEVDTELQAALHRARRLRQGGDQHRTPKLEEILQQIKE
EKTEESQEAGGSMVLDATAEFCRTLGDIPTYGLAGNREHTAEIMDFDREEAEPEPESGAS
GGAWSRVDVRTDRPADLEHEGASGAGLEAEPALGAGVAGALRLALSKGYLERDGALPAPR
PTRSSLAALAALHYSIEDKTYGEDDKYGRRERGGHSGPLSEFREKSDFRPDIKLEYVDDD
GRPLCPKEAFRYLSHKFHGKGPGKNKQEKRIKKAVQEGLMKKMSSTDTPLNTLQMLQQKQ
RETQSPFVVLSGAKRDAPN