New model in OGS2.0 | DPOGS210492  |
---|---|
Genomic Position | scaffold296:- 53051-57310 |
See gene structure | |
CDS Length | 1158 |
Paired RNAseq reads   | 274 |
Single RNAseq reads   | 861 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012583 (8e-103) |
Best Drosophila hit   | cleavage and polyadenylation specificity factor 73 (2e-103) |
Best Human hit | cleavage and polyadenylation specificity factor subunit 3 (3e-91) |
Best NR hit (blastp)   | PREDICTED: similar to cleavage and polyadenylation specificity factor [Nasonia vitripennis] (5e-139) |
Best NR hit (blastx)   | PREDICTED: similar to cleavage and polyadenylation specificity factor [Nasonia vitripennis] (7e-122) |
GeneOntology terms    | GO:0006379 mRNA cleavage GO:0005847 mRNA cleavage and polyadenylation specificity factor complex GO:0006378 mRNA polyadenylation GO:0016787 hydrolase activity GO:0006398 histone mRNA 3'-end processing |
InterPro families    | IPR021718 Pre-mRNA 3'-end-processing endonuclease polyadenylation factor C-term IPR022712 Beta-Casp domain IPR011108 RNA-metabolising metallo-beta-lactamase |
Orthology group | MCL40761 |
Nucleotide sequence:
ATGAGGTGTCGCTGTGATTCCAAATACAGTACTCTTCATTTGAGATCCCACCAGGCACCG
GGCATCGATCACTTCGAGGACATAGGTCCGTGTGTGATCATGGCTTCCCCGGGTATGATG
CAGTCGGGCCTCTCCCGGGAACTGTTCGAGTCGTGGTGCACGGATCCCAAGAACGGCGTC
ATCATAGCAGGTTACTGCGTGGAAGGCACCCTGGCCAAAACTATACTGTCGGAGCCGGAA
GAGATCACGACTATGTCAGGACAGAAACTTCCGCTGAAGATGTCCGTGGATTACATATCG
TTCTCCGCGCACACGGACTACCAACAGACCTCAGAGTTTATCAACATTCTGAAGCCTCCT
CATGTGGTGTTAGTTCACGGGGAACAGAACGAGATGTCTCGTCTGAAGGCGGCCCTGCAG
CGCGAACACCGCGGCCGCCTCGCCATACACACGCCCAGGAACACGCAACAGCTGGCCCTC
ACCTTCAGAGGCGACAAGACCGCTAAGGTAATGGGGTCCCTGGCCATGGAGGCGCCGGTG
CCGGGCGCACAGCTCCAGGGTGTTCTGGTCAAGAGGAACTTTAACTATCACATCCTGGCG
CCCTCCGACTTGAACAAGTACACGGACCTGTCCCAGTCGTCGGTGTCTCAGCGCGTGTCA
GTGTGGTGCGGAGCTCCGGTGGGTCTGGTCCGACACGCCGTGATGCGCCTGGCGGGGCCC
GTGGTGTTCCTGAGCGACACTCGCTGGAGGCTCTACGGCTGCATCGACCTCACGCTGGAC
CTGCCGCTCGTCACGCTGGAGTGGCAGGCGGCGCCGGTGTCTGACATGTTCGCGGACGCG
GTGGTGGCGGCGCTGCTGGCGGCCCCGGCCTCCGCCCCCGGGCCCGCGCCCAACGCGCCC
CTCGCACACAAACTGGACAAGATGCATTTCAAGGAGTGTGTGATCGAGATGTTGTCGGAG
ATGTTCGGCGAGGCGGCCGTGGCCAAGATGTTCCGCGGAGAGCGACTCACGGTCACGCTC
AACGAGCGCCAGGCGCACCTAGACCTCGCCACCATGGAGGTGAAGTGTCCCGAGGACGAG
TCTCTGGAGCGCACAATCCAGTCCGCCATCAGCAAGCTGCACGCCGCCCTCTCGCCCGTC
CGGCCTCCCGCACCCTGA
Protein sequence:
MRCRCDSKYSTLHLRSHQAPGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGV
IIAGYCVEGTLAKTILSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFINILKPP
HVVLVHGEQNEMSRLKAALQREHRGRLAIHTPRNTQQLALTFRGDKTAKVMGSLAMEAPV
PGAQLQGVLVKRNFNYHILAPSDLNKYTDLSQSSVSQRVSVWCGAPVGLVRHAVMRLAGP
VVFLSDTRWRLYGCIDLTLDLPLVTLEWQAAPVSDMFADAVVAALLAAPASAPGPAPNAP
LAHKLDKMHFKECVIEMLSEMFGEAAVAKMFRGERLTVTLNERQAHLDLATMEVKCPEDE
SLERTIQSAISKLHAALSPVRPPAP