DPGLEAN14387 in OGS1.0

New model in OGS2.0DPOGS210492 
Genomic Positionscaffold296:- 53051-57310
See gene structure
CDS Length1158
Paired RNAseq reads  274
Single RNAseq reads  861
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012583 (8e-103)
Best Drosophila hit  cleavage and polyadenylation specificity factor 73 (2e-103)
Best Human hitcleavage and polyadenylation specificity factor subunit 3 (3e-91)
Best NR hit (blastp)  PREDICTED: similar to cleavage and polyadenylation specificity factor [Nasonia vitripennis] (5e-139)
Best NR hit (blastx)  PREDICTED: similar to cleavage and polyadenylation specificity factor [Nasonia vitripennis] (7e-122)
GeneOntology terms



  
GO:0006379 mRNA cleavage
GO:0005847 mRNA cleavage and polyadenylation specificity factor complex
GO:0006378 mRNA polyadenylation
GO:0016787 hydrolase activity
GO:0006398 histone mRNA 3'-end processing
InterPro families

  
IPR021718 Pre-mRNA 3'-end-processing endonuclease polyadenylation factor C-term
IPR022712 Beta-Casp domain
IPR011108 RNA-metabolising metallo-beta-lactamase
Orthology groupMCL40761

Nucleotide sequence:

ATGAGGTGTCGCTGTGATTCCAAATACAGTACTCTTCATTTGAGATCCCACCAGGCACCG
GGCATCGATCACTTCGAGGACATAGGTCCGTGTGTGATCATGGCTTCCCCGGGTATGATG
CAGTCGGGCCTCTCCCGGGAACTGTTCGAGTCGTGGTGCACGGATCCCAAGAACGGCGTC
ATCATAGCAGGTTACTGCGTGGAAGGCACCCTGGCCAAAACTATACTGTCGGAGCCGGAA
GAGATCACGACTATGTCAGGACAGAAACTTCCGCTGAAGATGTCCGTGGATTACATATCG
TTCTCCGCGCACACGGACTACCAACAGACCTCAGAGTTTATCAACATTCTGAAGCCTCCT
CATGTGGTGTTAGTTCACGGGGAACAGAACGAGATGTCTCGTCTGAAGGCGGCCCTGCAG
CGCGAACACCGCGGCCGCCTCGCCATACACACGCCCAGGAACACGCAACAGCTGGCCCTC
ACCTTCAGAGGCGACAAGACCGCTAAGGTAATGGGGTCCCTGGCCATGGAGGCGCCGGTG
CCGGGCGCACAGCTCCAGGGTGTTCTGGTCAAGAGGAACTTTAACTATCACATCCTGGCG
CCCTCCGACTTGAACAAGTACACGGACCTGTCCCAGTCGTCGGTGTCTCAGCGCGTGTCA
GTGTGGTGCGGAGCTCCGGTGGGTCTGGTCCGACACGCCGTGATGCGCCTGGCGGGGCCC
GTGGTGTTCCTGAGCGACACTCGCTGGAGGCTCTACGGCTGCATCGACCTCACGCTGGAC
CTGCCGCTCGTCACGCTGGAGTGGCAGGCGGCGCCGGTGTCTGACATGTTCGCGGACGCG
GTGGTGGCGGCGCTGCTGGCGGCCCCGGCCTCCGCCCCCGGGCCCGCGCCCAACGCGCCC
CTCGCACACAAACTGGACAAGATGCATTTCAAGGAGTGTGTGATCGAGATGTTGTCGGAG
ATGTTCGGCGAGGCGGCCGTGGCCAAGATGTTCCGCGGAGAGCGACTCACGGTCACGCTC
AACGAGCGCCAGGCGCACCTAGACCTCGCCACCATGGAGGTGAAGTGTCCCGAGGACGAG
TCTCTGGAGCGCACAATCCAGTCCGCCATCAGCAAGCTGCACGCCGCCCTCTCGCCCGTC
CGGCCTCCCGCACCCTGA

Protein sequence:

MRCRCDSKYSTLHLRSHQAPGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGV
IIAGYCVEGTLAKTILSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFINILKPP
HVVLVHGEQNEMSRLKAALQREHRGRLAIHTPRNTQQLALTFRGDKTAKVMGSLAMEAPV
PGAQLQGVLVKRNFNYHILAPSDLNKYTDLSQSSVSQRVSVWCGAPVGLVRHAVMRLAGP
VVFLSDTRWRLYGCIDLTLDLPLVTLEWQAAPVSDMFADAVVAALLAAPASAPGPAPNAP
LAHKLDKMHFKECVIEMLSEMFGEAAVAKMFRGERLTVTLNERQAHLDLATMEVKCPEDE
SLERTIQSAISKLHAALSPVRPPAP