DPGLEAN19195 in OGS1.0

New model in OGS2.0DPOGS206430 
Genomic Positionscaffold1235:- 6672-17971
See gene structure
CDS Length2709
Paired RNAseq reads  776
Single RNAseq reads  2059
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013842 (0.0)
Best Drosophila hit  cleavage and polyadenylation specificity factor 160, isoform A (3e-175)
Best Human hitcleavage and polyadenylation specificity factor subunit 1 (5e-158)
Best NR hit (blastp)  PREDICTED: similar to cleavage and polyadenylation specificity factor cpsf [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to cleavage and polyadenylation specific factor 1 [Apis mellifera] (0.0)
GeneOntology terms




  
GO:0005847 mRNA cleavage and polyadenylation specificity factor complex
GO:0006378 mRNA polyadenylation
GO:0006379 mRNA cleavage
GO:0003730 mRNA 3'-UTR binding
GO:0003729 mRNA binding
GO:0005515 protein binding
InterPro families
  
IPR019781 WD40 repeat, subgroup
IPR015943 WD40/YVTN repeat-like-containing domain
Orthology groupMCL12453

Nucleotide sequence:

ATGTTTTCTATTTGTCGTCAAACTCACCCTGCTACGGGTATTGAGCACGCGATTAGCTGC
TGTTTCTTTAATAATGATGAAGTGTGCCTGATTACTGCTGGTGCTAATATAATAAAGGTT
TTCAGGCTTTTGCCTGAGGGCCACGCGAAAGAGGTTAATGCCGCTGGTCAACCGATTCCA
CCTAAAATGAAACTAGAATGTCTAGCCTCATACACTCTCTGGGGCAATGTGATGTCAATA
GCATCAGTGAAATGTCCAAGTGCTGGTCGTGACTTACTGCTGGTGTCATTCAAGGAGGCA
AAGTTATCTGTAGTGCAATATGATCCGCAAGTTAATAATCTCATTACACTTAGTATGCAT
TACTTTGAAGAAGATGATATGAAGGGTGGATGGACGACTCATCCCCACATACCCTGGATA
CGAGTGGACCCAGAATTCAGATGTGCCGTAATGTTATTGTATGGAAGAAAGTTAGCGGTG
TTGCCGTTCAGGAAAGATATAACCTCAGAAGAGGGTGACCCTTTGGAGGCTAAGCCATTA
GATTGTAAGAAAAATCAACCAATACAAACCATATCCAGAGCACCAACGCTAGCATCTTAT
GTGATAATATTGAAAGAACTCGATGAAAAGATAGACAATATATTGGACATACAGTTTCTC
TATGGTTACTATGAACCGACATTGTTGTTATTGTATGAACCGGTTAGGACATTTGCTGGA
CGTACGGCCGTTCGTAACGATACCTGTGCGATGGCTGGTGTTAGTCTCAATATGAGCGCC
AGAGTACATCCTGTTATTTGGTCTATAGGAGGATTGCCGTTTGACTGTATACAGGCTGTC
CCGATTCAGAAACCTTTAGGCGGTTGTTTAATAATGGCTGTGAATTCTTTGATATATTTG
AATCAATCTGTGCCGCCATACGGGGTCTCTCTGAACAGTATTGCTACACATACCACTAAT
TTTCCTCTACGTATTCAAGAAGGCGTCTGTATAACGCTGGACGGCGCTAGAGTGGTAGCT
CTAGGTGATACTCGATTGTCGCTCGCCCTCAAGGGTGGTCAACTGTACGTGTTGACCTTA
CTATCGGACTCCGTGAGGAGTGTACGGAGCTTTCACCTGGACCGCGCCGCAGCCTCTGTG
TTGACGTCCTGTATGTGTGTGATCGAAGAAGATTTTCTATTTCTTGGATCGAGACTTGGA
AATTCTTTGCTATTGAGAGTGACCGAGAGGGAAAATAGAATGTTGTTCTCAGTGGACAAG
CCTTTAGAAGCTACGGTCGACCTGACACTGTCCGAAACCGATAAAGATAAAGAACCATTG
CCGAAGGAACCCCAAAAAGAAATGTTAGATCCGCAAGCGAAGAAGCGTCGTTTGGACACC
ATAAGCGACTGTGTCGCCACCAATGTGGTGGAAATATCGGACAAGGACGAGCTGGAGGTG
TACGGCTCTGACATACGGACCTCCACCCAGCTCACCAGCTATGTGTTCGAGGTATGCGAT
TCCCTGTTGAACATATGTCCTATCGGCGACGTGTCTATGGGGGAGCTCCAGTTGGTGTCC
GAGGAGGGAGCGGGGAGGAGGTCGAGGCCGGCCCTCGAAATGGTCGCGTGTAGCGGACGA
GGGAAGAACGGAGCTTTGGCTGTGTTACAACGGTCGCTCACACCGCAGCTACTCACTGCC
TTCGATCTACCAGGCTGTATCGATATGTGGACGGTGATCGGAGAGGCGACGGAAGTCAAT
AGAGAAGCCCACAAAGATATGGAAGGCAGCCATGCTTACTTGATACTGACACAGGAAGAC
TCGAGTATGATTCTTCAAACCGGCCAAGAGATAAACGAAGTGGATAATTCTGGTTTTATG
ACGAGCGCCCCCACGGTGTTCGCGGGTAACTTAGGGAACAACAGGTTTATGGTCCAAGTT
ACCACAACAGCTATAAGACTTGTGAGAAATGGCGTGTTGGTTCAGTCTATCACGTTAGAG
TGGACGGCCCGCAGCGCGTGCACCGCCGACCCCTACCTGTGTGTGGTGTCCACTTGCGGC
CGGGCGCTGGTGCTCGCGCTCAGGGAGCTGCGGGCCAGGGACGCCACGTCAGCTCGGCTC
GCGCCAACGAGACAGGCGGTGCCTCACAGACCGGCCTTACTGAAAGCCGTTCCTTATCGA
GATCTCAGTGGGCTATTCACCAGCACAGACGACAACATACAGGTCAAAGGTGAGTTCACG
GGTAAAATGAAAGAGAAAAATATCAAGGCTGAAGGTTTCAAGGCGGACACAGTGTATGAA
TTGAACGATGAAGATGAGTTACTGTATGGAGGAGATCAGACGCCAGCGTCCATGGCTAGT
GTGAAGATATGGCACATCCCTGATGGTGGCCTATCTATGCACCTCACCGACTGGCTGGTT
GAGCTCCACGGGCACAAGAGGCGTGTGGCCTACATAGAGTGGCATCCCACGGCTGAGAAC
ATACTGTTTAGTGCTGGATTCGATTATCTGATTTTAGCTGACAGCTTAGAGTCCGTTCCT
ATACCGAACCAGACGGATGAAGACGAATTCAATACAGGGCATAGTAGTAACGCGGAGAGA
CTTCAAGAAATCCTAGTCGTCGGCCTGGGACATAAGGGGTCGAGGGTTCTCATGTTGCTG
AGGTGTGATGACGACCAGCTGATGATATATCAGGTCTGTAGTAGTATAGAAGTACCAGCA
GTAGTATAG

Protein sequence:

MFSICRQTHPATGIEHAISCCFFNNDEVCLITAGANIIKVFRLLPEGHAKEVNAAGQPIP
PKMKLECLASYTLWGNVMSIASVKCPSAGRDLLLVSFKEAKLSVVQYDPQVNNLITLSMH
YFEEDDMKGGWTTHPHIPWIRVDPEFRCAVMLLYGRKLAVLPFRKDITSEEGDPLEAKPL
DCKKNQPIQTISRAPTLASYVIILKELDEKIDNILDIQFLYGYYEPTLLLLYEPVRTFAG
RTAVRNDTCAMAGVSLNMSARVHPVIWSIGGLPFDCIQAVPIQKPLGGCLIMAVNSLIYL
NQSVPPYGVSLNSIATHTTNFPLRIQEGVCITLDGARVVALGDTRLSLALKGGQLYVLTL
LSDSVRSVRSFHLDRAAASVLTSCMCVIEEDFLFLGSRLGNSLLLRVTERENRMLFSVDK
PLEATVDLTLSETDKDKEPLPKEPQKEMLDPQAKKRRLDTISDCVATNVVEISDKDELEV
YGSDIRTSTQLTSYVFEVCDSLLNICPIGDVSMGELQLVSEEGAGRRSRPALEMVACSGR
GKNGALAVLQRSLTPQLLTAFDLPGCIDMWTVIGEATEVNREAHKDMEGSHAYLILTQED
SSMILQTGQEINEVDNSGFMTSAPTVFAGNLGNNRFMVQVTTTAIRLVRNGVLVQSITLE
WTARSACTADPYLCVVSTCGRALVLALRELRARDATSARLAPTRQAVPHRPALLKAVPYR
DLSGLFTSTDDNIQVKGEFTGKMKEKNIKAEGFKADTVYELNDEDELLYGGDQTPASMAS
VKIWHIPDGGLSMHLTDWLVELHGHKRRVAYIEWHPTAENILFSAGFDYLILADSLESVP
IPNQTDEDEFNTGHSSNAERLQEILVVGLGHKGSRVLMLLRCDDDQLMIYQVCSSIEVPA
VV