New model in OGS2.0 | DPOGS206430  |
---|---|
Genomic Position | scaffold1235:- 6672-17971 |
See gene structure | |
CDS Length | 2709 |
Paired RNAseq reads   | 776 |
Single RNAseq reads   | 2059 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013842 (0.0) |
Best Drosophila hit   | cleavage and polyadenylation specificity factor 160, isoform A (3e-175) |
Best Human hit | cleavage and polyadenylation specificity factor subunit 1 (5e-158) |
Best NR hit (blastp)   | PREDICTED: similar to cleavage and polyadenylation specificity factor cpsf [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to cleavage and polyadenylation specific factor 1 [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0005847 mRNA cleavage and polyadenylation specificity factor complex GO:0006378 mRNA polyadenylation GO:0006379 mRNA cleavage GO:0003730 mRNA 3'-UTR binding GO:0003729 mRNA binding GO:0005515 protein binding |
InterPro families    | IPR019781 WD40 repeat, subgroup IPR015943 WD40/YVTN repeat-like-containing domain |
Orthology group | MCL12453 |
Nucleotide sequence:
ATGTTTTCTATTTGTCGTCAAACTCACCCTGCTACGGGTATTGAGCACGCGATTAGCTGC
TGTTTCTTTAATAATGATGAAGTGTGCCTGATTACTGCTGGTGCTAATATAATAAAGGTT
TTCAGGCTTTTGCCTGAGGGCCACGCGAAAGAGGTTAATGCCGCTGGTCAACCGATTCCA
CCTAAAATGAAACTAGAATGTCTAGCCTCATACACTCTCTGGGGCAATGTGATGTCAATA
GCATCAGTGAAATGTCCAAGTGCTGGTCGTGACTTACTGCTGGTGTCATTCAAGGAGGCA
AAGTTATCTGTAGTGCAATATGATCCGCAAGTTAATAATCTCATTACACTTAGTATGCAT
TACTTTGAAGAAGATGATATGAAGGGTGGATGGACGACTCATCCCCACATACCCTGGATA
CGAGTGGACCCAGAATTCAGATGTGCCGTAATGTTATTGTATGGAAGAAAGTTAGCGGTG
TTGCCGTTCAGGAAAGATATAACCTCAGAAGAGGGTGACCCTTTGGAGGCTAAGCCATTA
GATTGTAAGAAAAATCAACCAATACAAACCATATCCAGAGCACCAACGCTAGCATCTTAT
GTGATAATATTGAAAGAACTCGATGAAAAGATAGACAATATATTGGACATACAGTTTCTC
TATGGTTACTATGAACCGACATTGTTGTTATTGTATGAACCGGTTAGGACATTTGCTGGA
CGTACGGCCGTTCGTAACGATACCTGTGCGATGGCTGGTGTTAGTCTCAATATGAGCGCC
AGAGTACATCCTGTTATTTGGTCTATAGGAGGATTGCCGTTTGACTGTATACAGGCTGTC
CCGATTCAGAAACCTTTAGGCGGTTGTTTAATAATGGCTGTGAATTCTTTGATATATTTG
AATCAATCTGTGCCGCCATACGGGGTCTCTCTGAACAGTATTGCTACACATACCACTAAT
TTTCCTCTACGTATTCAAGAAGGCGTCTGTATAACGCTGGACGGCGCTAGAGTGGTAGCT
CTAGGTGATACTCGATTGTCGCTCGCCCTCAAGGGTGGTCAACTGTACGTGTTGACCTTA
CTATCGGACTCCGTGAGGAGTGTACGGAGCTTTCACCTGGACCGCGCCGCAGCCTCTGTG
TTGACGTCCTGTATGTGTGTGATCGAAGAAGATTTTCTATTTCTTGGATCGAGACTTGGA
AATTCTTTGCTATTGAGAGTGACCGAGAGGGAAAATAGAATGTTGTTCTCAGTGGACAAG
CCTTTAGAAGCTACGGTCGACCTGACACTGTCCGAAACCGATAAAGATAAAGAACCATTG
CCGAAGGAACCCCAAAAAGAAATGTTAGATCCGCAAGCGAAGAAGCGTCGTTTGGACACC
ATAAGCGACTGTGTCGCCACCAATGTGGTGGAAATATCGGACAAGGACGAGCTGGAGGTG
TACGGCTCTGACATACGGACCTCCACCCAGCTCACCAGCTATGTGTTCGAGGTATGCGAT
TCCCTGTTGAACATATGTCCTATCGGCGACGTGTCTATGGGGGAGCTCCAGTTGGTGTCC
GAGGAGGGAGCGGGGAGGAGGTCGAGGCCGGCCCTCGAAATGGTCGCGTGTAGCGGACGA
GGGAAGAACGGAGCTTTGGCTGTGTTACAACGGTCGCTCACACCGCAGCTACTCACTGCC
TTCGATCTACCAGGCTGTATCGATATGTGGACGGTGATCGGAGAGGCGACGGAAGTCAAT
AGAGAAGCCCACAAAGATATGGAAGGCAGCCATGCTTACTTGATACTGACACAGGAAGAC
TCGAGTATGATTCTTCAAACCGGCCAAGAGATAAACGAAGTGGATAATTCTGGTTTTATG
ACGAGCGCCCCCACGGTGTTCGCGGGTAACTTAGGGAACAACAGGTTTATGGTCCAAGTT
ACCACAACAGCTATAAGACTTGTGAGAAATGGCGTGTTGGTTCAGTCTATCACGTTAGAG
TGGACGGCCCGCAGCGCGTGCACCGCCGACCCCTACCTGTGTGTGGTGTCCACTTGCGGC
CGGGCGCTGGTGCTCGCGCTCAGGGAGCTGCGGGCCAGGGACGCCACGTCAGCTCGGCTC
GCGCCAACGAGACAGGCGGTGCCTCACAGACCGGCCTTACTGAAAGCCGTTCCTTATCGA
GATCTCAGTGGGCTATTCACCAGCACAGACGACAACATACAGGTCAAAGGTGAGTTCACG
GGTAAAATGAAAGAGAAAAATATCAAGGCTGAAGGTTTCAAGGCGGACACAGTGTATGAA
TTGAACGATGAAGATGAGTTACTGTATGGAGGAGATCAGACGCCAGCGTCCATGGCTAGT
GTGAAGATATGGCACATCCCTGATGGTGGCCTATCTATGCACCTCACCGACTGGCTGGTT
GAGCTCCACGGGCACAAGAGGCGTGTGGCCTACATAGAGTGGCATCCCACGGCTGAGAAC
ATACTGTTTAGTGCTGGATTCGATTATCTGATTTTAGCTGACAGCTTAGAGTCCGTTCCT
ATACCGAACCAGACGGATGAAGACGAATTCAATACAGGGCATAGTAGTAACGCGGAGAGA
CTTCAAGAAATCCTAGTCGTCGGCCTGGGACATAAGGGGTCGAGGGTTCTCATGTTGCTG
AGGTGTGATGACGACCAGCTGATGATATATCAGGTCTGTAGTAGTATAGAAGTACCAGCA
GTAGTATAG
Protein sequence:
MFSICRQTHPATGIEHAISCCFFNNDEVCLITAGANIIKVFRLLPEGHAKEVNAAGQPIP
PKMKLECLASYTLWGNVMSIASVKCPSAGRDLLLVSFKEAKLSVVQYDPQVNNLITLSMH
YFEEDDMKGGWTTHPHIPWIRVDPEFRCAVMLLYGRKLAVLPFRKDITSEEGDPLEAKPL
DCKKNQPIQTISRAPTLASYVIILKELDEKIDNILDIQFLYGYYEPTLLLLYEPVRTFAG
RTAVRNDTCAMAGVSLNMSARVHPVIWSIGGLPFDCIQAVPIQKPLGGCLIMAVNSLIYL
NQSVPPYGVSLNSIATHTTNFPLRIQEGVCITLDGARVVALGDTRLSLALKGGQLYVLTL
LSDSVRSVRSFHLDRAAASVLTSCMCVIEEDFLFLGSRLGNSLLLRVTERENRMLFSVDK
PLEATVDLTLSETDKDKEPLPKEPQKEMLDPQAKKRRLDTISDCVATNVVEISDKDELEV
YGSDIRTSTQLTSYVFEVCDSLLNICPIGDVSMGELQLVSEEGAGRRSRPALEMVACSGR
GKNGALAVLQRSLTPQLLTAFDLPGCIDMWTVIGEATEVNREAHKDMEGSHAYLILTQED
SSMILQTGQEINEVDNSGFMTSAPTVFAGNLGNNRFMVQVTTTAIRLVRNGVLVQSITLE
WTARSACTADPYLCVVSTCGRALVLALRELRARDATSARLAPTRQAVPHRPALLKAVPYR
DLSGLFTSTDDNIQVKGEFTGKMKEKNIKAEGFKADTVYELNDEDELLYGGDQTPASMAS
VKIWHIPDGGLSMHLTDWLVELHGHKRRVAYIEWHPTAENILFSAGFDYLILADSLESVP
IPNQTDEDEFNTGHSSNAERLQEILVVGLGHKGSRVLMLLRCDDDQLMIYQVCSSIEVPA
VV