New model in OGS2.0 | DPOGS214715  |
---|---|
Genomic Position | scaffold559:- 27507-45087 |
See gene structure | |
CDS Length | 2457 |
Paired RNAseq reads   | 1228 |
Single RNAseq reads   | 2926 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008387 (3e-22) |
Best Drosophila hit   | cleavage and polyadenylation specificity factor 100, isoform A (0.0) |
Best Human hit | cleavage and polyadenylation specificity factor subunit 2 (4e-176) |
Best NR hit (blastp)   | PREDICTED: similar to Probable cleavage and polyadenylation specificity factor, 100 kDa subunit (CPSF 100 kDa subunit) [Apis mellifera] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to Probable cleavage and polyadenylation specificity factor, 100 kDa subunit (CPSF 100 kDa subunit) [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0006379 mRNA cleavage GO:0005847 mRNA cleavage and polyadenylation specificity factor complex GO:0006378 mRNA polyadenylation GO:0003730 mRNA 3'-UTR binding GO:0016787 hydrolase activity GO:0006398 histone mRNA 3'-end processing |
InterPro families    | IPR022712 Beta-Casp domain IPR001279 Beta-lactamase-like |
Orthology group | MCL15608 |
Nucleotide sequence:
ATGACTTCTATTATTAAATTCCATTGCCTCTCAGGGGCTGGAGACGAGTCTCCTCCCTGC
TACGTGTTGCAAGTGGATGAATTTAAATTCCTCTTGGACTGTGGATGGGATGAAAAATTT
GATATGGATTTTATAAAGGAACTTAAAAGACATGTCAACTCTATAGATGCAGTCCTACTG
TCACATTCAGATCCCCTTCATCTCGGGGCCCTACCATATGCTGTCGGACAGCTCGGTTTA
AACTGTCCTATATATGCCACCCTCCCAATATACAAGATGGGCCAAATGTTCATGTATGAT
CTCTACCAATCACATAAAAATGTCTCCGAGTTTGATCTGTTCACATTAGATGATGTGGAC
ACAGCATTTGATAGAATCACACAACTTAAATATAATCAGAGTGTTGATATGAAGGGTAAA
GGGCTAGGCCTGCGTATAACTCCACTGCCAGCCGGACACCTCCTGGGCGGAACTGTGTGG
CGTATTGCAGCCCCAGGGGAAGAAGACATAGTGTACGCACCAGACTTCAACCACAAAAAG
GAGCGGCATCTGAATGGGTGCGAGATTGAGAAGATTATGAGGCCTTCATTACTGCTGCTC
GGAGCTATGAATGCTGATTACGTGCAGCAGAGACGGCGGCTAAGGGACGAAAAACTTATG
ACAACAATCCTTAGTACACTTCGGGGTGGTGGTTCAGTACTGGTGTGTACGGACACCGCG
GGACGGGTTCTAGAGCTGGCCCATATGTTGGACCAACTTTGGAGGAACAAGGATTCTGGT
CTTGTTGCATATTCTCTGTTGTTGTTGTCCAACGTCAGCTATAATGTTGTGGAGTTTGCC
AAGTCACAGATCGAATGGATGAGCGACAAATTGACCCGCGCCTTCGAAGGAGCTAGAAGC
AACCCTTTCGCGCTGAGGCACTTGCAACTGTGTCACTCCGTAGTCGAGGTCACTCGGACC
CCGGGGCCCAAAGTGGTGCTGGCGTCCTTCCCAGACTTAGAGACCGGTTTCGCAAGAGAT
CTTTTCCTGCAATGGGCCCCTAATTCACAGAATTCTATAGTACTAACTGCAAGGACCTCT
CCGGGGACCCTCGCCAGGGATCTGATTGAGAAAGGCGGTGACCGCACCATAGAATTGACG
GTGAGGAGGCGGGTCCGGCTGGAGGGGGCGGAGCTTGAGGAGTTCATGCAACAGAGGGTC
AAGGTCAACAACTCGGTCAAAGAGGAGACCGGTGGTATATCATCCGACTCCGAGTCCGAG
GGTGAGTTGGAGATGTGCGTGGTGACCGGCAAACACGACATACCGGTCCGGGGGGACGCC
AGGCCCGCGGGGTGCTTCAAGAGCAACAAAAGACACCACGCCATGTACCCCTGTACCGAG
GAAAGAGCGAGGGCCGACGACTACGGAGAGATTATACGGCCTGAAGACTACCGCCTGGCG
GAGGTCGTGGACGCCGAGGGAGAGATTCGGGACGTGCCGCCCGCCCCGACACACACACAG
GAACCGGAAGAGGAGATAACAGAGATCCCGAGTAAGTGTATCACGGCGACCAAGCAGCTG
CAGGTGAAGGCCAGCATCCAGTACATAGAACTGGAGGGCCGCTGTGACGGAGAGTCACTG
CTGCGAGTGGTGGCGGCCGCCAAACCTCGGGCGGTGGTGGCCCTGAGAGCCGGACCTACG
GCACTGGCCACCCTCAAAAAGCACTGTGACAGTGAGGGTATCGAGAAAGTCTTCACACCG
GGCCGCGGCGACACAGTGGATGCGACCACGGAGTCTCATATCTACCAGGTGAAGTTAACG
GACAGTGTGATGTGCGGTTTGTCCTGGCGCTCGGCCGGGGACGCGGAGCTGGCGTGGCTG
TCGGCCGTGGTGGCGCAGCCGAGGACCCGGGACACGCCCAGCGAGGAAGTGGCGGATGTG
GAGATGATGTCGCTGGAGGCTGCGGAGGGCGTGCCTCACGGCGCGTGGTTCGTGAACAGT
GTGAGGCTCTCGGAGCTGAGGGCGGCGCTCGCCCGGAACGGCCTCGGGGCGGAGTTCAGT
GCCGGGGCCCTGGAGTGCTGCAACGGAACCATCGCTATACGAAGATTGGAGAACGGTCGC
GTCGCCCTCGAGGGAGTGCTCTCTGAGGAGTATTTCAAAGTGCGGGAACTTTTGTACGAC
CAGTTCGCTATAGTTAAGAGACCGCGGACGGCTCCCAGTGGAAAGGATCTGTCGTTACTA
TTGAGACTCGACTCCAGGAACCGCCGGCATCGGCGCACAAAACACACCGACGCGAGTCTG
CTCACTGACGGGGAGCCAAACAGATCGCGTCTCGATGGAACCGCGAGCCTCGCCTCAACA
CCTATCGGAGCCCGGTGGAGGGGGCCGGGGGTCGGGGGGAGTGCGGCGCGTCTAGACTGC
GGCCTTGACCGCGCCTGTTTCCGCGCGTTCGAGGTCGCCTCTACTCGCAGCCTATAG
Protein sequence:
MTSIIKFHCLSGAGDESPPCYVLQVDEFKFLLDCGWDEKFDMDFIKELKRHVNSIDAVLL
SHSDPLHLGALPYAVGQLGLNCPIYATLPIYKMGQMFMYDLYQSHKNVSEFDLFTLDDVD
TAFDRITQLKYNQSVDMKGKGLGLRITPLPAGHLLGGTVWRIAAPGEEDIVYAPDFNHKK
ERHLNGCEIEKIMRPSLLLLGAMNADYVQQRRRLRDEKLMTTILSTLRGGGSVLVCTDTA
GRVLELAHMLDQLWRNKDSGLVAYSLLLLSNVSYNVVEFAKSQIEWMSDKLTRAFEGARS
NPFALRHLQLCHSVVEVTRTPGPKVVLASFPDLETGFARDLFLQWAPNSQNSIVLTARTS
PGTLARDLIEKGGDRTIELTVRRRVRLEGAELEEFMQQRVKVNNSVKEETGGISSDSESE
GELEMCVVTGKHDIPVRGDARPAGCFKSNKRHHAMYPCTEERARADDYGEIIRPEDYRLA
EVVDAEGEIRDVPPAPTHTQEPEEEITEIPSKCITATKQLQVKASIQYIELEGRCDGESL
LRVVAAAKPRAVVALRAGPTALATLKKHCDSEGIEKVFTPGRGDTVDATTESHIYQVKLT
DSVMCGLSWRSAGDAELAWLSAVVAQPRTRDTPSEEVADVEMMSLEAAEGVPHGAWFVNS
VRLSELRAALARNGLGAEFSAGALECCNGTIAIRRLENGRVALEGVLSEEYFKVRELLYD
QFAIVKRPRTAPSGKDLSLLLRLDSRNRRHRRTKHTDASLLTDGEPNRSRLDGTASLAST
PIGARWRGPGVGGSAARLDCGLDRACFRAFEVASTRSL