DPGLEAN11057 in OGS1.0

New model in OGS2.0DPOGS214715 
Genomic Positionscaffold559:- 27507-45087
See gene structure
CDS Length2457
Paired RNAseq reads  1228
Single RNAseq reads  2926
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008387 (3e-22)
Best Drosophila hit  cleavage and polyadenylation specificity factor 100, isoform A (0.0)
Best Human hitcleavage and polyadenylation specificity factor subunit 2 (4e-176)
Best NR hit (blastp)  PREDICTED: similar to Probable cleavage and polyadenylation specificity factor, 100 kDa subunit (CPSF 100 kDa subunit) [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to Probable cleavage and polyadenylation specificity factor, 100 kDa subunit (CPSF 100 kDa subunit) [Apis mellifera] (0.0)
GeneOntology terms




  
GO:0006379 mRNA cleavage
GO:0005847 mRNA cleavage and polyadenylation specificity factor complex
GO:0006378 mRNA polyadenylation
GO:0003730 mRNA 3'-UTR binding
GO:0016787 hydrolase activity
GO:0006398 histone mRNA 3'-end processing
InterPro families
  
IPR022712 Beta-Casp domain
IPR001279 Beta-lactamase-like
Orthology groupMCL15608

Nucleotide sequence:

ATGACTTCTATTATTAAATTCCATTGCCTCTCAGGGGCTGGAGACGAGTCTCCTCCCTGC
TACGTGTTGCAAGTGGATGAATTTAAATTCCTCTTGGACTGTGGATGGGATGAAAAATTT
GATATGGATTTTATAAAGGAACTTAAAAGACATGTCAACTCTATAGATGCAGTCCTACTG
TCACATTCAGATCCCCTTCATCTCGGGGCCCTACCATATGCTGTCGGACAGCTCGGTTTA
AACTGTCCTATATATGCCACCCTCCCAATATACAAGATGGGCCAAATGTTCATGTATGAT
CTCTACCAATCACATAAAAATGTCTCCGAGTTTGATCTGTTCACATTAGATGATGTGGAC
ACAGCATTTGATAGAATCACACAACTTAAATATAATCAGAGTGTTGATATGAAGGGTAAA
GGGCTAGGCCTGCGTATAACTCCACTGCCAGCCGGACACCTCCTGGGCGGAACTGTGTGG
CGTATTGCAGCCCCAGGGGAAGAAGACATAGTGTACGCACCAGACTTCAACCACAAAAAG
GAGCGGCATCTGAATGGGTGCGAGATTGAGAAGATTATGAGGCCTTCATTACTGCTGCTC
GGAGCTATGAATGCTGATTACGTGCAGCAGAGACGGCGGCTAAGGGACGAAAAACTTATG
ACAACAATCCTTAGTACACTTCGGGGTGGTGGTTCAGTACTGGTGTGTACGGACACCGCG
GGACGGGTTCTAGAGCTGGCCCATATGTTGGACCAACTTTGGAGGAACAAGGATTCTGGT
CTTGTTGCATATTCTCTGTTGTTGTTGTCCAACGTCAGCTATAATGTTGTGGAGTTTGCC
AAGTCACAGATCGAATGGATGAGCGACAAATTGACCCGCGCCTTCGAAGGAGCTAGAAGC
AACCCTTTCGCGCTGAGGCACTTGCAACTGTGTCACTCCGTAGTCGAGGTCACTCGGACC
CCGGGGCCCAAAGTGGTGCTGGCGTCCTTCCCAGACTTAGAGACCGGTTTCGCAAGAGAT
CTTTTCCTGCAATGGGCCCCTAATTCACAGAATTCTATAGTACTAACTGCAAGGACCTCT
CCGGGGACCCTCGCCAGGGATCTGATTGAGAAAGGCGGTGACCGCACCATAGAATTGACG
GTGAGGAGGCGGGTCCGGCTGGAGGGGGCGGAGCTTGAGGAGTTCATGCAACAGAGGGTC
AAGGTCAACAACTCGGTCAAAGAGGAGACCGGTGGTATATCATCCGACTCCGAGTCCGAG
GGTGAGTTGGAGATGTGCGTGGTGACCGGCAAACACGACATACCGGTCCGGGGGGACGCC
AGGCCCGCGGGGTGCTTCAAGAGCAACAAAAGACACCACGCCATGTACCCCTGTACCGAG
GAAAGAGCGAGGGCCGACGACTACGGAGAGATTATACGGCCTGAAGACTACCGCCTGGCG
GAGGTCGTGGACGCCGAGGGAGAGATTCGGGACGTGCCGCCCGCCCCGACACACACACAG
GAACCGGAAGAGGAGATAACAGAGATCCCGAGTAAGTGTATCACGGCGACCAAGCAGCTG
CAGGTGAAGGCCAGCATCCAGTACATAGAACTGGAGGGCCGCTGTGACGGAGAGTCACTG
CTGCGAGTGGTGGCGGCCGCCAAACCTCGGGCGGTGGTGGCCCTGAGAGCCGGACCTACG
GCACTGGCCACCCTCAAAAAGCACTGTGACAGTGAGGGTATCGAGAAAGTCTTCACACCG
GGCCGCGGCGACACAGTGGATGCGACCACGGAGTCTCATATCTACCAGGTGAAGTTAACG
GACAGTGTGATGTGCGGTTTGTCCTGGCGCTCGGCCGGGGACGCGGAGCTGGCGTGGCTG
TCGGCCGTGGTGGCGCAGCCGAGGACCCGGGACACGCCCAGCGAGGAAGTGGCGGATGTG
GAGATGATGTCGCTGGAGGCTGCGGAGGGCGTGCCTCACGGCGCGTGGTTCGTGAACAGT
GTGAGGCTCTCGGAGCTGAGGGCGGCGCTCGCCCGGAACGGCCTCGGGGCGGAGTTCAGT
GCCGGGGCCCTGGAGTGCTGCAACGGAACCATCGCTATACGAAGATTGGAGAACGGTCGC
GTCGCCCTCGAGGGAGTGCTCTCTGAGGAGTATTTCAAAGTGCGGGAACTTTTGTACGAC
CAGTTCGCTATAGTTAAGAGACCGCGGACGGCTCCCAGTGGAAAGGATCTGTCGTTACTA
TTGAGACTCGACTCCAGGAACCGCCGGCATCGGCGCACAAAACACACCGACGCGAGTCTG
CTCACTGACGGGGAGCCAAACAGATCGCGTCTCGATGGAACCGCGAGCCTCGCCTCAACA
CCTATCGGAGCCCGGTGGAGGGGGCCGGGGGTCGGGGGGAGTGCGGCGCGTCTAGACTGC
GGCCTTGACCGCGCCTGTTTCCGCGCGTTCGAGGTCGCCTCTACTCGCAGCCTATAG

Protein sequence:

MTSIIKFHCLSGAGDESPPCYVLQVDEFKFLLDCGWDEKFDMDFIKELKRHVNSIDAVLL
SHSDPLHLGALPYAVGQLGLNCPIYATLPIYKMGQMFMYDLYQSHKNVSEFDLFTLDDVD
TAFDRITQLKYNQSVDMKGKGLGLRITPLPAGHLLGGTVWRIAAPGEEDIVYAPDFNHKK
ERHLNGCEIEKIMRPSLLLLGAMNADYVQQRRRLRDEKLMTTILSTLRGGGSVLVCTDTA
GRVLELAHMLDQLWRNKDSGLVAYSLLLLSNVSYNVVEFAKSQIEWMSDKLTRAFEGARS
NPFALRHLQLCHSVVEVTRTPGPKVVLASFPDLETGFARDLFLQWAPNSQNSIVLTARTS
PGTLARDLIEKGGDRTIELTVRRRVRLEGAELEEFMQQRVKVNNSVKEETGGISSDSESE
GELEMCVVTGKHDIPVRGDARPAGCFKSNKRHHAMYPCTEERARADDYGEIIRPEDYRLA
EVVDAEGEIRDVPPAPTHTQEPEEEITEIPSKCITATKQLQVKASIQYIELEGRCDGESL
LRVVAAAKPRAVVALRAGPTALATLKKHCDSEGIEKVFTPGRGDTVDATTESHIYQVKLT
DSVMCGLSWRSAGDAELAWLSAVVAQPRTRDTPSEEVADVEMMSLEAAEGVPHGAWFVNS
VRLSELRAALARNGLGAEFSAGALECCNGTIAIRRLENGRVALEGVLSEEYFKVRELLYD
QFAIVKRPRTAPSGKDLSLLLRLDSRNRRHRRTKHTDASLLTDGEPNRSRLDGTASLAST
PIGARWRGPGVGGSAARLDCGLDRACFRAFEVASTRSL