DPGLEAN19550 in OGS1.0

New model in OGS2.0DPOGS205980 
Genomic Positionscaffold970:+ 18566-27961
See gene structure
CDS Length2073
Paired RNAseq reads  2539
Single RNAseq reads  7747
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009409 (6e-29)
Best Drosophila hit  CG7185 (1e-46)
Best Human hitcleavage and polyadenylation specificity factor subunit 6 (2e-34)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC008014 [Tribolium castaneum] (1e-92)
Best NR hit (blastx)  PREDICTED: similar to CG7185-PA [Apis mellifera] (5e-56)
GeneOntology terms





  
GO:0003729 mRNA binding
GO:0006379 mRNA cleavage
GO:0005849 mRNA cleavage factor complex
GO:0003676 nucleic acid binding
GO:0000166 nucleotide binding
GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome
GO:0005634 nucleus
InterPro families
  
IPR000504 RNA recognition motif domain
IPR012677 Nucleotide-binding, alpha-beta plait
Orthology groupMCL40419

Nucleotide sequence:

ATGGCGGATGGAGGCGTGGATATAGATTTATACGCTGATGATATTGAATCTGATTTTAAT
AGACAGGATGACTTTGGAGGAGAAAATGTCGACTTGTACGACGACGTAATAGCGGCGCCA
ACAGTGAAAACCGAGGATGGAGATGGGCCACCGAACTCATCAGCCGCTCCGCACTCCCAT
CCTCCTGAGGAGACAAACGGTTCCGTTCCTTACCACAACAACGCACCCAGTCACGGACAT
CATGGCCGCCGTTTTCAACTTTACGTCGGCAACCTCACTTGGTGGGCTACAGATCAAGAC
ATAGCTAACGCGATCGCTGATATCGGGGTCACGGATTTTCAAGATGTCAAGTTCTTCGAG
AACAGGGCGAATGGGCAGTCGAAGGGATTCTGCGTCGTTTCTCTGGGATCCGACCAGTCC
ATCAGAATGGTCATGGATCGACTACCCAAGAAGGAGATACATGGGCAGCACCCAGTAGTC
ACGCTACCCACCAAACAGGCTTTAAACCAGTTTGAAAGTCAATCTAAGACGCGATCGACC
CCGCCGGGTCCTAATCCTGGGATGAGGGGAGGACCTCACGGTCCAGGAGGACCGCCTGGA
CCCCATCCAGGAGAGTTTTTCGGTGGAGGCCCTAACGGTCCGGGTCCGAACGGGCCACGG
ATGATGATGCCAGGCCCGGGTCCGCATCACCAGCTGCGAGGTCCGCCGCCAGGGCCTCAT
GGCCCTCCGCCGCACCACATGCCACAACACCAGGGCCCCCCGCCGCATCACATGCCTCCA
CATCAAGGCCCCCCACCACATCAAGGGCCCCCGCAGCATAGGCCACCGATGCAGTTCCAG
GGTCCGCCTCAAATGCAGCGTACGGGTCCAGGAGGTCCCGGTCCGGGCGGTCCCGGCCCC
GGGCCTGGTCCTCGTGGTCCTGAATGGCCTCGTCACTTACCGCCCGCGCCACAGTACGGA
CCCCCGCAGCATCAAATGCCGCATCAGATGCCTCCACCTCATGCCGGCCTGCCTCCGCCG
CATCATCAGCTGCCGCCTCATCAGCAAGGCCCGCCCCGCGGACCGGCGCCATTACCGCAA
CACCCAGGTGTGGGCGGAGCGCCGGCTCCTCATGTGAACCCGGCCTTCTTCAGTCAACAG
CCTCCAGTACAACCCGCCCCCGCTGTACAGCCGCCGCAGCCCGCCCCGCCAGGACCGGGA
GGACCCTACGGCCACCCAGCGCACGCGCCCCCGCCCACACAGGGACCCCCGCCAGGCGCC
AGGCAGCCTTATGGAAGACCTCCAGCCAACTATCCCCCTACAGACGGTCGAGCTGCACAC
CACGCGGCGCCGCTGGGGGTCCCACACGCGCACGCTGCCCTAGCGCCGCACGTCCCCCAC
GCTCCCATCAGCGAGGCGGAGTTCGAGGAGGTGATGAGCAGGAACAGGACCGTTTCCTCG
AGCGCCATCGCCAGGGCTGTGAGCGACGCGGCCGCCGGGGAGTACGCCTCCGCCATAGAG
ACGCTCGTCACCGCCATATCCCTCATAAAGCAGAGCAAGGTGGCCCACGACGATCGATGC
AAAATATTAATATCGTCTCTACAAGACACTCTACACGGTGTAGAGACTAAGTCGTACGGC
GGCGAACGAAGACGATCGCGCTCTAGGGAGAGAGATGCTAGAGCTCATCACAGAGCCCCG
AGACGAAGGGAACGATCGGCTTCCAGATACCGGGATAGATCAAGGGATAGAGACGACAGG
GATCGGTACAGGGAAAGGTCGAGGGAACGCGACCGTGAGCGCGACCGAGACGCGTACTAC
AGGGACTATCGCGAGAGGGAAAGGGACAGGTCTAGGTCGAGGGACAGAGAACGCACGGAT
CATTATAATAGAGGACACAGTCGCGAGGAAAGGCCACGTAAGTCGCCTGTAGAAGCGGCG
GCGGAGGGAGGCGCCGGGGACGCGGCGGCCAAGGGTCGGGCGGCGGCGCCCTACTACGAC
GAGAGGTACCGCGAGAGACGGGACCGCGAGCCCCCGCCGGCAGACAGAGACCGCGACCGG
GACCGCGACCATCGCCGGGACGCCAGGCACTAG

Protein sequence:

MADGGVDIDLYADDIESDFNRQDDFGGENVDLYDDVIAAPTVKTEDGDGPPNSSAAPHSH
PPEETNGSVPYHNNAPSHGHHGRRFQLYVGNLTWWATDQDIANAIADIGVTDFQDVKFFE
NRANGQSKGFCVVSLGSDQSIRMVMDRLPKKEIHGQHPVVTLPTKQALNQFESQSKTRST
PPGPNPGMRGGPHGPGGPPGPHPGEFFGGGPNGPGPNGPRMMMPGPGPHHQLRGPPPGPH
GPPPHHMPQHQGPPPHHMPPHQGPPPHQGPPQHRPPMQFQGPPQMQRTGPGGPGPGGPGP
GPGPRGPEWPRHLPPAPQYGPPQHQMPHQMPPPHAGLPPPHHQLPPHQQGPPRGPAPLPQ
HPGVGGAPAPHVNPAFFSQQPPVQPAPAVQPPQPAPPGPGGPYGHPAHAPPPTQGPPPGA
RQPYGRPPANYPPTDGRAAHHAAPLGVPHAHAALAPHVPHAPISEAEFEEVMSRNRTVSS
SAIARAVSDAAAGEYASAIETLVTAISLIKQSKVAHDDRCKILISSLQDTLHGVETKSYG
GERRRSRSRERDARAHHRAPRRRERSASRYRDRSRDRDDRDRYRERSRERDRERDRDAYY
RDYRERERDRSRSRDRERTDHYNRGHSREERPRKSPVEAAAEGGAGDAAAKGRAAAPYYD
ERYRERRDREPPPADRDRDRDRDHRRDARH