DPGLEAN11947 in OGS1.0

New model in OGS2.0DPOGS204562 
Genomic Positionscaffold11038:- 496-3813
See gene structure
CDS Length1167
Paired RNAseq reads  279
Single RNAseq reads  872
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009595 (1e-145)
Best Drosophila hit  CstF-50, isoform A (5e-111)
Best Human hitcleavage stimulation factor subunit 1 (2e-88)
Best NR hit (blastp)  PREDICTED: similar to CstF-50 CG2261-PA, isoform A isoform 1 [Apis mellifera] (1e-127)
Best NR hit (blastx)  PREDICTED: similar to CstF-50 CG2261-PA, isoform A isoform 1 [Apis mellifera] (6e-125)
GeneOntology terms
  
GO:0005848 mRNA cleavage stimulating factor complex
GO:0006379 mRNA cleavage
InterPro families






  
IPR019775 WD40 repeat, conserved site
IPR020472 G-protein beta WD-40 repeat
IPR001680 WD40 repeat
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR011046 WD40 repeat-like-containing domain
IPR019781 WD40 repeat, subgroup
IPR015943 WD40/YVTN repeat-like-containing domain
Orthology groupMCL13407

Nucleotide sequence:

TACAGTCAGTTACATTACGATGGCTTCCAACCTATAGCCGCGACTCTATCAGCCGCTGTA
CACGCAGACCCACCGTGCCCTCCGAGCGACAGATTATTAAATCTAATGATGGTCGGTCTA
CAACATGAACCGGACCGGAAGGACAGGCTGGCGGCATCCAGCGGGGCGGAACATCTGCTG
GGAACTACCGGCTTTGATCTCGAGTTTGAGATGGACGCGTCCTCGCTCGCCCCTGAGCCG
GCCACGTACGAGACGGCGTATGTGACGTCACACAAGATGTCTTGTAGGGCCGGAGCGTTC
AGCGCCTGTGGTCAGCTGGTGGCTACCGGCAGTGTGGATGCTAGCATTAAGATTCTGGAC
GTGGAGCGGATGTTGGCTAAATCAGCTCCCGAGGAAGTTGATCCCGGGAGAGAGCAACAG
GGACATCCGGTGATACGAACATTATACGATCACACCGACGAGATAACCGCCCTGGATTTC
TTCATATTTACTTACGTCCACAGTCAGTGTTTAAAGTTTCCCACAAAAGTTGAAACGCAT
GATATTCGTGTGATCGGATGCTACGCGCTTTTATTTTTCACATCTACCTCATCGGCTACG
GTTCCTCTACAGCGGACGCGATCACGAAGGTTAGACGAAAAACACTTTAGGGTTGACATA
GTTAGAAACGGTCGGAAACCGTTTTCACCGAACGGTAAGTATTACGCGTCGGGCAGCGCT
GATGGTAGCGTCAAGCTCTGGGACACCGTCTCCAACAGATGTTTTAACACGTTCACCAAC
GCTCACGAGGGTGCAGAAGTGTGTTCAGTGGCATTCACCAGGAACAGCAAGTATCTCCTC
ACATCTGGTTTGGATTCGTCTATAAAGTTGTGGGAGTTAGCGAGCAGCCGTTGTCTGATA
CAATATACGGGGGCCGGTACTACAGGTAAGCAGGAACACCACGCCCAGGCGATATTCAAT
CACACTGAGGACTACGTGATGTTCCCGGACGAGGCGACCACCTCGCTCTGCACCTGGCAC
TCCAGGTCAGCCAGCAGGTGCCAGCTGATGTCTCTGGGGCATAATGGAGCTGTTAGGTAC
ATAGTCCATTCTGGCACGGCTCCAGCGTTCCTCACCTGTAGCGATGATTACAGAGCCAGG
TTTTGGTACAGACGGAACACGCATTAA

Protein sequence:

YSQLHYDGFQPIAATLSAAVHADPPCPPSDRLLNLMMVGLQHEPDRKDRLAASSGAEHLL
GTTGFDLEFEMDASSLAPEPATYETAYVTSHKMSCRAGAFSACGQLVATGSVDASIKILD
VERMLAKSAPEEVDPGREQQGHPVIRTLYDHTDEITALDFFIFTYVHSQCLKFPTKVETH
DIRVIGCYALLFFTSTSSATVPLQRTRSRRLDEKHFRVDIVRNGRKPFSPNGKYYASGSA
DGSVKLWDTVSNRCFNTFTNAHEGAEVCSVAFTRNSKYLLTSGLDSSIKLWELASSRCLI
QYTGAGTTGKQEHHAQAIFNHTEDYVMFPDEATTSLCTWHSRSASRCQLMSLGHNGAVRY
IVHSGTAPAFLTCSDDYRARFWYRRNTH