New model in OGS2.0 | DPOGS204562  |
---|---|
Genomic Position | scaffold11038:- 496-3813 |
See gene structure | |
CDS Length | 1167 |
Paired RNAseq reads   | 279 |
Single RNAseq reads   | 872 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009595 (1e-145) |
Best Drosophila hit   | CstF-50, isoform A (5e-111) |
Best Human hit | cleavage stimulation factor subunit 1 (2e-88) |
Best NR hit (blastp)   | PREDICTED: similar to CstF-50 CG2261-PA, isoform A isoform 1 [Apis mellifera] (1e-127) |
Best NR hit (blastx)   | PREDICTED: similar to CstF-50 CG2261-PA, isoform A isoform 1 [Apis mellifera] (6e-125) |
GeneOntology terms    | GO:0005848 mRNA cleavage stimulating factor complex GO:0006379 mRNA cleavage |
InterPro families    | IPR019775 WD40 repeat, conserved site IPR020472 G-protein beta WD-40 repeat IPR001680 WD40 repeat IPR019782 WD40 repeat 2 IPR017986 WD40-repeat-containing domain IPR011046 WD40 repeat-like-containing domain IPR019781 WD40 repeat, subgroup IPR015943 WD40/YVTN repeat-like-containing domain |
Orthology group | MCL13407 |
Nucleotide sequence:
TACAGTCAGTTACATTACGATGGCTTCCAACCTATAGCCGCGACTCTATCAGCCGCTGTA
CACGCAGACCCACCGTGCCCTCCGAGCGACAGATTATTAAATCTAATGATGGTCGGTCTA
CAACATGAACCGGACCGGAAGGACAGGCTGGCGGCATCCAGCGGGGCGGAACATCTGCTG
GGAACTACCGGCTTTGATCTCGAGTTTGAGATGGACGCGTCCTCGCTCGCCCCTGAGCCG
GCCACGTACGAGACGGCGTATGTGACGTCACACAAGATGTCTTGTAGGGCCGGAGCGTTC
AGCGCCTGTGGTCAGCTGGTGGCTACCGGCAGTGTGGATGCTAGCATTAAGATTCTGGAC
GTGGAGCGGATGTTGGCTAAATCAGCTCCCGAGGAAGTTGATCCCGGGAGAGAGCAACAG
GGACATCCGGTGATACGAACATTATACGATCACACCGACGAGATAACCGCCCTGGATTTC
TTCATATTTACTTACGTCCACAGTCAGTGTTTAAAGTTTCCCACAAAAGTTGAAACGCAT
GATATTCGTGTGATCGGATGCTACGCGCTTTTATTTTTCACATCTACCTCATCGGCTACG
GTTCCTCTACAGCGGACGCGATCACGAAGGTTAGACGAAAAACACTTTAGGGTTGACATA
GTTAGAAACGGTCGGAAACCGTTTTCACCGAACGGTAAGTATTACGCGTCGGGCAGCGCT
GATGGTAGCGTCAAGCTCTGGGACACCGTCTCCAACAGATGTTTTAACACGTTCACCAAC
GCTCACGAGGGTGCAGAAGTGTGTTCAGTGGCATTCACCAGGAACAGCAAGTATCTCCTC
ACATCTGGTTTGGATTCGTCTATAAAGTTGTGGGAGTTAGCGAGCAGCCGTTGTCTGATA
CAATATACGGGGGCCGGTACTACAGGTAAGCAGGAACACCACGCCCAGGCGATATTCAAT
CACACTGAGGACTACGTGATGTTCCCGGACGAGGCGACCACCTCGCTCTGCACCTGGCAC
TCCAGGTCAGCCAGCAGGTGCCAGCTGATGTCTCTGGGGCATAATGGAGCTGTTAGGTAC
ATAGTCCATTCTGGCACGGCTCCAGCGTTCCTCACCTGTAGCGATGATTACAGAGCCAGG
TTTTGGTACAGACGGAACACGCATTAA
Protein sequence:
YSQLHYDGFQPIAATLSAAVHADPPCPPSDRLLNLMMVGLQHEPDRKDRLAASSGAEHLL
GTTGFDLEFEMDASSLAPEPATYETAYVTSHKMSCRAGAFSACGQLVATGSVDASIKILD
VERMLAKSAPEEVDPGREQQGHPVIRTLYDHTDEITALDFFIFTYVHSQCLKFPTKVETH
DIRVIGCYALLFFTSTSSATVPLQRTRSRRLDEKHFRVDIVRNGRKPFSPNGKYYASGSA
DGSVKLWDTVSNRCFNTFTNAHEGAEVCSVAFTRNSKYLLTSGLDSSIKLWELASSRCLI
QYTGAGTTGKQEHHAQAIFNHTEDYVMFPDEATTSLCTWHSRSASRCQLMSLGHNGAVRY
IVHSGTAPAFLTCSDDYRARFWYRRNTH