DPGLEAN13008 in OGS1.0

New model in OGS2.0DPOGS204962 
Genomic Positionscaffold1156:+ 15811-25341
See gene structure
CDS Length2283
Paired RNAseq reads  610
Single RNAseq reads  2048
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011133 (0.0)
Best Drosophila hit  suppressor of forked, isoform E (0.0)
Best Human hitcleavage stimulation factor subunit 3 isoform 1 (0.0)
Best NR hit (blastp)  PREDICTED: similar to Protein suppressor of forked [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to Protein suppressor of forked [Apis mellifera] (0.0)
GeneOntology terms



  
GO:0006379 mRNA cleavage
GO:0005848 mRNA cleavage stimulating factor complex
GO:0005847 mRNA cleavage and polyadenylation specificity factor complex
GO:0005634 nucleus
GO:0016070 RNA metabolic process
InterPro families
  
IPR003107 RNA-processing protein, HAT helix
IPR008847 Suppressor of forked
Orthology groupMCL14263

Nucleotide sequence:

ATGAATGATGAAAATGCAGAGATTGACTGGGGTAATGAGAGGTTAAGCCGAGCCCAACGC
GCGGTTGAAGCGAACACGTATGATGTTGATTCGTGGTCTCTCTTGATACGCGAAGCCCAA
ACCCGGCCTATAAATGAGGTCAGAACGATGTATGAGAAGCTCATTACAGCTTTCCCAACA
ACAGGGAGGTATTGGAAGATTTATATCGAACAGGAGATGAAAGCGAGAAATTTTGAGAAG
GTCGAAAAGTTGTTTCAGAGGTGTCTCATGAAGATTCTAAACATTGAACTGTGGAGGCTA
TACCTAAACTACGTTAAGGAGACCAAGTGCATGTTGCCGACATACAAAGAGAAAATGGCG
CAGGCCTACGACTTCGCGTTGGACAAAATAGGTCTGGACATACACGCGTATCCTATATGG
AACGATTACGTAACATTCTTGAAGGCTGTCGAGGCTGTTGGCTCTTACGCTGAGAACCAG
AAGATATCAGCCGTTAGGAAGGTATACCAGAGAGCGGTCATTACCCCGATAATAGGTATT
GAGACGCTGTGGAAAGACTACATCGCTTTCGAACAAGGAATCAACACTATCATAGCTGAG
CGTATGGCTATGGAGCGATCACGGGAATATATGAACGCGAGGAGAGTAGCCAAAGAATTG
GAGACGGTCACCAGGGGCTTGAATAGGAACATGCCGGCCACCCCGCCCACCGCAGACAGG
GAGGAGATGAAGCAGGTGGAGTTGTGGAAGAAGTACATATCCTGGGAGCGCTCCAACCCC
CTCAGGTCGGAGGATACCGCTCTCGTGGCCAGGCGGGTGATGTTCGCTATAGAACAGTGT
CTGCTATGTCTGGCCCACCACCCGGATGTATGGCATCAGGCGGCGCAGTTCCTCGACCAT
TCATCTAAATTACTGCAAGAAAAGGGGGATTCGACAGCCGCCCGTCTGTTCTCCGAGGAG
GCCGGTGCAGTCTACGAGAGAGCCACATCCGGTCCGCTCAAACATTCCACCTTACTGCAC
TTCGCTCACGCCGACTATGAGGAGAGTCGGCTGCATTACAACAAGGTACACCAGGTATAC
ACTCGCTATCTGGATATGGCGGACATCGAACCCACGCTGGCCTACGTTCAATATATGAAG
TTTGCGAGACGAGCTGAAGGTATCAAGTCGGCTAGGACGGTGTTCAAACGAGCCAGGGAA
GACCCGAGATCCCGTTACCACGTGTTCGTGGCGGCCGCCCTCATGGAGTACTACTGCTCC
AAAGACAAGAACATTGCTTTCAGGATATTTGAGTTGGGCCTCAAGAAGTTCTCCCACATT
CCGGAGTATGTGTTGTGCTACATCGACTACCTGTCACATTTGAACGAGGATAACAACACC
CGCGTGTTGTTCGAGCGCGTCCTGTCATCTGGATGTCTGAAGCCGGAGAGTTCTGTTGAT
ATCTGGAATAGATTCCTGGAATTTGAATCCAATATTGGGGACCTCGTCAGTATAGTGAAG
GTTGAGAAACGGAGGCAGGCGGTTCTGGAAAAGATCAAAGAGTTTGAGGGTAAGGAGACG
GCTCAGCTCGTGGACAGATACAAGTTCCTGGATCTCTACCCCTGTACTATAGCTGAACTC
AAGTCCATAGGATACACAGAGGTAGCATCAATGTCGAACAAGTCCTGGGCTCTCGGAGGA
CCGCTTGCTGGCATCTCACCAGAATTGGCCGCTGTGATACTAGGACAGAAAGACAATGAT
CCGAACAAGGACATAGTTCGTCCAGACACAAGTCAAATGATACCCTACAAGCCAAAATCG
AACCCACTCCCCGGAGAGCATCCTATACCAGGTACGTACAAAGACAATGATCCGAACAAG
GACATAGTTCGTCCAGACACAAGTCAAATGATACCCTACAAGCCAAAGTCCAACCCACTC
CCCGGAGAGCATCCTATACCGGGCGGTTCCTTCCCGCTGCCTCCCGCGGCCGCCGCCCTG
TGCACGGCGATGCCTCCCCCCTCCAGCTACAGAGGACCCTTCGTGGCGGTGGACATGCTG
ATAGCACTCTTCAACAGGATCACACTACCCGACAAACCCGCTGCCCCGACCAATGAGAAC
GGCTGTGACACCAAACTGTTTGAGCTGGCTCGCTCCGTCCACTGGATCATGGACGATGAC
ACCACCAAGAATAATACGGCTCGCAGAAGGAAGCTGGGTTCGGACTCGGATGACGACGAG
CTGGGCGCGCCGCCGCCTCTCAACGACGTCTACAGACAGAGACAACAGAAGAGAGTCAAG
TGA

Protein sequence:

MNDENAEIDWGNERLSRAQRAVEANTYDVDSWSLLIREAQTRPINEVRTMYEKLITAFPT
TGRYWKIYIEQEMKARNFEKVEKLFQRCLMKILNIELWRLYLNYVKETKCMLPTYKEKMA
QAYDFALDKIGLDIHAYPIWNDYVTFLKAVEAVGSYAENQKISAVRKVYQRAVITPIIGI
ETLWKDYIAFEQGINTIIAERMAMERSREYMNARRVAKELETVTRGLNRNMPATPPTADR
EEMKQVELWKKYISWERSNPLRSEDTALVARRVMFAIEQCLLCLAHHPDVWHQAAQFLDH
SSKLLQEKGDSTAARLFSEEAGAVYERATSGPLKHSTLLHFAHADYEESRLHYNKVHQVY
TRYLDMADIEPTLAYVQYMKFARRAEGIKSARTVFKRAREDPRSRYHVFVAAALMEYYCS
KDKNIAFRIFELGLKKFSHIPEYVLCYIDYLSHLNEDNNTRVLFERVLSSGCLKPESSVD
IWNRFLEFESNIGDLVSIVKVEKRRQAVLEKIKEFEGKETAQLVDRYKFLDLYPCTIAEL
KSIGYTEVASMSNKSWALGGPLAGISPELAAVILGQKDNDPNKDIVRPDTSQMIPYKPKS
NPLPGEHPIPGTYKDNDPNKDIVRPDTSQMIPYKPKSNPLPGEHPIPGGSFPLPPAAAAL
CTAMPPPSSYRGPFVAVDMLIALFNRITLPDKPAAPTNENGCDTKLFELARSVHWIMDDD
TTKNNTARRRKLGSDSDDDELGAPPPLNDVYRQRQQKRVK