DPGLEAN08054 in OGS1.0

New model in OGS2.0DPOGS208703 
Genomic Positionscaffold661:- 100848-104402
See gene structure
CDS Length1416
Paired RNAseq reads  2459
Single RNAseq reads  6016
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003348 (3e-82)
Best Drosophila hit  Fip1 (8e-28)
Best Human hitpre-mRNA 3'-end-processing factor FIP1 isoform 3 (5e-25)
Best NR hit (blastp)  PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] (2e-40)
Best NR hit (blastx)  PREDICTED: similar to CG1078 CG1078-PA [Tribolium castaneum] (4e-36)
GeneOntology terms

  
GO:0005515 protein binding
GO:0006398 histone mRNA 3'-end processing
GO:0005634 nucleus
InterPro families  IPR007854 Pre-mRNA polyadenylation factor Fip1
Orthology groupMCL16418

Nucleotide sequence:

ATGGCAGACGCTGCTGTAGAAACCGCTCCAGCAGATGAAAATGATGATAATTGGTTATAT
GGAGAATCAGCGAGTGAACAAATCGAAGCGGAAGCATCAACTACTGATGAAAAAACACAA
GAACAGGAAAATGCTAATAAGGAGAGAATAGATGATGATGGTAAAGTACAAGAAGAGACA
GAGACAAATGAGGGTAACAATGAAGACTCTCACTTCAACGATGAACACTTTGGTGAGGTG
GACAGAGATGATCAGGATCAGACGAATGGAGACGCCGACAGTCAGGACAACGGGGACACC
GACTCTGATGATAGTGATGACGTCAAAGTAACGATTGGAGAAATTAAGTCAGGACCACAA
GCTTATGCCAGTTTGAATATAAAACGTGGTGTTGGACTTGTAGCAGCTGGAACTGAGAAG
CCTCGTCAAGGTCCAGCGGCCGGTAACAAAGTGACTCTTGAAGACTTAGATGGTCCCGGA
AGCATCAACGGCGTGCCAGCGCTAGAGTTTAATATTGACACTATTGAAGATAAACCCTGG
AACAAACCTGGAGCTGACATATCTGATTACTTCAATTACGGTTTCAACGAGGTGACCTGG
AGCGCTTACTGTGAGCGTCAGAGACGGATGCGTGTTAGTGAGGCCGGTGTCGCTCTACAC
GCCGCCCCGCCGCCCCGCGCCGCCCCCACAGACAGACGGCAACAAGGTCCACCAAGACAT
GACGACATGCCGCCAGGGATGCCAAATAATTACCAGTCCAGAGAGAACACTATACAGGTG
ATGACAGCCGAGCGTCGTGAGTACGGCCGCGGTCAGGTGCGCGAGGCTGCACCACCCGCC
GACTACTTCAGCGCTCCGCCCCCCGACCACTACTACCAGCCTCCGCCTCACGCGCCTCAC
GCTCCACACCTCCCTCCACACCAGCACACACCACACTCATACGAAGAACCCTGGGCTCAT
CCAGAACAGACAGGCTGGGCGCCGTCAGATATAAAGGAACTAACGCCCGGACCCATGGGA
CCGCCGATGCCGCTAGGCATGCCCCCCGTACACATGCCCGCGCCCTACCCGACATACCGC
TCGCACGTCACACACACACACGAAAGAGACCGGGACCGGGAACGGGAAAGAGACCGAGAC
CGGGACCGGGACCGGGACAGGACCCGCGACGACCGTGACCGACGGGACCGGGACCGGAGG
GATGAGGAGGAAAGAGACAGGGATCGTGAACGTTCACGCTCCATTAAACCGGAGAGGATA
CGAGAGAAATCGTACCGCCGTGAGCGGTCTCGTTCACGTTCCCGCCGTCACAAGTCCCGG
TCCCGGTCTCCGAGACAACGGGAACGTTCCCGGGACAGGGAGAGGAGTATGAAGCCCAAG
AACAAGGATGCCAAGGAAAAGGACGAAGATAAATAA

Protein sequence:

MADAAVETAPADENDDNWLYGESASEQIEAEASTTDEKTQEQENANKERIDDDGKVQEET
ETNEGNNEDSHFNDEHFGEVDRDDQDQTNGDADSQDNGDTDSDDSDDVKVTIGEIKSGPQ
AYASLNIKRGVGLVAAGTEKPRQGPAAGNKVTLEDLDGPGSINGVPALEFNIDTIEDKPW
NKPGADISDYFNYGFNEVTWSAYCERQRRMRVSEAGVALHAAPPPRAAPTDRRQQGPPRH
DDMPPGMPNNYQSRENTIQVMTAERREYGRGQVREAAPPADYFSAPPPDHYYQPPPHAPH
APHLPPHQHTPHSYEEPWAHPEQTGWAPSDIKELTPGPMGPPMPLGMPPVHMPAPYPTYR
SHVTHTHERDRDRERERDRDRDRDRDRTRDDRDRRDRDRRDEEERDRDRERSRSIKPERI
REKSYRRERSRSRSRRHKSRSRSPRQRERSRDRERSMKPKNKDAKEKDEDK