New model in OGS2.0 | DPOGS208703  |
---|---|
Genomic Position | scaffold661:- 100848-104402 |
See gene structure | |
CDS Length | 1416 |
Paired RNAseq reads   | 2459 |
Single RNAseq reads   | 6016 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003348 (3e-82) |
Best Drosophila hit   | Fip1 (8e-28) |
Best Human hit | pre-mRNA 3'-end-processing factor FIP1 isoform 3 (5e-25) |
Best NR hit (blastp)   | PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] (2e-40) |
Best NR hit (blastx)   | PREDICTED: similar to CG1078 CG1078-PA [Tribolium castaneum] (4e-36) |
GeneOntology terms    | GO:0005515 protein binding GO:0006398 histone mRNA 3'-end processing GO:0005634 nucleus |
InterPro families   | IPR007854 Pre-mRNA polyadenylation factor Fip1 |
Orthology group | MCL16418 |
Nucleotide sequence:
ATGGCAGACGCTGCTGTAGAAACCGCTCCAGCAGATGAAAATGATGATAATTGGTTATAT
GGAGAATCAGCGAGTGAACAAATCGAAGCGGAAGCATCAACTACTGATGAAAAAACACAA
GAACAGGAAAATGCTAATAAGGAGAGAATAGATGATGATGGTAAAGTACAAGAAGAGACA
GAGACAAATGAGGGTAACAATGAAGACTCTCACTTCAACGATGAACACTTTGGTGAGGTG
GACAGAGATGATCAGGATCAGACGAATGGAGACGCCGACAGTCAGGACAACGGGGACACC
GACTCTGATGATAGTGATGACGTCAAAGTAACGATTGGAGAAATTAAGTCAGGACCACAA
GCTTATGCCAGTTTGAATATAAAACGTGGTGTTGGACTTGTAGCAGCTGGAACTGAGAAG
CCTCGTCAAGGTCCAGCGGCCGGTAACAAAGTGACTCTTGAAGACTTAGATGGTCCCGGA
AGCATCAACGGCGTGCCAGCGCTAGAGTTTAATATTGACACTATTGAAGATAAACCCTGG
AACAAACCTGGAGCTGACATATCTGATTACTTCAATTACGGTTTCAACGAGGTGACCTGG
AGCGCTTACTGTGAGCGTCAGAGACGGATGCGTGTTAGTGAGGCCGGTGTCGCTCTACAC
GCCGCCCCGCCGCCCCGCGCCGCCCCCACAGACAGACGGCAACAAGGTCCACCAAGACAT
GACGACATGCCGCCAGGGATGCCAAATAATTACCAGTCCAGAGAGAACACTATACAGGTG
ATGACAGCCGAGCGTCGTGAGTACGGCCGCGGTCAGGTGCGCGAGGCTGCACCACCCGCC
GACTACTTCAGCGCTCCGCCCCCCGACCACTACTACCAGCCTCCGCCTCACGCGCCTCAC
GCTCCACACCTCCCTCCACACCAGCACACACCACACTCATACGAAGAACCCTGGGCTCAT
CCAGAACAGACAGGCTGGGCGCCGTCAGATATAAAGGAACTAACGCCCGGACCCATGGGA
CCGCCGATGCCGCTAGGCATGCCCCCCGTACACATGCCCGCGCCCTACCCGACATACCGC
TCGCACGTCACACACACACACGAAAGAGACCGGGACCGGGAACGGGAAAGAGACCGAGAC
CGGGACCGGGACCGGGACAGGACCCGCGACGACCGTGACCGACGGGACCGGGACCGGAGG
GATGAGGAGGAAAGAGACAGGGATCGTGAACGTTCACGCTCCATTAAACCGGAGAGGATA
CGAGAGAAATCGTACCGCCGTGAGCGGTCTCGTTCACGTTCCCGCCGTCACAAGTCCCGG
TCCCGGTCTCCGAGACAACGGGAACGTTCCCGGGACAGGGAGAGGAGTATGAAGCCCAAG
AACAAGGATGCCAAGGAAAAGGACGAAGATAAATAA
Protein sequence:
MADAAVETAPADENDDNWLYGESASEQIEAEASTTDEKTQEQENANKERIDDDGKVQEET
ETNEGNNEDSHFNDEHFGEVDRDDQDQTNGDADSQDNGDTDSDDSDDVKVTIGEIKSGPQ
AYASLNIKRGVGLVAAGTEKPRQGPAAGNKVTLEDLDGPGSINGVPALEFNIDTIEDKPW
NKPGADISDYFNYGFNEVTWSAYCERQRRMRVSEAGVALHAAPPPRAAPTDRRQQGPPRH
DDMPPGMPNNYQSRENTIQVMTAERREYGRGQVREAAPPADYFSAPPPDHYYQPPPHAPH
APHLPPHQHTPHSYEEPWAHPEQTGWAPSDIKELTPGPMGPPMPLGMPPVHMPAPYPTYR
SHVTHTHERDRDRERERDRDRDRDRDRTRDDRDRRDRDRRDEEERDRDRERSRSIKPERI
REKSYRRERSRSRSRRHKSRSRSPRQRERSRDRERSMKPKNKDAKEKDEDK