DPGLEAN14186 in OGS1.0

New model in OGS2.0DPOGS212853 
Genomic Positionscaffold96:+ 72785-99556
See gene structure
CDS Length1641
Paired RNAseq reads  8781
Single RNAseq reads  20535
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000798 (1e-06)
Best Drosophila hit  pasilla, isoform G (8e-99)
Best Human hitRNA-binding protein Nova-1 isoform 2 (9e-64)
Best NR hit (blastp)  PREDICTED: similar to pasilla CG16765-PK [Tribolium castaneum] (3e-167)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC008998 [Tribolium castaneum] (1e-104)
GeneOntology terms

  
GO:0003729 mRNA binding
GO:0005634 nucleus
GO:0000398 nuclear mRNA splicing, via spliceosome
InterPro families

  
IPR004088 K Homology, type 1
IPR004087 K Homology
IPR018111 K Homology, type 1, subgroup
Orthology groupMCL11949

Nucleotide sequence:

ATGGCCGCTGACACAGGAATGGATACTTGCCCTAGTCCGGAAATCACGGACTCTAGAAAA
CGACCCCTGGATGGCGATTCAGAAAATGGGGATGTCAAGAGGTCACATTTTAGCTCGGTT
CAAGACTTGGTGACGGCCTTGCCACTGGCTAACGGCCATGGCAATATAACGTCTCATTTC
GCGGTGGAGCCGACGTACCACTTCAAGGTGCTGGTGCCGTCGATGGTGGCCGGCGCCATC
ATCGGCAAGGGTGGTGAGACCATAGCGCAACTGCAGAAGGACACGGGGGCCAGGGTCAAG
ATGTCCAAATCGCATGATTTCTATCCAGGTACTACAGAACGAGCGTGTCTCATAACGGGG
TCGGTGGAAGGCATCATGGTGGTGCTAGACTTCATCATGGAAAAGATCAAAGAGAAACCG
GAGCTGGTGAAACCCTTCCCGGAGGGCGTGGATGCCAAGATGCCGCAGGATAGAGACAAG
CAGGTGAAGATCTTGGTGCCGAACTCCACAGCTGGTATGATAATAGGAAAGGGGGGCAAC
TACATTAAACAAATCAAGGAACAGAGCGGCAGCTACGTACAGATATCTCAGAAGGCGAAG
GAACTGTCTCTCCAGGAGCGCTGCATCACTGTTGTCGGTGAGAAGGAGAACAACAAGAAG
GCCTGCCTGATGATCCTTCAGAAGGTGGTGGACGACCCTCAGTCCGGGTCCTGTCCCAAC
GTGTCGTACGCGGACGTGGCCGGGCCGGTCGCCAACTACAACCCCACCGGCTCGCCGTAC
GCCGTGCCCACCACTGAGAGTCACGCTCTGGTGGGTGGCGGGTCTGTGGGCGGCGTGGGC
GGTGCTGGCGCGCTAGGCGGCGTGGGCGGTGTGGGCGGCGTGCTCGTGAACGGTTCCGGT
CTGGGCTCGCTGTCCCTGTCTCTGTCGCTGGCGCCGCCCGGCACCCCGCCGCCGTCCCCG
CTCACACAGCACACGCTCGACCACATTAAGGCGGCGCTGCGTCAGGCGGGCTACTCGGAG
GCCGGGCTGAGCGAGATCGGCGCGGCGCTGGCTCTGCTGGTGAAGCACGGCGTGCTGGGC
CTGGCGCTGCCGGCCGCCCTGCCCGCGCCGCTGTCCGCCGCCTACTTCCCCCTGCAGCCC
AGCGACTCGCCCGCTGTCTTCGGACCGCTGGCGCAGGTCCAGCTCGGCGGCGCGCGCGGC
GGCTCGTTAGAGCGTTTCGCGGAGGTTGCATTCGAGGCGCTTCGCCCCCCAGCCGTGGCT
CCCATCTCGCTGTCGGGCGGCGTGGGGGGAGTGGGGGGCGTACCCGGCTTCCCCTCCGCC
AGCCTGCTGCCGCTCTCCAAGAGCCCCACGCCCGCCGACGCGGGCGCCAAGGACTCTAAG
AACGTCGAGATCCCGGAGGTCATCGTCGGGGCCATCCTGGGCCCCGGCGGCCGCAGCCTG
GTGGAGATCCAGCAGATGTCGGGTGCCAACATCCAGATTTCCAAAAAGGGCACGTTCGCC
CCGGGCACCCGCAACCGCATCGTGACCATCTCGGGCACCGCCACCGCCATCAGCAATGCG
CATTACCTCATCGAACAGAAGATCCAGGAGGAGGAGCTCAAGCGCACGCGCCACAACGCG
CTCTCCGGCCTCATGCAGTAG

Protein sequence:

MAADTGMDTCPSPEITDSRKRPLDGDSENGDVKRSHFSSVQDLVTALPLANGHGNITSHF
AVEPTYHFKVLVPSMVAGAIIGKGGETIAQLQKDTGARVKMSKSHDFYPGTTERACLITG
SVEGIMVVLDFIMEKIKEKPELVKPFPEGVDAKMPQDRDKQVKILVPNSTAGMIIGKGGN
YIKQIKEQSGSYVQISQKAKELSLQERCITVVGEKENNKKACLMILQKVVDDPQSGSCPN
VSYADVAGPVANYNPTGSPYAVPTTESHALVGGGSVGGVGGAGALGGVGGVGGVLVNGSG
LGSLSLSLSLAPPGTPPPSPLTQHTLDHIKAALRQAGYSEAGLSEIGAALALLVKHGVLG
LALPAALPAPLSAAYFPLQPSDSPAVFGPLAQVQLGGARGGSLERFAEVAFEALRPPAVA
PISLSGGVGGVGGVPGFPSASLLPLSKSPTPADAGAKDSKNVEIPEVIVGAILGPGGRSL
VEIQQMSGANIQISKKGTFAPGTRNRIVTISGTATAISNAHYLIEQKIQEEELKRTRHNA
LSGLMQ