DPGLEAN11059 in OGS1.0

New model in OGS2.0DPOGS214717 
Genomic Positionscaffold559:- 6588-9803
See gene structure
CDS Length1167
Paired RNAseq reads  3126
Single RNAseq reads  9047
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004892 (5e-167)
Best Drosophila hit  Rox8, isoform C (5e-119)
Best Human hitnucleolysin TIA-1 isoform p40 isoform 1 (4e-85)
Best NR hit (blastp)  TIA-1-related RNA binding protein [Spodoptera litura] (0.0)
Best NR hit (blastx)  TIA-1-related RNA binding protein [Spodoptera litura] (2e-174)
GeneOntology terms






  
GO:0008143 poly(A) RNA binding
GO:0003729 mRNA binding
GO:0005685 U1 snRNP
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0000166 nucleotide binding
GO:0003676 nucleic acid binding
GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome
GO:0005634 nucleus
InterPro families

  
IPR012677 Nucleotide-binding, alpha-beta plait
IPR000504 RNA recognition motif domain
IPR003954 RNA recognition motif domain, eukaryote
Orthology groupMCL12700

Nucleotide sequence:

ATGGGCGACGAAAGCCACCCGAAAACTCTCTACGTCGGTAATTTAGACGCGAGTGTGACA
GAAGAATTTCTATGCGCGTTATTCGGCCAAATAGGTGAAGTTAAAGGCTGTAAAATAATA
CGTGAACCAGGTAATGATCCGTATGCTTTCCTTGAGTTTACAAATCACGCGTCGGCGGCT
ACGGCCCTGGCCGCCATGAACAGGCGAGTATTTCTCGAAAAGGAAATGAAGGTTAACTGG
GCTACGAGTCCTGGTAATCAGCCTAAAACGGATACAAGCAATCACCATCATATATTCGTT
GGTGATCTATCACCAGAAATAGAAACCCATATTCTACGTGAAGCTTTTGCTCCGTTTGGC
GAGATTTCGAACTGTCGAATAGTACGTGACCCACAAACGCTCAAATCCAAGGGTTATGCC
TTCGTTTCGTTTGTTAAAAAAGCAGATGCTGAAGCGGCAATACAGGCTATGAATGGACAA
TGGTTAGGATCGCGATCCATACGTACTAATTGGTCAACGCGTAAGCCGCCAACCAAAGGT
CCAAACGAAGGAGCACCGAGTAGCAAACGTGTAAAACAACCAACATTTGATGAAGTTTAC
AACCAGAGCTCACCGACGAATACTACAGTTTACTGCGGCGGTTTCACTAGTAATGTAATT
ACAGAAGAATTAATGCAAAGCACATTTTCACAATTCGGCCAGATTCAGGACGTAAGAGTG
TTCAGGGATAAGGGATATGCTTTTATTAGGTTCACAACTAAAGAAGCCGCGGCTCATGCT
ATAGAAGCGACACATAATACCGAAATTAGTGGACACACAGTGAAGTGTTTCTGGGGTAAA
GAGAACGGAGGAACAGAGAATCAGAGCACGACTAATCCACCCGCTGCACCTGCATCAATG
GGTGCACAAACACAATATCCCTATGCATACCAACAAGGGATGGGATACTGGTACACTCAG
GGTTATCCTGCGATCCAGGGCTACATGGCGCCAGGGTATTATCAGCAGTATGCAGCCGCC
TACAGTAACCCTCAAGCAGCTGCTGCGGCGGGCTACCGCATGAGTATGCCGGGTGGCGCA
GTGGGTGCCGTGGGTGCGGGCGGTTCGTGGGGCGGCGCGCCTCAGCCGCTGGTGTATTCG
GTGCCTTCGCAGTACCCCTCGCAGTAG

Protein sequence:

MGDESHPKTLYVGNLDASVTEEFLCALFGQIGEVKGCKIIREPGNDPYAFLEFTNHASAA
TALAAMNRRVFLEKEMKVNWATSPGNQPKTDTSNHHHIFVGDLSPEIETHILREAFAPFG
EISNCRIVRDPQTLKSKGYAFVSFVKKADAEAAIQAMNGQWLGSRSIRTNWSTRKPPTKG
PNEGAPSSKRVKQPTFDEVYNQSSPTNTTVYCGGFTSNVITEELMQSTFSQFGQIQDVRV
FRDKGYAFIRFTTKEAAAHAIEATHNTEISGHTVKCFWGKENGGTENQSTTNPPAAPASM
GAQTQYPYAYQQGMGYWYTQGYPAIQGYMAPGYYQQYAAAYSNPQAAAAAGYRMSMPGGA
VGAVGAGGSWGGAPQPLVYSVPSQYPSQ