DPGLEAN02050 in OGS1.0

New model in OGS2.0DPOGS214221 
Genomic Positionscaffold323:- 96592-98959
See gene structure
CDS Length1179
Paired RNAseq reads  150
Single RNAseq reads  463
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006197 (3e-37)
Best Drosophila hit  CG3294, isoform A (7e-37)
Best Human hitU2 small nuclear ribonucleoprotein auxiliary factor 35 kDa subunit-related protein 2 (2e-43)
Best NR hit (blastp)  PREDICTED: hypothetical protein [Nasonia vitripennis] (5e-68)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC010403 [Tribolium castaneum] (3e-60)
GeneOntology terms





  
GO:0000166 nucleotide binding
GO:0003723 RNA binding
GO:0005634 nucleus
GO:0005689 U12-type spliceosomal complex
GO:0005730 nucleolus
GO:0008270 zinc ion binding
GO:0046872 metal ion binding
InterPro families



  
IPR012677 Nucleotide-binding, alpha-beta plait
IPR009145 U2 auxiliary factor small subunit
IPR000504 RNA recognition motif domain
IPR000571 Zinc finger, CCCH-type
IPR003954 RNA recognition motif domain, eukaryote
Orthology groupMCL11452

Nucleotide sequence:

ATGGGAAAGCATAAAGAATGGAGAAGAATTGTGAAACGGGAAAGAAGAAGGAGGATACGC
AAACAGTTGGCTAAACAAAGAGATTGCTTACCTAATGGAAATGATAATAAAGATTGGATA
AAAGCACAAGAAGAATTAGAAACATATATTTTTGAACAAGTTGAAAATTCTAACAAGGTT
GAAAATGAAAAGTGGTTGGCCGCCGAAGCAGTAGCTTTAAAACACTGGAAGGAACTTCAA
ATCAAAAAAGAAATGTTGCTTAAGAAGCAGCTAGAACTACAAGCTAAGCTTCATCAGGAA
TGGGAGATGGAAAAAGAGAGAAAAGAAAAGGAGGCTCAACGTCTTAAAGAATTAGAAGAA
GAAAATGTCAAGAGACAGGAAGAGTTCATGAAAAACTTAGAAGAATTCCTAAATGGTGAT
TGTAAGGATCCTCCACAGGAGCTGCTTACATTGTATGAAAGTCGGCCAAATTGTGACCCA
TGTCCATTTTATGCTAAAACTGCATGTTGCCGGTTTGGAGATGAATGTTCTAGAAACCAC
AAGTATCCAGGTATCAGCAAGATACTCTTAGCTCCAAATCTTTTTGGACATTTTGGCTTG
GAGAATTCAAATTTTAATGAATATGACACAGATATTATGTTGGAATATGAAGATAGTGAT
ACTTACAAGGATTTCAAAGAATTTTTTTTTGACATATTGCCAGAATTTCAAAAATTTGGC
CAGGTTGTTGAGATTAAAGTTTGCAATAACTTTGAAAAGCATCTCCGAGGCAACACATAT
ATAGAGTACTCCGACGTTCGTAGTGCAGTTTCAGCTTACAGAGCCCTTCATACAAGATGG
TATGGCGGAAAACAACTCTCATTGCAATTTTGTAGACTGTTATCATGGAGTAGTGCAATA
TGTGGTTTACAAGTTACTGGACGATGTCCGAAAGGCAGGGCATGTAATTTTCTTCATGTT
TTTAAAAATCCTATAGATTTACACATAGCCTATGAAAAACGTTATTCAAAAAGGCAACAA
CACACGTCGTCACGTTCATGGAGGTGGTCCGAATCGCCTGAAAGAGAAAGTCCAACTTCA
AGAAGTAAAAGAAAAGACGATGGTCATTCCAAAAGTAGAGAAAGAAGACACTATCAACAT
CGATCACCTAGATCAAGATCACATCGATATAGGGATTAA

Protein sequence:

MGKHKEWRRIVKRERRRRIRKQLAKQRDCLPNGNDNKDWIKAQEELETYIFEQVENSNKV
ENEKWLAAEAVALKHWKELQIKKEMLLKKQLELQAKLHQEWEMEKERKEKEAQRLKELEE
ENVKRQEEFMKNLEEFLNGDCKDPPQELLTLYESRPNCDPCPFYAKTACCRFGDECSRNH
KYPGISKILLAPNLFGHFGLENSNFNEYDTDIMLEYEDSDTYKDFKEFFFDILPEFQKFG
QVVEIKVCNNFEKHLRGNTYIEYSDVRSAVSAYRALHTRWYGGKQLSLQFCRLLSWSSAI
CGLQVTGRCPKGRACNFLHVFKNPIDLHIAYEKRYSKRQQHTSSRSWRWSESPERESPTS
RSKRKDDGHSKSRERRHYQHRSPRSRSHRYRD