DPGLEAN11769 in OGS1.0

New model in OGS2.0DPOGS215581 
Genomic Positionscaffold392:- 21708-29833
See gene structure
CDS Length3921
Paired RNAseq reads  7702
Single RNAseq reads  21210
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010991 (2e-48)
Best Drosophila hit  CG4266, isoform B (1e-59)
Best Human hitRNA-binding protein 16 (1e-54)
Best NR hit (blastp)  PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum] (9e-122)
Best NR hit (blastx)  PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] (5e-66)
GeneOntology terms


  
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0003729 mRNA binding
GO:0000166 nucleotide binding
GO:0003676 nucleic acid binding
InterPro families



  
IPR006903 Domain of unknown function DUF618
IPR000504 RNA recognition motif domain
IPR006569 RNA polymerase II, large subunit, CTD
IPR008942 ENTH/VHS
IPR012677 Nucleotide-binding, alpha-beta plait
Orthology groupMCL16992

Nucleotide sequence:

ATGGCGGAGGTCAAAGCGTTCAACGCGGAGTTATCCGGGTTGTACGAGAATCGGCCTCCG
ATTTCCAAGGCAAAAATGAGTGCTATCACTAGAGGTGCTATCAAAGCCATAAAATTCTAC
AAGCACGTCGTACACAGTGTGGAAAAGTTCATACAGAAGTGTAAACCTGAATACAAAGTT
CCTGGTCTGTACGTGATCGATTCAATAGTAAGACAGTCACGGCACCAGTTTGGCCAGGAC
AAGGACGTGTTCGCGCCAAGGTTCGCCAAAAACATGCAACAGACATTCGCTAACCTGTTC
AGGTGTCCTGATGAAGATAAGCGCAACATAATCAGAGTTCTGAATCTGTGGCAGAAGAAT
AATGTGTTTGGACCTGAAGTGATCCAACCACTGCTAGATCTGGCCGACCCAAGTCATCCT
TTGCATTTGGAAATACAGAACCAGAATAACACAACCAATGGAAGCATAAACATGAGTCAT
AACACATCAGACAGCAAGATCTCCCCCGCTCGGCAGGACTCCCCACAGACATCCTCGCCC
ATGGGAGATGCATTTCAAGATGACTCCTCGCCTGGTCCTCAAGCCAAATTCAACCGCAAG
CTCCTCAATGATTTTGAGTATGAAAGTGAAGATGAACAAGAACCTCCCCCGCAGCCACCA
CATCCGCCACATCCGCCACACGCGCCACATCCGCCACATTCGAGCCACGCCACACACACG
ACACACAACCCCACAGATGCTCTTGGCAGTATACTAACAAATCCAGAAATTATGAGGCAG
CTACAGAGTCTACAAGCCCAGATGCAGCTCATGACGGGCATGCAGATCCCAAATTTGATG
CCGATGATGTCAGACATGCAACTTCAGCAGAACCAAAATTCGAACGCACCATTCTTAAAC
TCTCAGACAGAACAACAGAAACCCGCAGAGCCCAAGGAGGACCTCGCCAACGAGTCCGAC
ATAGAGTTCGTGGAGACCGGACCCCAGGTCATCGAGATACCCGACGCCAACGACTCCCGG
TCGCCCTCGCCGAGGCGACGACATCGCTCCCGGTCGAGGAGTCCACGCCGGAGGAGGAGG
ACGCGCTCCCCGCGGAGAAGGAGGGACAAGGACCGCGATCGAGACCGCGACAAGACGCAC
AAGGAGAGGGAGGCCGAGAAAGAGCGCCAGCGGGAGAGGGAGAAGAGGGGCCTACCACCC
ATCAAGAAAGAGAACCTCAGTGTGTGCAGCACCACGCTGTGGGTCGGCCGGCTGTCCAAA
CAGGCCACGCCCGAGGAGCTGTGGGACCTGTTCGGGGCCGTGGGCGGCGTGGCGGCCGTG
GACGTGGTGGCGCCTCGGGGCTGCGCTTTCGTCGTCATGGAGCGGCGCCGGGACGCCGCG
CGCGCGCTCGCCAAGCTGCACCGACACAAACTGCACTCCAAAGAGATAGACGTCGCCTGG
GCGGCCGGCAAGGGCGTCAAAGGCCGCGAATGGAAGGACTACTGGGAGGCGGAGCTCGGG
GTCGCTTACTTGCCCTGGAGCGCCCTCCACGCGCGCTGGCTGCTGGGCGCGCTGTCGCTG
GACGCGCTCGAGGACGGCGGGGCCGTGGATGAGGACACGCTGCCGCCTTGGCTCCCGCCC
AGGATACTGCCTAAGTCTGTCGGGGAGGCCGTGCCGTTGATGGGCGCGCTGCCCGCTCCG
CTGCCGCTGCCCACCGGGTTGCCGCGTCTGCCGCCACCCGGCCTGGGCGCTCCGCCACCC
GCCGCGGGCTACCCCGGCCTGGGCTCACTGGCGCCGCACCAGCTGCTGAACGAGTCGCCG
GGCTCTTCGGCGCCGGGTCTGCAGCGCGACCCGCTGCTAGCCTTCCCGCCCGCCCTGCCT
CCCCACACCATGCCGCAGCCCGGCTTGGTGGGTGGCTTCCTGGGCGGTCTGATGGGTGTC
GGAGTGGGACACATGAACGTCGGCGGGCTCGTGCTACCCCTTCACCCGGCCCACGCCCAC
GCGGCGCACTCCCACGCGCAGGTCCACACGCATGCACACCCGCACGCCCCGCCACACGTC
CCGCCGCACGCACTCGTCCCTCAGGTGGGTCAACGAGCCGAGGTGGCGGATGACGCCATG
GAGCTGGACAATGACGACCAGACGGACGAGCCCCCAGCCCCCGCGGCCCCTCCGGCCCCA
GCGCCGGCGCCCGCACTCGGTCTTCCGCCTCCCGCCGTGCCCCCGCCGTTGTCTATGGAC
CAGCTTCAGGTCCTGTTGTCGAAGCCGCCGCCGACTTTCAACTCCGCGGAGCCTCCGCCT
GGGTTCAATCCGGAGTCTTTCGAGACGGAGGAGACTCCGGACGAGCGCCGCGAACGGGAC
AAGGAGCGACGGGACAGAGACCGGGACAGGGACCGGGACCGACGGGACCGGCGCGACGAC
CGACCGGACCGACCCGGGGGGCGCAGGGAGAGGGACCGGCCGCGGGACAGGGACAGGGAG
AGAGATGAGCGCCGGGAGAGAGACCGAGGAGGACGGGAGAGAAGAGACCGGGACAGGGAC
AGGGAGAGGGACAGGTTCCCCAGGGAGAACAACAACGAGAAAAGTCAGAAGTCTCCACGG
AGTCAGGCCGGCGAGGCGGGCGGCGCGGAGAAGACGCTGCAGGAGAGGCTGTGGGAGATG
GCCAACGGGAAGACGAGCGACGGAGACGAGCTCGAGCCCCGAGCGGACAGGCCTCCGCTC
ATAGAACGACCGCCTCTCATGGAGCGGCCGCAGACAGCGGACAGCAAGGTTCGTCTCCGC
GGTCCCGGCGGAGGGGGTGGTCCTCGTCCGCCGCCGCGCGCACCGTGGCTGGCTCCGCGC
TTCAATGGTTTGGGTCCGCCTTTCGTACGTCCTCCATTCGAGAGGCCCCCGTTTGAGGGT
CCTCCGATGTTCGAGAGGCCGCCGTTCGGCCGGATGCCGTTCGACGGCGCGCGGCCTCCT
TTCGACGGTCCCCGGCCTCCGGGTCCACGCCTGCCCTTCGATGCGCCGAGGCCTTTCGAC
GGGCCGCGCCCCCCATTCGATGTACCGCGCCCTCCCTTTGACGTTCCGAGGCCACCGTTC
GAAGGTCAGCGACCTCCCTTCGACGGACCGCGACCTCTCTTCGACGGTCCGAGGCCTCCG
TTCGATGGTTTCGAAGGAGATAGATCATTCGACGGACCCAGATTCGATGGGCCCCCCGAG
TTCTTCGACAGAGGCAACAGAAGATTCGATGATAGAGATTTCAACGAGAGAGGCTGGAAC
GGAGATAGAGACTTCGACCGAAGGACAGAATGGGAGGACAGGAGGAGAGAACGCAGAGGG
AGAGATAACGAGGAACGGTTCAGGGAGCGAGGGGGGAGGGGAAGAAACTACGACGAGAGA
GCGAGACCGAGAGACGAGAGGAACACCCGGAGAGACAAGGACAGGAAGTCGAGATGGGGA
GCGGCGGACGAGGCGGGGCAGGGGACAGAAGACGGCAAGGGGAAAGACACCGCGAGTGAG
AGACGAGAGGCAGAGAACGACGACCGGAACGAAACACACGACACGCACACTAGCAGGACC
AGCGGAGAGGAACAGAGGTCAGAAGGTGACGTGGGGCGAGAGGACACCGGGGCAGGGGCA
GAGAAGGAACTGGAGGGCGAGCGGCTGAAGGTCGAGGAAGACGGGGGCAGTGAACACGAA
CAGATTGGACAGGACGGATACCAGCAACAGGACGAGACAGGGGATAAGAAAATAACAGAT
ACGACAGGGGAGGAAGAAAAAATACAAGCAGGGGGGAAACCGGGGAGGTCGGGGAGCGGG
GCGAGGGGTCGGCTTGACATAGGACCAGGAGGAAGATGGGATATTTCCAGTGGAAATGAT
TCGGGTGACGCGGCGCCGGGCCGGGTGACGTCGTGCGACTTGATGCGACGCGGACGAGCG
ACGCCGGGCGACGGCCGCTAG

Protein sequence:

MAEVKAFNAELSGLYENRPPISKAKMSAITRGAIKAIKFYKHVVHSVEKFIQKCKPEYKV
PGLYVIDSIVRQSRHQFGQDKDVFAPRFAKNMQQTFANLFRCPDEDKRNIIRVLNLWQKN
NVFGPEVIQPLLDLADPSHPLHLEIQNQNNTTNGSINMSHNTSDSKISPARQDSPQTSSP
MGDAFQDDSSPGPQAKFNRKLLNDFEYESEDEQEPPPQPPHPPHPPHAPHPPHSSHATHT
THNPTDALGSILTNPEIMRQLQSLQAQMQLMTGMQIPNLMPMMSDMQLQQNQNSNAPFLN
SQTEQQKPAEPKEDLANESDIEFVETGPQVIEIPDANDSRSPSPRRRHRSRSRSPRRRRR
TRSPRRRRDKDRDRDRDKTHKEREAEKERQREREKRGLPPIKKENLSVCSTTLWVGRLSK
QATPEELWDLFGAVGGVAAVDVVAPRGCAFVVMERRRDAARALAKLHRHKLHSKEIDVAW
AAGKGVKGREWKDYWEAELGVAYLPWSALHARWLLGALSLDALEDGGAVDEDTLPPWLPP
RILPKSVGEAVPLMGALPAPLPLPTGLPRLPPPGLGAPPPAAGYPGLGSLAPHQLLNESP
GSSAPGLQRDPLLAFPPALPPHTMPQPGLVGGFLGGLMGVGVGHMNVGGLVLPLHPAHAH
AAHSHAQVHTHAHPHAPPHVPPHALVPQVGQRAEVADDAMELDNDDQTDEPPAPAAPPAP
APAPALGLPPPAVPPPLSMDQLQVLLSKPPPTFNSAEPPPGFNPESFETEETPDERRERD
KERRDRDRDRDRDRRDRRDDRPDRPGGRRERDRPRDRDRERDERRERDRGGRERRDRDRD
RERDRFPRENNNEKSQKSPRSQAGEAGGAEKTLQERLWEMANGKTSDGDELEPRADRPPL
IERPPLMERPQTADSKVRLRGPGGGGGPRPPPRAPWLAPRFNGLGPPFVRPPFERPPFEG
PPMFERPPFGRMPFDGARPPFDGPRPPGPRLPFDAPRPFDGPRPPFDVPRPPFDVPRPPF
EGQRPPFDGPRPLFDGPRPPFDGFEGDRSFDGPRFDGPPEFFDRGNRRFDDRDFNERGWN
GDRDFDRRTEWEDRRRERRGRDNEERFRERGGRGRNYDERARPRDERNTRRDKDRKSRWG
AADEAGQGTEDGKGKDTASERREAENDDRNETHDTHTSRTSGEEQRSEGDVGREDTGAGA
EKELEGERLKVEEDGGSEHEQIGQDGYQQQDETGDKKITDTTGEEEKIQAGGKPGRSGSG
ARGRLDIGPGGRWDISSGNDSGDAAPGRVTSCDLMRRGRATPGDGR