DPGLEAN01779 in OGS1.0

New model in OGS2.0DPOGS212437 
Genomic Positionscaffold1732:- 127-6545
See gene structure
CDS Length2982
Paired RNAseq reads  2868
Single RNAseq reads  7125
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002895 (1e-52)
Best Drosophila hit  peanuts (0.0)
Best Human hitATP-dependent RNA helicase DHX8 (0.0)
Best NR hit (blastp)  pre-mRNA-splicing factor ATP-dependent RNA helicase PRP22, putative [Pediculus humanus corporis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to CG8241 CG8241-PA [Tribolium castaneum] (0.0)
GeneOntology terms








  
GO:0007286 spermatid development
GO:0005681 spliceosomal complex
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0004004 ATP-dependent RNA helicase activity
GO:0003723 RNA binding
GO:0005524 ATP binding
GO:0005634 nucleus
GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome
GO:0071013 catalytic step 2 spliceosome
GO:0071011 precatalytic spliceosome
InterPro families






  
IPR003029 Ribosomal protein S1, RNA-binding domain
IPR014001 DEAD-like helicase
IPR001650 Helicase, C-terminal
IPR002464 DNA/RNA helicase, ATP-dependent, DEAH-box type, conserved site
IPR022967 RNA-binding domain, S1
IPR012340 Nucleic acid-binding, OB-fold
IPR011545 DNA/RNA helicase, DEAD/DEAH box type, N-terminal
IPR016027 Nucleic acid-binding, OB-fold-like
Orthology groupMCL10048

Nucleotide sequence:

ATGGATGAAGTTTCAAAGCTAGAACATCTTTCTCTGGTGTCCAAAATATGCACAGAGTTG
GACAACCATTTGGGATTGAATGATAAAGATTTAGCTGAGTTCATTATCGACTTGGCCGAC
AAGAACCCAAACTTTGACAATTTTAAAAAAGCACTTATCGAGAATGGTGCGGAATTTTCT
GACTCATTCATGACAAACCTGCTACGTATTATTCAACATATGAAACCTTCTGAAAATCAA
GCTGACGGGCCACAGAAAGAGATCAAGAGCAGTAATCCTCTGGCAAGCAAATTTCCAGGT
TTGGCTATTCCCAATGATAAACCTAGTAAATTTTCATCAGATGATGAAAGCGAAGATGAT
AATAAAAATACGAGAATAACATCAAAGGACATATTTAAAACTGAATCTAAAGCAAAGGAG
TCGTGTGTCGATGTTGAGGATGCGATGGCAGCGCTAGAGGCGCTAGCACCGTCCAATATA
AACAAAGGCAAGGATGAGTCAAAGAAAGATATCATAAAGAAGAGAGACCAGAGCGGCGAT
CGTAGCGAAACTATTAGGAAACGAGAAAGATCGAGGGAAAGAAAGCGCAGTAGAAGTAGG
GAGAGGAGACGCCGCAGTAAAGATAGAGACAGACGTAGCCGTAGTCGTAAGAGAAGTCGC
AGTAGAAGCAGAAGAAGTCACAGCAAAGAAAGAACAAATAGAAGTAGAGATCAGAGAAGT
AGAGACAGGGATAGACGAAGACGTTCCAGAAGTAAACCTCGACATAGATCCAGGTCTAGA
GATAGGCAACGGAGATCCAGATCTACTAATAGAAGATCTAGATCCAGATCCCGTTCATAT
GAAAGAAAAGAAAGAAATAGATACAATGATTATGGCAGAAATCAGAAAAGAAGAAGCGCT
GAGGTGGAGATGACCGATGATCCTGAACCGGGGAAAATTTACAATGGACGAGTGGCAAAC
ATAGTACCGTTTGGATGCTTCGTACAGATGGAAGGGCTGCGGAAGAGATGGGAGGGTCTC
GTACACATCTCTCAACTCAGAGCTGAAGGTAGAGTCACGAATGTATCCGATGTAGTGTCC
AGGGGTGACAAAGTTAAGGTATTACTGTCAGTGACCGGACAAAAGGTATCACTGACGATG
AAGGATGTCTGTCAAGAGTCCGGCAAGGATTTAAATCCGACTTCACATGCACATCTAGAG
GTGGAGCGTTCAGGTCGCAACCCGGACCGTCCCCCGGCCGTGTTGGCGGGACTCCAACTA
GACCCTGATGAAGACTCCAGCCGCAAACGGGTCACCAGGATATCCAGTCCCGAGCGATGG
GAGATCAAACAGATGATATCATCAGGTGTGATCGATAAAAGCGAGTTGCCAGATTTCGAT
GAAGAAACGGGTCTTCTGCCCAAAGAGGAGGACGGAGAAGCGGACATCGAGATAGAACTG
GTCGAGGAGGAACCGCCCTTCCTACAAGGTCACGGGCGAGCTCTACACGACCTGTCCCCT
GTTAGAATAGTCAAGAATCCTGATGGATCACTAGCGCAGGCCGCCATGATGCAGTCCGCT
CTGGCGAAGGAGAGAAGAGAACAGAAGATGATACAGAGAGAACAGGAGATGGAGAGTCTG
CCGACCGGTCTCAACAAAAACTGGATCGATCCTCTACCGGAAGCGGACGGGAGGGCGTTA
GCGGCTAACATGCGAGGCTCGGGCATAACACCGCAGGACTTGCCCGAGTGGAAGAAACAC
GTCATCGGGGGGAAGAAATCTTCATTCGGCAAGAAAACTAACCTGTCCCTCCTGGAGCAG
AGGCAGTCCCTGCCCATTTACAAGTTGAGAGACGAATTGACCAAGGCCATATCCGACAAC
CAGATCCTGATAGTGATAGGAGAGACGGGTTCCGGGAAGACGACTCAAATCACGCAGTAC
GTCTGCGAGTGTGGCGTGTCCGGGCGGGGCCGTGTGGCGTGCACCCAGCCCAGGAGAGTG
GCCGCCATGTCCGTCGCCAAGAGGGTCGCTGAGGAGTTCGGCTGCAGGCTGGGTCAAGAG
GTCGGCTACACCATACGATTTGAGGACTGCACCGGACCCGACACGGTCATCAAGTACATG
ACAGACGGTATGTTGCTCCGCGAGTGTCTGATGGATCTGGACCTGAAGAGCTACTCCGTC
ATCATGCTGGACGAGGCCCACGAGCGCACCATACACACGGACGTGCTGTTCGGCCTCCTC
AAACAAGCGGTCCAGAAACGACCGGAACTCAAACTGATCGTGACATCCGCCACCCTGGAC
GCCGTGAAATTCTCCCAGTACTTCTTCGAGGCCCCCATCTTCACCATACCCGGACGGACC
TTCCCCGTCGAGGTTCTGTACACAAAGGAACCGGAAACGGATTACCTGGACGCCTCCTTA
ATAACCGTCATGCAAATACATCTGCGTGAACCGCCCGGGGATATTCTGCTGTTTTTGACC
GGCCAGGAGGAAATCGACACCGCCTGCGAGATACTGTACGAGAGGATGAAGTCCCTCGGC
CCGGATGTACCTGAGCTGATCATTCTTCCGGTTTACTCCGCCCTTCCGTCTGAGATGCAG
ACCAGAATCTTCGAACCCGCTCCGCCTGGCTCGAGGAAGGTGGTGATAGCTACCAACATA
GCGGAGACCTCGCTCACCATAGACGGCATTTACTACGTAGTGGACCCCGGGTTCGTCAAA
CAGAAGGTCTACAATTCAAAGACCGGTATGGACTCGTTGGTCGTCACCCCGATCTCACAG
GCGGCGGCGAAAGTCGCTCGGCCAGCACGCCCAGCGACTGGCCCGGGGAAGTGTTACCGA
CTGTACACGGAGCGCGCATACCGGGATGAAATGTTGCCCACCCCTGTCCCGGAAATACAA
AGGACTAATCTCGCCACTACAGTATGTATAATAATATTTGTTAAACTAAAAACTAAATCA
CTCGCTTTTTTGGCAGCCAGGGAGTCGCAGGCAGATAATTAA

Protein sequence:

MDEVSKLEHLSLVSKICTELDNHLGLNDKDLAEFIIDLADKNPNFDNFKKALIENGAEFS
DSFMTNLLRIIQHMKPSENQADGPQKEIKSSNPLASKFPGLAIPNDKPSKFSSDDESEDD
NKNTRITSKDIFKTESKAKESCVDVEDAMAALEALAPSNINKGKDESKKDIIKKRDQSGD
RSETIRKRERSRERKRSRSRERRRRSKDRDRRSRSRKRSRSRSRRSHSKERTNRSRDQRS
RDRDRRRRSRSKPRHRSRSRDRQRRSRSTNRRSRSRSRSYERKERNRYNDYGRNQKRRSA
EVEMTDDPEPGKIYNGRVANIVPFGCFVQMEGLRKRWEGLVHISQLRAEGRVTNVSDVVS
RGDKVKVLLSVTGQKVSLTMKDVCQESGKDLNPTSHAHLEVERSGRNPDRPPAVLAGLQL
DPDEDSSRKRVTRISSPERWEIKQMISSGVIDKSELPDFDEETGLLPKEEDGEADIEIEL
VEEEPPFLQGHGRALHDLSPVRIVKNPDGSLAQAAMMQSALAKERREQKMIQREQEMESL
PTGLNKNWIDPLPEADGRALAANMRGSGITPQDLPEWKKHVIGGKKSSFGKKTNLSLLEQ
RQSLPIYKLRDELTKAISDNQILIVIGETGSGKTTQITQYVCECGVSGRGRVACTQPRRV
AAMSVAKRVAEEFGCRLGQEVGYTIRFEDCTGPDTVIKYMTDGMLLRECLMDLDLKSYSV
IMLDEAHERTIHTDVLFGLLKQAVQKRPELKLIVTSATLDAVKFSQYFFEAPIFTIPGRT
FPVEVLYTKEPETDYLDASLITVMQIHLREPPGDILLFLTGQEEIDTACEILYERMKSLG
PDVPELIILPVYSALPSEMQTRIFEPAPPGSRKVVIATNIAETSLTIDGIYYVVDPGFVK
QKVYNSKTGMDSLVVTPISQAAAKVARPARPATGPGKCYRLYTERAYRDEMLPTPVPEIQ
RTNLATTVCIIIFVKLKTKSLAFLAARESQADN