New model in OGS2.0 | DPOGS212437  |
---|---|
Genomic Position | scaffold1732:- 127-6545 |
See gene structure | |
CDS Length | 2982 |
Paired RNAseq reads   | 2868 |
Single RNAseq reads   | 7125 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002895 (1e-52) |
Best Drosophila hit   | peanuts (0.0) |
Best Human hit | ATP-dependent RNA helicase DHX8 (0.0) |
Best NR hit (blastp)   | pre-mRNA-splicing factor ATP-dependent RNA helicase PRP22, putative [Pediculus humanus corporis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to CG8241 CG8241-PA [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0007286 spermatid development GO:0005681 spliceosomal complex GO:0000398 nuclear mRNA splicing, via spliceosome GO:0004004 ATP-dependent RNA helicase activity GO:0003723 RNA binding GO:0005524 ATP binding GO:0005634 nucleus GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome GO:0071013 catalytic step 2 spliceosome GO:0071011 precatalytic spliceosome |
InterPro families    | IPR003029 Ribosomal protein S1, RNA-binding domain IPR014001 DEAD-like helicase IPR001650 Helicase, C-terminal IPR002464 DNA/RNA helicase, ATP-dependent, DEAH-box type, conserved site IPR022967 RNA-binding domain, S1 IPR012340 Nucleic acid-binding, OB-fold IPR011545 DNA/RNA helicase, DEAD/DEAH box type, N-terminal IPR016027 Nucleic acid-binding, OB-fold-like |
Orthology group | MCL10048 |
Nucleotide sequence:
ATGGATGAAGTTTCAAAGCTAGAACATCTTTCTCTGGTGTCCAAAATATGCACAGAGTTG
GACAACCATTTGGGATTGAATGATAAAGATTTAGCTGAGTTCATTATCGACTTGGCCGAC
AAGAACCCAAACTTTGACAATTTTAAAAAAGCACTTATCGAGAATGGTGCGGAATTTTCT
GACTCATTCATGACAAACCTGCTACGTATTATTCAACATATGAAACCTTCTGAAAATCAA
GCTGACGGGCCACAGAAAGAGATCAAGAGCAGTAATCCTCTGGCAAGCAAATTTCCAGGT
TTGGCTATTCCCAATGATAAACCTAGTAAATTTTCATCAGATGATGAAAGCGAAGATGAT
AATAAAAATACGAGAATAACATCAAAGGACATATTTAAAACTGAATCTAAAGCAAAGGAG
TCGTGTGTCGATGTTGAGGATGCGATGGCAGCGCTAGAGGCGCTAGCACCGTCCAATATA
AACAAAGGCAAGGATGAGTCAAAGAAAGATATCATAAAGAAGAGAGACCAGAGCGGCGAT
CGTAGCGAAACTATTAGGAAACGAGAAAGATCGAGGGAAAGAAAGCGCAGTAGAAGTAGG
GAGAGGAGACGCCGCAGTAAAGATAGAGACAGACGTAGCCGTAGTCGTAAGAGAAGTCGC
AGTAGAAGCAGAAGAAGTCACAGCAAAGAAAGAACAAATAGAAGTAGAGATCAGAGAAGT
AGAGACAGGGATAGACGAAGACGTTCCAGAAGTAAACCTCGACATAGATCCAGGTCTAGA
GATAGGCAACGGAGATCCAGATCTACTAATAGAAGATCTAGATCCAGATCCCGTTCATAT
GAAAGAAAAGAAAGAAATAGATACAATGATTATGGCAGAAATCAGAAAAGAAGAAGCGCT
GAGGTGGAGATGACCGATGATCCTGAACCGGGGAAAATTTACAATGGACGAGTGGCAAAC
ATAGTACCGTTTGGATGCTTCGTACAGATGGAAGGGCTGCGGAAGAGATGGGAGGGTCTC
GTACACATCTCTCAACTCAGAGCTGAAGGTAGAGTCACGAATGTATCCGATGTAGTGTCC
AGGGGTGACAAAGTTAAGGTATTACTGTCAGTGACCGGACAAAAGGTATCACTGACGATG
AAGGATGTCTGTCAAGAGTCCGGCAAGGATTTAAATCCGACTTCACATGCACATCTAGAG
GTGGAGCGTTCAGGTCGCAACCCGGACCGTCCCCCGGCCGTGTTGGCGGGACTCCAACTA
GACCCTGATGAAGACTCCAGCCGCAAACGGGTCACCAGGATATCCAGTCCCGAGCGATGG
GAGATCAAACAGATGATATCATCAGGTGTGATCGATAAAAGCGAGTTGCCAGATTTCGAT
GAAGAAACGGGTCTTCTGCCCAAAGAGGAGGACGGAGAAGCGGACATCGAGATAGAACTG
GTCGAGGAGGAACCGCCCTTCCTACAAGGTCACGGGCGAGCTCTACACGACCTGTCCCCT
GTTAGAATAGTCAAGAATCCTGATGGATCACTAGCGCAGGCCGCCATGATGCAGTCCGCT
CTGGCGAAGGAGAGAAGAGAACAGAAGATGATACAGAGAGAACAGGAGATGGAGAGTCTG
CCGACCGGTCTCAACAAAAACTGGATCGATCCTCTACCGGAAGCGGACGGGAGGGCGTTA
GCGGCTAACATGCGAGGCTCGGGCATAACACCGCAGGACTTGCCCGAGTGGAAGAAACAC
GTCATCGGGGGGAAGAAATCTTCATTCGGCAAGAAAACTAACCTGTCCCTCCTGGAGCAG
AGGCAGTCCCTGCCCATTTACAAGTTGAGAGACGAATTGACCAAGGCCATATCCGACAAC
CAGATCCTGATAGTGATAGGAGAGACGGGTTCCGGGAAGACGACTCAAATCACGCAGTAC
GTCTGCGAGTGTGGCGTGTCCGGGCGGGGCCGTGTGGCGTGCACCCAGCCCAGGAGAGTG
GCCGCCATGTCCGTCGCCAAGAGGGTCGCTGAGGAGTTCGGCTGCAGGCTGGGTCAAGAG
GTCGGCTACACCATACGATTTGAGGACTGCACCGGACCCGACACGGTCATCAAGTACATG
ACAGACGGTATGTTGCTCCGCGAGTGTCTGATGGATCTGGACCTGAAGAGCTACTCCGTC
ATCATGCTGGACGAGGCCCACGAGCGCACCATACACACGGACGTGCTGTTCGGCCTCCTC
AAACAAGCGGTCCAGAAACGACCGGAACTCAAACTGATCGTGACATCCGCCACCCTGGAC
GCCGTGAAATTCTCCCAGTACTTCTTCGAGGCCCCCATCTTCACCATACCCGGACGGACC
TTCCCCGTCGAGGTTCTGTACACAAAGGAACCGGAAACGGATTACCTGGACGCCTCCTTA
ATAACCGTCATGCAAATACATCTGCGTGAACCGCCCGGGGATATTCTGCTGTTTTTGACC
GGCCAGGAGGAAATCGACACCGCCTGCGAGATACTGTACGAGAGGATGAAGTCCCTCGGC
CCGGATGTACCTGAGCTGATCATTCTTCCGGTTTACTCCGCCCTTCCGTCTGAGATGCAG
ACCAGAATCTTCGAACCCGCTCCGCCTGGCTCGAGGAAGGTGGTGATAGCTACCAACATA
GCGGAGACCTCGCTCACCATAGACGGCATTTACTACGTAGTGGACCCCGGGTTCGTCAAA
CAGAAGGTCTACAATTCAAAGACCGGTATGGACTCGTTGGTCGTCACCCCGATCTCACAG
GCGGCGGCGAAAGTCGCTCGGCCAGCACGCCCAGCGACTGGCCCGGGGAAGTGTTACCGA
CTGTACACGGAGCGCGCATACCGGGATGAAATGTTGCCCACCCCTGTCCCGGAAATACAA
AGGACTAATCTCGCCACTACAGTATGTATAATAATATTTGTTAAACTAAAAACTAAATCA
CTCGCTTTTTTGGCAGCCAGGGAGTCGCAGGCAGATAATTAA
Protein sequence:
MDEVSKLEHLSLVSKICTELDNHLGLNDKDLAEFIIDLADKNPNFDNFKKALIENGAEFS
DSFMTNLLRIIQHMKPSENQADGPQKEIKSSNPLASKFPGLAIPNDKPSKFSSDDESEDD
NKNTRITSKDIFKTESKAKESCVDVEDAMAALEALAPSNINKGKDESKKDIIKKRDQSGD
RSETIRKRERSRERKRSRSRERRRRSKDRDRRSRSRKRSRSRSRRSHSKERTNRSRDQRS
RDRDRRRRSRSKPRHRSRSRDRQRRSRSTNRRSRSRSRSYERKERNRYNDYGRNQKRRSA
EVEMTDDPEPGKIYNGRVANIVPFGCFVQMEGLRKRWEGLVHISQLRAEGRVTNVSDVVS
RGDKVKVLLSVTGQKVSLTMKDVCQESGKDLNPTSHAHLEVERSGRNPDRPPAVLAGLQL
DPDEDSSRKRVTRISSPERWEIKQMISSGVIDKSELPDFDEETGLLPKEEDGEADIEIEL
VEEEPPFLQGHGRALHDLSPVRIVKNPDGSLAQAAMMQSALAKERREQKMIQREQEMESL
PTGLNKNWIDPLPEADGRALAANMRGSGITPQDLPEWKKHVIGGKKSSFGKKTNLSLLEQ
RQSLPIYKLRDELTKAISDNQILIVIGETGSGKTTQITQYVCECGVSGRGRVACTQPRRV
AAMSVAKRVAEEFGCRLGQEVGYTIRFEDCTGPDTVIKYMTDGMLLRECLMDLDLKSYSV
IMLDEAHERTIHTDVLFGLLKQAVQKRPELKLIVTSATLDAVKFSQYFFEAPIFTIPGRT
FPVEVLYTKEPETDYLDASLITVMQIHLREPPGDILLFLTGQEEIDTACEILYERMKSLG
PDVPELIILPVYSALPSEMQTRIFEPAPPGSRKVVIATNIAETSLTIDGIYYVVDPGFVK
QKVYNSKTGMDSLVVTPISQAAAKVARPARPATGPGKCYRLYTERAYRDEMLPTPVPEIQ
RTNLATTVCIIIFVKLKTKSLAFLAARESQADN