New model in OGS2.0 | DPOGS205379  |
---|---|
Genomic Position | scaffold6950:- 11-14293 |
See gene structure | |
CDS Length | 3207 |
Paired RNAseq reads   | 3477 |
Single RNAseq reads   | 8330 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008779 (2e-10) |
Best Drosophila hit   | CG5931 (0.0) |
Best Human hit | U5 small nuclear ribonucleoprotein 200 kDa helicase (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to pre-mRNA-splicing helicase BRR2 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC016163 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0030532 small nuclear ribonucleoprotein complex GO:0005682 U5 snRNP GO:0008380 RNA splicing GO:0003724 RNA helicase activity GO:0004004 ATP-dependent RNA helicase activity GO:0005681 spliceosomal complex GO:0003676 nucleic acid binding GO:0005524 ATP binding GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome GO:0005634 nucleus GO:0071011 precatalytic spliceosome GO:0071013 catalytic step 2 spliceosome GO:0000398 nuclear mRNA splicing, via spliceosome |
InterPro families    | IPR004179 Sec63 domain IPR011545 DNA/RNA helicase, DEAD/DEAH box type, N-terminal IPR001650 Helicase, C-terminal IPR023290 Sec63-domain IPR014001 DEAD-like helicase |
Orthology group | MCL10194 |
Nucleotide sequence:
AGTCGCATCTCTACAGCGGCCCTGGAGTCGGATGACAATCTGCTGGTCTGCGCGCCCACG
GGAGCGGGCAAGACGAACGTGGCGCTGCTGTGCATCCTGAGGGCGTTGGGCAAACACGTC
AACCACGACGGCAGCGTCAACGTGGGCGACTTCAAGGTGGTGTACGTGGCGCCCATGAGG
TCGCTGGTGCAGGAGATGGTGGGCAATTTCAGCAAGCGGCTGGCGGCTTACAACATGAAG
GTGTCGGAGCTGACGGGTGACCACCAGCTCACCAGGGAACAGATAGACGACACGCAGCTC
ATAGTGTGCACGCCTGAAAAATGGGACATCGTCACTAGGAAAGGTAACTTTGGTTGGGAG
AGAGGAGCTTGGACGGTGGAGGCGAGGCGAGCGATATTGGTGTTCGTGCACTCCAGGAAG
GAGACGGGGAAGACGGCGCGGGCTGTCCGCGACACCTGCCTGGAGAAGGACACGCTCGGA
CATTTCCTTAGGGAGGGTTCCGCCAGCGTGGAGGTCCTGCGTACCGAGGCTGAGCAGGTG
AAGAACCCTGAACTCCGTGAGTTACTACCCTACGGTTTCGCCATACATCACGCGGGTATG
AGCCGAGTGGACAGGACCCTGGTAGAAGATCTGTTCGCTGACAGACATATACAGGTGTTG
GTATCAACGGCTACCCTGGCGTGGGGTGTGAACCTGCCAGCACACACGGTCATAGTGAAG
GGGACGCAGGTGTACAGCCCCGAGAAGGGCCGCTGGAGCGAGCTGGGGGCGCTGGACGTG
CTTCAGATGCTGGGAAGAGCTGGACGGCCGCAGTACGACACCAAGGGAGAAGGCATTCTG
ATAACAAATCACTCCGAGCTGCAATACTACCTGTCGCTGTTGAACCAGCAATTGCCGATC
GAATCACAGCTCGTCTCCAAGCTACCGGATATGCTGAACGCGGAGATAGTGCTGGGTTCC
GTACAGAGCGTGAGGGATGCTGTCACGTGGCTCGGTTACACGTACCTGTACGTGCGCATG
CTCCGCCAGCCCGCTCTGTACGGTGTGAGTGAGGACAGGCTGAAGGACGACGCGTTGTTG
GAACTTCACCGGGCCGACCTCGTTCACACCGCCGCCTCGCTGCTCGACAAAGCCGGTCTA
ATTAAATACGAGCGGAAATCCGGTCACTTCCAAGCCACTGAGCTGGGTCGTATAGCTTCT
CACTACTACTGCACCTATGAGACCATGCAGAACTACAACCAGCTGTTGAAGCCCACGTTG
GCTGAGATAGAGTTGTTCAGAGTGTTCTCGCTGTCCGCTGAGTTCAAGCACATCACTGTC
AGGGACGAGGAGAAGCTGGAGCTGCACAAGCTCATGGAGCGGGTTCCTATACCTATCAAG
GAGAGTATAGAAGAACCGTCAGCCAAGATAAACGTGCTGCTGCAAGCTTATATATCACAG
CTCAAGCTGGAAGGCTTCGCGCTGATGGCGGATATGGTGTACGTGACGCAATCAGCGAGT
AGACTGCTCAGGGCTATCTTCGAGATTGTGCTGCATCGAGGCTGGGCGCAGCTGGTCGAT
AAGACGCTGGCTCTGTGCAAGATGGTAGACAAACGGATGTGGCAGTCCATGTCGCCGTTG
AGACAATTCAGGAAAATGCCGGAGGAGGTGATCAAAAAGCTGGAGAAAAAGAACTTCCCC
TGGGAGAAGCTGTACGAGCTGGGCCCCAATGAGATCGGTGAACTGGTCCGAGCACCCAAG
TTAGGGAAGATGATACACAAATATGTACACCAGTTCCCCAAGTTAGAACTGGCAACACAC
ATACAGCCCATCACGAGGTCCACACTGAGGGTCGAGCTTACTATAACTCCTGACTTCCAA
TGGGATGAGAAGATCCACGGCCAGTCTGAAGCTTTCTGGATCCTGGTCGAGGATGTGGAC
TCTGAGACCGTGCTGCACCACGAGCAGCTGCTGCTGAAGCACAAGTACTGCCGCGACGAG
CAGCACGTGAAGCTGTTCGTGCCGGTCTTCGAGCCACTGCCTCCACAGTACTTCCTCAGG
GTTGTATCAGATCGGTGGATAGCCGCGGAAACGCAGCTTCCAGTGTCATTCCGGCACCTC
ATCCTTCCGGAGAAAAACCTGCCGCCCACCGAACTTCTGGACCTGCAACCTCTGCCGATA
TCCGCCCTCCGGAACGAGAAATGGGAACAGCTGTACAAAGACGCCTTCCCGCAATTCAAT
CCAGTGCAGACACAGGTCTTCAACGCTGTATACAACTCGGACGACAACGTGTTCGTGGGT
GCTCCATCAGGGTCTGGCAAGTCGGTGATCGCTGAGCTGGCACTTCTCCGACTCCTGACC
CACAGCCCAGCCTCCAGAGCGGTGTACCTGGTCCCTCATGATGCACTCGCTGATATAGTG
TTTGCGGACTGGTATCACAAGTTCGGTGCCCGCTTCAATCTGAAGGTGGTCCAGTTGACC
GGTGAGACGGCTACAGACCACAAGCTGATCAACAAGGGCCAGATCATAGTTACTACAGCT
GAGAAGTGGGACGTACTGTCTAGAAGGTGGAAGGTCCGTAAGAGCGTCCAGAGCGTTTCA
CTGCTGGTGGTGGACGCGCTGCAGCTGCTGGGCGCGCACGAGGGACCCGTGCTGGAGGTG
GTGTGCTCCAGGATGAGGTACATCGCCTCGCAGACAGGTCGTCCTATCCGTATCGTGGCT
CTGTCCCTGCCGCTGGCGGACGCTCGCGACGTCTGGCAGTGGCTTGGCTGTAATACTAAC
AGCGCCTTCAACTTCCATCCCAGCGTCAGGCCGCTACCGTTGGAGCTTCATGTTCAGAGT
CACTTGGACCACCGTCTCCACGACCACATGAACGCTGAGATCGTGACGAAGACGATAGAG
AACAAACAGGACGCGGTGGACTACCTCACCTGGACCTTCCTCTACAGGAGACTCACGCAG
AACCCCAACTACTACAACCTGCAGGGGGTCACGCACAGACATCTCTCCGATCACCTGTCA
GAGCTGGTCGAGTCAACCCTATCAGATCTGGAACAGTCGAAGTGTATAGCCATCGAGGAT
GATATGGACCTGCAGCCTCTCAACCTTGGAATGATAGCCTCCTACTACTACATAAACTAC
ACTACTATAGGTAAAATTAAAAATACACGTACAAAGTCACCGGCAAAGCTTAGTAAGAAA
CCTAATTTATTGCCAATTTTATTATAA
Protein sequence:
SRISTAALESDDNLLVCAPTGAGKTNVALLCILRALGKHVNHDGSVNVGDFKVVYVAPMR
SLVQEMVGNFSKRLAAYNMKVSELTGDHQLTREQIDDTQLIVCTPEKWDIVTRKGNFGWE
RGAWTVEARRAILVFVHSRKETGKTARAVRDTCLEKDTLGHFLREGSASVEVLRTEAEQV
KNPELRELLPYGFAIHHAGMSRVDRTLVEDLFADRHIQVLVSTATLAWGVNLPAHTVIVK
GTQVYSPEKGRWSELGALDVLQMLGRAGRPQYDTKGEGILITNHSELQYYLSLLNQQLPI
ESQLVSKLPDMLNAEIVLGSVQSVRDAVTWLGYTYLYVRMLRQPALYGVSEDRLKDDALL
ELHRADLVHTAASLLDKAGLIKYERKSGHFQATELGRIASHYYCTYETMQNYNQLLKPTL
AEIELFRVFSLSAEFKHITVRDEEKLELHKLMERVPIPIKESIEEPSAKINVLLQAYISQ
LKLEGFALMADMVYVTQSASRLLRAIFEIVLHRGWAQLVDKTLALCKMVDKRMWQSMSPL
RQFRKMPEEVIKKLEKKNFPWEKLYELGPNEIGELVRAPKLGKMIHKYVHQFPKLELATH
IQPITRSTLRVELTITPDFQWDEKIHGQSEAFWILVEDVDSETVLHHEQLLLKHKYCRDE
QHVKLFVPVFEPLPPQYFLRVVSDRWIAAETQLPVSFRHLILPEKNLPPTELLDLQPLPI
SALRNEKWEQLYKDAFPQFNPVQTQVFNAVYNSDDNVFVGAPSGSGKSVIAELALLRLLT
HSPASRAVYLVPHDALADIVFADWYHKFGARFNLKVVQLTGETATDHKLINKGQIIVTTA
EKWDVLSRRWKVRKSVQSVSLLVVDALQLLGAHEGPVLEVVCSRMRYIASQTGRPIRIVA
LSLPLADARDVWQWLGCNTNSAFNFHPSVRPLPLELHVQSHLDHRLHDHMNAEIVTKTIE
NKQDAVDYLTWTFLYRRLTQNPNYYNLQGVTHRHLSDHLSELVESTLSDLEQSKCIAIED
DMDLQPLNLGMIASYYYINYTTIGKIKNTRTKSPAKLSKKPNLLPILL