DPGLEAN09478 in OGS1.0

New model in OGS2.0DPOGS212384 
Genomic Positionscaffold1571:- 12350-17650
See gene structure
CDS Length1956
Paired RNAseq reads  1603
Single RNAseq reads  3562
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012071 (5e-45)
Best Drosophila hit  lethal (1) G0007, isoform A (2e-130)
Best Human hitpre-mRNA-splicing factor ATP-dependent RNA helicase PRP16 (3e-118)
Best NR hit (blastp)  PREDICTED: similar to pre-mRNA splicing factor ATP-dependent RNA helicase PRP16 [Acyrthosiphon pisum] (7e-180)
Best NR hit (blastx)  pre-mRNA-splicing factor ATP-dependent RNA helicase prp16, putative [Pediculus humanus corporis] (3e-146)
GeneOntology terms






  
GO:0008026 ATP-dependent helicase activity
GO:0004004 ATP-dependent RNA helicase activity
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0005681 spliceosomal complex
GO:0003676 nucleic acid binding
GO:0005524 ATP binding
GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome
GO:0002121 inter-male aggressive behavior
InterPro families

  
IPR002464 DNA/RNA helicase, ATP-dependent, DEAH-box type, conserved site
IPR014001 DEAD-like helicase
IPR001482 Type II secretion system protein E
Orthology groupND

Nucleotide sequence:

ATGTCGGACAGTGAAGAAAACCTGCATCGTTTGGAGGGAACTCCAAACAATGCTCCTGGA
GGTCTTATTATCAAGAAAAAAGATAAACCGCCTGATTTCCAATTTGCCAAACCGTCCTTG
TTAGGTCTAGATAAGTTGGCCGCTGCTAAAAGAATGCAGAACCGCTTAATTTCTTTTCAA
AACGAAGACAACTATGGAGATGATATTGAGGATAAAAGTGGTTCCAGTGGTGTAAGGGAG
CGGAAATATAGGAAGCATAATGAAGAAACCCCGACGTACACCGGTGGCTTATCAGAACAA
GCCCGAGCAAGGATGTTGGAAAGATTAGAAATGAAAGAAAAGAAAGCGAGAGAAAAAGGT
GTGCATAATTCAACACTAGAAGAGAAAAAGTCACATTCCAAAGATGAAGAAAACAGCCGA
TTTCACAATTATGGTCGTGGGCATCGTGACAAGGACAGAAGACGTGACTATGATAGAAGA
GACAGAGACAGATATAGGGATCGGAATGACAGGAATAAGAGCGAAAGGGATAGAAGGAGA
GATTCAGAGAGGAGGGAAAGAGACCGCGACACTGATAGAGATAGCTCACGACGGAGTTAC
TATGAACCCAGGTTTAAAGATGAACCAAGAACACCAAGCATAAAAGCTCTCAAACCAACC
GACAAAACAGCTTGGGATGATGACGACGATGACCCTAAGGCCGTAAGGAAGTCAAGTTGG
GACTTCCCCACACCACTGCCCAGAGACTTGGCGGATAGATCAGCTCGGAGTGAACGTAAA
CCCACCAGAGACTATAAAGGAAGAGCATACGAAGATACCATTAGAGCTACACCTCATAAA
TGGGTGAGCTCACGTCGAGGACTGGATGTAGACGACCCCGAGTGGCAGGAAGCAGAGAAG
AAATTAGATCGAGCCTGGTACAACATGGGAGAGGGTGAAACCGACGAATCGGATCCGTTC
GCCGGCACCAGCGCTGAGTATATAGCGAAGAAAGAAGAACAGATAGAGAAGAGAAGGAAC
CGGAAGGTGTCAGCGGCGAGGCAGCAGATAGACAGAGACAACGAGCTGTGGGAGAGGAAC
CGCATGCTCACCAGCGGGGTGGTGCACTCCATCAACGTCAACAACGACCTCGACGAGGAG
AACGTAGACCGAGTACATCTTCTAGTACACAATATCGTTCCGCCATTCCTCGACGGCAGG
ATAGTGTTCACTAAACAACCTGAGCCTGTTATACCGGTCAAAGACCCGACATCGGACATG
GCGATAAATGCTAGGAAAGGGTCCGCTTTAGTGAAGGCGTTCAGAGAACAGAAGGAAAGA
AGAAGGGCACAGAAGAAACATTGGAAGTTGGAGGGAACGAAGATTGGTAACATAATGGGC
ATACAGAAACAAGAGGAAGAAATAGAAGACGGACCCACGAAACAGGCGTACAAATACGCC
GAGCACTTGGATAAAGCGGGCGAGGAAGCGGAGTCCAAATCAGATTTCGTCAAGAAGTTG
TCTATAACGGAGCAGAGACGTTTCTTGCCCGTGTTCGCTGTCAGAGAACAACTCATGCAG
GTGGTGAGGGAGAACAACGTTATCATTATAGTCGGTGAAACCGGAAGTGGTAAGACGACC
CAACTGACACAATACCTCCACGAGGACGGTTACAGTAAGATGGGCGCCATCGGCTGTACG
CAACCCAGACGCGTGGCCGCCATGTCCGTGGCCAAGAGAGTTGCTGATGAAATGGGAGTT
AAATTAGGTGAGGAAGTTGGTTACGCGATACGTTTCGAGGACTGCACCAACCCGTCCACG
GTCATCAAGTACATGACGGACGGGATCCTGCTGCGGGAGGGTCTGCGGGACCCCGACCTC
GACCAGTACAGCGCCATCATCATGGACGAGGCGCACGAGAGGTCTCTCTCCACTGACATG
CTGTTCGGACTCCTGAGAGAGGTAAGTTACCTTTAA

Protein sequence:

MSDSEENLHRLEGTPNNAPGGLIIKKKDKPPDFQFAKPSLLGLDKLAAAKRMQNRLISFQ
NEDNYGDDIEDKSGSSGVRERKYRKHNEETPTYTGGLSEQARARMLERLEMKEKKAREKG
VHNSTLEEKKSHSKDEENSRFHNYGRGHRDKDRRRDYDRRDRDRYRDRNDRNKSERDRRR
DSERRERDRDTDRDSSRRSYYEPRFKDEPRTPSIKALKPTDKTAWDDDDDDPKAVRKSSW
DFPTPLPRDLADRSARSERKPTRDYKGRAYEDTIRATPHKWVSSRRGLDVDDPEWQEAEK
KLDRAWYNMGEGETDESDPFAGTSAEYIAKKEEQIEKRRNRKVSAARQQIDRDNELWERN
RMLTSGVVHSINVNNDLDEENVDRVHLLVHNIVPPFLDGRIVFTKQPEPVIPVKDPTSDM
AINARKGSALVKAFREQKERRRAQKKHWKLEGTKIGNIMGIQKQEEEIEDGPTKQAYKYA
EHLDKAGEEAESKSDFVKKLSITEQRRFLPVFAVREQLMQVVRENNVIIIVGETGSGKTT
QLTQYLHEDGYSKMGAIGCTQPRRVAAMSVAKRVADEMGVKLGEEVGYAIRFEDCTNPST
VIKYMTDGILLREGLRDPDLDQYSAIIMDEAHERSLSTDMLFGLLREVSYL