DPGLEAN19783 in OGS1.0

New model in OGS2.0DPOGS215089 
Genomic Positionscaffold1700:+ 9395-17869
See gene structure
CDS Length2211
Paired RNAseq reads  840
Single RNAseq reads  1940
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007191 (0.0)
Best Drosophila hit  xeroderma pigmentosum D, isoform C (0.0)
Best Human hitTFIIH basal transcription factor complex helicase XPD subunit isoform 1 (0.0)
Best NR hit (blastp)  PREDICTED: similar to Xeroderma pigmentosum D CG9433-PA [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to Xeroderma pigmentosum D CG9433-PA [Tribolium castaneum] (0.0)
GeneOntology terms















  
GO:0000075 cell cycle checkpoint
GO:0005675 holo TFIIH complex
GO:0006283 transcription-coupled nucleotide-excision repair
GO:0006366 transcription from RNA polymerase II promoter
GO:0006917 induction of apoptosis
GO:0006979 response to oxidative stress
GO:0008022 protein C-terminus binding
GO:0033683 nucleotide-excision repair, DNA incision
GO:0035315 hair cell differentiation
GO:0043139 5'-3' DNA helicase activity
GO:0045944 positive regulation of transcription from RNA polymerase II promoter
GO:0047485 protein N-terminus binding
GO:0019907 cyclin-dependent protein kinase activating kinase holoenzyme complex
GO:0004672 protein kinase activity
GO:0008094 DNA-dependent ATPase activity
GO:0008353 RNA polymerase II carboxy-terminal domain kinase activity
GO:0016563 transcription activator activity
InterPro families






  
IPR013020 DNA helicase (DNA repair), Rad3 type
IPR001945 Xeroderma pigmentosum group D protein
IPR014013 Helicase, superfamily 1/2, ATP-binding domain, DinG/Rad3-type
IPR002464 DNA/RNA helicase, ATP-dependent, DEAH-box type, conserved site
IPR010643 Domain of unknown function DUF1227
IPR010614 DEAD2
IPR006554 Helicase-like, DEXD box c2 type
IPR006555 Helicase, ATP-dependent, c2 type
Orthology groupMCL11531

Nucleotide sequence:

ATGCTGGAACTTAAACGAGCCCTTGATGCTAAGGGCCACGGATTACTTGAAATGCCTTCA
GGGACTGGTAAAACTATATCCTTGTTATCGCTTATTGTGGCTTACATGATACAAAACCCA
CATCACGTCAGAAAACTCATCTATTGTTCCCGAACTGTACCTGAAATAGAAAAAGTCTTA
GAGGAACTTAAGAATCTTATAAAATATTATGAAAAGTCTCAAGGTGAGAAGCCGAGCTTG
ACGGGCGTTGTGCTCAGTTCAAGGAAAAACTTGTGCATACATCCAGAGGTATCAAGAGAG
CGTGAGGGGAAGCTGGTTGATGGGAAATGTCATTCGCTAACGGCCAGTTACATCAGAGAC
AGACACGAACAGGACCCTTCAGTGCCCATATGTCAATTCTATGAGGGTTTTAACCGTGAG
GGTCGCGAGTCCATGCTGCCGTATGGAGTGTACACTATGGATGACCTCAAACAATACGGA
GCTGACAGGAACTGGTGCCCCTACTTCCTGTCTAGATTCGCTATAATCCACGCTGAGATA
GTTGTGTACTCGTACCACTACTTATTAGATCCTAAGATAGCTGAAGTGGTATCAAAAGAA
CTGAACAAGGAGGCTGTGGTGGTGTTCGATGAGGCACATAATATAGATAATGTTTGTATC
GACTCTCTAAGTGTGAAGATCACGAGGCGGACTATCGATAAGAGCACGCAAGCACTACAG
ACGCTAGAAAAAGCTGTGTCACAATTAAAACAAGAGGACGAGGCGCGCCTGGCGCTGGAG
TACGAGCAGATGGTGGAGGGTCTGAGGGAGGCGGCGCAGCTGAGGGACAGTGACGTCATA
CTGGGCAACCCTGTACTACCTGATGAACTGCTCAACGAGGTGGTCCCTGGCAACATCAGG
AACGCGGTCCACTTCCTCGGGTTCTTGAAGCGGTTCATAGAATACTTGAAGACGAGGCTG
CGGATACAGCACGTGGTGCAGGAGTCGCCGGCCGGTTTCTTAAAGGACGTGTCGTCTCGC
GTGTGTATCGAGCGCAAGCCTCTCCGTTTCGTGTCGTCGCGGCTCCAGACCCTGATGAAG
ACCCTCCAGATCCCGGACCCCTCGAACTTCGGCTCCTTAACACTAGTGGCGCACCTGGCG
ACGCTCGTGTCCACGTACACCAAGGGCTTCGTCATCATCATAGAGCCCTTCGATGACAAA
ACCCCGACCGTCTCCAATCCAATACTACACTTCTCATGTATGGACTCGTCGATAGCCATG
CGGCCAGTGTTCGGTAGATTTCAAACTGTCATCATCACTTCCGGTACGCTATCTCCCCTG
GACATGTATCCCAAGATCCTGGACTTTAACCCCGTAGTAATGAGCTCCTTCACTATGACG
CTCGCCCGACCTTGCATACTGCCCATGATAGTGTCCAAAGGTAGCGACCAAGTGGCGATT
TCTTCAAAGTACGAGACACGAGAAGACGTCGCGGTGATAAGGAACTACGGACAACTACTA
GTAGAGATATCAGCCTGCGTGCCGGACGGGGTGGTGTGCTTCTTCACTTCGTATCTGTAC
CTGGAGAGCGTGGTCGGAGCTTGGTATGATCAGGGTGTCGTCGCCAATTTACAGAAACAC
AAGCTGCTGTTTATCGAGACGCAGGACTCGGCGGAGACCAGCTTCGCCTTAATAAACTAC
ATTAAGGCGTGCGAGAGCGGTCGTGGGGCGGTGTTGCTATCGGTGGCGCGCGGCAAGGTC
TCGGAGGGAGTGGACTTCGACCATCACCTCGGACGGGCGGTCCTCATGTTCGGGATACCT
TACGTGTTCACTCAGAGCAGGATATTAAAGGCCCGTCTAGAGTACCTGAGAGATCAGTTC
CAGATCCGTGAGAACGATTTCCTAACGTTCGACGCGATGCGTCACGCGGCTCAGTGTGTT
GGCCGAGCGTTGAGAGGCAAGACGGACTACGGTATAATGATATTCGCTGACAAGCGCTTC
AGTCGCTCGGACAAGAGAAGTAAGCTACCGCGGTGGATACAAGAACATCTGAGGGACTCG
CTCTGCAACCTCAGTACCGAGGAAGCCGTACAGATAAGTAAGCGTTGGCTCCGCCAGATG
TCGCAGCCGTTCAGCCGCGAGGACCAGCTGGGAGTGTCGCTGTTGACGCTCCAGCAGTTA
CAGAGCAAGGAGCAGCAGGAGAAGATCGAGAAGCAGGTCCTCCAGAAGTAG

Protein sequence:

MLELKRALDAKGHGLLEMPSGTGKTISLLSLIVAYMIQNPHHVRKLIYCSRTVPEIEKVL
EELKNLIKYYEKSQGEKPSLTGVVLSSRKNLCIHPEVSREREGKLVDGKCHSLTASYIRD
RHEQDPSVPICQFYEGFNREGRESMLPYGVYTMDDLKQYGADRNWCPYFLSRFAIIHAEI
VVYSYHYLLDPKIAEVVSKELNKEAVVVFDEAHNIDNVCIDSLSVKITRRTIDKSTQALQ
TLEKAVSQLKQEDEARLALEYEQMVEGLREAAQLRDSDVILGNPVLPDELLNEVVPGNIR
NAVHFLGFLKRFIEYLKTRLRIQHVVQESPAGFLKDVSSRVCIERKPLRFVSSRLQTLMK
TLQIPDPSNFGSLTLVAHLATLVSTYTKGFVIIIEPFDDKTPTVSNPILHFSCMDSSIAM
RPVFGRFQTVIITSGTLSPLDMYPKILDFNPVVMSSFTMTLARPCILPMIVSKGSDQVAI
SSKYETREDVAVIRNYGQLLVEISACVPDGVVCFFTSYLYLESVVGAWYDQGVVANLQKH
KLLFIETQDSAETSFALINYIKACESGRGAVLLSVARGKVSEGVDFDHHLGRAVLMFGIP
YVFTQSRILKARLEYLRDQFQIRENDFLTFDAMRHAAQCVGRALRGKTDYGIMIFADKRF
SRSDKRSKLPRWIQEHLRDSLCNLSTEEAVQISKRWLRQMSQPFSREDQLGVSLLTLQQL
QSKEQQEKIEKQVLQK