DPGLEAN06048 in OGS1.0

New model in OGS2.0DPOGS213517 
Genomic Positionscaffold271:+ 90664-100943
See gene structure
CDS Length3144
Paired RNAseq reads  1201
Single RNAseq reads  3372
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011797 (0.0)
Best Drosophila hit  haywire, isoform B (0.0)
Best Human hitTFIIH basal transcription factor complex helicase XPB subunit (0.0)
Best NR hit (blastp)  haywire, isoform A [Drosophila melanogaster] (0.0)
Best NR hit (blastx)  AGAP012169-PA [Anopheles gambiae str. PEST] (0.0)
GeneOntology terms






























  
GO:0005675 holo TFIIH complex
GO:0006366 transcription from RNA polymerase II promoter
GO:0006917 induction of apoptosis
GO:0006283 transcription-coupled nucleotide-excision repair
GO:0047485 protein N-terminus binding
GO:0008134 transcription factor binding
GO:0045944 positive regulation of transcription from RNA polymerase II promoter
GO:0008353 RNA polymerase II carboxy-terminal domain kinase activity
GO:0000717 nucleotide-excision repair, DNA duplex unwinding
GO:0043138 3'-5' DNA helicase activity
GO:0033683 nucleotide-excision repair, DNA incision
GO:0008094 DNA-dependent ATPase activity
GO:0006979 response to oxidative stress
GO:0045449 regulation of transcription
GO:0003677 DNA binding
GO:0016787 hydrolase activity
GO:0008104 protein localization
GO:0035315 hair cell differentiation
GO:0005524 ATP binding
GO:0006265 DNA topological change
GO:0006289 nucleotide-excision repair
GO:0008022 protein C-terminus binding
GO:0009411 response to UV
GO:0016887 ATPase activity
GO:0004672 protein kinase activity
GO:0000075 cell cycle checkpoint
GO:0009650 UV protection
GO:0004386 helicase activity
GO:0006281 DNA repair
GO:0005634 nucleus
GO:0004003 ATP-dependent DNA helicase activity
GO:0000166 nucleotide binding
InterPro families



  
IPR001161 Xeroderma pigmentosum group B protein (XP-B)
IPR014001 DEAD-like helicase
IPR001650 Helicase, C-terminal
IPR018607 Chromosome transmission fidelity protein 8
IPR006935 UvrABC complex, subunit B
Orthology groupMCL12081

Nucleotide sequence:

ATGGGACCCCCTAAAAAGTTCAGGAAATATGACTCCAAAGGTGGAAGCGACAGATCTGGT
AAAAAGAAAAAAGTAGATGAGGAAGTGACAATTGATTTAGTGGATGATGATAACCCTGAA
AGTTCTGGAGTGCCGGGAGCGGCCCTGCAGGATGCTGAGAAAAACGACCAGGTCCCTGAA
GATGAGTTTGGAGCTAAAGATTATAGAAATCAAATGGAACTCAAACCTGACAATGCCAGT
CGTCCGCTGTGGGTGGCTCCTAATGGTCATATATTTCTCGAGTCATTTTCGCCAGTCTAT
AAACATGCTCATGACTTTTTAATTGCTATCGCTGAGCCAGTGTCAAGGCCTCAGCACATT
CATGAGTATAAATTAACAGCGTACAGTTTATATGCAGCAGTTTCTGTGGGTTTGCAAACA
AATGACATAATTGAATATTTACAACGTCTCAGTAAGTGTAACGTTCCGGCCGGTATCATA
GAATTCATCACACTTTGTACTTTGTCTTATGGCAAAGTTAAGCTTGTACTGAAACATAAC
AGATATCTAGTGGAGAGTAAGCACGTGGATGTTCTTCAGAAGCTCCTCAAGGATCCCGTG
ATCCAGCAGTGTAGACTGAGACGAGACGGAGATGAAGAACTGGTTACCTCCGCTCTACCT
ACTACGGCGCCCGCACCGCCCGGGACCGCAGTTAAGACTGGTCGGGTGATACAAAAACGC
TGCATCGAGCTGGAGTATCCGCTGCTGGCGGAGTACGACTTCCGCAACGACGCCGTCAAC
CCTGACATTAACATTGATCTAAAGCCTACAGCAGTGCTGCGACCCTACCAGGAGAAGAGT
CTCAGGAAGATGTTCGGAAACGGCCGAGCGAGGTCAGGTGTGATAGTGTTGCCGTGCGGC
GCGGGCAAGTCCTTGGTGGGCGTGACGGCCGTGTGCACGGTCCGCAAGAGGGCGCTGGTG
CTGTGCAACTCAGGAGTCTCCGTGGAACAATGGAAACAGCAGTTCAAGTGTTGGTCCACC
GCCGACGACAGTATGATATGCAGGTTCACGTCGGAGGCCAAGGACAAGCCGATGGGCGCC
GGCATCCTGATCACGACCTACTCCATGATAACGCACGGCCAGCGCCGCTCGTGGGAGGCC
GAGCAGACCATGAAGTGGCTACAGGCGCAGGAGTGGGGGCTCGTGGTGCTGGACGAGGTG
CACACCATCCCCGCCAAGATGTTCCGCCGGGTGCTCACCATAGTGCACTCACACGCCAAG
CTGGGTTTGACGGCGACGCTACTCCGCGAGGACGACAAGATAGCCGACCTGAACTTCCTG
ATCGGCCCCAAGCTGTACGAAGCCAACTGGTTGGAGCTGCAAGCCAACGGCTACATCGCC
AGGGTCCAGTGCGCCGAGGTCTGGTGTCCCATGACGCCCGAGTTCTACCGGGAGTACCTC
GTGCAGAAGATCAATAAGAAAATGTTGCTGTATGTAATGAACCCGTCCAAATTCCGCGCC
TGCCAGTTCCTCGTCCGCTACCACGAACGCCGCGGGGACAAGACCATAGTGTTCTCGGAC
AACGTGTTCGCCCTCAGACACTACGCCGTCAAAATGAACAAGCCCTACATCTACGGCCCG
ACGTCCCAGAACGAGAGGATACAGATCCTTCAAAACTTCAAGTTCAACCCTAAAGTTAAC
ACGATTTTCGTCAGCAAAGTCGCCGACACCAGCTTCGACCTGCCCGAGGCGAACGTTCTC
ATACAAATCTCCTCGCACGGCGGCTCTAGGAGACAAGAAGCGCAACGTTTGGGTCGTATA
TTAAGAGCCAAGAAGGGTGCACTAGCGGAGGAGTACAACGCATTTTTCTACACACTAGTA
TCACAGGATACTTTGGAGATGGCGTACAGTCGCAAGAGACAGCGGTTCCTTGTGAACCAG
GGTTACAGTTACAAGGTTATTACAGAATTGAAGGGCATGGACCAGGAGCCCGATCTGTTG
TACGGAACTCGGGAGGAACAAGGGATGCTGCTGCAACAAGTTCTCGCGGCGTCAGAGACG
GACTGCGAGGAGGAGAGGGAGGGTGGAGCGGGCGGTGCGGGGAGCGCGGGCGGGGCGAGG
CGCACCGCGGGGTCATTGGCCTCGCTGGCGGGCGCCGACGACGCTCTGTACCTGGAACAC
AGGCGCTCCTCGCACCACAACAAGCACCCACTGTTCAAGAACGAGAACGAGACGGGCGGA
ATATCAGAGTGGGCGATAGTGGAACTGCAGGGTCTCGTGCAGGTGGAGGGAGACGATCGC
GGCGGACCCGCGGTGGTGGGGGACCTGCATTACTTCAAAAGAAACCGACATCCCGTGCTC
GTGCTCGGCCACCACGTGCTCACCGGCAAGGAGGTCAAGCTGGAGCAACCTATGGCAGTC
ATGGAGAAGACTGTGGACGGAGGTCAAACTTCGTACAGAGTCAAGGCGATCGTTAGAAAG
AAACTACTCTTTAAATCGAGACCTAAACCCATCATATCAAACGAGGAAAGTGAGCGAAGT
CTTCCCCGGTGCGAGGACGGCGAGGCGTGCTCGGTGTTGCTGCGCCGCTACTGGCGCCCC
CCGGCCCTGGTGCGGCTCTGCCGGTGCTCCCGACGCACGCGCTGCGATAAGATCGCGTCA
GGAGACAGACTGGTGGAACTGAACAACCGATCGGACTTGCAGGTGCGAGACAATATCACT
CTCAAAGTGGTTGTAAACCGGCGTGAATGGCCGGAGTGTTCCATCAACGAAGCGCCTCTC
AAGATCGAAACAGCGTACGAGCGCATGAGTCCCGATGAGATCGAACTATTGCACCGCCAG
AGCATACAGCTCGCGCCGCCGAGGATACGCCTCCGGTGCCTTTGCCCGAAACCGAACTAT
TGGAAATTAAAAACCGAAGACAGCGACACAAACCTAACGTATCGCTGCTCGTCTCTGCCG
CTCTGTAAAACCGGTGACGTCTGCGGGAACGTGGACGACGTCCTGCTATCTCTGTATCAG
TCGTGCCTGTGTCCCAAAAACCACATCTGCGTGCACAGCGGCGGAAGAACGCAGATCCAG
ATCTCGGAGCCGCTGTATCGAGGGAGGGGCTGGCGTGCCCGCTGTCAAGCTCTAAGTGAC
GAGGATAGCTACGAGGATTACTGA

Protein sequence:

MGPPKKFRKYDSKGGSDRSGKKKKVDEEVTIDLVDDDNPESSGVPGAALQDAEKNDQVPE
DEFGAKDYRNQMELKPDNASRPLWVAPNGHIFLESFSPVYKHAHDFLIAIAEPVSRPQHI
HEYKLTAYSLYAAVSVGLQTNDIIEYLQRLSKCNVPAGIIEFITLCTLSYGKVKLVLKHN
RYLVESKHVDVLQKLLKDPVIQQCRLRRDGDEELVTSALPTTAPAPPGTAVKTGRVIQKR
CIELEYPLLAEYDFRNDAVNPDINIDLKPTAVLRPYQEKSLRKMFGNGRARSGVIVLPCG
AGKSLVGVTAVCTVRKRALVLCNSGVSVEQWKQQFKCWSTADDSMICRFTSEAKDKPMGA
GILITTYSMITHGQRRSWEAEQTMKWLQAQEWGLVVLDEVHTIPAKMFRRVLTIVHSHAK
LGLTATLLREDDKIADLNFLIGPKLYEANWLELQANGYIARVQCAEVWCPMTPEFYREYL
VQKINKKMLLYVMNPSKFRACQFLVRYHERRGDKTIVFSDNVFALRHYAVKMNKPYIYGP
TSQNERIQILQNFKFNPKVNTIFVSKVADTSFDLPEANVLIQISSHGGSRRQEAQRLGRI
LRAKKGALAEEYNAFFYTLVSQDTLEMAYSRKRQRFLVNQGYSYKVITELKGMDQEPDLL
YGTREEQGMLLQQVLAASETDCEEEREGGAGGAGSAGGARRTAGSLASLAGADDALYLEH
RRSSHHNKHPLFKNENETGGISEWAIVELQGLVQVEGDDRGGPAVVGDLHYFKRNRHPVL
VLGHHVLTGKEVKLEQPMAVMEKTVDGGQTSYRVKAIVRKKLLFKSRPKPIISNEESERS
LPRCEDGEACSVLLRRYWRPPALVRLCRCSRRTRCDKIASGDRLVELNNRSDLQVRDNIT
LKVVVNRREWPECSINEAPLKIETAYERMSPDEIELLHRQSIQLAPPRIRLRCLCPKPNY
WKLKTEDSDTNLTYRCSSLPLCKTGDVCGNVDDVLLSLYQSCLCPKNHICVHSGGRTQIQ
ISEPLYRGRGWRARCQALSDEDSYEDY