New model in OGS2.0 | DPOGS215089  |
---|---|
Genomic Position | scaffold1700:+ 9395-17869 |
See gene structure | |
CDS Length | 2211 |
Paired RNAseq reads   | 840 |
Single RNAseq reads   | 1940 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007191 (0.0) |
Best Drosophila hit   | xeroderma pigmentosum D, isoform C (0.0) |
Best Human hit | TFIIH basal transcription factor complex helicase XPD subunit isoform 1 (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to Xeroderma pigmentosum D CG9433-PA [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to Xeroderma pigmentosum D CG9433-PA [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0000075 cell cycle checkpoint GO:0005675 holo TFIIH complex GO:0006283 transcription-coupled nucleotide-excision repair GO:0006366 transcription from RNA polymerase II promoter GO:0006917 induction of apoptosis GO:0006979 response to oxidative stress GO:0008022 protein C-terminus binding GO:0033683 nucleotide-excision repair, DNA incision GO:0035315 hair cell differentiation GO:0043139 5'-3' DNA helicase activity GO:0045944 positive regulation of transcription from RNA polymerase II promoter GO:0047485 protein N-terminus binding GO:0019907 cyclin-dependent protein kinase activating kinase holoenzyme complex GO:0004672 protein kinase activity GO:0008094 DNA-dependent ATPase activity GO:0008353 RNA polymerase II carboxy-terminal domain kinase activity GO:0016563 transcription activator activity |
InterPro families    | IPR013020 DNA helicase (DNA repair), Rad3 type IPR001945 Xeroderma pigmentosum group D protein IPR014013 Helicase, superfamily 1/2, ATP-binding domain, DinG/Rad3-type IPR002464 DNA/RNA helicase, ATP-dependent, DEAH-box type, conserved site IPR010643 Domain of unknown function DUF1227 IPR010614 DEAD2 IPR006554 Helicase-like, DEXD box c2 type IPR006555 Helicase, ATP-dependent, c2 type |
Orthology group | MCL11531 |
Nucleotide sequence:
ATGCTGGAACTTAAACGAGCCCTTGATGCTAAGGGCCACGGATTACTTGAAATGCCTTCA
GGGACTGGTAAAACTATATCCTTGTTATCGCTTATTGTGGCTTACATGATACAAAACCCA
CATCACGTCAGAAAACTCATCTATTGTTCCCGAACTGTACCTGAAATAGAAAAAGTCTTA
GAGGAACTTAAGAATCTTATAAAATATTATGAAAAGTCTCAAGGTGAGAAGCCGAGCTTG
ACGGGCGTTGTGCTCAGTTCAAGGAAAAACTTGTGCATACATCCAGAGGTATCAAGAGAG
CGTGAGGGGAAGCTGGTTGATGGGAAATGTCATTCGCTAACGGCCAGTTACATCAGAGAC
AGACACGAACAGGACCCTTCAGTGCCCATATGTCAATTCTATGAGGGTTTTAACCGTGAG
GGTCGCGAGTCCATGCTGCCGTATGGAGTGTACACTATGGATGACCTCAAACAATACGGA
GCTGACAGGAACTGGTGCCCCTACTTCCTGTCTAGATTCGCTATAATCCACGCTGAGATA
GTTGTGTACTCGTACCACTACTTATTAGATCCTAAGATAGCTGAAGTGGTATCAAAAGAA
CTGAACAAGGAGGCTGTGGTGGTGTTCGATGAGGCACATAATATAGATAATGTTTGTATC
GACTCTCTAAGTGTGAAGATCACGAGGCGGACTATCGATAAGAGCACGCAAGCACTACAG
ACGCTAGAAAAAGCTGTGTCACAATTAAAACAAGAGGACGAGGCGCGCCTGGCGCTGGAG
TACGAGCAGATGGTGGAGGGTCTGAGGGAGGCGGCGCAGCTGAGGGACAGTGACGTCATA
CTGGGCAACCCTGTACTACCTGATGAACTGCTCAACGAGGTGGTCCCTGGCAACATCAGG
AACGCGGTCCACTTCCTCGGGTTCTTGAAGCGGTTCATAGAATACTTGAAGACGAGGCTG
CGGATACAGCACGTGGTGCAGGAGTCGCCGGCCGGTTTCTTAAAGGACGTGTCGTCTCGC
GTGTGTATCGAGCGCAAGCCTCTCCGTTTCGTGTCGTCGCGGCTCCAGACCCTGATGAAG
ACCCTCCAGATCCCGGACCCCTCGAACTTCGGCTCCTTAACACTAGTGGCGCACCTGGCG
ACGCTCGTGTCCACGTACACCAAGGGCTTCGTCATCATCATAGAGCCCTTCGATGACAAA
ACCCCGACCGTCTCCAATCCAATACTACACTTCTCATGTATGGACTCGTCGATAGCCATG
CGGCCAGTGTTCGGTAGATTTCAAACTGTCATCATCACTTCCGGTACGCTATCTCCCCTG
GACATGTATCCCAAGATCCTGGACTTTAACCCCGTAGTAATGAGCTCCTTCACTATGACG
CTCGCCCGACCTTGCATACTGCCCATGATAGTGTCCAAAGGTAGCGACCAAGTGGCGATT
TCTTCAAAGTACGAGACACGAGAAGACGTCGCGGTGATAAGGAACTACGGACAACTACTA
GTAGAGATATCAGCCTGCGTGCCGGACGGGGTGGTGTGCTTCTTCACTTCGTATCTGTAC
CTGGAGAGCGTGGTCGGAGCTTGGTATGATCAGGGTGTCGTCGCCAATTTACAGAAACAC
AAGCTGCTGTTTATCGAGACGCAGGACTCGGCGGAGACCAGCTTCGCCTTAATAAACTAC
ATTAAGGCGTGCGAGAGCGGTCGTGGGGCGGTGTTGCTATCGGTGGCGCGCGGCAAGGTC
TCGGAGGGAGTGGACTTCGACCATCACCTCGGACGGGCGGTCCTCATGTTCGGGATACCT
TACGTGTTCACTCAGAGCAGGATATTAAAGGCCCGTCTAGAGTACCTGAGAGATCAGTTC
CAGATCCGTGAGAACGATTTCCTAACGTTCGACGCGATGCGTCACGCGGCTCAGTGTGTT
GGCCGAGCGTTGAGAGGCAAGACGGACTACGGTATAATGATATTCGCTGACAAGCGCTTC
AGTCGCTCGGACAAGAGAAGTAAGCTACCGCGGTGGATACAAGAACATCTGAGGGACTCG
CTCTGCAACCTCAGTACCGAGGAAGCCGTACAGATAAGTAAGCGTTGGCTCCGCCAGATG
TCGCAGCCGTTCAGCCGCGAGGACCAGCTGGGAGTGTCGCTGTTGACGCTCCAGCAGTTA
CAGAGCAAGGAGCAGCAGGAGAAGATCGAGAAGCAGGTCCTCCAGAAGTAG
Protein sequence:
MLELKRALDAKGHGLLEMPSGTGKTISLLSLIVAYMIQNPHHVRKLIYCSRTVPEIEKVL
EELKNLIKYYEKSQGEKPSLTGVVLSSRKNLCIHPEVSREREGKLVDGKCHSLTASYIRD
RHEQDPSVPICQFYEGFNREGRESMLPYGVYTMDDLKQYGADRNWCPYFLSRFAIIHAEI
VVYSYHYLLDPKIAEVVSKELNKEAVVVFDEAHNIDNVCIDSLSVKITRRTIDKSTQALQ
TLEKAVSQLKQEDEARLALEYEQMVEGLREAAQLRDSDVILGNPVLPDELLNEVVPGNIR
NAVHFLGFLKRFIEYLKTRLRIQHVVQESPAGFLKDVSSRVCIERKPLRFVSSRLQTLMK
TLQIPDPSNFGSLTLVAHLATLVSTYTKGFVIIIEPFDDKTPTVSNPILHFSCMDSSIAM
RPVFGRFQTVIITSGTLSPLDMYPKILDFNPVVMSSFTMTLARPCILPMIVSKGSDQVAI
SSKYETREDVAVIRNYGQLLVEISACVPDGVVCFFTSYLYLESVVGAWYDQGVVANLQKH
KLLFIETQDSAETSFALINYIKACESGRGAVLLSVARGKVSEGVDFDHHLGRAVLMFGIP
YVFTQSRILKARLEYLRDQFQIRENDFLTFDAMRHAAQCVGRALRGKTDYGIMIFADKRF
SRSDKRSKLPRWIQEHLRDSLCNLSTEEAVQISKRWLRQMSQPFSREDQLGVSLLTLQQL
QSKEQQEKIEKQVLQK