DPGLEAN14814 in OGS1.0

New model in OGS2.0DPOGS208948 
Genomic Positionscaffold31:+ 276416-283424
See gene structure
CDS Length3840
Paired RNAseq reads  293
Single RNAseq reads  748
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002418 (4e-174)
Best Drosophila hit  mutagen-sensitive 308 (0.0)
Best Human hitDNA polymerase theta (9e-178)
Best NR hit (blastp)  PREDICTED: similar to DNA polymerase theta [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to DNA polymerase theta [Tribolium castaneum] (0.0)
GeneOntology terms







  
GO:0006281 DNA repair
GO:0004386 helicase activity
GO:0003887 DNA-directed DNA polymerase activity
GO:0005524 ATP binding
GO:0006260 DNA replication
GO:0008026 ATP-dependent helicase activity
GO:0003677 DNA binding
GO:0006289 nucleotide-excision repair
GO:0006303 double-strand break repair via nonhomologous end joining
InterPro families




  
IPR002298 DNA polymerase A
IPR014001 DEAD-like helicase
IPR001650 Helicase, C-terminal
IPR001098 DNA-directed DNA polymerase, family A, palm domain
IPR019760 DNA-directed DNA polymerase, family A, conserved site
IPR011545 DNA/RNA helicase, DEAD/DEAH box type, N-terminal
Orthology groupMCL14343

Nucleotide sequence:

ATGTTCGATTGGCAAGTTGAATGTCTCAGCAATCCAAAAGTGCTTATAGATTGTCAAAAT
CTGTTATATTCGGCACCAACATCTGCTGGTAAGACACTTGTTGCTGAATTATTGACCATT
AAGACTGTTCTGGAAAGACAGAAAAAAGTCATAATCATATTACCCTTTGTATCAATTGTG
AGAGAGAAAATGTTTTATTTGCAAGACATATTATCTAGTTCAGGTATCAGGGTAGAAGGA
TTCATGGGCTCCCAGACTCCACCTGGTGGTTTACAGGCAGTACACATTGCGATATGTACA
ATTGAAAAAGCGAATAGTTTAATCAATAAACTTTTAGATGAAGGAAATATATCAGAATTG
GGTGCTGTAGTTGTTGATGAATTACATTTACTTGGAGATCCACATAGAGGATATATTCTG
GAGCTTCTTTTAACTAAAATTAAATATACAGCATCTAAATTAAATGATCTCTCAATACAA
ATAATAGGAATGTCTGCAACTTTACCAAATTTAAAAATGTTGGCGGATTGGTTGGAAGCT
CATTTATTTATAACAGAATTTCGGCCCATACCTCTAATTGAATCATGTTTGGTCGGAGAC
AAGTATTATAATAAAAAAGGTGAACACATAGGCATGCTGTGTAAGTCAAATTTAAAAGAA
ATTGATGATGATAGTGTCCTTTTGATTTGTCTGGAAACAATAAAAAGCAGTTGTTCTGTT
CTTATATTTTGTATGACTAAGAATAGATGTGAAAACTTAGCACAGAGCATTGCATCATCA
TTTTTTAAATTGGGTTGTATGAATAATGAACAAGGTATGATTTTAAGAGAACAATTAAAG
ACTTCAAGTATTCTCGAAGTTTTAGAACAATTGAAAGGTTGTCCTGTTGGTTTGGATCCA
GTATTAAAAAATATTATCTCATTTGGAGTTGCATATCATCATGCTGGACTTACATTCGAT
GAGAGGGACATAATAGAAGGGGCATTCAAATCTGGTGCTGTGAGAGTACTCGTTGCTACA
TCCACCTTGAGTTCCGGTGTTAATTTACCTGCTAGAAAAGTAATCATCAGGTGCCCCATG
TTCCAGAAGCAACCAATTAATATTTTGACCTATAAACAAATGGTTGGCAGAGCTGGGCGT
ATGGGAAAAGATACAAAGGGAGAAAGTATTCTAATATGCACTCCAAATGAACAAAAAATT
GGATTTGATCTGATGATGGGGGATCTGGATCCTGTAAAAAGTTGCATAGAGACTGAAGAT
AAATTTATGAGAGCTGTATTAGAAATGATTGCTAGTCAAGATGTTTGTACGGAAGAACAG
TTAGATTTGTACTCTAAAAGTACACTATTATTTAGCCAACAAAGTCTCCATCCATCCCAA
AACTTTTTATTAAATGACACTCTAAAGGAACTCGTCAATTATGAACTTGTGAGAATACAA
AAAGATGGAGAAGAAATAAGATATGTAGCCACTTCATTAGGGAAAGCCTGTTTGTCATCT
TCCATGTCGCCAAACGATGGAATATCTTTGTTTTGTGAGTTACAAAAAGCTCGACAATGT
TTAGTCTTAGAAACAGACTTACATCTTATTTATTTAGTGACGCCATATAGCGTTAGTAAT
CAATGGAATAATATAGATTGGTTACATCTGCTCACTCTTTGGGAAAGTCTCACATCCGCC
ATGAAAAGAGTTGGCGAGCTTGTTGGTGTCCAAGAGAGTTTTATAATTCGTTGCTTAAGG
GGAACAAACAAAAATAATAATAACCAAAATAAACTTAATATACATAAGAGATTTTATACA
GCACTAGCATTACAGGATTTAGTGAATGAAGTGCCACTCTCTGAAGTTGCTGGTAAATTT
CAGTGTGCTAGAGGTTTCTTACAAGGTTTACAGCAAGCTTCCGCTACATTTGCCGGAATG
GTAACATCATTTTGTCATCAACTTGGGTGGAAAAACATGGAAATGATTATATCGCAATTT
CAAGATCGTTTGCATTTTGGTATACATTCAGAGTTATTAGAACTCATGAAACTATCCTCC
CTAAACGGCGTTCGAGCGAGAACTTTATTTAATGCGGGTTTTGAAACTGTTGCAAGCATT
GCATCAGCTGAAGTTAATGTTATAGAAAATGCACTTCATAAATCCGTACCATTCCAAAGT
GAAAAACAAAGAGACGAAGATGATATGAGCGATTTAAGAAAAAGGAATAAAATCAAGAAT
ATATGGATAACAGGCTACTGTGGCGAACACGAGCAGATATTTAAAACAAAGATGTCGGAG
ATTCTATCAAACGATTCCCTTCAGTTGGATATGCTGTCGATAAAGACGTATTACGCTGAA
ATCAAGAAATATTTTGGAGTTAATTTGTCTTATTGTAACGACGTGTCTTTAGCTGAGTGG
CTTCTAGATAGTGAGGAGAAAATATCGACAATCGCTGATCTGGCGTTCAAGTACTGTGAT
CTAGATTTACAAAAGATGGAAATAAAAATTGACAATCAGATAAAAAGTTACAAATCCTTG
AACATGCATGAGATGAATTGTTTAAGGGCATGGTGTTTATGCGATATAGTAAAACAACAG
GAGAAAAAAATATCGCAAGAAACATTGGTCATGGAGAAGATCTTAAATACAGAGATCCAA
GTTTGCAAGATCCTTGGGGATTGCGAGTATCACGGCATTACGGTGGATAAAGATCTCGTG
TCGAGATTTTTGATTGATGTGAAAAATTCTCAAGAGATCTTACAGAAGAAGGCATTTAAG
ATATGCGGATACCATTTTAATTTCAACTCATCCAAGGATGTAGCTAAAGTTTTAGGACTT
TACAAGGGTCGTAAGACCAGCACTAGGAAGAGTGTTCTTTCGGCGCACAACAGTCCTATG
TCTAGTATTATAATATACTGGCGGAAACTCAACTCCATACTCACTAAGAGTCTTTATCCC
ATCACTGAACAAGCCTGTGTATACACTGAAGATAATAGGATATCTCCATCTTATACCATG
TACACATGCACGGGACGCATTAGCATGCACGAGCCGAATTTGCAAAACTTACCGCGGAAA
TTCACGATACCGGCAAACTATTTATGTGATAATGAATCTTGTGACGACGTAATAGAGTTC
AATTGTAGGAAAATATTCAGAGCAGCGCCCGGTTACGTTTTCATATCGGCTGATTACTGC
CAGTTGGAAATGAGGATTCTGACACACTTTTCCAAGGACGTTACTCTAACTAGGATAATG
GGTTCGGATGTTGACGTTTTTAAATCGATTGCAGCGTCTTGGAGTGGTGTGCCCGAGCAC
GAGGTAGACGAAGATTTACGTCATAAAGCCAAGCAGCTTTGTTACGGTATATTATACGGA
ATGGGTAATAGGACTCTGTCTCAACATTTAAACGTTACAGAATTAGAGGCTGCATATTTT
ATGGATATGTTTTATAAGACCTATCCATCGATAAAGGTTTTTACAGCGAGTCTGATAGAG
GAGTGTAGGAAGAAAGGTTACGTGGAAACTTTGATGAAGAGGAGAAGATATCTTCCTAAC
ATCAACAGCAGTGTTCCTTCAAAGAGGAGTGCAGCTGAAAGGCAAGCTGTTAACACGACC
ATCCAAGGATCGGCCGCAGACATAGCGAAGTCAGCGATGTGTTCCATACAACAAAGCACT
TCATCACGTCTGATATTACAAATGCACGATGAACTTATATACGAAGTACCGGTTAATAAT
AAACAAGATTTTATAGTTATTTTAAAAAAATCTATGGAAAATACCGTCCGTCTGAACGTA
CCTTTACCGGTCAAAATAAAGTGTGGGCAGACCTGGGGTACAATGGAGGACGTCAAATAA

Protein sequence:

MFDWQVECLSNPKVLIDCQNLLYSAPTSAGKTLVAELLTIKTVLERQKKVIIILPFVSIV
REKMFYLQDILSSSGIRVEGFMGSQTPPGGLQAVHIAICTIEKANSLINKLLDEGNISEL
GAVVVDELHLLGDPHRGYILELLLTKIKYTASKLNDLSIQIIGMSATLPNLKMLADWLEA
HLFITEFRPIPLIESCLVGDKYYNKKGEHIGMLCKSNLKEIDDDSVLLICLETIKSSCSV
LIFCMTKNRCENLAQSIASSFFKLGCMNNEQGMILREQLKTSSILEVLEQLKGCPVGLDP
VLKNIISFGVAYHHAGLTFDERDIIEGAFKSGAVRVLVATSTLSSGVNLPARKVIIRCPM
FQKQPINILTYKQMVGRAGRMGKDTKGESILICTPNEQKIGFDLMMGDLDPVKSCIETED
KFMRAVLEMIASQDVCTEEQLDLYSKSTLLFSQQSLHPSQNFLLNDTLKELVNYELVRIQ
KDGEEIRYVATSLGKACLSSSMSPNDGISLFCELQKARQCLVLETDLHLIYLVTPYSVSN
QWNNIDWLHLLTLWESLTSAMKRVGELVGVQESFIIRCLRGTNKNNNNQNKLNIHKRFYT
ALALQDLVNEVPLSEVAGKFQCARGFLQGLQQASATFAGMVTSFCHQLGWKNMEMIISQF
QDRLHFGIHSELLELMKLSSLNGVRARTLFNAGFETVASIASAEVNVIENALHKSVPFQS
EKQRDEDDMSDLRKRNKIKNIWITGYCGEHEQIFKTKMSEILSNDSLQLDMLSIKTYYAE
IKKYFGVNLSYCNDVSLAEWLLDSEEKISTIADLAFKYCDLDLQKMEIKIDNQIKSYKSL
NMHEMNCLRAWCLCDIVKQQEKKISQETLVMEKILNTEIQVCKILGDCEYHGITVDKDLV
SRFLIDVKNSQEILQKKAFKICGYHFNFNSSKDVAKVLGLYKGRKTSTRKSVLSAHNSPM
SSIIIYWRKLNSILTKSLYPITEQACVYTEDNRISPSYTMYTCTGRISMHEPNLQNLPRK
FTIPANYLCDNESCDDVIEFNCRKIFRAAPGYVFISADYCQLEMRILTHFSKDVTLTRIM
GSDVDVFKSIAASWSGVPEHEVDEDLRHKAKQLCYGILYGMGNRTLSQHLNVTELEAAYF
MDMFYKTYPSIKVFTASLIEECRKKGYVETLMKRRRYLPNINSSVPSKRSAAERQAVNTT
IQGSAADIAKSAMCSIQQSTSSRLILQMHDELIYEVPVNNKQDFIVILKKSMENTVRLNV
PLPVKIKCGQTWGTMEDVK