New model in OGS2.0 | DPOGS208948  |
---|---|
Genomic Position | scaffold31:+ 276416-283424 |
See gene structure | |
CDS Length | 3840 |
Paired RNAseq reads   | 293 |
Single RNAseq reads   | 748 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002418 (4e-174) |
Best Drosophila hit   | mutagen-sensitive 308 (0.0) |
Best Human hit | DNA polymerase theta (9e-178) |
Best NR hit (blastp)   | PREDICTED: similar to DNA polymerase theta [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to DNA polymerase theta [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0006281 DNA repair GO:0004386 helicase activity GO:0003887 DNA-directed DNA polymerase activity GO:0005524 ATP binding GO:0006260 DNA replication GO:0008026 ATP-dependent helicase activity GO:0003677 DNA binding GO:0006289 nucleotide-excision repair GO:0006303 double-strand break repair via nonhomologous end joining |
InterPro families    | IPR002298 DNA polymerase A IPR014001 DEAD-like helicase IPR001650 Helicase, C-terminal IPR001098 DNA-directed DNA polymerase, family A, palm domain IPR019760 DNA-directed DNA polymerase, family A, conserved site IPR011545 DNA/RNA helicase, DEAD/DEAH box type, N-terminal |
Orthology group | MCL14343 |
Nucleotide sequence:
ATGTTCGATTGGCAAGTTGAATGTCTCAGCAATCCAAAAGTGCTTATAGATTGTCAAAAT
CTGTTATATTCGGCACCAACATCTGCTGGTAAGACACTTGTTGCTGAATTATTGACCATT
AAGACTGTTCTGGAAAGACAGAAAAAAGTCATAATCATATTACCCTTTGTATCAATTGTG
AGAGAGAAAATGTTTTATTTGCAAGACATATTATCTAGTTCAGGTATCAGGGTAGAAGGA
TTCATGGGCTCCCAGACTCCACCTGGTGGTTTACAGGCAGTACACATTGCGATATGTACA
ATTGAAAAAGCGAATAGTTTAATCAATAAACTTTTAGATGAAGGAAATATATCAGAATTG
GGTGCTGTAGTTGTTGATGAATTACATTTACTTGGAGATCCACATAGAGGATATATTCTG
GAGCTTCTTTTAACTAAAATTAAATATACAGCATCTAAATTAAATGATCTCTCAATACAA
ATAATAGGAATGTCTGCAACTTTACCAAATTTAAAAATGTTGGCGGATTGGTTGGAAGCT
CATTTATTTATAACAGAATTTCGGCCCATACCTCTAATTGAATCATGTTTGGTCGGAGAC
AAGTATTATAATAAAAAAGGTGAACACATAGGCATGCTGTGTAAGTCAAATTTAAAAGAA
ATTGATGATGATAGTGTCCTTTTGATTTGTCTGGAAACAATAAAAAGCAGTTGTTCTGTT
CTTATATTTTGTATGACTAAGAATAGATGTGAAAACTTAGCACAGAGCATTGCATCATCA
TTTTTTAAATTGGGTTGTATGAATAATGAACAAGGTATGATTTTAAGAGAACAATTAAAG
ACTTCAAGTATTCTCGAAGTTTTAGAACAATTGAAAGGTTGTCCTGTTGGTTTGGATCCA
GTATTAAAAAATATTATCTCATTTGGAGTTGCATATCATCATGCTGGACTTACATTCGAT
GAGAGGGACATAATAGAAGGGGCATTCAAATCTGGTGCTGTGAGAGTACTCGTTGCTACA
TCCACCTTGAGTTCCGGTGTTAATTTACCTGCTAGAAAAGTAATCATCAGGTGCCCCATG
TTCCAGAAGCAACCAATTAATATTTTGACCTATAAACAAATGGTTGGCAGAGCTGGGCGT
ATGGGAAAAGATACAAAGGGAGAAAGTATTCTAATATGCACTCCAAATGAACAAAAAATT
GGATTTGATCTGATGATGGGGGATCTGGATCCTGTAAAAAGTTGCATAGAGACTGAAGAT
AAATTTATGAGAGCTGTATTAGAAATGATTGCTAGTCAAGATGTTTGTACGGAAGAACAG
TTAGATTTGTACTCTAAAAGTACACTATTATTTAGCCAACAAAGTCTCCATCCATCCCAA
AACTTTTTATTAAATGACACTCTAAAGGAACTCGTCAATTATGAACTTGTGAGAATACAA
AAAGATGGAGAAGAAATAAGATATGTAGCCACTTCATTAGGGAAAGCCTGTTTGTCATCT
TCCATGTCGCCAAACGATGGAATATCTTTGTTTTGTGAGTTACAAAAAGCTCGACAATGT
TTAGTCTTAGAAACAGACTTACATCTTATTTATTTAGTGACGCCATATAGCGTTAGTAAT
CAATGGAATAATATAGATTGGTTACATCTGCTCACTCTTTGGGAAAGTCTCACATCCGCC
ATGAAAAGAGTTGGCGAGCTTGTTGGTGTCCAAGAGAGTTTTATAATTCGTTGCTTAAGG
GGAACAAACAAAAATAATAATAACCAAAATAAACTTAATATACATAAGAGATTTTATACA
GCACTAGCATTACAGGATTTAGTGAATGAAGTGCCACTCTCTGAAGTTGCTGGTAAATTT
CAGTGTGCTAGAGGTTTCTTACAAGGTTTACAGCAAGCTTCCGCTACATTTGCCGGAATG
GTAACATCATTTTGTCATCAACTTGGGTGGAAAAACATGGAAATGATTATATCGCAATTT
CAAGATCGTTTGCATTTTGGTATACATTCAGAGTTATTAGAACTCATGAAACTATCCTCC
CTAAACGGCGTTCGAGCGAGAACTTTATTTAATGCGGGTTTTGAAACTGTTGCAAGCATT
GCATCAGCTGAAGTTAATGTTATAGAAAATGCACTTCATAAATCCGTACCATTCCAAAGT
GAAAAACAAAGAGACGAAGATGATATGAGCGATTTAAGAAAAAGGAATAAAATCAAGAAT
ATATGGATAACAGGCTACTGTGGCGAACACGAGCAGATATTTAAAACAAAGATGTCGGAG
ATTCTATCAAACGATTCCCTTCAGTTGGATATGCTGTCGATAAAGACGTATTACGCTGAA
ATCAAGAAATATTTTGGAGTTAATTTGTCTTATTGTAACGACGTGTCTTTAGCTGAGTGG
CTTCTAGATAGTGAGGAGAAAATATCGACAATCGCTGATCTGGCGTTCAAGTACTGTGAT
CTAGATTTACAAAAGATGGAAATAAAAATTGACAATCAGATAAAAAGTTACAAATCCTTG
AACATGCATGAGATGAATTGTTTAAGGGCATGGTGTTTATGCGATATAGTAAAACAACAG
GAGAAAAAAATATCGCAAGAAACATTGGTCATGGAGAAGATCTTAAATACAGAGATCCAA
GTTTGCAAGATCCTTGGGGATTGCGAGTATCACGGCATTACGGTGGATAAAGATCTCGTG
TCGAGATTTTTGATTGATGTGAAAAATTCTCAAGAGATCTTACAGAAGAAGGCATTTAAG
ATATGCGGATACCATTTTAATTTCAACTCATCCAAGGATGTAGCTAAAGTTTTAGGACTT
TACAAGGGTCGTAAGACCAGCACTAGGAAGAGTGTTCTTTCGGCGCACAACAGTCCTATG
TCTAGTATTATAATATACTGGCGGAAACTCAACTCCATACTCACTAAGAGTCTTTATCCC
ATCACTGAACAAGCCTGTGTATACACTGAAGATAATAGGATATCTCCATCTTATACCATG
TACACATGCACGGGACGCATTAGCATGCACGAGCCGAATTTGCAAAACTTACCGCGGAAA
TTCACGATACCGGCAAACTATTTATGTGATAATGAATCTTGTGACGACGTAATAGAGTTC
AATTGTAGGAAAATATTCAGAGCAGCGCCCGGTTACGTTTTCATATCGGCTGATTACTGC
CAGTTGGAAATGAGGATTCTGACACACTTTTCCAAGGACGTTACTCTAACTAGGATAATG
GGTTCGGATGTTGACGTTTTTAAATCGATTGCAGCGTCTTGGAGTGGTGTGCCCGAGCAC
GAGGTAGACGAAGATTTACGTCATAAAGCCAAGCAGCTTTGTTACGGTATATTATACGGA
ATGGGTAATAGGACTCTGTCTCAACATTTAAACGTTACAGAATTAGAGGCTGCATATTTT
ATGGATATGTTTTATAAGACCTATCCATCGATAAAGGTTTTTACAGCGAGTCTGATAGAG
GAGTGTAGGAAGAAAGGTTACGTGGAAACTTTGATGAAGAGGAGAAGATATCTTCCTAAC
ATCAACAGCAGTGTTCCTTCAAAGAGGAGTGCAGCTGAAAGGCAAGCTGTTAACACGACC
ATCCAAGGATCGGCCGCAGACATAGCGAAGTCAGCGATGTGTTCCATACAACAAAGCACT
TCATCACGTCTGATATTACAAATGCACGATGAACTTATATACGAAGTACCGGTTAATAAT
AAACAAGATTTTATAGTTATTTTAAAAAAATCTATGGAAAATACCGTCCGTCTGAACGTA
CCTTTACCGGTCAAAATAAAGTGTGGGCAGACCTGGGGTACAATGGAGGACGTCAAATAA
Protein sequence:
MFDWQVECLSNPKVLIDCQNLLYSAPTSAGKTLVAELLTIKTVLERQKKVIIILPFVSIV
REKMFYLQDILSSSGIRVEGFMGSQTPPGGLQAVHIAICTIEKANSLINKLLDEGNISEL
GAVVVDELHLLGDPHRGYILELLLTKIKYTASKLNDLSIQIIGMSATLPNLKMLADWLEA
HLFITEFRPIPLIESCLVGDKYYNKKGEHIGMLCKSNLKEIDDDSVLLICLETIKSSCSV
LIFCMTKNRCENLAQSIASSFFKLGCMNNEQGMILREQLKTSSILEVLEQLKGCPVGLDP
VLKNIISFGVAYHHAGLTFDERDIIEGAFKSGAVRVLVATSTLSSGVNLPARKVIIRCPM
FQKQPINILTYKQMVGRAGRMGKDTKGESILICTPNEQKIGFDLMMGDLDPVKSCIETED
KFMRAVLEMIASQDVCTEEQLDLYSKSTLLFSQQSLHPSQNFLLNDTLKELVNYELVRIQ
KDGEEIRYVATSLGKACLSSSMSPNDGISLFCELQKARQCLVLETDLHLIYLVTPYSVSN
QWNNIDWLHLLTLWESLTSAMKRVGELVGVQESFIIRCLRGTNKNNNNQNKLNIHKRFYT
ALALQDLVNEVPLSEVAGKFQCARGFLQGLQQASATFAGMVTSFCHQLGWKNMEMIISQF
QDRLHFGIHSELLELMKLSSLNGVRARTLFNAGFETVASIASAEVNVIENALHKSVPFQS
EKQRDEDDMSDLRKRNKIKNIWITGYCGEHEQIFKTKMSEILSNDSLQLDMLSIKTYYAE
IKKYFGVNLSYCNDVSLAEWLLDSEEKISTIADLAFKYCDLDLQKMEIKIDNQIKSYKSL
NMHEMNCLRAWCLCDIVKQQEKKISQETLVMEKILNTEIQVCKILGDCEYHGITVDKDLV
SRFLIDVKNSQEILQKKAFKICGYHFNFNSSKDVAKVLGLYKGRKTSTRKSVLSAHNSPM
SSIIIYWRKLNSILTKSLYPITEQACVYTEDNRISPSYTMYTCTGRISMHEPNLQNLPRK
FTIPANYLCDNESCDDVIEFNCRKIFRAAPGYVFISADYCQLEMRILTHFSKDVTLTRIM
GSDVDVFKSIAASWSGVPEHEVDEDLRHKAKQLCYGILYGMGNRTLSQHLNVTELEAAYF
MDMFYKTYPSIKVFTASLIEECRKKGYVETLMKRRRYLPNINSSVPSKRSAAERQAVNTT
IQGSAADIAKSAMCSIQQSTSSRLILQMHDELIYEVPVNNKQDFIVILKKSMENTVRLNV
PLPVKIKCGQTWGTMEDVK