New model in OGS2.0 | DPOGS211792  |
---|---|
Genomic Position | scaffold427:+ 31097-32881 |
See gene structure | |
CDS Length | 1785 |
Paired RNAseq reads   | 361 |
Single RNAseq reads   | 1100 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004019 (4e-111) |
Best Drosophila hit   | inverted repeat-binding protein (8e-19) |
Best Human hit | X-ray repair cross-complementing protein 6 (3e-29) |
Best NR hit (blastp)   | hypothetical protein TRIADDRAFT_61626 [Trichoplax adhaerens] (3e-46) |
Best NR hit (blastx)   | human Ku70 autoantigen homologue [Xenopus laevis] (7e-44) |
GeneOntology terms    | GO:0003677 DNA binding GO:0004003 ATP-dependent DNA helicase activity GO:0005515 protein binding GO:0005958 DNA-dependent protein kinase-DNA ligase 4 complex GO:0006303 double-strand break repair via nonhomologous end joining |
InterPro families    | IPR006165 DNA helicase, ATP-dependent, Ku70 subunit IPR006164 DNA helicase, ATP-dependent, Ku type IPR005161 Ku70/Ku80, N-terminal alpha/beta IPR005160 Ku70/Ku80 C-terminal arm IPR016194 Spen Paralogue and Orthologue SPOC, C-terminal-like |
Orthology group | MCL16500 |
Nucleotide sequence:
ATGGAATCTGATGACGAATACGATGATGAACCTATGTGGAAAGGTACTCCTGGTACAATA
TTATTAATAAATATTTTTGAAAAGTCGAAATACAGTTCATCGTCAAATGTCCAAGTTGCA
ACTTGTCAATTGATAAGACAATATTTGCGATCACAGAGCTCACATTACATCAGCGTATGC
ATTTATGGAACCGAAGAATCAAATACATCAACATTTGATGTGAAACCTGTTGTACAATTA
TTCCCCTTATCTCCTCCGTCCTTAGAAAACTATGAGAAACTCAAAAACACCAATATATGT
AATCTTGTTGAAGCTAAAGAGTTTAACTTGTCTGATGTATTATGGCATTGCAGTAAAATG
TTCAATAGCTGTAAAAAGAAATTATCGTCTCGAACGGTGGTGATGTTAACACGCCTTGAT
GTCCCTCCTGCTGCTTCAGATAAAGATCAAACACTCAACCGGGCTATTGATTTAGTAGAC
TCGAACATTGATATTAAAATCATAAATATCTCGGAAACTGAATATGTCATTGACAAATAT
TATGAAAAATTATTAAAGATAGCAAATAGGGGTAGTGATTGTATTCCACCAAAACCTGTT
TGGAACATCAATGATATTGAAAAAATTGTCTTCCAGGAGACTCACAGAAACATAGCCATT
GCAAAAATAAATTTTGAAATAGGTGATACTTTTAATATTGGTGTCAGTGTTTACACTCTG
TTGAAAAAAGCTGGTCAAAATAATAAGAAAAACATTAATTTGGATAGAGAAAGTAATGCA
ATTGTAACAAGCATTAAAAACACATTAAAAGTTTCAAATGATGTCATTGATGAAGATAGT
CAAGATAGATCCAAAAGGAAAGTGCCTCTGTTGAAGTCAGAACTACTACATTATCAAGAA
TTTGGGGGAGAAAGAGTCGAATTTACAGATGAAGAAATGAAAATGATCAAAAATCCATTT
GGACCTCCAATGATGAAACTCCTTGGTTTCAAGCCAGCCAGTATCATTTGTAAGGAAAAG
TGGTATTTTAAAATTGGACAATTTTTATATCCAAATGAAAGTATTATAGAAGGTTCCACA
GTCGCTTTCAAAGCTTTACATGAAGCTTGCACTGTTATGAAAATGGTAGCACTTTGTATT
TTGTGTACTAGAGTCAATTCTAGACCCGTAATAGTTGCGCTGAGTCCTTGTGTGAAACCT
CTCAATCTTAACATTGATATTGGTTTTGACATTGTTAATATACCATTTGTTGAACATGTA
AGGGAACTTAATGTCGAAGAGGATGTTATCGAAGATGAGAGTCTAGTTGTAGAAAGTGCA
CACAAAGAGCTGATGAAGGGTATAATAAATAATACTATAATAGATTACCGACCCGATATG
TTTGAGGATCCCAAATTGCAATCTAAATACAGAGCGATTGAGGCTCTAGCATTGGACGAA
GATGAGACTGAACCTTTTGTAGATACAACCAAACCTAGCATTGAAAGATTTCAAAACTTA
CCAGACGATCTATTTGAGGAACTATTTGGACCCTTTGCATCTATGACTTTGAAGAGATCA
TGTCCTAAAGTGCCATCTCAACAGAACAAGAAGCCAAAAATTGAAAATTTTGATGAAGAA
CTTTTTAATACTAAATTGAAAGAAAAAAAGATTGAGTCATATACTGTGCCACAGTTAAAA
AACATATTAAAATATAAAAATATTCAGAATCTTCCAGCGTTAAATGGCTTAAAAAAGGCT
GAGCTTGTTAATTTAGTTTACACACATTGTGATGAAGAGAAATAA
Protein sequence:
MESDDEYDDEPMWKGTPGTILLINIFEKSKYSSSSNVQVATCQLIRQYLRSQSSHYISVC
IYGTEESNTSTFDVKPVVQLFPLSPPSLENYEKLKNTNICNLVEAKEFNLSDVLWHCSKM
FNSCKKKLSSRTVVMLTRLDVPPAASDKDQTLNRAIDLVDSNIDIKIINISETEYVIDKY
YEKLLKIANRGSDCIPPKPVWNINDIEKIVFQETHRNIAIAKINFEIGDTFNIGVSVYTL
LKKAGQNNKKNINLDRESNAIVTSIKNTLKVSNDVIDEDSQDRSKRKVPLLKSELLHYQE
FGGERVEFTDEEMKMIKNPFGPPMMKLLGFKPASIICKEKWYFKIGQFLYPNESIIEGST
VAFKALHEACTVMKMVALCILCTRVNSRPVIVALSPCVKPLNLNIDIGFDIVNIPFVEHV
RELNVEEDVIEDESLVVESAHKELMKGIINNTIIDYRPDMFEDPKLQSKYRAIEALALDE
DETEPFVDTTKPSIERFQNLPDDLFEELFGPFASMTLKRSCPKVPSQQNKKPKIENFDEE
LFNTKLKEKKIESYTVPQLKNILKYKNIQNLPALNGLKKAELVNLVYTHCDEEK