New model in OGS2.0 | DPOGS215098  |
---|---|
Genomic Position | scaffold2335:+ 412-11647 |
See gene structure | |
CDS Length | 3195 |
Paired RNAseq reads   | 376 |
Single RNAseq reads   | 1019 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009588 (4e-56) |
Best Drosophila hit   | ND |
Best Human hit | serine/threonine-protein kinase SMG1 (1e-06) |
Best NR hit (blastp)   | DNA-dependent protein kinase subunit [Polysphondylium pallidum PN500] (5e-11) |
Best NR hit (blastx)   | PREDICTED: similar to Serine-protein kinase ATM (Ataxia telangiectasia mutated homolog) (A-T, mutated homolog) [Apis mellifera] (2e-10) |
GeneOntology terms    | GO:0006975 DNA damage induced protein phosphorylation GO:0004672 protein kinase activity GO:0005524 ATP binding GO:0005958 DNA-dependent protein kinase-DNA ligase 4 complex GO:0006302 double-strand break repair GO:0006310 DNA recombination GO:0006468 protein amino acid phosphorylation GO:0006974 response to DNA damage stimulus GO:0003677 DNA binding GO:0004677 DNA-dependent protein kinase activity GO:0005634 nucleus GO:0006303 double-strand break repair via nonhomologous end joining GO:0016301 kinase activity |
InterPro families    | IPR011009 Protein kinase-like domain IPR000403 Phosphatidylinositol 3-/4-kinase, catalytic IPR003152 PIK-related kinase, FATC |
Orthology group | MCL22687 |
Nucleotide sequence:
ATGTCGCATCGTATGACGTCACATCCTGTAACAGTGCAGAGGTCGTCCGAGGAGCCGAGT
ATAAAGCGCCCCGCCCGGCCGCAGCTTCCGGCGCACAGCACCCTGCAAGGACTCGCCCGG
GGGCTCGCTGCAGGGCTCCGGGGACTTCGCGGCCTCAGCTCGGACTTAGCGCTCAGACTG
TACGAGTGTCTGTTGTTGTCGCTGGTGGTCCCACTCGCTGACGGTCTCAGGGTGTCACTG
GCGGCGGCGGTGGAGCTCTTCCTGCAGGACATCCGGGAACAGGATACGTTCAGAAACGCC
AAATTAGATGTTATTGACATAAGCGCTCCACCGACCACAGTAACCGGAAGTCTGGCCAGT
ACAGATTCAAATAAAGATCTTGAGCAACATATAAGTTTAGAGGACGTCGTAGAGAACATA
CTGCAATTTGCCCGAACGGACGATGATTTCTGTACGTCCCTGATAACCCAATTCATACGT
CAGTGTGGTAAAATTAGTCCCGGGTTCGATTTCCGCAGCGCCATCTGTTCAGATGTAATG
AATTGCATCAAAATAAGTCCAGTAACCGGCAGTGTCATTGAAATTGCCAGGCAGATTGAT
ATTTTGCAGGACATCGACTTATATAAATTAACTGACTTAGTTAACTCGTCCCTGGGCTCT
GAAGGCGAGGGTGTGTGTCGACTTCTATATGAAGACGTGCTTCTGAGAAAGGGAGAGGAA
ATATTTGTTTTGGGACAAAATAAGCCCTGGGATGAAGAAATGTCCGTGGTCAATAACAAA
TTCTCAATCGAAGACGCGCCATTCTGGACGAGGTGTTTAGTGTCTAAATACAGGGAGCAA
GACATACACACATACCAAGTATTAAAAGAACTGGTCCGATGGCCGAAGAGGCAGTTCGAC
GTCGCCGCAGTACTAGGCTCCCAGCAGTGGGAGGTGACGCAAGACGATATGGACAATGAC
TGCCTCTCCAAGTGGGCCTTCAGGGCTATGATGGGATCACTGTGCAGTAACAGTTCCAAG
GGTCCGTCTCGCTCACAGGTGTCGTGGATGACGAGAGCCAATCACATGAAGATGTACAGC
CAGGTGTTGAGATGCTGCGAGGGGATCGACACAAATGATGATAAAATATCGCTAGACATG
AGACACCAGGAACTGCTGGCCCTTAGGGGGATGGCGTTACAATCAAACGATAGGCAGGCT
CTGGAGAATATTTTGGAGAGAACAGAGTCTATGATACAATCTGGAAGTGAGCACAGTTAC
GGAGATATGTTGCGCGTATACGAACTGACATTGCGTCTGAGACGAGATTTGAACTCCGTG
GAGCGAGTTAACGTGGAAGCGATCGTGAAACACGTGATGGGCGACGTTAGAAACGTTGAA
AACGACGTCACCAGGAACCTCGACACGTTGTGTTTGCTGGGAATAACTTGTCTGGAGACC
ATGTTTGAGAACAGCACAGATATCGACGAACGTTCTTCACTGATGCTGTCAATCTGTGAG
ACGATCTCCTATCAAAGCTCTCCATCAACAGACGTTGTTCTTGACAAATTGGACTCCTTC
GACCGGATCCTGGATGAGAATGTCTCCCAGAGGATACTAAACACGAGAGATATGTTCAGA
CAACGAAAATACACCAACACAAGTCAAGACACGATGGCTCAGTTTACCAAGCGACTGGAG
CGTTATGTACCGGAGGTCCCGGGGTCCAGACCCGTTGGTCTGCAACCCCTCCTGAGCACA
ATGGACCAGTATTCCGTGGACCAGTTGCTGTCCAAGCGCGAGCTGTTCTCTCCTCACCCC
GCGCTGCTCGAGCTCCATCAAAACCAGCTCAGCTCGGACAAATATAAGCAGTGTCTGAGC
TTAGTCTCTGACCCTGTGTATTTGCTGAAACAATACGTAGGAATGATGCTAGCAGCCGTG
CAAACAGACGATGTAACGAAATACAAACAGATATACAAAAATATGAGACAGAGGATTTTT
GAGAATCCGTACGTCGGCGCCGATTACATCGTCCTCAACAAGTACAGTAAGCAACTGGAA
GGCTGTGACGATTTTGAGACGGACGTACACACGTTGCAGCGGCTATTGAAAGACATACAC
GCAGACTTACAATCCAGTAAGAGCCGTCTCTCTCTAACAGACATCTGCCCGACTCTATTA
GAGGAGAAACAGAGTCGGGCACTCGACAGACTGCTGGCTCTCAAGGACGGAGTCCACTTC
ATTAAATTCATGGAAAATGTGTCCGTATACCGTGACGCCGTGACACGGCCGGTGTTACTG
AGCTATTTGTCGTCAGACGGATTCACACGTCGTTGTATAGTGAAGACAGAGGGTAAGGGA
CATGCGGCTGCGGTCAGGGTCCGGGGGGCCTTGGAGAAGGCCTGGGAGATCCCACGGGGA
TATAAGGTGACGCCGCTCACCTCCGACTGTCTTCTGATCGAGTACGTGGAGAATAACACG
CGACTCCGGGACATGGTGGACACCGGCGGTGGGGATGCCGGCGTCACTCGGACCGCTGAC
GAGAATCTCATACTAAACGTACCTCAAGCTATTTCCCAGTTGGAGTCCCTAGCAAAGAGC
GTGCCTGCCACTTCACTACGGTCATCCATTGAGTCGGGGTGTCTCACCTTGGAAGAGTTC
ATAAGGAAGAAGACGGCCTTCACCGAGTCCCTGGGTCATATGACAGCATTCAGCTTTATA
TGTGGTCTGTCAGACCGTCATTTACAGAACATCCTGTATGACCCGGTCCGAGGCACCGTC
TGCGCTGTGGACTGCGGAGCCCTACAGCCCCAGGAGATACCGCCCGCTAGACTCACGAGG
AACCTGCTGGCGGTCTGTCGCACCGACGTTCTCGAAGCTCGACTCCAAAGAATGTTGTCC
AGACTACGGGAATATCAAGGAATAATACTCCCAGCAGTAAATATATCGCTCAAGAGATCT
GGACACCTGGATAAGCTACCCGCCATCCGCGGCAAAATCCAGGGCCGCCTTCTCCAGCAC
CAGGTCACTAAGGAGTGGATACAAAGGTCAGAGGTCAAATACAAAGAGAAATATCTGGAG
CTGCTGGACGAGATCTTCGGCACAGATGACAAATGTTCATACACCGTCGAGGAACAGGTA
TCAAATCTGTTGCTCCAATCTACCGATCCAAGGATCCTCAGCGTGACCAGGTCAGGATGG
GAGCCCTGGATATAA
Protein sequence:
MSHRMTSHPVTVQRSSEEPSIKRPARPQLPAHSTLQGLARGLAAGLRGLRGLSSDLALRL
YECLLLSLVVPLADGLRVSLAAAVELFLQDIREQDTFRNAKLDVIDISAPPTTVTGSLAS
TDSNKDLEQHISLEDVVENILQFARTDDDFCTSLITQFIRQCGKISPGFDFRSAICSDVM
NCIKISPVTGSVIEIARQIDILQDIDLYKLTDLVNSSLGSEGEGVCRLLYEDVLLRKGEE
IFVLGQNKPWDEEMSVVNNKFSIEDAPFWTRCLVSKYREQDIHTYQVLKELVRWPKRQFD
VAAVLGSQQWEVTQDDMDNDCLSKWAFRAMMGSLCSNSSKGPSRSQVSWMTRANHMKMYS
QVLRCCEGIDTNDDKISLDMRHQELLALRGMALQSNDRQALENILERTESMIQSGSEHSY
GDMLRVYELTLRLRRDLNSVERVNVEAIVKHVMGDVRNVENDVTRNLDTLCLLGITCLET
MFENSTDIDERSSLMLSICETISYQSSPSTDVVLDKLDSFDRILDENVSQRILNTRDMFR
QRKYTNTSQDTMAQFTKRLERYVPEVPGSRPVGLQPLLSTMDQYSVDQLLSKRELFSPHP
ALLELHQNQLSSDKYKQCLSLVSDPVYLLKQYVGMMLAAVQTDDVTKYKQIYKNMRQRIF
ENPYVGADYIVLNKYSKQLEGCDDFETDVHTLQRLLKDIHADLQSSKSRLSLTDICPTLL
EEKQSRALDRLLALKDGVHFIKFMENVSVYRDAVTRPVLLSYLSSDGFTRRCIVKTEGKG
HAAAVRVRGALEKAWEIPRGYKVTPLTSDCLLIEYVENNTRLRDMVDTGGGDAGVTRTAD
ENLILNVPQAISQLESLAKSVPATSLRSSIESGCLTLEEFIRKKTAFTESLGHMTAFSFI
CGLSDRHLQNILYDPVRGTVCAVDCGALQPQEIPPARLTRNLLAVCRTDVLEARLQRMLS
RLREYQGIILPAVNISLKRSGHLDKLPAIRGKIQGRLLQHQVTKEWIQRSEVKYKEKYLE
LLDEIFGTDDKCSYTVEEQVSNLLLQSTDPRILSVTRSGWEPWI